Nvidia unveils new GPU designed for long-context inference

At the AI Infrastructure Summit on Tuesday, Nvidia announced a new GPU calledthe Rubin CPX, designed for context windows larger than 1 million tokens.

Part of the chip giant’s forthcoming Rubin series, the CPX is optimized for processing large sequences of context and is meant to be used as part ofa broader “disaggregated inference” infrastructure approach. For users, the result will be better performance on long-context tasks like video generation or software development.

Nvidia’s relentless development cycle has resulted in enormous profits for the company, which brought in $41.1 billion in data center sales inits most recent quarter.

The Rubin CPX is slated to be available at the end of 2026.

Source: Techcrunch

Scroll to Top