Nvidia unveils new GPU designed for long-context inference
At the AI Infrastructure Summit on Tuesday, Nvidia announced a new GPU called the Rubin CPX, designed for context windows larger than 1 million tokens. Part of the chip giant’s forthcoming Rubin series, the CPX is optimized for processing large sequences of context and is meant to be used as part of a broader “disaggregated … Read more
https://www.profitablecpmrate.com/nsirjwzb79?key=c706907e420c1171a8852e02ab2e6ea4
Skip to content