Nvidia Unveils AI Server That Boosts Model Performance 10x”

Nvidia’s newest AI server accelerates mixture-of-experts models by 10x, leveraging 72 chips and high-speed interconnects, maintaining an edge over AMD and competitors.

By Olivia Grant Published: Dec 4, 2025 at 2:28 pm UTC Updated: Mar 1, 2026 at 9:54 am UTC

Nvidia announced that its latest AI server can improve the performance of mixture-of-experts AI models by up to ten times, including high-profile models from China such as Moonshoot’s Kimi K2 Thinking and DeepSeek’s open-source models. Mixture-of-experts models divide tasks among specialized components within a model, improving efficiency and gaining popularity in 2025.

Nvidia’s server integrates 72 of its top GPUs with high-speed links, providing advantages in serving AI models to end users, even as these models may require less training on Nvidia hardware. The company emphasizes that the performance gains stem from both the number of chips and the rapid interconnects, areas where Nvidia maintains an edge over competitors.

While Nvidia dominates AI training markets, it now faces growing competition in inference and deployment from firms like Advanced Micro Devices (AMD) and Cerebras. AMD is developing a multi-chip AI server expected to launch next year. Nvidia’s new data highlights its continued focus on scaling AI infrastructure to handle increasingly complex models efficiently.

AI & Machine Learning, Cloud & Infrastructure, Enterprise Tech, News

AI & Machine Learning, Cloud & Infrastructure, Enterprise Tech, News Palantir and Nvidia Launch Sovereign AI Data Center Architecture

By Olivia Grant March 12th, 2026

AI & Machine Learning, News, Story of the Day Musk Confirms Tesla–xAI Collaboration on Digital Optimus AI Agent Project

By Daniel Mercer March 12th, 2026

AI & Machine Learning, Cloud & Infrastructure, News Meta Unveils New MTIA Chips for AI Data Centers

By Olivia Grant March 11th, 2026