Amazon and Cerebras Combine AI Chips for New AWS Inference Platform
Amazon and Cerebras have partnered to combine their AI chips in a new AWS service designed to accelerate inference for chatbots, coding tools, and other generative AI applications.
Amazon and Cerebras have partnered to combine their AI chips in a new AWS service designed to accelerate inference for chatbots, coding tools, and other generative AI applications.
Palantir and Nvidia unveiled a sovereign AI OS reference architecture designed to deliver turnkey AI data centers. The platform integrates Nvidia Blackwell systems with Palantir’s enterprise AI software stack.
Meta introduced four new in-house MTIA chips designed for AI training and inference as the company accelerates data center expansion. The chips aim to improve performance and reduce reliance on external hardware suppliers.
Nvidia and Nebius have formed a strategic partnership to build hyperscale AI cloud infrastructure, with Nvidia investing $2 billion to support gigawatt-scale AI computing capacity.
Nvidia and Thinking Machines Lab have formed a multiyear partnership to deploy next-generation Vera Rubin systems for frontier AI training. The collaboration aims to expand access to customizable AI models and large-scale compute infrastructure.
Nscale raised $2 billion in a Series C round led by Aker and 8090 Industries, valuing the Nvidia-backed AI infrastructure company at $14.6 billion.
Broadcom expects artificial intelligence chip sales to surpass $100 billion by 2027 as demand surges from major technology companies building AI infrastructure.
Xiaomi plans to release a new smartphone processor each year and launch an AI assistant for international markets as it expands its chip design and AI ecosystem.
Core Scientific sold over 1,900 BTC in January and plans to liquidate the remaining 600 BTC in Q1 to support liquidity and capital expenditures for its AI colocation expansion.
Dell shares jumped 19% after the company topped fourth-quarter expectations and issued strong revenue guidance, driven by surging demand for AI servers amid a memory shortage.
Nvidia reports record Q4 revenue of $68.1B and net income of $43B, driven by strong AI data center demand and continued hyperscale investment.
Meta plans to buy up to $100 billion in AMD GPUs and CPUs, issuing performance-based warrants as it ramps AI data centers and diversifies compute infrastructure.
Amazon announces a $12 billion plan for Louisiana data centers, creating 540 jobs and funding water and power infrastructure to support AI-driven growth.
India’s AI Impact Summit in New Delhi delivered more than $250 billion in investment commitments, spanning data centers, AI infrastructure, and global expansion plans.
OpenAI will become the first customer of Tata Consultancy Services’ data center unit, securing 100 megawatts of capacity under the Stargate AI infrastructure initiative.
Analysis of data centres, chips, semiconductors, edge computing and AI infrastructure powering the digital economy. Understand how next-generation computing resources, connectivity and hardware ecosystems support growth of online services and artificial intelligence worldwide. This category tracks the unseen backbone of modern technology.