AWS Expands Tools for Enterprise AI, From Custom LLMs to On-Prem AI Hubs
AWS introduced new tools for enterprise customers to build and fine-tune custom large language models, alongside AI infrastructure offerings for on-premises deployments.
AWS introduced new tools for enterprise customers to build and fine-tune custom large language models, alongside AI infrastructure offerings for on-premises deployments.
AWS AI Factories bring dedicated AI infrastructure directly into customer data centers, enabling enterprises and governments to rapidly develop and deploy AI applications while meeting data sovereignty and regulatory requirements.
Nvidia is investing $2 billion in Synopsys to integrate its AI hardware with Synopsys’s chip-design software, enhancing GPU-based workflows and solidifying its influence in semiconductor design.
The surge in AI and cloud computing is straining data center cooling systems, highlighted by an outage at CyrusOne affecting CME Group trading. Companies are exploring liquid cooling and water-efficient designs to manage heat.
Baidu is scaling up its in-house AI-chip business via its Kunlunxin unit, unveiling a five-year roadmap with M100 and M300 chips and aiming to fill the gap left by restricted Nvidia GPU access in China.
Elon Musk’s xAI plans an 88-acre solar farm next to its massive Memphis data center as the company faces backlash for operating unpermitted gas turbines linked to rising air pollution.
Nvidia asserts it is “a generation ahead” in AI chips after reports that Google may supply Meta with its processors, spurring market competition.
Google and Accel will co-invest up to $2M in early-stage Indian AI startups through Accel’s Atoms program, offering capital, compute credits, and deep technical mentorship.
Meta is in talks to spend billions on Google’s TPUs for its data centers, a shift that would position Alphabet as a stronger competitor to Nvidia. The discussions include Meta potentially renting Google Cloud chips as early as next year.
Amazon is investing up to $50 billion to expand AI and supercomputing capacity for U.S. government agencies through AWS’s secure cloud regions, adding nearly 1.3 gigawatts of advanced computational infrastructure.
OpenAI has partnered with Foxconn to co-design and prepare U.S. manufacturing for next-generation AI data center hardware, strengthening domestic supply chains and accelerating deployment of advanced compute systems.
Google executives told employees the company must double its compute capacity every six months to keep pace with surging demand for AI services, underscoring the rising cost and competition in cloud infrastructure.
SoftBank plans to invest up to $3 billion to convert an Ohio EV plant into a facility producing modular data center equipment for OpenAI’s expanding U.S. infrastructure.
Nvidia reported strong third quarter results driven by soaring demand for its data center GPUs and Blackwell architecture. The company raised its outlook as AI infrastructure spending accelerates.
Elon Musk announced a 500-megawatt xAI data center in Saudi Arabia, built with HUMAIN AI and powered by Nvidia chips, expanding the company’s global compute footprint.