Skip to main content
Back to Pulse
AI News

NVIDIA and Google infrastructure cuts AI inference costs

Read the full articleNVIDIA and Google infrastructure cuts AI inference costs on AI News

What Happened

At the Google Cloud Next conference, Google and NVIDIA outlined their hardware roadmap designed to address the cost of AI inference at scale. The companies detailed the new A5X bare-metal instances, which run on NVIDIA Vera Rubin NVL72 rack-scale systems. Through hardware and software codesign, this

Cited By

React

Newsletter

Get the weekly AI digest

The stories that matter, with a builder's perspective. Every Thursday.

Loading comments...