AWS Introduces New Trainium AI Chip and Graviton 4, Expands Collaboration with Nvidia

0
433


aws-graviton4-and-aws-trainium2-prototype

The Graviton 4 chip, left, is a general-purpose microprocessor chip being used by SAP and others for large workloads, while Trainium 2 is a special-purpose accelerator chip for very large neural network programs such as generative AI.

Amazon AWS

At its annual AWS re:Invent developer conference in Las Vegas, Amazon on Tuesday announced a new version of Trainium 2, its dedicated chip for training neural networks. Trainium 2 is tuned specifically for training so-called large language models (LLMs) and foundation models — the kinds of generative AI programs such as OpenAI’s GPT-4.

The company also unveiled a new version of its custom microprocessor, Graviton 4, and said it is extending its partnership with Nvidia to run Nvidia’s most advanced chips in its cloud computing service. 

Also: The future of cloud computing, from hybrid to edge to AI-powered

The Trainium 2 is designed to handle neural networks with trillions of parameters, or neural weights, which are the functions of the program’s algorithm that give it scale and power, generally speaking. Scaling to larger and larger parameters is a focus of the entire AI industry. 

// Additional content left