Skip to content

TorchTPU: PyTorch Now Runs Natively on Google’s TPU Systems

The Bottom Line: Google introduces TorchTPU, a solution enabling native PyTorch on TPUs. The system allows developers to migrate their ML workloads to Google’s supercomputing infrastructure with minimal code adjustments and to fully leverage TPU resources.

Google introduces TorchTPU – a solution that enables developers to run PyTorch models directly on Tensor Processing Units. The system was designed for minimal code modifications and maximum performance on Google’s supercomputer-like infrastructures.

The requirements for modern AI infrastructure have fundamentally changed. At the frontier of machine learning, distributed systems are deployed that span thousands of accelerators. While models run on clusters with approximately 100,000 chips, the underlying software systems must meet new demands for performance, hardware portability, and reliability.

At Google, proprietary Tensor Processing Units (TPUs) form the foundation of the supercomputing infrastructure. These custom-designed ASICs accelerate both training and inference for Google’s AI platforms such as Gemini and Veo, as well as for large workloads from cloud customers.

Since many developers implement their models in PyTorch, seamless and high-performance integration was required. This is precisely where TorchTPU comes in. The engineering team pursued the goal of creating a technology stack that places user-friendliness, portability, and outstanding performance at its center. Developers should be able to transfer their existing PyTorch workloads with minimal code changes while gaining access to APIs and tools to harness the full computing power of the hardware.

Share on: