Uber expands use of AWS chips for AI workloads

Large companies are rethinking how they run artificial intelligence workloads in the cloud. Uber is one of the latest examples, expanding its use of AWS chips to support its AI systems.

At the centre of this change are AWS-designed chips like Graviton and Trainium. Reuters reports Uber is increasing its use of the hardware to power AI models and backend systems for its ride-hailing and delivery platforms. Uber’s AI models work on core functions like matching riders with drivers, estimating trip times, setting prices, and managing food delivery routes. Such tasks rely on large volumes of data and constant updates, which can push up cloud costs.

Custom chips offer a way to manage price pressure. AWS says Graviton can improve price-performance compared to traditional x86-based instances, while Trainium is designed to lower training costs. The hardware may help companies like Uber run more AI tasks without a similar rise in spending.

How custom chips change cloud use

The decision to explore alternative hardware ties closely to scale for Uber. The company operates in dozens of countries and processes millions of transactions each day. Even small gains in efficiency can matter in a network of that size.

According to Reuters, Uber is using AWS chips to improve both training and inference workloads. Training refers to how AI models learn from data, whereas inference is how those models make decisions in live systems. Both stages can be costly, but inference often runs continuously in production, making efficiency particularly important.

Chips like Trainium are designed for high-throughput machine learning tasks, which can help minimise the time and cost needed to train models. Graviton, which is built on ARM architecture, is often used for general workloads that benefit from lower power use and better cost control. Together, they give enterprises more options in how they run AI systems in the cloud.

Balancing cost and flexibility

Cloud strategies are also changing. Companies are taking a more active role in how workloads are structured, from choosing instance types to tuning models for certain chips and balancing cost against performance.

This approach can add complexity, however. Developers need to adjust software for ARM-based processors or specialised AI chips, and it may require closer coordination with cloud providers.

Uber’s move comes at a time when AI workloads are expanding in many industries. From finance to retail, companies are using machine learning for tasks like fraud detection, demand forecasting, and customer support. As these systems grow, so does the need to manage the cost of running them.

Custom silicon is one response. Cloud providers like AWS are building their own processors, which gives them more control over pricing and performance. It also raises questions about flexibility. Companies that build around specific cloud chips may find it harder to move workloads between providers.

Uber’s use of AWS chips shows how these trade-offs are playing out in practice. Rather than moving away from the cloud, the company is using more specialised cloud hardware. Reuters does not detail the exact scale of Uber’s deployment, but it says the chips support important AI-driven functions in the platform.

Rising cloud costs are forcing more companies to rethink how they run workloads. Custom chips may not replace general-purpose compute, but they are becoming part of the mix.

Uber’s move reflects a broader change in how enterprises use the cloud. The focus is increasingly on running workloads more efficiently. Companies will need to balance cost and flexibility, and custom silicon is likely to play a larger role.

(Photo by Erik Mclean)

See also: Cloud costs rise as AI moves into core business systems

Want to learn more about Cloud Computing from industry leaders? Check out Cyber Security & Cloud Expo taking place in Amsterdam, California, and London. The comprehensive event is part of TechEx and is co-located with other leading technology events, click here for more information.

CloudTech News is powered by TechForge Media. Explore other upcoming enterprise technology events and webinars here.

Source link

What's Hot

Microsoft named a Leader in The Forrester Wave™

Non-Obvious Patterns in Building Enterprise AI Assistants

A philosophy of work | MIT News

Uber expands use of AWS chips for AI workloads

AWS targets AI agent sprawl with new Bedrock Agent Registry

Making AI Trustworthy and Observable in Real-Time: Cisco Announces Intent to Acquire Galileo

Cisco Celebrates The Smart Industry Industrial Transformation Award Winners

Microsoft named a Leader in The Forrester Wave™

Non-Obvious Patterns in Building Enterprise AI Assistants

A philosophy of work | MIT News

javascript – React Native iOS app crashes on language change with react-native-restart (works on Android)

Don't Miss!

Microsoft named a Leader in The Forrester Wave™

Non-Obvious Patterns in Building Enterprise AI Assistants

Subscribe to Updates

What's Hot

Uber expands use of AWS chips for AI workloads

How custom chips change cloud use

Balancing cost and flexibility

Related Posts

Subscribe to Updates