NEW YORK, May 21, 2026 (GLOBE NEWSWIRE) -- Global technology intelligence firm ABI Research forecasts that AI inference workloads will grow at a 42% CAGR to surpass 46 Gigawatts of capacity consumption by 2035, overtaking training workloads by 2033 as the dominant force in cloud AI infrastructure. The finding signals a structural shift in the market, as enterprise adoption, better token economics, and stronger production Return on Investment push operators to prioritize inference-optimized capacity.
“Inference is the commercial engine of the AI market, and its market activity is accelerating at an incredible pace with better model capabilities and the computational demands of agentic systems,” said Larbi Belkhit, Senior Analyst at ABI Research. “For the last several years, the cloud demand and build-out centered predominantly on training frontier models, but the next wave of competition will be won by providers that can deliver inference at scale with the right balance of performance, latency, cost, and compute utilization.”
Training demand will still expand aggressively, reaching 36 GW by 2035, but its composition is changing. Fine-tuning workloads are set to surpass foundation model training by 2032 and grow to 21 GW by 2035, while foundation model training climbs to nearly 13 GW, underscoring how enterprise customization is becoming a major contributor to cloud AI capacity consumption.
Among inference workloads, code generation is the standout category in the enterprise market, scaling to roughly 24 GW by 2035 and accounting for more than half of total inference capacity consumption. Text generation will continue growing, but only to about 7 GW by 2035, while audio generation is projected to become the fastest growing inference workload at a 42% CAGR as model capabilities and economics improve.
The market shift will also reshape competitive dynamics across cloud providers. ABI Research expects neocloud providers to nearly catch hyperscalers in total inference capacity consumption by 2035, reaching 15 GW and 16 GW respectively, as they expand infrastructure built specifically for inference-heavy AI services.
“These forecasts show that AI infrastructure strategy is moving into a new phase, where success depends less on headline model size and more on operationalizing AI across real-world workloads,” Belkhit said. “Cloud providers, enterprises, and the wider AI supply chain are preparing for a far more heterogeneous compute landscape shaped by long-running autonomous agentic systems, multi-modal workloads, and scalable inference demand.”
These findings are from ABI Research’s AI Cloud Workloads market data report, part of the company’s AI & Machine Learning research service, which includes research, data, and ABI Insights.
About ABI Research
ABI Research is a global technology intelligence firm uniquely positioned at the intersection of technology solution providers and end-market companies. We serve as the bridge that seamlessly connects these two segments by providing exclusive research and expert guidance to drive successful technology implementations and deliver strategies proven to attract and retain customers.
ABI Research是一家全球性的技术情报公司,拥有得天独厚的优势,充当终端市场公司和技术解决方案提供商之间的桥梁,通过提供独家研究和专业性指导,推动成功的技术实施和提供经证明可吸引和留住客户的战略,无缝连接这两大主体。
For more information about ABI Research’s services, contact us at +1.516.624.2500 in the Americas, +44.203.326.0140 in Europe, +65.6592.0290 in Asia-Pacific, or visit www.abiresearch.com.
Contact Info:
Global
Jason Scheer
Tel: +1.516.624.2558
pr@abiresearch.com