# inference cost
Latest news and articles about inference cost
Total: 2 articles found

China Overtakes US in AI Calls as Four Homegrown Models Dominate Global Top Five — Experts Point to Inference-Efficiency Strategies
NetEase reports that China’s aggregate AI API call volume has surpassed the United States for the first time, with four Chinese large models filling four of the global top five usage slots. Experts attribute the surge largely to engineering choices that reduce inference costs, enabling mass deployment across consumer and enterprise services.

Alibaba’s Qianwen Open-Sources an 80B Coding Model Optimized for Agents and Local Development
Alibaba’s Qianwen has open‑sourced Qwen3‑Coder‑Next, an 80B parameter model designed for coding agents and local deployment that combines hybrid attention with MoE to lower inference costs. The release aims to accelerate enterprise adoption in China by enabling on‑premise use and customization, while raising questions about IP, safety and the infrastructure needed to realize claimed efficiency gains.