# inference cost

Latest news and articles about inference cost

Total: 2 articles found

A digital representation of how large language models function in AI technology.

China Overtakes US in AI Calls as Four Homegrown Models Dominate Global Top Five — Experts Point to Inference-Efficiency Strategies

NetEase reports that China’s aggregate AI API call volume has surpassed the United States for the first time, with four Chinese large models filling four of the global top five usage slots. Experts attribute the surge largely to engineering choices that reduce inference costs, enabling mass deployment across consumer and enterprise services.

NeTe2026年2月26日 13:47

#China AI#large language models#inference cost

Senior male perfumer sitting among fragrance bottles in a rustic setting, creating unique scents.

Technology

Alibaba’s Qianwen Open-Sources an 80B Coding Model Optimized for Agents and Local Development

Alibaba’s Qianwen has open‑sourced Qwen3‑Coder‑Next, an 80B parameter model designed for coding agents and local deployment that combines hybrid attention with MoE to lower inference costs. The release aims to accelerate enterprise adoption in China by enabling on‑premise use and customization, while raising questions about IP, safety and the infrastructure needed to realize claimed efficiency gains.

NeTe2026年2月4日 01:50

#Qwen3-Coder-Next#Alibaba Qianwen#Mixture of Experts