# inference
Latest news and articles about inference
Total: 3 articles found

The Token Wars Begin: Nvidia’s Vera Rubin vs China’s Low‑Cost Inference Push
At GTC 2026 Nvidia declared the AI era has shifted from training models to continuously generating tokens and presented Vera Rubin, a full‑stack platform it says can cut token costs dramatically. At the same time, Chinese large‑model providers are already undercutting foreign counterparts on token prices and capturing high API volumes, creating a global contest over who will set token pricing and infrastructure standards.

Midea’s MevoX Pushes Smart Homes from Remote Control to Cognitive Spaces
Midea unveiled MevoX, a self‑evolving home intelligence agent, and pledged over RMB 60 billion to AI and embodied intelligence over three years. The company is targeting two persistent technical gaps—reasoning (inference) and memory—to move smart homes from device control to proactive, context‑aware spaces, while signalling a strategic pivot from hardware sales to platform and service revenue.

Microsoft Unveils Maia 200 — A 3nm AI Inference Chip Aimed at Denting NVIDIA’s Dominance
Microsoft has launched Maia 200, a TSMC 3nm AI inference chip the company says outperforms Amazon’s Trainium v3 and Google’s TPU v7 on low-precision workloads while improving inference cost-efficiency by about 30% versus its current fleet. The release underscores hyperscalers’ push into custom silicon to reduce reliance on Nvidia GPUs, but success will depend on software tooling, ecosystem adoption and independent benchmarking.