# inference

Latest news and articles about inference

Total: 3 articles found

Detailed close-up of a laptop keyboard featuring Intel Core i7 and NVIDIA GeForce stickers, highlighting technology components.
Technology

The Token Wars Begin: Nvidia’s Vera Rubin vs China’s Low‑Cost Inference Push

At GTC 2026 Nvidia declared the AI era has shifted from training models to continuously generating tokens and presented Vera Rubin, a full‑stack platform it says can cut token costs dramatically. At the same time, Chinese large‑model providers are already undercutting foreign counterparts on token prices and capturing high API volumes, creating a global contest over who will set token pricing and infrastructure standards.

NeTe2026年3月17日 10:32
#Nvidia#Vera Rubin#inference
Philips smart hub beside a leafy plant in a stylish indoor setting, showcasing modern home automation.
Technology

Midea’s MevoX Pushes Smart Homes from Remote Control to Cognitive Spaces

Midea unveiled MevoX, a self‑evolving home intelligence agent, and pledged over RMB 60 billion to AI and embodied intelligence over three years. The company is targeting two persistent technical gaps—reasoning (inference) and memory—to move smart homes from device control to proactive, context‑aware spaces, while signalling a strategic pivot from hardware sales to platform and service revenue.

NeTe2026年3月12日 20:57
#Midea#MevoX#smart home
Close-up of a digital assistant interface on a dark screen, showcasing AI technology communication.
Technology

Microsoft Unveils Maia 200 — A 3nm AI Inference Chip Aimed at Denting NVIDIA’s Dominance

Microsoft has launched Maia 200, a TSMC 3nm AI inference chip the company says outperforms Amazon’s Trainium v3 and Google’s TPU v7 on low-precision workloads while improving inference cost-efficiency by about 30% versus its current fleet. The release underscores hyperscalers’ push into custom silicon to reduce reliance on Nvidia GPUs, but success will depend on software tooling, ecosystem adoption and independent benchmarking.

NeTe2026年1月26日 22:40
#Microsoft#Maia 200#AI accelerator