# inference

Latest news and articles about inference

Total: 1 articles found

Close-up of a digital assistant interface on a dark screen, showcasing AI technology communication.

Microsoft Unveils Maia 200 — A 3nm AI Inference Chip Aimed at Denting NVIDIA’s Dominance

Microsoft has launched Maia 200, a TSMC 3nm AI inference chip the company says outperforms Amazon’s Trainium v3 and Google’s TPU v7 on low-precision workloads while improving inference cost-efficiency by about 30% versus its current fleet. The release underscores hyperscalers’ push into custom silicon to reduce reliance on Nvidia GPUs, but success will depend on software tooling, ecosystem adoption and independent benchmarking.

NeTe2026年1月26日 22:40

#Microsoft#Maia 200#AI accelerator