Latest news and articles about test‑time scaling
Total: 1 articles found
Alibaba has launched Qwen3‑Max‑Thinking, an inference model that combines adaptive tool calling and test‑time scaling to improve reasoning, factual accuracy and alignment. Alibaba claims benchmark parity with leading models such as GPT‑5.2‑Thinking, and has deployed the capability in Qwen Chat, signalling rapid commercialisation within its cloud and consumer ecosystem.