China’s Yushi Predicts a Breakthrough Year for Humanoid Robots — But the ‘ChatGPT Moment’ for Embodied AI Is Still Years Away

Yushi Technology founder Wang Xingxing says humanoid robots have made rapid progress in mobility and shipment volumes, with lighter, cheaper models driving adoption and public-facing demonstrations like kung-fu performances gaining global attention. However, core AI generalisation for robots — an ‘‘embodied ChatGPT’’ that reliably follows language instructions in novel environments — is still likely two to three years away, and industrial use remains in pilot stages.

A futuristic humanoid robot in an indoor Tokyo setting, showcasing modern technology.

Key Takeaways

  • 1Yushi estimates about 5,000 humanoid robots were shipped globally in 2025, with its 1.3m lightweight model dominating deployments.
  • 2Robotic athletic milestones include a 1,500m time of just over six minutes; Wang predicts sub-10-second 100m robots by mid-year.
  • 3Major limitation remains AI generalisation; a robust embodied-AI able to perform 80% of tasks in 80% of unfamiliar scenes is likely 2–3 years away.
  • 4Yushi plans to deploy several thousand to 10,000 robots this year to collect daily embodied-data streams and accelerate training.
  • 5Industrial applications are still experimental; widespread commercial uptake depends on improvements in perception, world models and reliability.

Editor's
Desk

Strategic Analysis

Wang’s remarks are important both technically and strategically. Technically, they highlight a pragmatic roadmap: cheaper, lighter hardware for scale; massive real-world data collection to solve generalisation; and synthetic-video–based world models to bridge sim-to-real transfer. Strategically, they signal how China’s robotics firms aim to leverage domestic manufacturing scale, public showcases and tight integration with AI research to establish a lead in embodied systems. That combination could compress global competition cycles and shift where investments in robotics infrastructure, regulation and workforce planning gravitate. Yet the claim that sprint times and demo feats will quickly translate into reliable industrial labour — or social acceptance — understates persistent challenges in safety, human-robot interaction and task transfer. Policymakers and companies should prepare for parallel outcomes: rapid consumer and research proliferation of humanoids alongside slow, uneven adoption in safety-critical or productivity-focused industry roles.

NewsWeb Editorial
Strategic Insight
NewsWeb

At the Aabori forum this month, Wang Xingxing, founder of Shanghai-based Yushi Technology, painted an assertive near-term picture for humanoid robots while warning that truly generalised, language-driven robotic intelligence remains some way off.

Wang said Yushi’s second-generation machine, launched in 2024, has become one of the top-selling humanoid platforms worldwide: the company estimates roughly 5,000 humanoid robots shipped globally last year, with its 1.3-metre, lightweight model accounting for most deployments. The smaller size and a reported entry price of about RMB 30,000 (roughly $4,000) have encouraged wider adoption for research, performance and pilot commercial uses.

Sporting benchmarks have captured public imagination. At a Beijing humanoid-robot athletics meet in August, Yushi machines won the 1,500m, 400m and the 4×100m obstacle relay; Wang said the 1,500m prototype ran in just over six minutes, faster than most of his staff. He predicted that by mid-year humanoids — especially Chinese models — would be able to outpace humans over sprint distances, with 100-metre times dipping under ten seconds.

Yushi’s team also emphasised cultural projection: its robots performed a kung-fu routine on the Spring Festival Gala that included complex martial-arts moves and acrobatics. Wang framed the program as both a technical showcase and a form of cultural diplomacy that resonated abroad, where clips generated substantial international attention.

On the technical front, Wang was candid about limits. Current systems lack robust generalisation: machines trained for a handful of scenes succeed reliably there but fail when environments change. He set out a concrete test for an embodied-AI ‘ChatGPT moment’ — a robot that, when dropped into an unfamiliar scene, can carry out roughly 80% of language-directed tasks about 80% of the time without prior mapping or rehearsal. He thinks that moment is likely two to three years away, a more cautious estimate than some in the industry.

His favored route to that leap is the ‘‘world model’’ approach built from video generation. Wang highlighted ByteDance’s January release of Seedance 2.0 as a landmark in video-generation quality and argued that aligning generated video with robot motion could produce the generalized embodied models robots need.

Data scarcity is the immediate operational constraint, he said. To address it, Yushi plans to deploy thousands — potentially up to ten thousand — robots by year-end and collect around ten hours of telemetry per day per unit. Those data, Wang argued, would rapidly accelerate training and close a major bottleneck for embodied learning.

Despite showy demos and rising shipments, industrial deployment remains largely pilot-stage. Wang acknowledged that efficiency and success rates for factory or logistics tasks lag what is needed for large-scale commercial rollout. He therefore positions Yushi’s strategy as parallel development of locomotion and manipulation: movement capability must be solved before robots can reliably “do the work” at scale.

Wang ended with a market hypothesis: if embodied intelligence crosses a critical threshold, demand could spike from thousands to millions of units a year. He said China’s current first-mover position gives domestic companies a rare window of opportunity to shape that market.

Share Article

Related Articles

📰
No related articles found