Latest news and articles about inference optimisation
Total: 1 articles found
Haiguang’s DCU has completed Day‑0 adaptation and deep tuning for the Qwen3.5‑397B‑A17B model, offering immediate, pre‑optimised deployment for developers and enterprises. The achievement shortens deployment times and bolsters China’s domestic AI hardware–software stack, though independent performance validation and production testing remain necessary.