Latest news and articles about compute bottleneck
Total: 1 articles found
Zhipu has limited daily sales of its GLM Coding Plan to 20% of previous volumes as a temporary measure after GLM‑4.7 triggered heavy demand that strained compute resources and slowed model responses during peak hours. The cap, beginning Jan 23 and refreshed daily, aims to protect existing users while Zhipu expands capacity and tightens control over malicious traffic.