Chinese AI lab Zhipu releases GLM-5 under MIT license, claims parity with top Western models
Summary
Chinese AI company Zhipu AI has released GLM-5, an open-source model with 744 billion parameters under an MIT license. The company claims GLM-5 matches Claude Opus 4.5 and GPT-5.2 on coding and agent tasks, featuring a Mixture-of-Experts architecture with 40 billion active parameters at any time and trained on 28.5 trillion tokens. It utilizes Deepseek Sparse Attention (DSA) to reduce deployment costs and supports document generation into .docx, .pdf, and .xlsx files. Crucially, GLM-5 runs on both Nvidia GPUs and Chinese chips from Huawei, Moore Threads, and Cambricon, addressing US export restrictions. The model also integrates with frameworks like OpenClaw and popular coding agents, and its release significantly narrows the performance gap with Western counterparts.
Key takeaway
For NLP Engineers and CTOs evaluating large language models for deployment, GLM-5 offers a compelling open-source option with competitive performance on agent and coding tasks. Its compatibility with Chinese domestic hardware like Huawei Ascend chips provides a strategic advantage for operations in regions affected by US export controls. Consider testing GLM-5 for applications requiring document generation or complex agentic workflows, especially if hardware flexibility is a priority.
Key insights
GLM-5 is an open-source, 744B-parameter Chinese AI model claiming parity with top Western models on agent and coding tasks.
Principles
- Mixture-of-Experts architectures scale model size efficiently.
- Hardware diversification is critical for geopolitical resilience.
Method
GLM-5 uses a Mixture-of-Experts architecture, Deepseek Sparse Attention, and an asynchronous reinforcement learning framework (slime) for training, supporting vLLM and SGLang for inference.
In practice
- Deploy GLM-5 on Huawei Ascend or other Chinese chips.
- Use GLM-5 for direct document generation (e.g., .docx, .pdf).
- Integrate GLM-5 with OpenClaw for cross-app workflows.
Topics
- GLM-5
- Mixture-of-Experts
- AI Agents
- Open-source Models
- Chinese AI Hardware
Code references
Best for: NLP Engineer, CTO, VP of Engineering/Data, AI Engineer, Machine Learning Engineer, AI Researcher
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by The Decoder.