z.ai's open source GLM-5 achieves record low hallucination rate and leverages new RL 'slime' technique

· Source: VentureBeat · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering, Emerging Technologies & Innovation · Depth: Advanced, medium

Summary

Zhupai (z.ai), a Chinese AI startup, has released GLM-5, an open-source large language model under an MIT License, featuring a record-low hallucination rate of -1 on the Artificial Analysis Intelligence Index v4.0. This represents a 35-point improvement over its predecessor, surpassing major U.S. competitors in knowledge reliability. GLM-5, with 744B parameters and a Mixture-of-Experts architecture (40B active per token), is designed for high-utility knowledge work, offering native "Agent Mode" capabilities to generate professional office documents like .docx, .pdf, and .xlsx files directly from prompts. The model is disruptively priced at approximately $0.80 per million input tokens and $2.56 per million output tokens, making it significantly more cost-effective than proprietary alternatives like Claude Opus 4.6. Its training leverages a novel asynchronous reinforcement learning infrastructure called "slime" to enhance efficiency for complex agentic behaviors.

Key takeaway

For CTOs and VPs of Engineering evaluating LLM adoption, GLM-5 offers a compelling open-source option for building autonomous office solutions. Its low hallucination rate, native document generation, and aggressive pricing make it ideal for enterprises ready to move beyond simple copilots. However, you must weigh the substantial hardware requirements, geopolitical considerations for a China-based model, and the need for robust agent-specific permissions to mitigate governance risks associated with autonomous operations.

Key insights

GLM-5 sets new benchmarks for open-source LLMs in hallucination reduction and agentic document generation at a competitive price.

Principles

Method

The "slime" RL infrastructure uses independent trajectory generation and system-level optimizations like Active Partial Rollouts (APRIL) to accelerate training for complex agentic tasks, integrating Megatron-LM and SGLang.

In practice

Topics

Code references

Best for: CTO, VP of Engineering/Data, Entrepreneur, AI Engineer, Machine Learning Engineer, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by VentureBeat.