The New Open-Weights Leader, Big AI’s Political Influence, Predicting Illness, Faster Reasoning

· Source: The Batch | DeepLearning.AI · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering, Emerging Technologies & Innovation · Depth: Intermediate, long

Summary

Z.ai has released GLM-5, a 744 billion-parameter Mixture-of-Experts transformer model with 40 billion active parameters per token, designed for long-running agentic tasks. It achieved top performance among open-weights models on Artificial Analysis' Intelligence Index, 𝜏²-Bench Telecom, Vending Bench 2, and Chatbot Arena Code, nearly matching proprietary leaders like Claude Opus 4.6 and GPT-5.2. GLM-5 supports up to 200,000 input tokens and 128,000 output tokens, features function calling, reasoning, and context caching, and is available via a free web interface, Hugging Face (MIT license), and API. The model was pretrained on 28.5 trillion tokens and utilized Z.ai's open-source reinforcement learning software, "slime," for post-training, along with DeepSeek sparse attention for efficient long-context processing. Additionally, Liquid AI introduced LFM2.5-1.2B-Thinking, a 1.17 billion-parameter hybrid transformer-convolutional neural network model optimized for on-device reasoning, operating under 900MB RAM and twice as fast as Qwen3-1.7B on CPUs. This model, pretrained on 28 trillion tokens, excels in agentic tasks and data extraction, supporting eight languages, but shows higher hallucination rates in knowledge-intensive tasks. Separately, researchers developed SleepFM, an AI system using CNN, transformer, and LSTM architectures to classify over 130 diseases, including heart disease and psychiatric disorders, up to six years before symptom onset, based on one night of sleep study recordings. SleepFM achieved higher AUC scores across various disease categories compared to systems without pretraining. Finally, top tech and AI companies, including Meta, Amazon, Alphabet, and Microsoft, spent over $100 million on lobbying in 2025, influencing policies like the reversal of the advanced AI chip ban to China and limiting state-level AI regulation, while also supporting large data center projects.

Key takeaway

For CTOs and VPs of Engineering evaluating AI model adoption, the emergence of high-performing open-weights models like GLM-5 and efficient edge models like LFM2.5-1.2B-Thinking presents compelling alternatives to proprietary solutions. You should explore these options for cost-effectiveness, customization, and deployment flexibility, especially for agentic workflows or on-device applications. Be mindful of the evolving regulatory landscape influenced by tech lobbying, which may streamline compute infrastructure but could also favor larger players.

Key insights

Open-weights AI models are rapidly closing the performance gap with proprietary frontier models, offering high-performance, customizable alternatives.

Principles

Method

GLM-5 uses a Mixture-of-Experts transformer architecture, pretrained on 28.5 trillion tokens, with post-training via "slime" reinforcement learning and DeepSeek sparse attention. LFM2.5-1.2B-Thinking employs a phased training approach for its hybrid transformer-convolutional architecture, including step-by-step reasoning data and merging specialized RL models. SleepFM combines CNN, transformer, and LSTM in a two-stage training process for sleep data encoding and disease classification.

In practice

Topics

Code references

Best for: CTO, VP of Engineering/Data, Machine Learning Engineer, AI Engineer, Director of AI/ML, Executive

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The Batch | DeepLearning.AI.