Latest open artifacts (#19): Qwen 3.5, GLM 5, MiniMax 2.5 — Chinese labs' latest push of the frontier
Summary
The latest intelligence brief highlights a busy month for open-weights AI, with new flagship model releases from Qwen, MiniMax, Z.ai, Ant Ling, and StepFun, alongside anticipation for DeepSeek V4. The report introduces "Relative Adoption Metrics (RAM)" to normalize model downloads, identifying GPT-OSS as a top performer and noting DeepSeek V3.2's underperformance compared to earlier 2025 releases. Featured models include Qwen 3.5 (0.8B to 397B-A17B, multi-modal, Qwen-Next architecture with GDN layers), Step-3.5-Flash (196B-A11B MoE, strong in math), GLM-5 (744B-A40B, leading to price hikes), and MiniMax-M2.5 (rivaling larger models). OpenThinker-Agent-v1, Tri-21B-Think, and MiniCPM-SALA are also noted for their specific capabilities.
Key takeaway
For AI Scientists evaluating new open-weight models, utilize the Relative Adoption Metrics (RAM) framework to assess true adoption and identify high-potential models like GPT-OSS. Pay close attention to the early adoption of models with hybrid architectures, such as Qwen 3.5 dense models, as their performance relative to established brands can indicate future trends. Consider the specific strengths of models like Step-3.5-Flash for specialized tasks.
Key insights
New open-weights AI models are rapidly emerging, with adoption tracked by Relative Adoption Metrics (RAM).
Principles
- RAM scores >1 indicate a model is on track to be a top 10 all-time downloaded model in its size class.
- Hybrid model architectures can push the limits of open-source tools.
Method
Relative Adoption Metrics (RAM) normalizes model downloads against peer models in their size class to gauge adoption and identify high-performing or underrated models.
In practice
- Consider Qwen 3.5 models for multilingual and general tasks, disabling reasoning for smaller models if overthinking occurs.
- Evaluate Step-3.5-Flash for math-intensive benchmarks.
- Explore OpenThinker-Agent-v1 for agentic reasoning tasks and evaluation.
Topics
- Open-weights AI Models
- Relative Adoption Metrics
- Qwen 3.5
- Mixture-of-Experts
- Agentic Reasoning
Best for: NLP Engineer, AI Scientist, Research Scientist, Machine Learning Engineer, AI Engineer, AI Product Manager
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Interconnects AI.