Snowflake CEO finds GLM-5.2 competitive with Opus 4.7 at a fraction of the cost

2026-06-24 · Source: The Decoder · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering, Emerging Technologies & Innovation · Depth: Intermediate, short

Summary

Snowflake's CEO, Sridhar Ramaswamy, conducted a real-world programming benchmark comparing China's GLM-5.2 AI model with Anthropic's Opus 4.7. The test involved 103 coding tasks, run three times, requiring code compatible with both DuckDB and Snowflake. Results showed GLM-5.2 solved 66% of tasks, nearly matching Opus 4.7's 67%. While Opus 4.7 demonstrated higher first-attempt accuracy (53.7% vs. 47.6%) and better efficiency, requiring 80 iterations and 439 million tokens compared to GLM's 99 iterations and 860 million tokens, GLM-5.2 presents a significant cost advantage. GLM-5.2 is priced at \$4.40 per million output tokens, substantially cheaper than Opus 4.7's \$25.00 and GPT-5.5's \$30.00, creating considerable price pressure on Western AI companies.

Key takeaway

For Directors of AI/ML evaluating large language models for coding tasks, you should critically assess the total cost of ownership beyond raw performance metrics. While Western models like Opus 4.7 offer higher first-attempt accuracy, GLM-5.2's significantly lower token pricing, at \$4.40 per million output tokens, can offset its higher token consumption. Your team could achieve comparable task completion rates at a substantially reduced operational cost, potentially impacting your budget allocation for AI infrastructure.

Key insights

Chinese AI model GLM-5.2 offers competitive coding performance at a fraction of Western models' cost, intensifying market price pressure.

Principles

High iteration counts do not guarantee correctness.
Price-performance ratios can disrupt market valuations.
Cross-platform validation is a key model strength.

Method

Snowflake's benchmark involved 103 coding tasks, each run three times, requiring models to generate code functional on both DuckDB and Snowflake platforms.

In practice

Evaluate models on real-world, multi-platform coding tasks.
Consider total cost of ownership, including token usage.
Explore non-Western models for cost-effective solutions.

Topics

Large Language Models
AI Benchmarking
GLM-5.2
Claude Opus 4.7
AI Pricing
Code Generation

Best for: CTO, Machine Learning Engineer, Entrepreneur, AI Engineer, Director of AI/ML, Investor

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The Decoder.