Alibaba's open Qwen 3.5 takes aim at GPT-5 mini and Claude Sonnet 4.5 at a fraction of the cost

· Source: The Decoder · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Emerging Technologies & Innovation · Depth: Intermediate, quick

Summary

Alibaba has released an expanded Qwen 3.5 model series, including Qwen3.5-Flash, Qwen3.5-35B-A3B, Qwen3.5-122B-A10B, and Qwen3.5-27B. These models accept text, images, and video as input, generating text output, and are designed for stronger performance with reduced computational requirements. The smaller Qwen3.5-35B-A3B notably surpasses its larger predecessor, Qwen3-235B-A22B, indicating architectural and data quality improvements. Benchmarks show Qwen 3.5 models matching or exceeding Western counterparts like OpenAI's GPT-5 mini and Anthropic's Claude Sonnet 4.5 in areas such as agent-based tool use (BFCL V4, 72.2), web search (BrowseComp, 63.8), visual reasoning (MMMU-Pro, 76.9), and document recognition (OmniDocBench, 89.8). The models are available on Hugging Face and ModelScope under the Apache License 2.0, with Qwen3.5-Flash offering a 1-million-token context length and an API cost of $0.10 per million input tokens and $0.40 per million output tokens.

Key takeaway

For NLP Engineers and CTOs evaluating cost-effective, high-performance multimodal LLMs, Alibaba's Qwen 3.5 series presents a compelling open-source alternative. Its competitive benchmark scores against GPT-5 mini and Claude Sonnet 4.5, coupled with an Apache 2.0 license and low API costs for Qwen3.5-Flash, suggest a viable option for deploying advanced AI capabilities without the premium associated with proprietary models. Consider integrating Qwen 3.5 for agent-based applications requiring multimodal input.

Key insights

Alibaba's Qwen 3.5 models achieve competitive performance against top Western LLMs at lower computational cost.

Principles

In practice

Topics

Best for: NLP Engineer, Computer Vision Engineer, CTO, AI Engineer, Machine Learning Engineer, AI Product Manager

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The Decoder.