Now in Foundry: Qwen3.5 Medium Model Series

2026-03-02 · Source: Microsoft Foundry Blog articles · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cloud Computing & IT Infrastructure · Depth: Intermediate, short

Summary

The Qwen3.5 Medium Model Series, now available in Microsoft Foundry, comprises three Vision Language Models (VLMs) featuring early-fusion multimodal training, a 262K native context window, and support for 201 languages under Apache 2.0. These models incorporate unified vision-language training, Gated Delta Networks for linear attention, and scalable reinforcement learning for post-training. The series includes a 27B dense model optimized for latency-sensitive applications, a 35B total parameter MoE model activating 3B parameters for high-throughput and cost-efficiency, and a 122B total parameter MoE model activating 10B parameters, offering frontier-class multimodal performance and expert-level knowledge depth. Each model is designed for specific use cases, from real-time visual inspection to complex financial research.

Key takeaway

For AI/ML Directors evaluating multimodal models for production, the Qwen3.5 Medium Series in Microsoft Foundry offers specialized options. Your choice should align with specific operational needs: Qwen3.5-27B for predictable low-latency tasks, Qwen3.5-35B-A3B for cost-optimized high-throughput scenarios, or Qwen3.5-122B-A10B for maximum capability in complex reasoning. Consider deploying directly via the Hugging Face collection in Foundry for streamlined integration and secure inference.

Key insights

Qwen3.5 models offer diverse VLM capabilities, from low-latency dense to cost-efficient MoE architectures.

Principles

Early-fusion VLM training enhances reasoning.
Gated Delta Networks improve inference throughput.
MoE routing balances performance and cost.

Method

Qwen3.5 models use unified vision-language training, Gated Delta Networks for attention, and scalable RL across multi-agent environments for post-training.

In practice

Use Qwen3.5-27B for real-time visual inspection.
Deploy Qwen3.5-35B-A3B for high-throughput contract review.
Leverage Qwen3.5-122B-A10B for complex financial analysis.

Topics

Qwen3.5 Models
Vision Language Models
Mixture-of-Experts
Multimodal AI
Microsoft Foundry

Best for: CTO, VP of Engineering/Data, Director of AI/ML, AI Engineer, Machine Learning Engineer, Data Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Microsoft Foundry Blog articles.