Now in Foundry: Qwen3.5 Medium Model Series
Summary
The Qwen3.5 Medium Model Series, now available in Microsoft Foundry, comprises three Vision Language Models (VLMs) featuring early-fusion multimodal training, a 262K native context window, and support for 201 languages under Apache 2.0. These models incorporate unified vision-language training, Gated Delta Networks for linear attention, and scalable reinforcement learning for post-training. The series includes a 27B dense model optimized for latency-sensitive applications, a 35B total parameter MoE model activating 3B parameters for high-throughput and cost-efficiency, and a 122B total parameter MoE model activating 10B parameters, offering frontier-class multimodal performance and expert-level knowledge depth. Each model is designed for specific use cases, from real-time visual inspection to complex financial research.
Key takeaway
For AI/ML Directors evaluating multimodal models for production, the Qwen3.5 Medium Series in Microsoft Foundry offers specialized options. Your choice should align with specific operational needs: Qwen3.5-27B for predictable low-latency tasks, Qwen3.5-35B-A3B for cost-optimized high-throughput scenarios, or Qwen3.5-122B-A10B for maximum capability in complex reasoning. Consider deploying directly via the Hugging Face collection in Foundry for streamlined integration and secure inference.
Key insights
Qwen3.5 models offer diverse VLM capabilities, from low-latency dense to cost-efficient MoE architectures.
Principles
- Early-fusion VLM training enhances reasoning.
- Gated Delta Networks improve inference throughput.
- MoE routing balances performance and cost.
Method
Qwen3.5 models use unified vision-language training, Gated Delta Networks for attention, and scalable RL across multi-agent environments for post-training.
In practice
- Use Qwen3.5-27B for real-time visual inspection.
- Deploy Qwen3.5-35B-A3B for high-throughput contract review.
- Leverage Qwen3.5-122B-A10B for complex financial analysis.
Topics
- Qwen3.5 Models
- Vision Language Models
- Mixture-of-Experts
- Multimodal AI
- Microsoft Foundry
Best for: CTO, VP of Engineering/Data, Director of AI/ML, AI Engineer, Machine Learning Engineer, Data Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Microsoft Foundry Blog articles.