Latest open artifacts (#17): NVIDIA, Arcee, Minimax, DeepSeek, Z.ai and others close an eventful year on a high note

· Source: Interconnects AI · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Emerging Technologies & Innovation · Depth: Advanced, quick

Summary

The open ecosystem continues its rapid expansion into 2026, with several significant model releases. LLM360, a project from MBZUAI, introduced K2-V2, a 70B dense model with fully open pre-training (12T tokens) and SFT data, generated using GPT-OSS 120B. NVIDIA updated its Nemotron series with NVIDIA-Nemotron-3-Nano-30B-A3B-BF16, featuring a Mamba2-Transformer MoE architecture and plans for larger 100B and 500B models in H1 2026 utilizing Latent MoE and multi-token prediction. Arcee-ai released Trinity-Mini (26B-A3B MoE) and Trinity-Nano (6B-A1B MoE), with a 420B-A13B MoE Large model planned. Zhipu's GLM-4.7, released before its January 8th IPO, demonstrates strong performance in tasks like GPVal-AA and DesignArena, despite not leading on academic benchmarks. Other notable releases include Llama-3.3-8B-Instruct, ServiceNow-AI's Apriel-1.6-15b-Thinker, XiaomiMiMo's 309B-A15B MoE MiMo-V2-Flash, and DeepSeek-V3.2, including a "Speciale" version claiming gold-medal performance on IMO and IOI.

Key takeaway

For AI Architects evaluating new open-source models, prioritize those offering full data transparency like LLM360's K2-V2 or advanced architectures such as NVIDIA's Nemotron-3 MoE series. Your decision should weigh specific task performance, as seen with GLM-4.7 excelling in practical applications over academic benchmarks, against resource requirements like GLM-4.7's slower inference. Consider the upcoming larger models from NVIDIA and Arcee for future scaling needs.

Key insights

The open model ecosystem is rapidly evolving with diverse architectures, data transparency, and specialized performance.

Principles

Method

LLM360 generated SFT data using GPT-OSS 120B across three reasoning levels. ServiceNow-AI used GSPO with length and verbosity penalties to reduce token usage.

In practice

Topics

Best for: AI Architect, NLP Engineer, AI Scientist, AI Engineer, Machine Learning Engineer, AI Researcher

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Interconnects AI.