NVIDIA Nemotron 3 Nano 30B MoE model is now available in Amazon SageMaker JumpStart

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cloud Computing & IT Infrastructure · Depth: Intermediate, short

Summary

The NVIDIA Nemotron 3 Nano 30B model, featuring 3 billion active parameters, is now generally available in the Amazon SageMaker JumpStart model catalog. This small language hybrid Mixture-of-Experts (MoE) model is designed for high compute efficiency and accuracy, excelling in coding, scientific reasoning, and math. It leads on benchmarks such as SWE Bench Verified, GPQA Diamond, AIME 2025, Arena Hard v2, LiveCodeBench, BFCL, and IFBench. Nemotron 3 Nano is fully open-source, providing open weights, datasets, and recipes for customization and deployment on various infrastructures. It supports a context window of up to 1 million tokens and functions as a text-based foundation model for both inputs and outputs.

Key takeaway

For AI Engineers building generative AI applications, Nemotron 3 Nano 30B offers a powerful, open-source MoE model with strong performance in coding and reasoning. You should consider deploying it via Amazon SageMaker JumpStart to leverage its managed capabilities and accelerate development without managing complex infrastructure, while also benefiting from its customization options.

Key insights

NVIDIA's Nemotron 3 Nano 30B, an open-source MoE model, offers high efficiency and accuracy for agentic tasks on AWS.

Principles

Method

Deploy Nemotron 3 Nano via Amazon SageMaker JumpStart by selecting the model in SageMaker Studio, deploying to an endpoint, and interacting using AWS CLI or SageMaker SDK.

In practice

Topics

Code references

Best for: AI Engineer, Machine Learning Engineer, Software Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.