Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

2026-04-23 · Source: Hugging Face - Blog · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cybersecurity & Data Privacy, Software Development & Engineering · Depth: Intermediate, long

Summary

NVIDIA released Nemotron 3.5 Content Safety on June 4, 2026, a 4B-parameter model built on Google Gemma 3 4B IT. This model unifies multimodal input, multilingual reach across 12 explicitly trained languages and approximately 140 zero-shot languages, custom enterprise policy enforcement, and auditable reasoning into a single inference call. It processes user prompts, optional images, and assistant responses together to catch policy violations arising from interactions between modalities. A key architectural addition is custom policy enforcement, allowing the model to reason over user-defined policies instead of a fixed taxonomy. Nemotron 3.5 also offers an optional "THINK mode" for step-by-step reasoning traces, aiding compliance and human review. The accompanying multimodal, multilingual safety dataset, including reasoning traces and 99% real photographs, is also released. Benchmarks show Nemotron 3.5 achieves approximately 85% average accuracy on multimodal harmful-content tests and 92.7% on combined Aegis and RTP-LX multilingual benchmarks, while maintaining low latency.

Key takeaway

For AI Architects designing global enterprise AI systems, Nemotron 3.5 Content Safety offers a unified solution to complex moderation challenges. You can enforce custom, domain-specific policies across multimodal and multilingual inputs, ensuring consistent safety posture without deploying separate regional or modality-specific models. Utilize its "THINK mode" for auditable reasoning, crucial for compliance and refining policy language, while maintaining low latency for real-time decisions. This streamlines safety integration and reduces operational overhead.

Key insights

Nemotron 3.5 Content Safety unifies multimodal, multilingual, and custom policy enforcement with auditable reasoning in a single, efficient model.

Principles

Multimodal safety requires evaluating interactions, not just individual components.
Customizable policies are essential for diverse enterprise risk profiles.
Auditable reasoning enhances compliance and policy refinement.

Method

Nemotron 3.5 fine-tunes Google Gemma 3 4B IT with a LoRA adapter, using a 2-step process for concise reasoning traces generated by larger teacher models (Qwen 397B, Qwen 80B).

In practice

Integrate Nemotron 3.5 for unified multimodal and multilingual safety checks.
Define custom policies using the provided skill for domain-specific rules.
Enable "THINK mode" for auditable reasoning traces in regulated environments.

Topics

Nemotron 3.5
Multimodal Safety
Multilingual AI
Custom Policy Enforcement
AI Safety Auditing
Content Moderation

Code references

NVIDIA-NeMo/Nemotron

Best for: CTO, VP of Engineering/Data, Machine Learning Engineer, MLOps Engineer, AI Engineer, AI Architect

Related on AIssential

See Counsel's argued verdicts on the open AI decisions leaders are weighing →

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Hugging Face - Blog.