Introducing Gemma 4 models on Amazon Bedrock

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cloud Computing & IT Infrastructure, Software Development & Engineering · Depth: Intermediate, extended

Summary

Amazon Bedrock now offers the Gemma 4 family of open-weight models, developed by Google DeepMind under the Apache 2.0 license. This family includes three instruction-tuned variants: Gemma 4 31B (30.7B dense), Gemma 4 26B-A4B (25.2B total / 3.8B active MoE), and Gemma 4 E2B (5.1B total / 2.3B effective dense). These models feature built-in reasoning, native function calling, and multimodal input across text and image, supporting over 35 languages. Gemma 4 31B achieved an Intelligence Index of 39, significantly above the 4B–40B open-weights class median of 15. Amazon Bedrock provides these models as a fully managed service via the "bedrock-mantle" endpoint, ensuring data protection and operational control. The service offers Standard, Priority, and Flex tiers, with context windows up to 256K tokens. Gemma 4 models are available in four AWS Regions: US East (N. Virginia), US East (Ohio), US West (Oregon), and Europe (Frankfurt).

Key takeaway

For AI Engineers deploying open-weight models, Amazon Bedrock's Gemma 4 family offers a secure, managed inference solution. You can select variants like Gemma 4 31B for reasoning-heavy tasks or Gemma 4 26B-A4B for cost-sensitive, high-throughput needs, leveraging native function calling and multimodal input. Implement exponential backoff for transient errors and gradually ramp up traffic to avoid 503s. Consider the Priority tier for latency-sensitive applications to ensure consistent performance without upfront commitment, optimizing your operational costs and reliability.

Key insights

Gemma 4 models on Amazon Bedrock offer flexible, open-weight AI with multimodal capabilities and managed inference for diverse workloads.

Principles

Method

Access Gemma 4 via the "bedrock-mantle" endpoint using OpenAI SDKs, configure IAM permissions, and utilize the console playground for testing.

In practice

Topics

Best for: AI Engineer, Machine Learning Engineer, MLOps Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.