Kubernetes v1.36: Security Defaults Tighten as AI Workload Support Matures

· Source: InfoQ · Field: Technology & Digital — Cloud Computing & IT Infrastructure, Artificial Intelligence & Machine Learning, Software Development & Engineering · Depth: Advanced, medium

Summary

Kubernetes v1.36, codenamed Haru, was released on May 14, 2026, introducing 70 enhancements, with 18 stable, 25 beta, and 25 alpha features. This release significantly tightens security defaults, matures support for AI and machine learning workloads, and improves API scalability. Key security advancements include User Namespaces and Mutating Admission Policies reaching General Availability, enhancing container isolation and simplifying policy definition. For AI/ML, Dynamic Resource Allocation (DRA) enhancements are now enabled by default, replacing the integer-GPU model with primitives for partitioned and shared accelerators. Workload-Aware Preemption, an alpha feature, addresses partial preemption failures for distributed training jobs by treating PodGroups as single preemption units. Additionally, sharded list and watch streams are introduced as an alpha feature to alleviate watch stream bottlenecks in large clusters.

Key takeaway

For CTOs and VPs of Engineering managing large-scale Kubernetes deployments, v1.36 offers critical security and AI/ML workload improvements. You should prioritize upgrading to leverage User Namespaces and Mutating Admission Policies for enhanced security, and enable DRA features to optimize GPU utilization for AI workloads. Be aware of removals like `gitRepo` and IPVS mode in kube-proxy, planning migrations before upgrading to avoid disruption and ensure continued operational stability.

Key insights

Kubernetes v1.36 enhances security, AI/ML workload management, and API scalability through new stable and beta features.

Principles

Method

Mutating Admission Policies use Common Expression Language (CEL) for native mutation logic, eliminating external webhook servers. SELinux Volume Labeling applies labels at mount time via `mount -o context=XYZ` to reduce pod startup delays.

In practice

Topics

Code references

Best for: CTO, VP of Engineering/Data, AI Architect, MLOps Engineer, AI Engineer, DevOps Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by InfoQ.