Deft Scheduling of Dynamic Cloud Workflows with Varying Deadlines via Mixture-of-Experts

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cloud Computing & IT Infrastructure · Depth: Expert, quick

Summary

DEFT (Deadline-perceptive Mixture-of-Experts) is a novel deep reinforcement learning (DRL) policy architecture designed for dynamic cloud workflow scheduling, addressing the challenge of allocating graph-structured workflows with varying deadlines onto changing virtual machine resources. Published on 2026-05-31, DEFT introduces and validates a Mixture-of-Experts (MoE) architecture, where individual experts are trained to manage specific levels of deadline tightness. Its core innovation is a graph-adaptive gating mechanism that encodes workflow deadlines, DAGs, task states, and VM conditions, utilizing cross-attention to activate the most appropriate expert. This adaptive routing enables DEFT to meet a broad spectrum of deadline requirements, significantly reducing execution cost and deadline violations compared to multiple advanced DRL baselines in dynamic cloud workflow benchmarks.

Key takeaway

For Machine Learning Engineers or Cloud Operations Managers designing dynamic cloud workflow schedulers, DEFT's Mixture-of-Experts DRL architecture presents a compelling approach. You should investigate this method to significantly reduce execution costs and minimize deadline violations for complex, graph-structured workflows. Its adaptive, deadline-perceptive gating mechanism offers a more robust solution than traditional single-path DRL schedulers, enhancing overall system efficiency and reliability in variable cloud environments.

Key insights

DEFT uses a Mixture-of-Experts DRL architecture with a graph-adaptive gating mechanism to optimize dynamic cloud workflow scheduling.

Principles

Method

DEFT employs a DRL policy with a Mixture-of-Experts. A graph-adaptive gating mechanism encodes workflow deadlines, DAGs, task states, and VM conditions, using cross-attention to route decisions to the most appropriate expert for dynamic cloud workflow scheduling.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Architect

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.