From Specification to Execution: AI Assisted Scientific Workflow Management

2025-05-18 · Source: cs.SE updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering, Data Science & Analytics · Depth: Expert, extended

Summary

An AI-assisted approach to scientific workflow management is presented, combining specification-driven generation, automated debugging, and distributed execution. This method introduces a structured specification phase, separating workflow intent, design, and implementation for validation prior to code generation. An LLM-based debugging agent diagnoses and resolves failures across multiple system layers. The system integrates Pegasus WMS with a Model Context Protocol (MCP) layer for unified interaction. Evaluated with a federated learning workflow for medical imaging, it generated and executed large-scale workflows with thousands of jobs, reduced debugging effort, and enabled non-expert users to construct expert-level designs. Claude Code, using the "pegasus-ai" plugin, produced a feature-complete workflow in 2 sessions for ~\$15-20, significantly outperforming OpenAI Codex and Kimi K2.6 in completeness and reproducibility, and reducing development time from an estimated 3-4 months to days.

Key takeaway

For MLOps Engineers designing complex scientific pipelines, adopting AI-assisted workflow management can drastically cut development time from months to days. You should prioritize systems that offer specification-driven generation and integrated LLM-based debugging, like the Claude Code with "pegasus-ai" plugin, to ensure reproducibility and autonomous fault recovery. This approach allows non-experts to achieve expert-level designs, freeing up specialized talent for optimization.

Key insights

AI-assisted scientific workflow management enhances reproducibility and reduces development effort through structured generation and autonomous debugging.

Principles

Separate workflow intent, design, and implementation.
Use structured specifications for validation.
Ground LLMs with domain-specific skills/plugins.

Method

The proposed method involves a three-stage process: prompt/dataset description, structured specification generation, and code generation. It integrates an LLM-based debugging agent and Pegasus WMS with an MCP layer for distributed execution.

In practice

Employ specification-driven workflow generation.
Integrate LLM-based debugging agents for fault recovery.
Utilize WMS like Pegasus for scalable execution.

Topics

AI-assisted Workflow Management
Scientific Workflows
LLM-based Debugging
Pegasus WMS
Federated Learning
Model Context Protocol

Code references

Best for: Machine Learning Engineer, Research Scientist, AI Scientist, AI Engineer, MLOps Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.SE updates on arXiv.org.