MADP: A Multi-Agent Pipeline for Sustainable Document Processing with Human-in-the-Loop

· Source: cs.MA updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering, Operations & Process Management · Depth: Advanced, extended

Summary

MADP is a multi-agent architecture designed for sustainable enterprise document processing, combining deep learning classification, large language model (LLM) extraction, and human-in-the-loop (HITL) validation. The system integrates five specialized agents: Classificator, Splitter, Parser, Extraction, and Validator, along with a novel Prompt Fine Tuning with Feedback Inheritance (PFTFI) mechanism. Operational analysis on a production use-case of 100,000 invoices per year projects a 70% reduction in Full-Time Equivalent (FTE) requirements. Production deployment on 955 real-world documents through January 2026 achieved a 97.0% full-pipeline automation rate. An ablation study on a 100-document subset demonstrated 98.5% document-level accuracy with HITL supervision. The hybrid AI+HITL approach also reduces CO2 emissions by 69%, energy consumption by 69%, and water usage by 63% compared to traditional manual processing.

Key takeaway

For AI Architects and CTOs evaluating document automation solutions, MADP demonstrates that a hybrid AI+HITL multi-agent system can achieve high accuracy (98.5%) and significant operational savings (70% FTE reduction) while also reducing environmental impact. You should consider implementing modular, agent-based architectures with integrated human feedback loops to enhance reliability and sustainability in your document-intensive workflows, especially for mission-critical applications like invoice processing.

Key insights

A multi-agent AI pipeline with human oversight significantly boosts document processing accuracy and sustainability.

Principles

Method

MADP orchestrates five agents (Classificator, Splitter, Parser, Extraction, Validator) with a PFTFI feedback loop, using human corrections to update prompts for continuous improvement without model retraining.

In practice

Topics

Code references

Best for: AI Architect, CTO, VP of Engineering/Data, AI Engineer, Machine Learning Engineer, MLOps Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.MA updates on arXiv.org.