MADP: A Multi-Agent Pipeline for Sustainable Document Processing with Human-in-the-Loop

2026-05-19 · Source: cs.MA updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering, Operations & Process Management · Depth: Advanced, extended

Summary

MADP is a multi-agent architecture designed for sustainable enterprise document processing, combining deep learning classification, large language model (LLM) extraction, and human-in-the-loop (HITL) validation. The system integrates five specialized agents: Classificator, Splitter, Parser, Extraction, and Validator, along with a novel Prompt Fine Tuning with Feedback Inheritance (PFTFI) mechanism. Operational analysis on a production use-case of 100,000 invoices per year projects a 70% reduction in Full-Time Equivalent (FTE) requirements. Production deployment on 955 real-world documents through January 2026 achieved a 97.0% full-pipeline automation rate. An ablation study on a 100-document subset demonstrated 98.5% document-level accuracy with HITL supervision. The hybrid AI+HITL approach also reduces CO2 emissions by 69%, energy consumption by 69%, and water usage by 63% compared to traditional manual processing.

Key takeaway

For AI Architects and CTOs evaluating document automation solutions, MADP demonstrates that a hybrid AI+HITL multi-agent system can achieve high accuracy (98.5%) and significant operational savings (70% FTE reduction) while also reducing environmental impact. You should consider implementing modular, agent-based architectures with integrated human feedback loops to enhance reliability and sustainability in your document-intensive workflows, especially for mission-critical applications like invoice processing.

Key insights

A multi-agent AI pipeline with human oversight significantly boosts document processing accuracy and sustainability.

Principles

Modular agent design improves robustness.
Structured preprocessing enhances LLM extraction.
Human feedback refines AI without retraining.

Method

MADP orchestrates five agents (Classificator, Splitter, Parser, Extraction, Validator) with a PFTFI feedback loop, using human corrections to update prompts for continuous improvement without model retraining.

In practice

Use ResNet-18 for fast document classification.
Employ Docling for structured markdown parsing.
Benchmark LLM backends for optimal F1/precision/latency.

Topics

Multi-Agent Systems
Document Processing Automation
Human-in-the-Loop
Prompt Fine Tuning with Feedback Inheritance
Sustainability Analysis

Code references

Best for: AI Architect, CTO, VP of Engineering/Data, AI Engineer, Machine Learning Engineer, MLOps Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.MA updates on arXiv.org.