LandingAI’s DPT-2 in 2026: Why Agentic Document Extraction Finally Makes Sense

2026-04-17 · Source: Towards AI - Medium · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics, Software Development & Engineering · Depth: Intermediate, long

Summary

LandingAI has released its Agentic Document Extraction (ADE) system, powered by the DPT-2 (Document Pre-trained Transformer-2) model, with a Python SDK (`landingai-ade`) launched on March 11, 2026. ADE addresses the challenge of extracting structured data from visually complex documents like contracts, lab reports, and invoices, which often defeat traditional OCR and generic LLM approaches by discarding layout and visual context. DPT-2, announced September 30, 2025, processes both visual and textual elements jointly, introducing features like agentic table captioning with cell-level grounding, expanded chunk ontology for elements like checkboxes and barcodes, and refined figure captioning. The system operates via a three-API pipeline (Parse, Split, Extract) and employs an agentic loop for self-verification and replanning, enhancing accuracy on complex tables and long documents. LandingAI reports 99.16% accuracy on a subset of the DocVQA benchmark.

Key takeaway

For CTOs or VP of Engineering evaluating document intelligence solutions, LandingAI's ADE with DPT-2 offers a robust, vision-first approach for complex document extraction. You should prioritize the Zero Data Retention (ZDR) mode and EU data residency for GDPR compliance, and secure a Business Associate Agreement (BAA) for HIPAA-regulated data. Consider the Snowflake Native App for maximum data isolation if cloud processing is a concern, but be aware of the proprietary model and lack of self-hosting options.

Key insights

LandingAI's ADE system uses a vision-first, agentic approach to extract structured data from complex documents.

Principles

Documents are visual objects, not just text.
Agentic loops improve complex parsing accuracy.
One-size-fits-all models fail on diverse document types.

Method

ADE employs a Parse-Split-Extract pipeline, using DPT-2 to jointly process visual and textual elements, and an agentic loop to plan, decide, and self-verify extraction tasks.

In practice

Use `landingai-ade` SDK for document parsing.
Define Pydantic schemas for field extraction.
Configure Zero Data Retention for sensitive data.

Topics

Agentic Document Extraction
DPT-2 Model Architecture
Document AI
Computer Vision
Enterprise Compliance

Code references

Best for: CTO, VP of Engineering/Data, AI Architect, AI Engineer, MLOps Engineer, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Towards AI - Medium.