COTCAgent: Preventive Consultation via Probabilistic Chain-of-Thought Completion

· Source: Computation and Language · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics, Medical Devices & Health Technology · Depth: Advanced, quick

Summary

COTCAgent, a Probabilistic Chain-of-Thought Completion Agent, is introduced as a hierarchical reasoning framework designed to improve longitudinal electronic health record (EHR) analysis. Current large language models struggle with EHRs due to issues like hallucinating clinical trends from implied quantitative evidence and failing to capture long-range temporal dependencies from non-uniform time series and scarce labels. COTCAgent addresses these by using a Temporal-Statistics Adapter for standardized trend output, a Chain-of-Thought Completion layer with a symptom-trend-disease knowledge base for risk evaluation, and a bounded completion module for structured evidence acquisition. This framework decouples statistical computation, feature matching, and language generation, reducing reliance on complex multi-modal inputs and lowering computational overhead. Powered by Baichuan-M2, COTCAgent achieved 90.47% Top-1 accuracy on a self-built dataset and 70.41% on HealthBench, surpassing existing medical agents and mainstream LLMs.

Key takeaway

For AI Scientists and Machine Learning Engineers developing clinical decision support systems, COTCAgent offers a robust approach to overcome current LLM limitations in longitudinal EHR reasoning. You should consider adopting its hierarchical framework, particularly its decoupled statistical computation and probabilistic chain-of-thought completion, to improve diagnostic accuracy and reduce computational overhead in your medical AI applications.

Key insights

COTCAgent enhances longitudinal EHR reasoning by decoupling statistical computation and leveraging a probabilistic chain-of-thought.

Principles

Method

COTCAgent uses a Temporal-Statistics Adapter for trend output, a Chain-of-Thought Completion layer with a knowledge base for risk evaluation, and a bounded completion module for structured evidence acquisition via iterative scoring.

In practice

Topics

Code references

Best for: AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.