EasyLens: A Training-Free Plug-and-Play Subtle-Lesion Representation Amplifier for Medical Vision-Language Models

· Source: cs.AI updates on arXiv.org · Field: Health & Wellbeing — Health & Medical Research, Medical Devices & Health Technology, Artificial Intelligence & Machine Learning · Depth: Expert, extended

Summary

EasyLens is a training-free, plug-and-play subtle-lesion representation amplifier designed for medical vision-language models (VLMs). It addresses the limited sensitivity of frozen medical VLMs to subtle lesions, which are often sparse, low-contrast, and underrepresented in global image embeddings. EasyLens comprises three components: EasyBank, a pathology-anatomy prototype space; EasyTag, a counterfactual prototype reasoning module for selecting lesion-relevant patches; and EasyAmplifier, which strengthens these selected patch representations through morphology-guided residual enhancement. Experiments on ReXGroundingCT, LIDC-IDRI, and AbdomenAtlas 3.0 Mini, using frozen MedGemma1.5, LLaVA-Med, RadFM, Lingshu, and MedGemma backbones, demonstrate consistent improvements in subtle-lesion detection, region selection, and report generation without model fine-tuning.

Key takeaway

For AI Scientists and Machine Learning Engineers working with medical VLMs, EasyLens offers a critical solution to improve subtle lesion detection without costly retraining. If your current frozen models struggle with sparse, low-contrast abnormalities, you should consider integrating this training-free, plug-and-play amplifier. EasyLens enhances lesion-relevant visual evidence at inference time, enabling more reliable diagnostic support and report generation for micro-lesions, thereby increasing the clinical utility of your existing VLM deployments.

Key insights

Training-free EasyLens amplifies subtle lesion cues in frozen VLM patch representations, improving detection and report generation.

Principles

Method

EasyLens constructs EasyBank (prototype space), uses EasyTag for counterfactual patch selection, then EasyAmplifier applies morphology-guided residual enhancement to selected patches.

In practice

Topics

Best for: Computer Vision Engineer, AI Scientist, Research Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.