EasyLens: A Training-Free Plug-and-Play Subtle-Lesion Representation Amplifier for Medical Vision-Language Models
Summary
EasyLens is a training-free, plug-and-play subtle-lesion representation amplifier designed for medical vision-language models (VLMs). It addresses the limited sensitivity of frozen medical VLMs to subtle lesions, which are often sparse, low-contrast, and underrepresented in global image embeddings. EasyLens comprises three components: EasyBank, a pathology-anatomy prototype space; EasyTag, a counterfactual prototype reasoning module for selecting lesion-relevant patches; and EasyAmplifier, which strengthens these selected patch representations through morphology-guided residual enhancement. Experiments on ReXGroundingCT, LIDC-IDRI, and AbdomenAtlas 3.0 Mini, using frozen MedGemma1.5, LLaVA-Med, RadFM, Lingshu, and MedGemma backbones, demonstrate consistent improvements in subtle-lesion detection, region selection, and report generation without model fine-tuning.
Key takeaway
For AI Scientists and Machine Learning Engineers working with medical VLMs, EasyLens offers a critical solution to improve subtle lesion detection without costly retraining. If your current frozen models struggle with sparse, low-contrast abnormalities, you should consider integrating this training-free, plug-and-play amplifier. EasyLens enhances lesion-relevant visual evidence at inference time, enabling more reliable diagnostic support and report generation for micro-lesions, thereby increasing the clinical utility of your existing VLM deployments.
Key insights
Training-free EasyLens amplifies subtle lesion cues in frozen VLM patch representations, improving detection and report generation.
Principles
- Subtle lesion cues are often preserved at patch-level but diluted in global VLM embeddings.
- Prototype spaces can differentiate pathological patterns from normal anatomical variations.
- Morphology-guided residual enhancement strengthens lesion signals without disrupting context.
Method
EasyLens constructs EasyBank (prototype space), uses EasyTag for counterfactual patch selection, then EasyAmplifier applies morphology-guided residual enhancement to selected patches.
In practice
- Integrate EasyLens as an inference-time adapter for existing medical VLMs.
- Build pathology-anatomy prototype banks for fine-grained lesion discrimination.
- Apply morphology-guided residual updates to enhance subtle lesion representations.
Topics
- Medical Vision-Language Models
- Subtle Lesion Detection
- Representation Amplification
- Prototype Learning
- Training-Free Inference
- Radiology AI
Best for: Computer Vision Engineer, AI Scientist, Research Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.