EasyLens: A Training-Free Plug-and-Play Subtle-Lesion Representation Amplifier for Medical Vision-Language Models

2026-06-06 · Source: cs.AI updates on arXiv.org · Field: Health & Wellbeing — Health & Medical Research, Medical Devices & Health Technology, Artificial Intelligence & Machine Learning · Depth: Expert, extended

Summary

EasyLens is a training-free, plug-and-play subtle-lesion representation amplifier designed for medical vision-language models (VLMs). It addresses the limited sensitivity of frozen medical VLMs to subtle lesions, which are often sparse, low-contrast, and underrepresented in global image embeddings. EasyLens comprises three components: EasyBank, a pathology-anatomy prototype space; EasyTag, a counterfactual prototype reasoning module for selecting lesion-relevant patches; and EasyAmplifier, which strengthens these selected patch representations through morphology-guided residual enhancement. Experiments on ReXGroundingCT, LIDC-IDRI, and AbdomenAtlas 3.0 Mini, using frozen MedGemma1.5, LLaVA-Med, RadFM, Lingshu, and MedGemma backbones, demonstrate consistent improvements in subtle-lesion detection, region selection, and report generation without model fine-tuning.

Key takeaway

For AI Scientists and Machine Learning Engineers working with medical VLMs, EasyLens offers a critical solution to improve subtle lesion detection without costly retraining. If your current frozen models struggle with sparse, low-contrast abnormalities, you should consider integrating this training-free, plug-and-play amplifier. EasyLens enhances lesion-relevant visual evidence at inference time, enabling more reliable diagnostic support and report generation for micro-lesions, thereby increasing the clinical utility of your existing VLM deployments.

Key insights

Training-free EasyLens amplifies subtle lesion cues in frozen VLM patch representations, improving detection and report generation.

Principles

Subtle lesion cues are often preserved at patch-level but diluted in global VLM embeddings.
Prototype spaces can differentiate pathological patterns from normal anatomical variations.
Morphology-guided residual enhancement strengthens lesion signals without disrupting context.

Method

EasyLens constructs EasyBank (prototype space), uses EasyTag for counterfactual patch selection, then EasyAmplifier applies morphology-guided residual enhancement to selected patches.

In practice

Integrate EasyLens as an inference-time adapter for existing medical VLMs.
Build pathology-anatomy prototype banks for fine-grained lesion discrimination.
Apply morphology-guided residual updates to enhance subtle lesion representations.

Topics

Medical Vision-Language Models
Subtle Lesion Detection
Representation Amplification
Prototype Learning
Training-Free Inference
Radiology AI

Best for: Computer Vision Engineer, AI Scientist, Research Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.