From Boundaries to Semantics: Prompt-Guided Multi-Task Learning for Petrographic Thin-section Segmentation

· Source: Computer Vision and Pattern Recognition · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Computer Vision · Depth: Expert, quick

Summary

Petro-SAM is a novel two-stage, multi-task framework designed for high-quality joint grain-edge segmentation (GES) and lithology semantic segmentation (LSS) on petrographic thin-section images. These two tasks are crucial for quantifying rock fabric and composition but traditionally suffer from separate treatment and high annotation costs. Petro-SAM addresses challenges like severe domain gaps due to extinction-dependent color variations and ultra-fine grain boundaries, which hinder direct adaptation of foundation models like the Segment Anything Model (SAM). The framework integrates seven polarized views using a Merge Block to resolve extinction issues and employs multi-scale feature fusion and color-entropy priors to refine segmentation detection, building upon SAM's robust boundary alignment capabilities.

Key takeaway

For Computer Vision Engineers developing geological imaging solutions, Petro-SAM offers a robust approach to joint grain-edge and lithology segmentation. You should consider its multi-view integration and prior-guided refinement techniques to overcome domain gaps and improve accuracy in petrographic image analysis, potentially reducing reliance on extensive expert-annotated datasets.

Key insights

Petro-SAM enables high-quality joint grain-edge and lithology segmentation in petrographic images by adapting SAM.

Principles

Method

Petro-SAM uses a two-stage, multi-task framework based on SAM, incorporating a Merge Block for multi-view integration and multi-scale feature fusion with color-entropy priors for refinement.

In practice

Topics

Best for: Computer Vision Engineer, AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.