Interview with Deepika Vemuri: interpretability and concept-based learning
Summary
Deepika Vemuri, a PhD student at IIT Hyderabad, is researching interpretability and concept-based learning in AI models, particularly for high-stakes applications like criminal justice and healthcare. Her work aims to guide model learning through human-understandable concepts, similar to how humans learn by identifying shared features. She has explored two main aspects: enhancing concept-to-class mapping beyond linear layers by introducing a logic module with differentiable fuzzy logic gates, which improved accuracy and interpretability, leading to a WACV 2026 publication. Additionally, she investigated aligning neural network depth-wise representations with semantic hierarchies, guiding models to learn general concepts in early layers and specific ones in deeper layers using Formal Concept Analysis, resulting in more interpretable embeddings and hierarchically structured concept representations. She is also exploring concept representation in videos and developing new metrics for concept-based models.
Key takeaway
For research scientists and computer vision engineers developing interpretable AI, consider integrating logic modules or hierarchical concept learning into your models. This approach can improve both model accuracy and the clarity of concept-to-class associations, which is vital for deploying AI in sensitive domains. You should also explore new evaluation metrics beyond task accuracy to better assess the robustness and reliability of concept-based models against misleading information.
Key insights
Concept-based learning enhances AI interpretability by aligning model representations with human-understandable concepts.
Principles
- Interpretability is crucial for high-stakes AI applications.
- Logic operations can model complex concept-to-class relations.
- Semantic hierarchies improve concept learning in deep networks.
Method
A logic module with differentiable fuzzy logic gates learns predicates for classes. Formal Concept Analysis constructs a concept lattice to provide supervisory signals for hierarchical concept learning across network layers.
In practice
- Use fuzzy logic gates for non-linear concept associations.
- Structure concept learning hierarchically across network layers.
- Develop worst-case metrics for concept-based model evaluation.
Topics
- Interpretability
- Concept-Based Learning
- Logic Modules
- Formal Concept Analysis
- Video Language Models
Best for: Computer Vision Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, AI Student
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by ΑΙhub.