GRAFT: Auditing Graph Neural Networks via Global Feature Attribution

· Source: Takara TLDR - Daily AI Papers · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, medium

Summary

GRAFT is a novel post-hoc global explanation framework introduced on May 5, 2026, by Rishi Raj Sahoo and Subhankar Mishra, designed to interpret Graph Neural Networks (GNNs) by identifying class-level feature importance profiles. While existing GNN explainers focus on structural motifs, GRAFT uniquely explains model behavior globally at the input node attribute level. The method integrates diversity-guided exemplar selection, Integrated Gradients-based attribution, and aggregation to create a comprehensive view of feature influence for each class. It further translates these insights into natural language rules using a self-refining large language model. Evaluated across various datasets, architectures, and experimental setups, GRAFT effectively captures model-relevant features, aids in bias analysis, and supports feature-efficient transfer learning. A structured human evaluation protocol was also developed to assess the interpretability of the generated rules, confirming GRAFT's practical utility in bridging quantitative attribution with human-understandable explanations for GNNs.

Key takeaway

For research scientists developing or deploying Graph Neural Networks, understanding which input features drive predictions is critical for trust and debugging. GRAFT offers a practical solution by providing global, class-level feature importance profiles and translating them into natural language rules. You should consider integrating GRAFT into your GNN development pipeline to enhance model interpretability, facilitate bias detection, and improve transfer learning efficiency.

Key insights

GRAFT provides global, class-level feature importance explanations for GNNs, translating them into natural language rules.

Principles

Method

GRAFT combines diversity-guided exemplar selection, Integrated Gradients-based attribution, and aggregation to construct class-level feature importance profiles, then uses an LLM with self-refinement to generate natural language rules.

In practice

Topics

Code references

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Ethicist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Takara TLDR - Daily AI Papers.