Explaining Neural Networks in Preference Learning: a Post-hoc Inductive Logic Programming Approach

· Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

A new approach proposes using Learning from Answer Sets (ILASP) to approximate black-box Neural Networks (NNs) in user preference learning. The method specifically explores ILASP's application to preference learning systems via weak constraints. Researchers created a dataset of user preferences for recipes to train NNs, which are then approximated by ILASP. Experiments evaluated ILASP as both a global and local approximator, addressing the challenge of high-dimensional feature spaces. To maintain transparency and manage computational time, a preprocessing step utilizing Principal Component Analysis (PCA) is introduced to reduce dataset dimensionality. This work is under consideration for publication in Theory and Practice of Logic Programming (TPLP).

Key takeaway

For Research Scientists developing explainable AI for preference learning, this work demonstrates a viable path to approximate complex Neural Networks using Inductive Logic Programming. You should consider integrating ILASP with a PCA preprocessing step to achieve transparent explanations without excessive computational overhead, especially when dealing with high-dimensional user preference data.

Key insights

ILASP can approximate black-box Neural Networks for preference learning, even in high-dimensional spaces.

Principles

Method

The method involves training NNs on preference data, then using ILASP with weak constraints to approximate the NN's behavior. PCA is applied as a preprocessing step to reduce feature space dimensionality.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Data Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.