Product-of-Experts Training Reduces Dataset Artifacts in Natural Language Inference

· Source: Computation and Language · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

Neural Natural Language Inference (NLI) models frequently overfit dataset artifacts rather than performing genuine reasoning, as evidenced by a hypothesis-only model achieving 57.7% accuracy on SNLI. These artifacts contribute to 38.6% of baseline model errors. To address this, researchers propose Product-of-Experts (PoE) training, a method that downweights examples where biased models exhibit high confidence. PoE training nearly maintains accuracy, with a slight drop from 89.30% to 89.10%, while significantly reducing reliance on bias by 4.71%, moving bias agreement from 49.85% to 45%. An ablation study identified a lambda value of 1.5 as optimal for balancing debiasing and accuracy. Despite these improvements, behavioral tests indicate persistent challenges with negation and numerical reasoning.

Key takeaway

For AI Engineers developing NLI models, consider integrating Product-of-Experts (PoE) training to mitigate dataset artifact overfitting. This approach can reduce bias reliance by 4.71% while maintaining high accuracy (e.g., 89.10% on SNLI). Evaluate your model's performance on negation and numerical reasoning, as these areas may still present challenges even with debiasing techniques.

Key insights

Product-of-Experts (PoE) training reduces NLI model reliance on dataset artifacts while preserving accuracy.

Principles

Method

PoE training downweights examples where biased models are overconfident, balancing debiasing with accuracy preservation. A lambda of 1.5 was found optimal.

In practice

Topics

Best for: AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.