[D] How do you add theoretical justification to an AI/ML paper?
Summary
An AI/ML researcher, primarily experienced in empirical modeling, seeks guidance on integrating theoretical justifications, such as theorems, lemmas, and proofs, into their papers. The researcher notes difficulty in framing intuitive ideas rigorously, citing an example of measuring uncertainty in attention mechanisms by analyzing different attention head outputs. They also observe that many papers reference advanced theory not covered in their postgraduate studies. The discussion highlights varying perspectives on the necessity of theoretical backing, with some suggesting that empirical observations are sufficient, while others recommend collaborating with co-authors from pure math, statistics, or physics backgrounds to develop formal sections. One comment points to the "Troubling Trends in Machine Learning Scholarship" paper (arXiv:1807.03341) as relevant to the debate.
Key takeaway
For AI scientists focused on empirical modeling, you should evaluate whether formal theoretical justification truly enhances your paper or if robust experimental validation and clear intuition are sufficient. Consider collaborating with mathematicians or statisticians for theoretical sections, or explore existing theoretical frameworks that align with your empirical observations, rather than forcing a justification where it isn't naturally supported. This approach can save time and improve clarity.
Key insights
Theoretical justification in AI/ML papers is often not strictly necessary, with empirical validation frequently sufficing.
Principles
- Empirical observation can precede theoretical conceptualization.
- Avoid adding unnecessary theoretical machinery to empirical results.
Method
To bridge empirical work with theory, consider collaborating with co-authors from pure math, statistics, or physics, or explore existing theoretical frameworks like ensemble disagreement for uncertainty quantification.
In practice
- Consult existing work on ensemble disagreement for uncertainty.
- Seek co-authors with strong mathematical backgrounds.
Topics
- Theoretical Justification
- Empirical Modeling
- Attention Mechanisms
- Uncertainty Quantification
- Academic Collaboration
Best for: AI Scientist, AI Researcher, AI Student, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.