A data model is not just a “likelihood”
Summary
A common misconception in Bayesian modeling is the interchangeable use of "data model" and "likelihood." A Bayesian model is fundamentally defined by the joint distribution p(y, \theta), which factorizes into the data model p(y | \theta) and the prior p(\theta). While p(y | \theta) as a function of the observed data y is the data model, the same expression p(y | \theta) as a function of the parameters \theta is the likelihood function. This distinction is crucial because, for example, a discrete Bernoulli data model yields a continuous likelihood function. Furthermore, different data models, such as Poisson and Gamma, can produce identical likelihood shapes, making it impossible to infer the data model solely from the likelihood. The article emphasizes that in complex scenarios like hierarchical or missing data models, the boundary between data and parameters blurs, making precise terminology even more critical for clarity.
Key takeaway
For AI Scientists and Research Scientists developing or interpreting Bayesian models, understanding the precise distinction between a "data model" and a "likelihood function" is critical. Misusing these terms, especially in complex hierarchical or missing data scenarios, can lead to fundamental misunderstandings about model structure and implications. Ensure your model specifications clearly differentiate between the generative process for data and the parameter-dependent likelihood function to avoid conceptual errors and facilitate accurate prior predictive checks.
Key insights
Distinguishing between a data model and a likelihood function is crucial for accurate Bayesian modeling.
Principles
- Data model is p(y | \theta) as a function of y.
- Likelihood is p(y | \theta) as a function of \theta.
- Different data models can yield identical likelihoods.
Method
When defining a Bayesian model, explicitly distinguish between the generative data model p(y | \theta) and the likelihood function, especially in Stan's `y ~ normal(mu, sigma)` vs. `target += normal_lpdf(y | mu, sigma)`.
In practice
- Use "data model" for prior checking.
- Recognize that likelihoods can be continuous for discrete data models.
- Be precise with terminology in hierarchical models.
Topics
- Bayesian Modeling
- Data Model
- Likelihood Function
- Generative Models
- Prior Predictive Distribution
Best for: AI Scientist, Research Scientist, Data Scientist, AI Researcher, AI Student
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Statistical Modeling, Causal Inference, and Social Science.