Probability is not Likelihood
Summary
The core distinction between probability and likelihood hinges on which variable is fixed in the formula P(X|θ). Probability quantifies the chance of observing specific data (X) given a known model parameter (θ), where X varies and θ is constant. For example, with a coin having P=0.5 for heads, the probability of getting seven heads in 10 flips is approximately 0.117, and all possible outcome probabilities sum to one, forming a probability distribution. Conversely, likelihood assesses the plausibility of different model parameters (θ) given observed, fixed data (X). If seven heads are observed in 10 flips, the likelihood function plots the plausibility of various P values, peaking at P=0.7. Unlike probability distributions, the area under a likelihood function does not sum to one, as it indicates how well each parameter value explains the data, not the probability of the parameter itself. The maximum likelihood estimate (MLE) identifies the parameter value that maximizes this likelihood function, such as P-hat = 0.7 for the coin example.
Key takeaway
For Data Scientists or Machine Learning Engineers working on model parameter estimation, understanding the fundamental difference between probability and likelihood is crucial. Your choice of interpretation for P(X|θ) dictates whether you are predicting data outcomes or inferring model parameters. Always remember that likelihood functions, while indicating parameter plausibility, are not probability distributions for the parameters themselves, guiding your statistical inference correctly.
Key insights
Probability fixes model parameters and varies data outcomes; likelihood fixes data and varies model parameters.
Principles
- Probability distributions sum to one.
- Likelihood functions do not integrate to one.
Method
The Maximum Likelihood Estimate (MLE) identifies the parameter value that maximizes the likelihood function for observed data.
In practice
- Use probability for predicting outcomes given a model.
- Use likelihood for estimating model parameters from data.
Topics
- Probability vs. Likelihood
- Conditional Probability
- Probability Distribution
- Likelihood Function
- Maximum Likelihood Estimate
Best for: AI Student, Data Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by DataMListic.