Probability is not Likelihood

· Source: DataMListic · Field: Technology & Digital — Data Science & Analytics, Artificial Intelligence & Machine Learning, Mathematics & Computational Sciences · Depth: Novice, quick

Summary

The core distinction between probability and likelihood hinges on which variable is fixed in the formula P(X|θ). Probability quantifies the chance of observing specific data (X) given a known model parameter (θ), where X varies and θ is constant. For example, with a coin having P=0.5 for heads, the probability of getting seven heads in 10 flips is approximately 0.117, and all possible outcome probabilities sum to one, forming a probability distribution. Conversely, likelihood assesses the plausibility of different model parameters (θ) given observed, fixed data (X). If seven heads are observed in 10 flips, the likelihood function plots the plausibility of various P values, peaking at P=0.7. Unlike probability distributions, the area under a likelihood function does not sum to one, as it indicates how well each parameter value explains the data, not the probability of the parameter itself. The maximum likelihood estimate (MLE) identifies the parameter value that maximizes this likelihood function, such as P-hat = 0.7 for the coin example.

Key takeaway

For Data Scientists or Machine Learning Engineers working on model parameter estimation, understanding the fundamental difference between probability and likelihood is crucial. Your choice of interpretation for P(X|θ) dictates whether you are predicting data outcomes or inferring model parameters. Always remember that likelihood functions, while indicating parameter plausibility, are not probability distributions for the parameters themselves, guiding your statistical inference correctly.

Key insights

Probability fixes model parameters and varies data outcomes; likelihood fixes data and varies model parameters.

Principles

Method

The Maximum Likelihood Estimate (MLE) identifies the parameter value that maximizes the likelihood function for observed data.

In practice

Topics

Best for: AI Student, Data Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by DataMListic.