Bayes' theorem — LLMpedia

Bayes' theorem
AI-generated (Stable Diffusion 3.5) · CC BY 4.0 · source
Name	Bayes' theorem
Field	Probability theory
Introduced	Thomas Bayes (1763, posthumous)
Notable users	Pierre-Simon Laplace, Ronald Fisher, Harold Jeffreys

Contents

History
Statement and proof
Applications
Bayesian inference and methods
Extensions and generalizations

Bayes' theorem is a mathematical result in probability theory that relates conditional probabilities of events and provides a rule for updating degrees of belief in hypotheses given evidence. It underpins statistical inference and decision-making across fields, connecting work by Thomas Bayes, Pierre-Simon Laplace, and later contributors in statistics and philosophy. The theorem has influenced developments in machine learning, medicine, and jurisprudence and remains central to debates between frequentist and Bayesian schools.

History

The theorem originates with Thomas Bayes and was communicated posthumously through the Royal Society and Royal Society of London correspondence, later popularized by Pierre-Simon Laplace who applied related ideas in celestial mechanics and demographics. Subsequent contributors include Adrien-Marie Legendre in astronomy, Carl Friedrich Gauss in geodesy, and later statisticians such as Ronald Fisher and Harold Jeffreys who debated methodology in contexts involving the Royal Statistical Society and institutions like University of Cambridge and University of Oxford. Philosophers such as David Hume and John Stuart Mill influenced the interpretation of inductive reasoning, while 20th-century applications were driven by groups at Bell Labs, Massachusetts Institute of Technology, and Stanford University where Bayesian ideas intersected with work by Alan Turing during World War II efforts at Bletchley Park and with artificial intelligence research at Carnegie Mellon University and University of California, Berkeley.

Statement and proof

In formal probability, the theorem relates P(A|B) to P(B|A), P(A), and P(B), a relation used in derivations within measure-theoretic treatments by Émile Borel and Andrey Kolmogorov at institutions such as École Normale Supérieure and Steklov Institute. A common proof employs the product rule for joint probability, as found in texts by Kolmogorov, and appears in treatments by Harald Cramér and Jerzy Neyman from the Institute of Mathematical Statistics. The proof is elementary: from P(A ∩ B) = P(A)P(B|A) = P(B)P(A|B), solving for P(A|B) yields the canonical ratio used in inference. Variants and rigorous formulations appear in works by Bruno de Finetti and Leonard Savage within the context of subjective probability debates at Princeton University and University of Chicago.

Applications

The theorem is applied in diagnostics and decision support in medicine at institutions like Mayo Clinic and Johns Hopkins Hospital, in legal evidence assessment in courts including discussions at the Supreme Court of the United States, and in signal processing in projects at NASA and European Space Agency. In engineering, groups at General Electric and Siemens used Bayesian updating for reliability; in finance, traders and researchers at Goldman Sachs and BlackRock employ Bayesian risk models. Theorem-driven methods inform spam filters developed by teams at Microsoft and Google, and underpin modern machine learning algorithms in corporations such as OpenAI and research labs at DeepMind. Applications also span ecology with studies by Smithsonian Institution, epidemiology in outbreaks investigated by Centers for Disease Control and Prevention and World Health Organization, and policy modeling in United Nations analyses.

Bayesian inference and methods

Bayesian inference uses the theorem to update a prior distribution into a posterior distribution, as formalized by statisticians at Columbia University and Yale University and implemented in software from projects originating at University of Washington and Carnegie Mellon University. Methods include Markov chain Monte Carlo developed in part by researchers at University of Toronto and the Metropolis–Hastings algorithm linked to work at Los Alamos National Laboratory, as well as variational inference techniques pursued at Google Research and Facebook AI Research. Model comparison, hierarchical modeling, and Bayesian nonparametrics draw on contributions by Jerzy Neyman, Egon Pearson, and later by John Tukey, with practical use in genetics at Broad Institute and neuroimaging at Massachusetts General Hospital.

Extensions and generalizations

Extensions include Bayesian networks developed by Judea Pearl and colleagues at University of California, Los Angeles and causal inference frameworks used in analyses by researchers at Harvard University and Princeton University. Generalizations to continuous spaces and to measure-theoretic conditional expectation are central in work at Institute for Advanced Study and in textbooks by Kolmogorov and Sergei Sobolev. Decision-theoretic integrations connect to von Neumann and Morgenstern game theory at Princeton University and to statistical learning theory advanced at Bell Labs and AT&T Labs. Quantum variants and quantum Bayesianism have been explored by researchers at Perimeter Institute and University of Waterloo, while reliable computational extensions underpin probabilistic programming languages developed by teams at Stanford University, University of Oxford, and Imperial College London.

Category:Probability theory