Bradley–Terry model explained
The Bradley–Terry model is a probability model for the outcome of pairwise comparisons between items, teams, or objects. Given a pair of items and drawn from some population, it estimates the probability that the pairwise comparison turns out true, as
where is a positive real-valued score assigned to individual . The comparison can be read as " is preferred to ", " ranks higher than ", or " beats ", depending on the application.
For example, might represent the skill of a team in a sports tournament and
the probability that wins a game against . Or might represent the quality or desirability of a commercial product and
the probability that a consumer will prefer product over product .
The Bradley–Terry model can be used in the forward direction to predict outcomes, as described, but is more commonly used in reverse to infer the scores given an observed set of outcomes. In this type of application represents some measure of the strength or quality of
and the model lets us estimate the strengths from a series of pairwise comparisons. In a survey of wine preferences, for instance, it might be difficult for respondents to give a complete ranking of a large set of wines, but relatively easy for them to compare sample pairs of wines and say which they feel is better. Based on a set of such pairwise comparisons, the Bradley–Terry model can then be used to derive a full ranking of the wines.
Once the values of the scores have been calculated, the model can then also be used in the forward direction, for instance to predict the likely outcome of comparisons that have not yet actually occurred. In the wine survey example, for instance, one could calculate the probability that someone will prefer wine
over wine
, even if no one in the survey directly compared that particular pair.
History and applications
The model is named after Ralph A. Bradley and Milton E. Terry,[1] who presented it in 1952,[2] although it had already been studied by Ernst Zermelo in the 1920s.[3] [4] Applications of the model include the ranking of competitors in sports, chess, and other competitions,[5] the ranking of products in paired comparison surveys of consumer choice, analysis of dominance hierarchies within animal and human communities,[6] ranking of journals, ranking of AI models,[7] and estimation of the relevance of documents in machine-learned search engines.[8]
Definition
The Bradley–Terry model can be parametrized in various ways. Equation is perhaps the most common, but there are a number of others. Bradley and Terry themselves defined exponential score functions
, so that
[9]
Alternatively, one can use a logit, such that
\operatorname{logit}\Pr(i>j)=log
=log
=\betai-\betaj,
i.e. for