A major branch of social choice theory is devoted to the comparison of electoral systems, otherwise known as social choice functions. Viewed from the perspective of political science, electoral systems are rules for conducting elections and determining winners from the ballots cast. From the perspective of economics, mathematics, and philosophy, a social choice function is a mathematical function that determines how a society should make choices, given a collection of individual preferences.
This article discusses the methods and results of comparing different systems. There are two broad ways to compare voting systems:
Voting methods can be evaluated by measuring their accuracy under random simulated elections aiming to be faithful to the properties of elections in real life. The first such evaluation was conducted by Chamberlin and Cohen in 1978, who measured the frequency with which certain non-Condorcet systems elected Condorcet winners.[1]
The Marquis de Condorcet viewed elections as analogous to jury votes where each member expresses an independent judgement on the quality of candidates. Candidates differ in terms of their objective merit, but voters have imperfect information about the relative merits of the candidates. Such jury models are sometimes known as valence models. Condorcet and his contemporary Laplace demonstrated that, in such a model, voting theory could be reduced to probability by finding the expected quality of each candidate.[2]
The jury model implies several natural concepts of accuracy for voting systems under different models:
However, Condorcet's model is based on the extremely strong assumption of independent errors, i.e. voters will not be systematically biased in favor of one group of candidates or another. This is usually unrealistic: voters tend to communicate with each other, form parties or political ideologies, and engage in other behaviors that can result in correlated errors.
Duncan Black proposed a one-dimensional spatial model of voting in 1948, viewing elections as ideologically driven.[4] His ideas were later expanded by Anthony Downs.[5] Voters' opinions are regarded as positions in a space of one or more dimensions; candidates have positions in the same space; and voters choose candidates in order of proximity (measured under Euclidean distance or some other metric).
Spatial models imply a different notion of merit for voting systems: the more acceptable the winning candidate may be as a location parameter for the voter distribution, the better the system. A political spectrum is a one-dimensional spatial model.
Neutral voting models try to minimize the number of parameters and, as an example of the nothing-up-my-sleeve principle. The most common such model is the impartial anonymous culture model (or Dirichlet model). These models assume voters assign each candidate a utility completely at random (from a uniform distribution).
Tideman and Plassmann conducted a study which showed that a two-dimensional spatial model gave a reasonable fit to 3-candidate reductions of a large set of electoral rankings. Jury models, neutral models, and one-dimensional spatial models were all inadequate. They looked at Condorcet cycles in voter preferences (an example of which is A being preferred to B by a majority of voters, B to C and C to A) and found that the number of them was consistent with small-sample effects, concluding that "voting cycles will occur very rarely, if at all, in elections with many voters." The relevance of sample size had been studied previously by Gordon Tullock, who argued graphically that although finite electorates will be prone to cycles, the area in which candidates may give rise to cycling shrinks as the number of voters increases.[6]
A utilitarian model views voters as ranking candidates in order of utility. The rightful winner, under this model, is the candidate who maximizes overall social utility. A utilitarian model differs from a spatial model in several important ways:
It follows from the last property that no voting system which gives equal influence to all voters is likely to achieve maximum social utility. Extreme cases of conflict between the claims of utilitarianism and democracy are referred to as the 'tyranny of the majority'. See Laslier's, Merlin's, and Nurmi's comments in Laslier's write-up.
James Mill seems to have been the first to claim the existence of an a priori connection between democracy and utilitarianism – see the Stanford Encyclopedia article.[7]
Suppose that the i th candidate in an election has merit xi (we may assume that xi ~ N (0,σ2)[8]), and that voter j 's level of approval for candidate i may be written as xi + εij (we will assume that the εij are iid. N (0,τ2)). We assume that a voter ranks candidates in decreasing order of approval. We may interpret εij as the error in voter j 's valuation of candidate i and regard a voting method as having the task of finding the candidate of greatest merit.
Each voter will rank the better of two candidates higher than the less good with a determinate probability p (which under the normal model outlined here is equal to
\tfrac{1}{2}+\tfrac{1}{\pi}rm{tan}-1\tfrac{\sigma}{\tau}
Peyton Young showed that three further properties apply to votes between arbitrary numbers of candidates, suggesting that Condorcet was aware of the first and third of them.[9]
Robert F. Bordley constructed a 'utilitarian' model which is a slight variant of Condorcet's jury model.[10] He viewed the task of a voting method as that of finding the candidate who has the greatest total approval from the electorate, i.e. the highest sum of individual voters' levels of approval. This model makes sense even with σ2 = 0, in which case p takes the value
\tfrac{1}{2}+\tfrac{1}{\pi}rm{tan}-1\tfrac{1}{n-1}
A simulated election can be constructed from a distribution of voters in a suitable space. The illustration shows voters satisfying a bivariate Gaussian distribution centred on O. There are 3 randomly generated candidates, A, B and C. The space is divided into 6 segments by 3 lines, with the voters in each segment having the same candidate preferences. The proportion of voters ordering the candidates in any way is given by the integral of the voter distribution over the associated segment.
The proportions corresponding to the 6 possible orderings of candidates determine the results yielded by different voting systems. Those which elect the best candidate, i.e. the candidate closest to O (who in this case is A), are considered to have given a correct result, and those which elect someone else have exhibited an error. By looking at results for large numbers of randomly generated candidates the empirical properties of voting systems can be measured.
The evaluation protocol outlined here is modelled on the one described by Tideman and Plassmann.Evaluations of this type are commonest for single-winner electoral systems. Ranked voting systems fit most naturally into the framework, but other types of ballot (such a FPTP and Approval voting) can be accommodated with lesser or greater effort.
The evaluation protocol can be varied in a number of ways:
3 | 6 | 10 | 15 | 25 | 40 | ||
---|---|---|---|---|---|---|---|
FPTP | 70.6 | 35.5 | 21.1 | 14.5 | 9.3 | 6.4 | |
AV/IRV | 85.2 | 50.1 | 31.5 | 21.6 | 12.9 | 7.9 | |
Borda | 87.6 | 82.1 | 74.2 | 67.0 | 58.3 | 50.1 | |
Condorcet | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 |
Evaluations published in research papers use multidimensional Gaussians, making the calculation numerically difficult.[13] The number of voters is kept finite and the number of candidates is necessarily small.
The computation is much more straightforward in a single dimension, which allows an infinite number of voters and an arbitrary number m of candidates. Results for this simple case are shown in the first table, which is directly comparable with Table 5 (1000 voters, medium dispersion) of the cited paper by Chamberlin and Cohen. The candidates were sampled randomly from the voter distribution and a single Condorcet method (Minimax) was included in the trials for confirmation.
10 | ||
---|---|---|
FPTP | 0.166 | |
AV/IRV | 0.058 | |
Borda | 0.016 | |
Condorcet | 0.010 |
An alternative measure of accuracy is the average distance of voters from the winner (in which smaller means better). This is unlikely to change the ranking of voting methods, but is preferred by people who interpret distance as disutility. The second table shows the average distance (in standard deviations) minus
\sqrt\tfrac{2}{\pi}
James Green-Armytage et al. published a study in which they assessed the vulnerability of several voting systems to manipulation by voters.[14] They say little about how they adapted their evaluation for this purpose, mentioning simply that it "requires creative programming". An earlier paper by the first author gives a little more detail.[15]
The number of candidates in their simulated elections was limited to 3. This removes the distinction between certain systems; for instance Black's method and the Dasgupta-Maskin method are equivalent on 3 candidates.
The conclusions from the study are hard to summarise, but the Borda count performed badly; Minimax was somewhat vulnerable; and IRV was highly resistant. The authors showed that limiting any method to elections with no Condorcet winner (choosing the Condorcet winner when there was one) would never increase its susceptibility to tactical voting. They reported that the 'Condorcet-Hare' system which uses IRV as a tie-break for elections not resolved by the Condorcet criterion was as resistant to tactical voting as IRV on its own and more accurate. Condorcet-Hare is equivalent to Copeland's method with an IRV tie-break in elections with 3 candidates.
0 | 0.25 | 0.5 | 1 | 1.5 | ||
---|---|---|---|---|---|---|
3 | 87.6 | 87.9 | 88.9 | 93.0 | 97.4 | |
6 | 82.1 | 80.2 | 76.2 | 71.9 | 79.9 | |
10 | 74.1 | 70.1 | 61.2 | 47.6 | 54.1 | |
15 | 66.9 | 60.6 | 46.4 | 26.6 | 30.8 | |
25 | 58.3 | 47.0 | 26.3 | 8.1 | 10.1 | |
40 | 50.2 | 33.3 | 11.3 | 1.5 | 2.1 |
Sensitivity to the distribution of candidates can be thought of as a matter either of accuracy or of resistance to manipulation. If one expects that in the course of things candidates will naturally come from the same distribution as voters, then any displacement will be seen as attempted subversion; but if one thinks that factors determining the viability of candidacy (such as financial backing) may be correlated with ideological position, then one will view it more in terms of accuracy.
Published evaluations take different views of the candidate distribution. Some simply assume that candidates are drawn from the same distribution as voters. Several older papers assume equal means but allow the candidate distribution to be more or less tight than the voter distribution.[16] A paper by Tideman and Plassmann approximates the relationship between candidate and voter distributions based on empirical measurements.[17] This is less realistic than it may appear, since it makes no allowance for the candidate distribution to adjust to exploit any weakness in the voting system. A paper by James Green-Armytage looks at the candidate distribution as a separate issue, viewing it as a form of manipulation and measuring the effects of strategic entry and exit. Unsurprisingly he finds the Borda count to be particularly vulnerable.
The task of a voting system under a spatial model is to identify the candidate whose position most accurately represents the distribution of voter opinions. This amounts to choosing a location parameter for the distribution from the set of alternatives offered by the candidates. Location parameters may be based on the mean, the median, or the mode; but since ranked preference ballots provide only ordinal information, the median is the only acceptable statistic.
This can be seen from the diagram, which illustrates two simulated elections with the same candidates but different voter distributions. In both cases the mid-point between the candidates is the 51st percentile of the voter distribution; hence 51% of voters prefer A and 49% prefer B. If we consider a voting method to be correct if it elects the candidate closest to the median of the voter population, then since the median is necessarily slightly to the left of the 51% line, a voting method will be considered to be correct if it elects A in each case.
The mean of the teal distribution is also slightly to the left of the 51% line, but the mean of the orange distribution is slightly to the right. Hence if we consider a voting method to be correct if it elects the candidate closest to the mean of the voter population, then a method will not be able to obtain full marks unless it produces different winners from the same ballots in the two elections. Clearly this will impute spurious errors to voting methods. The same problem will arise for any cardinal measure of location; only the median gives consistent results.
The median is not defined for multivariate distributions but the univariate median has a property which generalizes conveniently. The median of a distribution is the position whose average distance from all points within the distribution is smallest. This definition generalizes to the geometric median in multiple dimensions. The distance is often defined as a voter's disutility function.
If we have a set of candidates and a population of voters, then it is not necessary to solve the computationally difficult problem of finding the geometric median of the voters and then identify the candidate closest to it; instead we can identify the candidate whose average distance from the voters is minimized. This is the metric which has been generally deployed since Merrill onwards; see also Green-Armytage and Darlington.
The candidate closest to the geometric median of the voter distribution may be termed the 'spatial winner'.
Data from real elections can be analysed to compare the effects of different systems, either by comparing between countries or by applying alternative electoral systems to the real election data. The electoral outcomes can be compared through democracy indices, measures of political fragmentation, voter turnout,[19] [20] political efficacy and various economic and judicial indicators. The practical criteria to assess real elections include the share of wasted votes, the complexity of vote counting, proportionality, and barriers to entry for new political movements.[21] Additional opportunities for comparison of real elections arise through electoral reforms.
A Canadian example of such an opportunity is seen in the City of Edmonton (Canada), which went from first-past-the-post voting in 1917 Alberta general election to five-member plurality block voting in 1921 Alberta general election, to five-member single transferable voting in 1926 Alberta general election, then to FPTP again in 1959 Alberta general election. One party swept all the Edmonton seats in 1917, 1921 and 1959. Under STV in 1926, two Conservatives, one Liberal, one Labour and one United Farmers MLA were elected.
Traditionally the merits of different electoral systems have been argued by reference to logical criteria. These have the form of rules of inference for electoral decisions, licensing the deduction, for instance, that "if E and E ' are elections such that R (E,E '), and if A is the rightful winner of E , then A is the rightful winner of E ' ".
The criteria are as debatable as the voting systems themselves. Here we briefly discuss the considerations advanced concerning their validity, and then summarize the most important criteria, showing in a table which of the principal voting systems satisfy them.
We now turn to the logical criteria themselves, starting with the absolute criteria which state that, if the set of ballots is a certain way, a certain candidate must or must not win.
Criterion | Description | |
---|---|---|
Majority criterion | Will a candidate who is listed as the unique favorite by a majority of voters always win? This criterion comes in two versions: | |
Majority loser criterion | if a majority of voters prefers every other candidate over a given candidate, then does that candidate not win? | |
Mutual majority criterion | Will the winning candidate always come from a group of candidates ranked above all others by a majority of voters? | |
Condorcet winner criterion | If a candidate beats every other candidate in head-to-head match-ups, will that candidate always win the election? (This implies the majority criterion, above.) | |
Condorcet loser criterion | If a candidate loses to every other candidate in head-to-head match-ups, will that candidate always lose the election? |
These are criteria that state that, if a certain candidate wins in one circumstance, the same candidate must (or must not) win in a related circumstance.
Criteron | Description | |
---|---|---|
Independence of Smith-dominated alternatives (Smith-IAA) | Does the outcome never change if a Smith-dominated candidate is added or removed, assuming votes regarding the other candidates are unchanged? Candidate C is Smith-dominated if there is some other candidate A such that C is beaten by A and every candidate B that is not beaten by A etc. Note that although this criterion is classed here as nominee-relative, it has a strong absolute component in excluding Smith-dominated candidates from winning. In fact, it implies all of the absolute criteria above. | |
Independence of irrelevant alternatives (IIA) | Does the outcome never change if a non-winning candidate is added or removed, assuming voter preferences regarding the other candidates are unchanged? For instance, plurality rule fails IIA; adding a candidate X can cause the winner to change from W to Y even though Y receives no more votes than before. | |
Local independence of irrelevant alternatives (LIIA) | Does the outcome never change if the alternative that would finish last is removed? And, could the alternative that finishes second fail to become the winner if the winner were removed? | |
Independence of clone alternatives (cloneproof) | Does the outcome never change if non-winning candidates similar to an existing candidate are added? There are three different situations which could cause a method to fail this criterion:
| |
Monotonicity criterion (monotone) | If candidate W wins for one set of ballots, will W still always win if those ballots change to increase support for W? This also implies that you cannot cause a losing candidate to win by decreasing support for them. | |
Consistency criterion | If candidate W' wins for one set of ballots, will W still always win if those ballots change by adding another set of ballots where W also wins? | |
Participation criterion | Is voting honestly always better than not voting at all? (This is grouped with the distinct but similar Consistency Criterion in the table below.[22]) | |
Reversal symmetry | If individual preferences of each voter are inverted, does the original winner never win? |
These are criteria which relate to the process of counting votes and determining a winner.
Criterion | Description | |
---|---|---|
Polynomial time | Can the winner be calculated in a runtime that is polynomial in the number of candidates and linear in the number of voters? | |
Can the winner be calculated in all cases−except exact ties by ballot count−without using any random processes such as flipping coins? That is, are exact ties, in which the winner could be one of two or more candidates, vanishingly rare in large elections? | ||
Summability | Can the winner be calculated by tallying ballots at each polling station separately and simply adding up the individual tallies? The amount of information necessary for such tallies is expressed as an order function of the number of candidates N. Slower-growing functions such as O(N) or O(N2) make for easier counting, while faster-growing functions such as O(N!) might make it more difficult to do the same. |
These are criteria that relate to a voter's incentive to use certain forms of strategy. They could also be considered as relative result criteria; however, unlike the criteria in that section, these criteria are directly relevant to voters; the fact that a method passes these criteria can simplify the process of figuring out one's optimal strategic vote.
Criterion | Description | |
---|---|---|
Later-no-harm criterion | Can voters be sure that adding a lower preference to a ballot will not harm any more preferred candidate already listed? | |
later-no-help criterion | Can voters be sure that adding a lower preference to a ballot will not help any more preferred candidate already listed? | |
No favorite betrayal criterion | Can voters be sure that they do not need to support any other candidate above their favorite in order to obtain a result they prefer?[23] |
Ballots are broadly distinguishable into two categories, cardinal and ordinal, where cardinal ballots request individual measures of support for each candidate and ordinal ballots request relative measures of support. A few methods do not fall neatly into one category, such as STAR, which asks the voter to give independent ratings for each candidate, but uses both the absolute and relative ratings to determine the winner. Comparing two methods based on ballot type alone is mostly a matter of voter experience preference, unless the ballot type is connected back to one of the other mathematical criterion listed here.
Criterion A is "stronger" than B if satisfying A implies satisfying B. For instance, the Condorcet criterion is stronger than the majority criterion, because all majority winners are Condorcet winners. Thus, any voting method that satisfies the Condorcet criterion must satisfy the majority criterion.
The following table shows which of the above criteria are met by several single-winner methods. Not every criteria is listed.
The concerns raised above are used by social choice theorists to devise systems that are accurate and resistant to manipulation. However, there are also practical reasons why one system may be more socially acceptable than another, which fall under the fields of public choice and political science. Important practical considerations include:
Other considerations include barriers to entry to the political competition[24] and likelihood of gridlocked government.[25]
Multi-winner electoral systems at their best seek to produce assemblies representative in a broader sense than that of making the same decisions as would be made by single-winner votes. They can also be route to one-party sweeps of a city's seats, if a non-proportional system, such as plurality block voting or ticket voting, is used.
Evaluating the performance of multi-winner voting methods requires different metrics than are used for single-winner systems. The following have been proposed.
The following table shows which of the above criteria are met by several multiple winner methods.
Proportional | Monotone | Consistency | Participation | No Favorite Betrayal | Semihonest | Universally Liked Candidates | ||||||||
Sequential Proportional Approval | ||||||||||||||
Re-weighted Range | ||||||||||||||
Proportional Approval | ||||||||||||||
Single Transferable Vote | ||||||||||||||
CPO-STV | ||||||||||||||
Schulze STV | ||||||||||||||
Phragmen | ||||||||||||||
Quota Borda System | ||||||||||||||
Single non-transferable vote | (not proportional) | |||||||||||||
Limited vote | (not proportional) | |||||||||||||
Cumulative voting | [27] | (not proportional) | ||||||||||||
Sortition, Arbitrary Winner | (not proportional) |