Participation bias or non-response bias is a phenomenon in which the results of elections, studies, polls, etc. become non-representative because the participants disproportionately possess certain traits which affect the outcome. These traits mean the sample is systematically different from the target population, potentially resulting in biased estimates.[1]
For instance, a study found that those who refused to answer a survey on AIDS tended to be "older, attend church more often, are less likely to believe in the confidentiality of surveys, and have lower sexual self disclosure."[2] It may occur due to several factors as outlined in Deming (1990).[3]
Non-response bias can be a problem in longitudinal research due to attrition during the study.[4]
If one selects a sample of 1000 managers in a field and polls them about their workload, the managers with a high workload may not answer the survey because they do not have enough time to answer it, and/or those with a low workload may decline to respond for fear that their supervisors or colleagues will perceive them as surplus employees (either immediately, if the survey is non-anonymous, or in the future, should their anonymity be compromised). Therefore, non-response bias may make the measured value for the workload too low, too high, or, if the effects of the above biases happen to offset each other, "right for the wrong reasons." For a simple example of this effect, consider a survey that includes, "Agree or disagree: I have enough time in my day to complete a survey."
Prior to the 1936 U.S. presidential election, The Literary Digest mailed out 10 million questionnaires, of which 2.38 million were returned. Based on these returns, they predicted that Republican candidiate Alf Landon would be elected president with 57.08% of the popular vote and 370 of 531 electoral votes: in the election, Landon received only 37.54% of the popular vote and eight electoral votes in an unprecedented landslide loss to Democratic incumbent President Franklin D. Roosevelt. Subsequent research published in 1976 and 1988 concluded that non-response bias was the primary source of this error, although their sampling frame was also quite different from the vast majority of voters.
Non-responders have been shown to be associated with younger patients, poorer communities and those who are less satisfied and subsequently could be a source of bias.[5]
There are different ways to test for non-response bias. A common technique involves comparing the first and fourth quartiles of responses for differences in demographics and key constructs.[6] In e-mail surveys some values are already known from all potential participants (e.g. age, branch of the firm, ...) and can be compared to the values that prevail in the subgroup of those who answered. If there is no significant difference this is an indicator that there might be no non-response bias.
In e-mail surveys those who didn't answer can also systematically be phoned and a small number of survey questions can be asked. If their answers don't differ significantly from those who answered the survey, there might be no non-response bias. This technique is sometimes called non-response follow-up.
Generally speaking, the lower the response rate, the greater the likelihood of a non-response bias in play.