Cronbach's alpha explained

Cronbach's alpha (Cronbach's

\alpha

), also known as tau-equivalent reliability (

\rhoT

) or coefficient alpha (coefficient

\alpha

), is a reliability coefficient and a measure of the internal consistency of tests and measures.[1] [2] [3] It was named after the American psychologist Lee Cronbach.

Numerous studies warn against using Cronbach's alpha unconditionally. Statisticians regard reliability coefficients based on structural equation modeling (SEM) or generalizability theory as superior alternatives in many situations.[4] [5] [6] [7] [8] [9]

History

In his initial 1951 publication, Lee Cronbach described the coefficient as Coefficient alpha[1] and included an additional derivation.[10] Coefficient alpha had been used implicitly in previous studies,[11] [12] [13] [14] but his interpretation was thought to be more intuitively attractive relative to previous studies and it became quite popular.[15]

\beta

,

\gamma

, etc.), but later changed his mind.

\rhoT

. Cronbach opposed the use of the name "Cronbach's alpha" and explicitly denied the existence of studies that had published the general formula of KR-20 before Cronbach's 1951 publication of the same name.

Prerequisites for using Cronbach's alpha

To use Cronbach's alpha as a reliability coefficient, the following conditions must be met:[17] [18]

  1. The data is normally distributed and linear;
  2. The compared tests or measures are essentially tau-equivalent;
  3. Errors in the measurements are independent.

Formula and calculation

Cronbach's alpha is calculated by taking a score from each scale item and correlating it with the total score for each observation. The resulting correlations are then compared with the variance for all individual item scores. Cronbach's alpha is best understood as a function of the number of questions or items in a measure, the average covariance between pairs of items, and the overall variance of the total measured score.[19] [8] \alpha = \left(1 - \right)

where:

k

represents the number of items in the measure
2
\sigma
yi
the variance associated with each item i
2
\sigma
y
the variance associated with the total scores

(y=

k
\sum
i=1

yi)

Alternatively, it can be calculated through the following formula:[20]

\alpha={k\barc\over\barv+(k-1)\barc}

where:

\barv

represents the average variance

\barc

represents the average inter-item covariance.

Common misconceptions

[7]

The value of Cronbach's alpha ranges between zero and one

By definition, reliability cannot be less than zero and cannot be greater than one. Many textbooks mistakenly equate

\rhoT

with reliability and give an inaccurate explanation of its range.

\rhoT

can be less than reliability when applied to data that are not essentially tau-equivalent. Suppose that

X2

copied the value of

X1

as it is, and

X3

copied by multiplying the value of

X1

by -1.

The covariance matrix between items is as follows,

\rhoT=-3

.
Observed covariance matrix

X1

X2

X3

X1

1

1

-1

X2

1

1

-1

X3

-1

-1

1

Negative

\rhoT

can occur for reasons such as negative discrimination or mistakes in processing reversely scored items.

Unlike

\rhoT

, SEM-based reliability coefficients (e.g.,

\rhoC

) are always greater than or equal to zero.

This anomaly was first pointed out by Cronbach (1943)[21] to criticize

\rhoT

, but Cronbach (1951)[10] did not comment on this problem in his article that otherwise discussed potentially problematic issues related

\rhoT

.[9] [22]

If there is no measurement error, the value of Cronbach's alpha is one.

This anomaly also originates from the fact that

\rhoT

underestimates reliability.

Suppose that

X2

copied the value of

X1

as it is, and

X3

copied by multiplying the value of

X1

by two.

The covariance matrix between items is as follows,

\rhoT=0.9375

.
Observed covariance matrix

X1

X2

X3

X1

1

1

2

X2

1

1

2

X3

2

2

4

For the above data, both

\rhoP

and

\rhoC

have a value of one.

The above example is presented by Cho and Kim (2015).[7]

A high value of Cronbach's alpha indicates homogeneity between the items

Many textbooks refer to

\rhoT

as an indicator of homogeneity[23] between items. This misconception stems from the inaccurate explanation of Cronbach (1951)[10] that high

\rhoT

values show homogeneity between the items. Homogeneity is a term that is rarely used in modern literature, and related studies interpret the term as referring to uni-dimensionality. Several studies have provided proofs or counterexamples that high

\rhoT

values do not indicate uni-dimensionality.[24] [7] [25] [26] [27] [28] See counterexamples below.
Uni-dimensional data

X1

X2

X3

X4

X5

X6

X1

10

3

3

3

3

3

X2

3

10

3

3

3

3

X3

3

3

10

3

3

3

X4

3

3

3

10

3

3

X5

3

3

3

3

10

3

X6

3

3

3

3

3

10

\rhoT=0.72

in the uni-dimensional data above.
Multidimensional data

X1

X2

X3

X4

X5

X6

X1

10

6

6

1

1

1

X2

6

10

6

1

1

1

X3

6

6

10

1

1

1

X4

1

1

1

10

6

6

X5

1

1

1

6

10

6

X6

1

1

1

6

6

10

\rhoT=0.72

in the multidimensional data above.
Multidimensional data with extremely high reliability

X1

X2

X3

X4

X5

X6

X1

10

9

9

8

8

8

X2

9

10

9

8

8

8

X3

9

9

10

8

8

8

X4

8

8

8

10

9

9

X5

8

8

8

9

10

9

X6

8

8

8

9

9

10

The above data have

\rhoT=0.9692

, but are multidimensional.
Uni-dimensional data with unacceptably low reliability

X1

X2

X3

X4

X5

X6

X1

10

1

1

1

1

1

X2

1

10

1

1

1

1

X3

1

1

10

1

1

1

X4

1

1

1

10

1

1

X5

1

1

1

1

10

1

X6

1

1

1

1

1

10

The above data have

\rhoT=0.4

, but are uni-dimensional.

Uni-dimensionality is a prerequisite for

\rhoT

. One should check uni-dimensionality before calculating

\rhoT

rather than calculating

\rhoT

to check uni-dimensionality.[3]

A high value of Cronbach's alpha indicates internal consistency

The term "internal consistency" is commonly used in the reliability literature, but its meaning is not clearly defined. The term is sometimes used to refer to a certain kind of reliability (e.g., internal consistency reliability), but it is unclear exactly which reliability coefficients are included here, in addition to

\rhoT

. Cronbach (1951)[10] used the term in several senses without an explicit definition. Cho and Kim (2015)[7] showed that

\rhoT

is not an indicator of any of these.

Removing items using "alpha if item deleted" always increases reliability

Removing an item using "alpha if item deleted" may result in 'alpha inflation,' where sample-level reliability is reported to be higher than population-level reliability.[29] It may also reduce population-level reliability.[30] The elimination of less-reliable items should be based not only on a statistical basis but also on a theoretical and logical basis. It is also recommended that the whole sample be divided into two and cross-validated.[29]

Ideal reliability level and how to increase reliability

Nunnally's recommendations for the level of reliability

Nunnally's book[31] [32] is often mentioned as the primary source for determining the appropriate level of dependability coefficients. However, his proposals contradict his aims as he suggests that different criteria should be used depending on the goal or stage of the investigation. Regardless of the type of study, whether it is exploratory research, applied research, or scale development research, a criterion of 0.7 is universally employed.[33] He advocated 0.7 as a criterion for the early stages of a study, most studies published in the journal do not fall under that category. Rather than 0.7, Nunnally's applied research criterion of 0.8 is more suited for most empirical studies.[33]

Nunnally's recommendations on the level of reliability
1st edition 2nd & 3rd edition
Early stage of research0.5 or 0.60.7
Applied research0.80.8
When making important decisions0.95 (minimum 0.9)0.95 (minimum 0.9)

His recommendation level did not imply a cutoff point. If a criterion means a cutoff point, it is important whether or not it is met, but it is unimportant how much it is over or under. He did not mean that it should be strictly 0.8 when referring to the criteria of 0.8. If the reliability has a value near 0.8 (e.g., 0.78), it can be considered that his recommendation has been met.[34]

Cost to obtain a high level of reliability

Nunnally's idea was that there is a cost to increasing reliability, so there is no need to try to obtain maximum reliability in every situation.

Trade-off with validity

Measurements with perfect reliability lack validity.[7] For example, a person who takes the test with a reliability of one will either receive a perfect score or a zero score, because if they answer one item correctly or incorrectly, they will answer all other items in the same manner. The phenomenon where validity is sacrificed to increase reliability is known as the attenuation paradox.[35] [36]

A high value of reliability can conflict with content validity. To achieve high content validity, each item should comprehensively represent the content to be measured. However, a strategy of repeatedly measuring essentially the same question in different ways is often used solely to increase reliability.[37] [38]

Trade-off with efficiency

When the other conditions are equal, reliability increases as the number of items increases. However, the increase in the number of items hinders the efficiency of measurements.

Methods to increase reliability

Despite the costs associated with increasing reliability discussed above, a high level of reliability may be required. The following methods can be considered to increase reliability.

Before data collection:

After data collection:

\rhoT

. For example,

\rhoC

is 0.02 larger than

\rhoT

on average.[41]

Which reliability coefficient to use

\rhoT

is used in an overwhelming proportion. A study estimates that approximately 97% of studies use

\rhoT

as a reliability coefficient.[3]

However, simulation studies comparing the accuracy of several reliability coefficients have led to the common result that

\rhoT

is an inaccurate reliability coefficient.[42] [43] [6] [44] [45]

Methodological studies are critical of the use of

\rhoT

. Simplifying and classifying the conclusions of existing studies are as follows.
  1. Conditional use: Use

\rhoT

only when certain conditions are met.[3] [7] [8]
  1. Opposition to use:

\rhoT

is inferior and should not be used.[46] [5] [47] [6] [4] [48]

Alternatives to Cronbach's alpha

Existing studies are practically unanimous in that they oppose the widespread practice of using

\rhoT

unconditionally for all data. However, different opinions are given on which reliability coefficient should be used instead of

\rhoT

.

Different reliability coefficients ranked first in each simulation study[42] [43] [6] [44] [45] comparing the accuracy of several reliability coefficients.[7]

The majority opinion is to use structural equation modeling or SEM-based reliability coefficients as an alternative to

\rhoT

.[3] [7] [46] [5] [47] [8] [6] [48]

However, there is no consensus on which of the several SEM-based reliability coefficients (e.g., uni-dimensional or multidimensional models) is the best to use.

Some people suggest

\omegaH

[6] as an alternative, but

\omegaH

shows information that is completely different from reliability.

\omegaH

is a type of coefficient comparable to Reveille's

\beta

.[49] [6] They do not substitute, but complement reliability.[3]

Among SEM-based reliability coefficients, multidimensional reliability coefficients are rarely used, and the most commonly used is

\rhoC

,[3] also known as composite or congeneric reliability.

Software for SEM-based reliability coefficients

General-purpose statistical software such as SPSS and SAS include a function to calculate

\rhoT

. Users who don't know the formula

\rhoT

have no problem in obtaining the estimates with just a few mouse clicks.

SEM software such as AMOS, LISREL, and MPLUS does not have a function to calculate SEM-based reliability coefficients. Users need to calculate the result by inputting it to the formula. To avoid this inconvenience and possible error, even studies reporting the use of SEM rely on

\rhoT

instead of SEM-based reliability coefficients.[3] There are a few alternatives to automatically calculate SEM-based reliability coefficients.
  1. R (free): The psych package[50] calculates various reliability coefficients.
  2. EQS (paid):[51] This SEM software has a function to calculate reliability coefficients.
  3. RelCalc (free):[3] Available with Microsoft Excel.

\rhoC

can be obtained without the need for SEM software. Various multidimensional SEM reliability coefficients and various types of

\omegaH

can be calculated based on the results of SEM software.

External links

Notes and References

  1. Cronbach. Lee J.. Coefficient alpha and the internal structure of tests. Psychometrika. Springer Science and Business Media LLC. 16. 3. 1951. 10.1007/bf02310555. 297–334. 10983/2196. 13820448. free.
  2. Cronbach. L. J.. 1978. Citation Classics. Current Contents. 13. 263. 2021-03-22. 2022-01-20. https://web.archive.org/web/20220120235644/http://garfield.library.upenn.edu/classics1978/A1978EQ39200002.pdf. live.
  3. Cho. Eunseong. Making Reliability Reliable. Organizational Research Methods. SAGE Publications. 19. 4. 2016-07-08. 1094-4281. 10.1177/1094428116656239. 651–682. 124129255.
  4. K.. Sijtsma. On the use, the misuse, and the very limited usefulness of Cronbach's alpha. Psychometrika. 74. 1. 107–120. 2009. 10.1007/s11336-008-9101-0. 20037639. 2792363.
  5. Green. S. B.. Yang. Y.. Commentary on coefficient alpha: A cautionary tale. Psychometrika. 74. 1. 121–135. 2009. 10.1007/s11336-008-9098-4. 122718353.
  6. Revelle. W.. Zinbarg. R. E.. Coefficients alpha, beta, omega, and the glb: Comments on Sijtsma. Psychometrika. 74. 1. 145–154. 2009. 10.1007/s11336-008-9102-z. 5864489.
  7. Cho. E.. Kim. S.. Cronbach's coefficient alpha: Well known but poorly understood. Organizational Research Methods. 2. 207–230. 2015. 10.1177/1094428114555994. 124810308.
  8. Raykov. T.. Marcoulides. G. A.. Thanks coefficient alpha, we still need you!. Educational and Psychological Measurement. 79. 1. 200–210. 2017. 10.1177/0013164417725127. 30636788. 6318747.
  9. Cronbach. L. J.. Shavelson. R. J.. My Current Thoughts on Coefficient Alpha and Successor Procedures. Educational and Psychological Measurement. 64. 3. 391–418. 2004. 10.1177/0013164404266386. 51846704.
  10. L.J.. Cronbach. Coefficient alpha and the internal structure of tests. Psychometrika. 16. 3. 297–334. 1951. 10.1007/BF02310555. 13820448. 10983/2196. free.
  11. C.. Hoyt. Test reliability estimated by analysis of variance. Psychometrika. 6. 3. 153–160. 1941. 10.1007/BF02289270. 122361318.
  12. L.. Guttman. A basis for analyzing test-retest reliability. Psychometrika. 10. 4. 255–282. 1945. 10.1007/BF02288892. 21007983. 17220260.
  13. Jackson. R. W. B.. Ferguson. G. A.. Studies on the reliability of tests. University of Toronto Department of Educational Research Bulletin. 12. 132. 1941.
  14. Book: Gulliksen, H.. Theory of mental tests. Wiley. 1950. 10.1037/13240-000.
  15. Cronbach. Lee. 1978. Citation Classics. Current Contents. 13. 8. 2022-10-21. 2022-10-22. https://web.archive.org/web/20221022201253/http://www.garfield.library.upenn.edu/classics1978/A1978EQ39200002.pdf. live.
  16. Novick. M. R.. Lewis. C.. Coefficient alpha and the reliability of composite measurements. Psychometrika. 32. 1. 1–13. 1967. 10.1007/BF02289400. 5232569. 186226312.
  17. Spiliotopoulou. Georgia. 2009. Reliability reconsidered: Cronbach's alpha and paediatric assessment in occupational therapy. Australian Occupational Therapy Journal. en. 56. 3. 150–155. 10.1111/j.1440-1630.2009.00785.x. 20854508. 2022-10-21. 2022-10-21. https://web.archive.org/web/20221021235248/https://onlinelibrary.wiley.com/doi/10.1111/j.1440-1630.2009.00785.x. live.
  18. Cortina. Jose M.. 1993. What is coefficient alpha? An examination of theory and applications.. Journal of Applied Psychology. en. 78. 1. 98–104. 10.1037/0021-9010.78.1.98. 1939-1854. 2022-10-21. 2023-08-13. https://web.archive.org/web/20230813121351/https://psycnet.apa.org/doiLanding?doi=10.1037/0021-9010.78.1.98. live.
  19. Web site: Goforth. Chelsea. November 16, 2015. Using and Interpreting Cronbach's Alpha - University of Virginia Library Research Data Services + Sciences. 2022-09-06. University of Virginia Library. 2022-08-09. https://web.archive.org/web/20220809031644/https://data.library.virginia.edu/using-and-interpreting-cronbachs-alpha/. live.
  20. Cronbach's Alpha (Simply explained). October 27, 2021. 2023-08-01. DATAtab. YouTube. 4:08.
  21. L. J.. Cronbach. On estimates of test reliability. Journal of Educational Psychology. 34. 8. 485–494. 1943. 10.1037/h0058608.
  22. Waller . Niels . Revelle . William . 2023-05-25 . What are the mathematical bounds for coefficient α? . Psychological Methods . en . 10.1037/met0000583 . 1939-1463.
  23. Web site: APA Dictionary of Psychology. 2023-02-20. dictionary.apa.org. en. 2019-07-31. https://web.archive.org/web/20190731124940/http://dictionary.apa.org/. live.
  24. J. M.. Cortina. What is coefficient alpha? An examination of theory and applications. Journal of Applied Psychology. 78. 1. 98–104. 1993. 10.1037/0021-9010.78.1.98.
  25. Green. S. B.. Lissitz. R. W.. Mulaik. S. A.. Limitations of coefficient alpha as an Index of test unidimensionality. Educational and Psychological Measurement. 37. 4. 827–838. 1977. 10.1177/001316447703700403. 122986180.
  26. R. P.. McDonald. The dimensionality of tests and items. The British Journal of Mathematical and Statistical Psychology. 34. 1. 100–117. 1981. 10.1111/j.2044-8317.1981.tb00621.x.
  27. N.. Schmitt. Uses and abuses of coefficient alpha. Psychological Assessment. 8. 4. 350–3. 1996. 10.1037/1040-3590.8.4.350.
  28. Ten Berge. J. M. F.. Sočan. G.. The greatest lower bound to the reliability of a test and the hypothesis of unidimensionality. Psychometrika. 69. 4. 613–625. 2004. 10.1007/BF02289858. 122674001.
  29. Kopalle. P. K.. Lehmann. D. R.. Alpha inflation? The impact of eliminating scale items on Cronbach's alpha. Organizational Behavior and Human Decision Processes. 70. 3. 189–197. 1997. 10.1006/obhd.1997.2702. free.
  30. T.. Raykov. Reliability if deleted, not 'alpha if deleted': Evaluation of scale reliability following component deletion. The British Journal of Mathematical and Statistical Psychology. 60. 2. 201–216. 2007. 10.1348/000711006X115954. 17971267.
  31. Book: Nunnally, J. C.. Psychometric theory. McGraw-Hill. 1967. 0-07-047465-6. 926852171.
  32. Book: Nunnally. J. C.. Bernstein. I. H.. Psychometric theory. McGraw-Hill. 3rd. 1994. 0-07-047849-X. 28221417.
  33. Lance. C. E.. Butts. M. M.. Michels. L. C.. What did they really say?. Organizational Research Methods. 9. 2. 202–220. 2006. 10.1177/1094428105284919. 144195175.
  34. E.. Cho. A comprehensive review of so-called Cronbach's alpha. Journal of Product Research. 38. 1. 9–20. 2020.
  35. J.. Loevinger. The attenuation paradox in test theory. Psychological Bulletin. 51. 5. 493–504. 1954. 10.1002/j.2333-8504.1954.tb00485.x. 13204488.
  36. L.. Humphreys. The normal curve and the attenuation paradox in test theory. Psychological Bulletin. 53. 6. 472–6. 1956. 10.1037/h0041091. 13370692.
  37. G. J.. Boyle. Does item homogeneity indicate internal consistency or item redundancy in psychometric scales?. Personality and Individual Differences. 12. 3. 291–4. 1991. 10.1016/0191-8869(91)90115-R.
  38. D. L.. Streiner. Starting at the beginning: An introduction to coefficient alpha and internal consistency. Journal of Personality Assessment. 80. 1. 99–103. 2003. 10.1207/S15327752JPA8001_18. 12584072. 3679277.
  39. Beatty. P.. Herrmann. D.. Puskar. C.. Kerwin. J.. July 1998. "Don't know" responses in surveys: is what I know what you want to know and do I want you to know it?. Memory (Hove, England). 6. 4. 407–426. 10.1080/741942605. 0965-8211. 9829099. 2023-02-20. 2023-02-20. https://web.archive.org/web/20230220140847/https://pubmed.ncbi.nlm.nih.gov/9829099/. live.
  40. Lee, H. (2017). Research Methodology (2nd ed.), Hakhyunsa.
  41. Peterson. R. A.. Kim. Y.. On the relationship between coefficient alpha and composite reliability. Journal of Applied Psychology. 98. 1. 194–8. 2013. 10.1037/a0030767. 23127213.
  42. Kamata, A., Turhan, A., & Darandari, E. (2003). Estimating reliability for multidimensional composite scale scores. Annual Meeting of American Educational Research Association, Chicago, April 2003, April, 1–27.
  43. H. G.. Osburn. Coefficient alpha and related internal consistency reliability coefficients. Psychological Methods. 5. 3. 343–355. 2000. 10.1037/1082-989X.5.3.343. 11004872.
  44. Tang, W., & Cui, Y. (2012). A simulation study for comparing three lower bounds to reliability. Paper Presented on April 17, 2012 at the AERA Division D: Measurement and Research Methodology, Section 1: Educational Measurement, Psychometrics, and Assessment, 1–25.
  45. van der Ark. L. A.. van der Palm. D. W.. Sijtsma. K.. A latent class approach to estimating test-score reliability. Applied Psychological Measurement. 35. 5. 380–392. 2011. 10.1177/0146621610392911. 41739445. 2023-06-04. 2023-08-13. https://web.archive.org/web/20230813121350/https://research.tilburguniversity.edu/en/publications/a-latent-class-approach-to-estimating-test-score-reliability. live.
  46. Dunn. T. J.. Baguley. T.. Brunsden. V.. From alpha to omega: A practical solution to the pervasive problem of internal consistency estimation. British Journal of Psychology. 105. 3. 399–412. 2014. 10.1111/bjop.12046. 24844115. 2023-06-04. 2023-03-24. https://web.archive.org/web/20230324013930/http://irep.ntu.ac.uk/id/eprint/4853/1/215051_Dunn.pdf. live.
  47. G. Y.. Peters. The alpha and the omega of scale reliability and validity comprehensive assessment of scale quality. The European Health Psychologist. 1. 2. 56–69. 2014.
  48. Yang, Y., & Green, S. B.Coefficient alpha: A reliability coefficient for the 21st century?. Journal of Psychoeducational Assessment. 29. 4. 377–392. 2011. 10.1177/0734282911406668. Yanyun Yang. Green. Samuel B.. 119926199.
  49. W.. Revelle. Hierarchical cluster analysis and the internal structure of tests. Multivariate Behavioral Research. 14. 1. 57–74. 1979. 10.1207/s15327906mbr1401_4. 26766619.
  50. Web site: An overview of the psych package. Revelle. William. 7 January 2017. 23 April 2020. 27 August 2020. https://web.archive.org/web/20200827020016/http://personality-project.org/r/overview.pdf. live.
  51. Web site: Multivariate Software, Inc.. www.mvsoft.com. dead. https://web.archive.org/web/20010521070751/http://www.mvsoft.com/eqs60.htm. 2001-05-21.