Reference class forecasting explained

Reference class forecasting or comparison class forecasting is a method of predicting the future by looking at similar past situations and their outcomes. The theories behind reference class forecasting were developed by Daniel Kahneman and Amos Tversky. The theoretical work helped Kahneman win the Nobel Prize in Economics.

Reference class forecasting is so named as it predicts the outcome of a planned action based on actual outcomes in a reference class of similar actions to that being forecast.

Discussion of which reference class to use when forecasting a given situation is known as the reference class problem.

Overview

Kahneman and Tversky^[1] ^[2] found that human judgment is generally optimistic due to overconfidence and insufficient consideration of distributional information about outcomes.

People tend to underestimate the costs, completion times, and risks of planned actions, whereas they tend to overestimate the benefits of those same actions. Such error is caused by actors taking an "inside view", where focus is on the constituents of the specific planned action instead of on the actual outcomes of similar ventures that have already been completed.

Kahneman and Tversky concluded that disregard of distributional information, i.e. risk, is perhaps the major source of error in forecasting. On that basis they recommended that forecasters "should therefore make every effort to frame the forecasting problem so as to facilitate utilizing all the distributional information that is available". Using distributional information from previous ventures similar to the one being forecast is called taking an "outside view". Reference class forecasting is a method for taking an outside view on planned actions.

Reference class forecasting for a specific project involves the following three steps:

Identify a reference class of past, similar projects.
Establish a probability distribution for the selected reference class for the parameter that is being forecast.
Compare the specific project with the reference class distribution, in order to establish the most likely outcome for the specific project.

Reference class tennis

The reference class problem, also known as reference class tennis, is the discussion of which reference class to use when forecasting a given situation.

Suppose someone were trying to predict how long it would take to write a psychology textbook. Reference class tennis would involve debating whether we should take the average of all books (closest to an outside view), just all textbooks, or just all psychology textbooks (closest to an inside view).^[3] ^[4]

Practical use in policy and planning

Whereas Kahneman and Tversky developed the theories of reference class forecasting, Flyvbjerg and COWI (2004) developed the method for its practical use in policy and planning, which was published as an official Guidance Document in June 2004 by the UK Department for Transport.^[5]

The first instance of reference class forecasting in practice is described in Flyvbjerg (2006).^[6] This forecast was part of a review of the Edinburgh Tram Line 2 business case, which was carried out in October 2004 by Ove Arup and Partners Scotland. At the time, the project was forecast to cost a total of £320 million, of which £64 million – or 25% – was allocated for contingency. Using the newly implemented reference class forecasting guidelines, Ove Arup and Partners Scotland calculated the 80th percentile value (i.e., 80% likelihood of staying within budget) for total capital costs to be £400 million, which equaled 57% contingency. Similarly, they calculated the 50th percentile value (i.e., 50% likelihood of staying within budget) to be £357 million, which equaled 40% contingency. The review further acknowledged that the reference class forecasts were likely to be too low because the guidelines recommended that the uplifts should be applied at the time of decision to build, which the project had not yet reached, and that the risks therefore would be substantially higher at this early business case stage. On this basis, the review concluded that the forecasted costs could have been underestimated. The Edinburgh Tram Line 2 opened three years late in May 2014 with a final outturn cost of £776 million, which equals £628 million in 2004-prices.^[7]

Since the Edinburgh forecast, reference class forecasting has been applied to numerous other projects in the UK, including the £15 (US$29) billion Crossrail project in London. After 2004, The Netherlands, Denmark, and Switzerland have also implemented various types of reference class forecasting.

Before this, in 2001 (updated in 2011), AACE International (the Association for the Advancement of Cost Engineering) included Estimate Validation as a distinct step in the recommended practice of Cost Estimating (Estimate Validation is equivalent to Reference class forecasting in that it calls for separate empirical-based evaluations to benchmark the base estimate):

The estimate should be benchmarked or validated against or compared to historical experience and/or past estimates of the enterprise and of competitive enterprises to check its appropriateness, competitiveness, and to identify improvement opportunities...Validation examines the estimate from a different perspective and using different metrics than are used in estimate preparation.^[8]

In the process industries (e.g., oil and gas, chemicals, mining, energy, etc. which tend to dominate AACE's membership), benchmarking (i.e., "outside view") of project cost estimates against the historical costs of completed projects of similar types, including probabilistic information, has a long history.^[9] A method combining reference class forecasting and competitive crowdsourcing, Human Forest, has also been used in the life sciences, to estimate the likelihood that vaccines and treatments will successfully progress through clinical trial phases.^[10] ^[11]

Bibliography

12858711 . 2003 . Lovallo . D . Delusions of success. How optimism undermines executives' decisions . Harvard Business Review . 81 . 7 . 56–63 . Kahneman . D .
Book: https://books.google.com/books?id=tenILJ-MowQC&pg=PA133. Decision-making on Mega-Projects. 9781848440173. Priemus. Hugo. Flyvbjerg. Bent. van Wee. Bert. 2008. 10.4337/9781848440173.00014. Public Planning of Mega-Projects: Overestimation of Demand and Underestimation of Costs. Flyvbjerg. Bent.
Book: 10.1093/oxfordhb/9780199563142.003.0014. 2011. Flyvbjerg. Bent. Over Budget, Over Time, Over and Over Again: Managing Major Projects. https://books.google.com/books?id=FI7l8O1tlkkC&pg=PT321. The Oxford Handbook of Project Management. 9780199563142. Morris. Peter W. G. Pinto. Jeffrey K. Söderlund. Jonas. Jonas Söderlund.

Notes and References

Kahneman. Daniel. Tversky. Amos. Prospect Theory: An Analysis of Decision under Risk. Econometrica. 1979. 47. 2. 263–291. 10.2307/1914185. 1914185. 10.1.1.407.1910.
Intuitive prediction: Biases and corrective procedures. https://web.archive.org/web/20130908065829/http://www.dtic.mil/dtic/tr/fulltext/u2/a047747.pdf. live. September 8, 2013. 1977. Kahneman . Daniel. Tversky. Amos. Decision Research Technical Report PTR-1042-77-6. InBook: Kahneman. Daniel. Slovic. Paul. Tversky. Amos. 10.1017/CBO9780511809477.031. Judgment Under Uncertainty: Heuristics and Biases. 414–421. 1982. 9780511809477. Intuitive prediction: Biases and corrective procedures. Kahneman. Daniel. Tversky. Amos.
News: Week 10: Reference Class Forecasting. Conceptually. 20 April 2017.
News: Outside view. LessWrong Wiki. 20 April 2017.
News: June 2004. Procedures for Dealing with Optimism Bias in Transport Planning. The British Department for Transport. 23 July 2021.
Bent. Flyvbjerg. 2006. From Nobel Prize to Project Management: Getting Risks Right. Project Management Journal. 37. 3. 5–15. 1302.3642. 2238013. 2013arXiv1302.3642F. 10.1177/875697280603700302. 13203075.
News: February 2018. Report for the Edinburgh Tram Inquiry. 23 July 2021.
https://web.archive.org/web/20160530211731/http://www.aacei.org/tcmfree/tcmframework_webedition.pdf AACE International, "Total Cost Management Framework, Section 7.3, Cost Estimating and Budgeting", 2011. p. 147
Merrow, E. and Yarossi, M., "Assessing Project Cost and Schedule Risk", 1990 AACE Transactions. pp H.6.1-7
Atanasov . Pavel D. . Joseph . Regina . Feijoo . Felipe . Marshall . Max . Conway . Amanda . Siddiqui . Sauleh . 2022-07-27 . Human Forest vs. Random Forest in Time-Sensitive COVID-19 Clinical Trial Prediction . en . Rochester, NY. 10.2139/ssrn.3981732 . 3981732 . 245174381 .
Web site: NSF Award Search: Award # 1919333 - Human Forests versus Random Forest Models in Prediction . 2022-09-20 . www.nsf.gov.