Data valuation explained

Data valuation is a discipline in the fields of accounting and information economics.[1] It is concerned with methods to calculate the value of data collected, stored, analyzed and traded by organizations. This valuation depends on the type, reliability and field of data.

History

In the 21st century, exponential increases in computing power and data storage capabilities (in line with Moore's law) have led to a proliferation of big data, machine learning and other data analysis techniques. Businesses increasingly adapt these techniques and technologies to pursue data-driven strategies to create new business models. Traditional accounting techniques used to value organizations were developed in an era before high-volume data capture and analysis became widespread and focused on tangible assets (machinery, equipment, capital, property, materials etc.), ignoring data assets. As a result, accounting calculations often ignore data and leave its value off organizations' balance sheets.[2] Notably, in the wake of the 9/11 attacks on the World Trade Center in 2001, a number of businesses lost significant amounts of data. They filed claims with their insurance companies for the value of information that was destroyed, but the insurance companies denied the claims, arguing that information did not count as property and therefore was not covered by their policies.[3]

A number of organizations and individuals began noticing this and then publishing on the topic of data valuation. Doug Laney, vice president and analyst at Gartner, conducted research on Wall Street valued companies, which found that companies that had become information-centric, treating data as an asset, often had market-to-book values two to three times higher than the norm.[3] [4] On the topic, Laney commented: "Even as we are in the midst of the Information Age, information simply is not valued by those in the valuation business. However, we believe that, over the next several years, those in the business of valuing corporate investments, including equity analysts, will be compelled to consider a company's wealth of information in properly valuing the company itself."[2] In the latter part of the 2010s, the list of most valuable firms in the world (a list traditionally dominated by oil and energy companies) was dominated by data firms – Microsoft, Alphabet, Apple, Amazon and Facebook.[5] [6]

Characteristics of data as an asset

A 2020 study by the Nuffield Institute at Cambridge University, UK divided the characteristics of data into two categories, economic characteristics and informational characteristics.[7]

Economic characteristics

Informational characteristics

Data value drivers

A number of drivers affect the extent to which future economic benefits can be derived from data. Some drivers relate to data quality, while others may either render the data valueless or create unique and valuable competitive advantages for data owners.[8]

The process of realizing value from data can be subdivided into a number of key stages: data assessment, where the current states and uses of data are mapped; data valuation, where data value is measured; data investment, where capital is spent to improve processes, governance and technologies underlying data; data utilization, where data is used in business initiatives; and data reflection, where the previous stages are reviewed and new ideas and improvements are suggested.[9]

Methods for valuing data

Due to the wide range of potential datasets and use cases, as well as the relative infancy of data valuation, there are no simple or universally agreed upon methods. High option value and externalities mean data value may fluctuate unpredictably, and seemingly worthless data may suddenly become extremely valuable at an unspecified future date.[7] Nonetheless, a number of methods have been proposed for calculating or estimating data value.

Information-theoretic characterization

Information theory provides quantitative mechanisms for data valuation. For instance, secure data sharing requires careful protection of individual privacy or organization intellectual property. Information-theoretic approaches and data obfuscation can be applied to sanitize data prior to its dissemination.[10] [11]

Information-theoretic measures, such as entropy, information gain, and information cost, are useful for anomaly and outlier detection.[12] In data-driven analytics, a common problem is quantifying whether larger data sizes and/or more complex data elements actually enhance, degrade, or alter the data information content and utility. The data value metric (DVM) quantifies the useful information content of large and heterogeneous datasets in terms of the tradeoffs between the size, utility, value, and energy of the data.[13] Such methods can be used to determine if appending, expanding, or augmenting an existent dataset may improve the modeling or understanding of the underlying phenomenon.

Infonomics valuation models

Doug Laney identifies six approaches for valuing data, dividing these into two categories: foundational models and financial models. Foundational models assign a relative, informational value to data, where financial models assign an absolute, economic value.[14]

Foundational models

Financial models

Bennett institute valuations

Research by the Bennett Institute divides approaches for estimating the value of data into market-based valuations and non-market-based valuations.[7]

Market based valuations

Non-market based valuations

Other approaches

Companies performing Data Valuations

Data Valuation as a Service provides:

References

  1. Information as an Economic Commodity. 2006582. Allen. Beth. The American Economic Review. 1990. 80. 2. 268–273.
  2. Web site: Gartner Says Within Five Years, Organizations Will Be Valued on Their Information Portfolios.
  3. Web site: How Do You Value Information?. 15 September 2016 .
  4. Web site: Applied Infonomics: Why and How to Measure the Value of Your Information Assets.
  5. Web site: The Value of Data. 22 September 2017 .
  6. Web site: Most Valuable Companies in the World – 2020.
  7. Web site: The Value of Data Summary Report.
  8. Web site: Putting a value on data .
  9. Web site: Data Valuation – What is Your Data Worth and How do You Value it?. 13 September 2019 .
  10. Book: Association for Computing Machinery . Askari, M . Safavi-Naini, R . Barker, K . Proceedings of the second ACM conference on Data and Application Security and Privacy . An information theoretic privacy and utility measure for data sanitization mechanisms . 2012 . 10.1145/2133601.2133637 . 283–294 . 9781450310918 . 18338542 . https://doi.org/10.1145/2133601.2133637.
  11. Zhou, N. Wu, Q . Wu, Z . Marino, S . Dinov, ID . 2022 . DataSifterText: Partially Synthetic Text Generation for Sensitive Clinical Notes . Journal of Medical Systems . 46 . 96 . 96 . 10.1007/s10916-022-01880-6 . 36380246 . 10111580 .
  12. Book: IEEE. Lee, W . Xiang, D . Proceedings 2001 IEEE Symposium on Security and Privacy. S&P 2001 . Information-theoretic measures for anomaly detection . 2001 . 10.1109/SECPRI.2001.924294 . 130–143 . 0-7695-1046-9 . 6014214 . https://doi.org/10.1109/SECPRI.2001.924294.
  13. Springer. Noshad, M . Choi, J . Sun, Y . Hero, A . Dinov, ID . 2021 . An information theoretic privacy and utility measure for data sanitization mechanisms . J Big Data . 10.1186/s40537-021-00446-6 . 8 . 82 . 82 . 34777945 . 8550565 . free .
  14. Web site: Why and How to Measure the Value of your Information Assets.
  15. Web site: Measuring the Value of Information: An Asset Valuation Approach.
  16. Web site: The Valuation of Data as an Asset.
  17. Web site: Consumption-Based Method. 4 December 2018 .
  18. Web site: Keeping Research Data Safe Method. 4 December 2018 .
  19. Web site: Why you should be treating data as an asset. 2 March 2020 .
  20. Web site: Data Valuation – Valuing the World's Greatest Asset.