Data collaboratives explained

Data collaboratives (sometimes called “corporate data philanthropy”)[1] are a form of collaboration in which participants from different sectors—including private companies, research institutions, and government agencies—can exchange data and data expertise to help solve public problems.[2] [3]

Types

Data collaboratives can take many forms. They can be organized as:[4]

Reasons for data collaboratives

The big data boom has demonstrated the power of data to inform and design public projects in an accountable and iterative manner.[8] However, unequal access to certain data across sectors limits the ability of groups to find, access, or be made aware of valuable information, hindering social innovation.[9] Data collaboratives create networks that bridge access and knowledge gaps by bringing different sectors together to share data to address social challenges.[6]

The GovLab argues data collaboratives wherein a private sector data holder shares data with other groups tend to be motivated by a desire for:[4]

Data collaboratives can help respond to service delivery and emergency preparedness and disaster response problems. Robert Kirkpatrick, Director of UN Global Pulse noted that “the lack of innovation [in these sectors have] resulted in a failure to protect the public from what turns out to be preventable harms.”[10]

Incentives for private sector participation

According to The GovLab, data collaboratives can provide five main benefits for public problems:[4]

Examples

From 2017 to 2019, the percentage of companies entering data-related partnerships rose from 21% to 40%.[16] A growing share of business competitors are also deciding to connect their data—jumping from 7% to 17%.[6] In a 2019 report, the World Economic Forum and McKinsey estimated that connecting data across institutional and geographic boundaries could create roughly $3 trillion annually in economic value by 2020.[6]

The following is an illustrative (but not exhaustive) list of some data collaboratives:

Risks, challenges, and ethical considerations

Data collaboratives have significant challenges related to data security, data privacy, commercial risk, reputational concerns and regulatory uncertainty.[29] In addition, there exist concerns about the lack of trust among individuals, institutions and governments.[6]

Risks

Mitigating privacy protection issues

Privacy preserving computation (PPC) presents data in forms that can be shared, analyzed, and operated on by multiple stakeholders without the raw information. To do so, PPC seeks to control the environment within which the data is operated on (Trusted Execution Environment) and strips the data of identifying traits (Differential Privacy).[30] Protecting the data via Homomorphic Encryption techniques, PPC allows users to execute operations and see their outcomes without exposing the source data.[6] Through secure Multi-Party Computation, different groups can combine data to work in a decentralized and collaborative manner.[6]

PPC techniques are already being leveraged by governments and large corporations. In 2015, the Estonian government worked with the private firm, Sharemind, to analyze tax and education records through Multi-Party Computation for the Private Statistics Project. An external audit by the European Commission PRACTICE project found that the Private Statistics Project did not expose any personal data.[6]

In 2019, Google released its Private Join and Compute protocol to open-source, allowing users to use Homomorphic Encryption and Multi-Party Computation.[6] In the same year, ten pharmaceutical companies formed the Melloddy consortium to use blockchain technology to train a drug discovery algorithm via shared data.[6]

Mitigating power asymmetries

Power imbalances can occur when stronger parties manipulate, exclude, or pressure weaker members of the data collaborative. From a classical viewpoint, power refers to the influence a person or group has over another.[31] Examining collaborative governance, Dave Egan, Evan E. Hjerpe, and Jesse Abrams suggest a three-phased approach to power: power over refers to the ability to control the behavior of others, power for looks at the ability to authorize the participation of stakeholders, and power to considers the ability to measure another entity’s ability to realize its goals.[32]

Power imbalances can arise from disparities in authority, resources, legitimacy or trust between parties.[6] The more actors in the data collaborative or more incentives of data use, the increased likelihood for conflicting interests. Oftentimes, data is viewed as an organizational asset, and opening it up to new uses by others means relinquishing control over the data and ceding this autonomy to the collaborative, resulting in the “control and generativity challenge.”[33] Data stewards can help reduce the power imbalances by reducing bias influences, follow operating procedures, and provide issue resolution and remediation.[34]

See also

Notes and References

  1. Taddeo . Mariarosaria . Data philanthropy and individual rights . Minds and Machines . 2017 . 27 . 1 . 1–5 . 10.1007/s11023-017-9429-2. 38297827 . free .
  2. Web site: Verhulst . Stefaan . Sangokoya . David . Data Collaboratives: Exchanging Data to Improve People's Lives . The Governance Lab . 22 April 2015.
  3. Book: Young . Andrew . Verhulst . Stefaan . Harris . Phil . Bitonti . Alberto . Fleisher . Craig S. . Skorkjær Binderkrantz . Anne . The Palgrave Encyclopedia of Interest Groups, Lobbying and Public Affairs . 2020 . Palgrave Macmillan, Cham . Data Collaboratives.
  4. Web site: Verhulst . Stefaan . Young . Andrew . Srinivasan . Prianka . An Introduction to Data Collaboratives: Creating Public Value By Exchanging Data .
  5. Web site: Verhulst . Stefaan G. . Young . Andrew . Winowatan . Michelle . Zahuranec . Andrew J. . Leveraging Data for Public Good: A Descriptive Analysis and Typology of Existing Practices . The Governance Lab . October 2019.
  6. Ibid.
  7. Web site: Economic Graph Research . LinkedIn .
  8. Book: OECD . The Path to Becoming a Data-Driven Public Sector . 2019 . OECD Digital Government Studies, OECD Publishing . Paris.
  9. Susha . Iryna . Janssen . Marijn . Verhulst . Stefaan . Data collaboratives as "bazaars"? A review of coordination problems and mechanisms to match demand for data with supply . Transforming Government: People, Process and Policy . 2017 . 11 . 1 . 157–172 . 10.1108/TG-01-2017-0007. 195968470 .
  10. Web site: Kirkpatrick . Robert . Unpacking the Issue of Missed Use and Misuse of Data . UN Global Pulse. 18 March 2019 .
  11. Web site: Goldman . Hunter . Big Data Offers New Opportunities for Community Resilience . The Rockefeller Foundation . 30 December 2014.
  12. Web site: Young . Andrew . Verhulst . Stefaan . Battling Ebola in Sierra Leone: Data Sharing to Improve Crisis Response . ODI Impact . January 2016.
  13. Web site: All of Us . All of US.
  14. Adler . Natalia . Cattuto . Ciro . Kalimeri . Kyriaki . Paolotti . Daniela . Tizzoni . Michele . Verhulst . Stefaan . Yom-Tov . Elad . Young . Andrew . How Search Engine Data Enhance the Understanding of Determinants of Suicide in India and Inform Prevention: Observational Study . Journal of Medical Internet Research . 2019 . 21 . 1 . e10179 . 10.2196/10179 . 30609976 . 6682304 . free .
  15. Web site: Thrombotic events and death in inpatient-identified COVID-19 . Sentinel . 14 December 2020.
  16. Web site: Hoffman . William . Bick . Raphael . Boral . Austin . Henke . Nicolaus . Olukoya . Didunoluwa . Rifai . Khaled . Roth . Marcus . Youldon . Tom . Collaborating for the common good: Navigating public-private data partnerships . McKinsey & Company . 30 May 2019.
  17. Web site: AI4BetterHearts - A Cardiovascular Data Collaborative . Novartis.
  18. Web site: Chicago Data Collaborative. Chicago Data Collaborative.
  19. Web site: Counter-Trafficking Data Collaborative (CTDC). www.ctdatacollaborative.org.
  20. Web site: Offline Intelligence & Measurement - Increase Return on Ad Spend. Cuebiq.
  21. Web site: Data for Good . CubeIQ.
  22. Web site: Data Collaborative for Justice. Data Collaborative for Justice.
  23. Web site: Data for health and sustainable development. Health Data Collaborative.
  24. Web site: What is INDIGO? . Governance Outcomes Lab.
  25. Web site: Reclaim your data destiny | InfoSum. www.infosum.com.
  26. Web site: Mobility Data Collaborative .
  27. Web site: Mobility Data Collaborative Publishes Best Practices for Data Terminology and Governance . SAE International . 5 May 2020.
  28. Web site: Home. Water Data Collaborative.
  29. Web site: Data Collaboration for the Common Good: Enabling Trust and Innovation Through Public-Private Partnerships . World Economic Forum . April 2019.
  30. Web site: Maximize collaboration through secure data sharing . Accenture . 2019.
  31. Book: Weber . Max . Henderson . A.M. . Parsons . T. . The Theory of Social and Economic Organization . 1947 . Oxford University Press . New York.
  32. Orth . Patricia B. . Cheng . Antony S. . Who's in Charge? The Role of Power in Collaborative Governance and Forest Management . Humboldt Journal of Social Relations . 2018 . 1 . 40 . 191–210. 10.55671/0160-4341.1068 . 55822373 . free .
  33. Klievink . Bram . van der Voort . Haiko . Veeneman . Wijnand . Creating value through data collaboratives: Balancing innovation and control . Information Polity . 2018 . 23 . 4 . 379–397 . 10.3233/IP-180070 . 58005713 .
  34. Downing . Kathy . Importance of Data Stewards in Information Governance . Journal of AHIMA Website . 26 May 2016 .