Data Colada Explained

Data Colada is a blog dedicated to investigative analysis and replication of academic research, focusing in particular on the validity of findings in the social sciences.^[1]

It is known for its advocacy against problematic research practices such as p-hacking, and for publishing evidence of data manipulation and research misconduct in several prominent cases, including celebrity professors Dan Ariely and Francesca Gino. Data Colada was established in 2013 by three behavioral science researchers: Uri Simonsohn, a professor at ESADE Business School, Barcelona/Spain (as of 2023), Leif Nelson, a professor at University of California, Berkeley, and Joe Simmons, a professor at University of Pennsylvania.

History

Around 2011, Simmons, Nelson and Simonsohn "bonded over the false, ridiculous, and flashy findings that the field [of behavioral sciences] was capable of producing", such as a paper by Cornell psychologist Daryl Bem that had supposedly found evidence for clairvoyance.^[2] They reacted by publishing an influential 2011 paper about false positive results in psychology, illustrating the problem with a parody research finding that supposedly showed that listening to the Beatles song "When I’m Sixty-Four" made experimental subjects one and a half years younger.

The "Data Colada" blog was launched two years later, in 2013, carrying the tagline "Thinking about evidence, and vice versa", becoming what the New York Times described as "a hub for nerdy discussions of statistical methods — and, before long, various research crimes and misdemeanors".

In particular, the three researchers objected to the then widespread practice of cherry-picking data and attempts to make insignificant results appear statistically credible, especially an approach for which they coined the term p-hacking in a 2014 paper.^[3] ^[4]

Notable findings

Apart from calling out faulty, but presumably well-intended research practices, Data Colada also published evidence of data manipulations and research misconduct. These include studies about the concept of the moral high ground by psychologist Lawrence Sanna, and research by Flemish psychologist Dirk Smeesters. According to The New Yorker, after Data Colada published their work, the careers of Sanna and Smeesters "came to an unceremonious end".

In 2021, Data Colada discovered fabricated data in a 2012 field study published in PNAS^[5] by Lisa L. Shu, Nina Mazar, Francesca Gino, Dan Ariely, and Max H. Bazerman.^[6] ^[7] All of the study's authors agreed with their assessment and the paper was retracted. The authors also agreed that Ariely was the only author who had access to the data prior to transmitting it in its fraudulent form to Mazar, the analyst. Ariely denied manipulating the data,^[8] but Excel metadata showed that he created the spreadsheet and was the last to edit it. He also admitted to having mislabeled all of the values in an entire column of the data in an e-mail to Mazar shortly after he initially sent her the data.^[9] Ariely has stated that someone at the insurance agency that provided the data must have fabricated it.^[10] ^[11]

Reception

Data Colada's work is credited with contributing awareness to the replication crisis, the idea that many research results in the social sciences are difficult or impossible to reproduce. Data Colada is also recognized for helping to establish better research practices, such as the sharing of replication data.

The Nobel-prize winning psychologist Daniel Kahneman described Data Colada in 2023 as "heroes of mine" and expressed his regret about previously endorsing research findings that the blog later showed were faulty. Brian Nosek of the Center for Open Science applauded Data Colada for having "done an amazing job of developing new methodologies to interrogate the credibility of research."

On the other hand, as summarized by The New Yorker, "Data Colada's harshest critics saw the young men as jealous upstarts who didn’t understand the soft artistry of the social sciences". Psychologist Norbert Schwarz accused Data Colada and other reformers of engaging in a "witch hunt," while psychologist Daniel Gilbert denounced what he called the "replication police" as "shameless little bullies".

Francesca Gino lawsuit

In 2021, researcher Zoé Ziani and another collaborator alerted Data Colada about problems replicating work by Harvard behavioral scientist Francesca Gino. Later that year, the Data Colada team contacted Harvard University about anomalies in four papers by Gino. Harvard subsequently conducted its own internal investigation with the help of an outside firm, which discovered additional data alterations besides the cases raised by Data Colada. In June 2023, Harvard Business School placed Gino on unpaid administrative leave after the internal investigation determined she had falsified data in her research.^[12] ^[13] ^[14] Around the same time, Data Colada published four blog posts detailing evidence that the four papers (all of which had been retracted or set to be retracted at that point), and possibly others by Gino, "contain fake data." Gino subsequently filed a defamation suit against Harvard, Harvard Business School Dean Srikant Datar, and the three members of Data Colada for $25 million, alleging that they had conspired to damage her reputation with false accusations, and that the penalties against her amounted to gender-based discrimination under Title IX.^[14] Gino accused Harvard and the Data Colada team of having "worked together to destroy my career and reputation despite admitting they have no evidence proving their allegations."^[15] The lawsuit raised concerns about chilling effects. Open science proponent Simine Vazire raised over $370,000 to help cover the legal fees of Data Colada.^[16] ^[17]

On September 11, 2024, the judge dismissed all of Gino's claims against the Data Colada defendants (defamation and other claims), and dismissed Gino's defamation and certain other claims (such as violation of privacy) against the Harvard University defendants, while allowing some breach of contract claims against Harvard to continue.^[18] ^[19]

External links

Official site

Notes and References

Web site: Simmons . Joseph P . Nelson . Leif D . Simonsohn . Uri . 2011-10-17 . False-Positive Psychology: Undisclosed Flexibility in Data Collection and Analysis Allows Presenting Anything as Significant . live . Sage Journals . en-US.
Lewis-Kraus . Gideon . 2023-09-30 . They Studied Dishonesty. Was Their Work a Lie? . en-US . The New Yorker . 2023-10-01 . 0028-792X . 2023-10-01 . https://web.archive.org/web/20231001005942/https://www.newyorker.com/magazine/2023/10/09/they-studied-dishonesty-was-their-work-a-lie . live .
Web site: Subbaraman . Nidhi . 2023-09-24 . The Band of Debunkers Busting Bad Scientists . live . https://archive.today/20230924094046/https://www.wsj.com/science/data-colada-debunk-stanford-president-research-14664f3 . 2023-09-24 . 2023-10-08 . . en-US.
Web site: APA PsycNet . 2023-10-08 . psycnet.apa.org . en . 2023-11-02 . https://web.archive.org/web/20231102173913/https://psycnet.apa.org/record/2013-25331-001 . live .
News: August 20, 2021 . A study on dishonesty was based on fraudulent data . . August 23, 2021 . 0013-0613 . August 22, 2021 . https://web.archive.org/web/20210822230902/https://www.economist.com/graphic-detail/2021/08/20/a-study-on-dishonesty-was-based-on-fraudulent-data . live .
Web site: August 17, 2021 . [98] Evidence of Fraud in an Influential Field Experiment About Dishonesty ]. August 18, 2021 . Data Colada . en-US . June 23, 2023 . https://web.archive.org/web/20230623041108/https://datacolada.org/98 . live .
Web site: Lee . Stephanie M. . August 20, 2021 . A Famous Honesty Researcher Is Retracting A Study Over Fake Data . August 23, 2021 . BuzzFeed News . en . August 25, 2021 . https://web.archive.org/web/20210825013035/https://www.buzzfeednews.com/article/stephaniemlee/dan-ariely-honesty-study-retraction . live .
Web site: Ariely . Dan . August 16, 2021 . Dan Blog Comment . 29 January 2023 . datacolada.org.
Web site: Charlton . Aaron . 2021-08-17 . Conflicts between Dan Ariely's statement and Footnote #14 (DataColada #98) . 2023-01-30 . OpenMKT.org . en-US . 2023-01-30 . https://web.archive.org/web/20230130062439/https://openmkt.org/blog/2021/conflicts-between-dan-arielys-statement-and-footnote-14-datacolada-98/ . live .
Web site: Charlton . Aaron . 2022-08-21 . Dan Ariely claims authorship order shields him from blame; speculates that a low-level envelope stuffer committed the fraud . 2023-01-30 . OpenMKT.org . en-US . 2023-01-30 . https://web.archive.org/web/20230130062439/https://openmkt.org/blog/2022/dan-ariely-claims-authorship-order-shields-him-from-blame-speculates-that-a-low-level-envelope-stuffer-committed-the-fraud/ . live .
News: דן אריאלי: "אנשים צועקים עליי ברחוב, קוראים לי רוצח ופסיכופת" . he . הארץ . 2023-01-30 . 2023-01-30 . https://web.archive.org/web/20230130062441/https://www.haaretz.co.il/gallery/galleryfriday/2022-06-09/ty-article-magazine/.highlight/00000181-3e90-d207-a795-7ef0418c0000 . live .
Web site: Francesca Gino - Faculty & Research - Harvard Business School . 2023-08-07 . www.hbs.edu . en . 2023-08-07 . https://web.archive.org/web/20230807080833/https://www.hbs.edu/faculty/Pages/profile.aspx?facId=271812 . live .
Web site: Quinn . Ryan . Harvard Dishonesty Researcher Now on Administrative Leave . 2023-07-24 . Inside Higher Ed . en . 2023-07-24 . https://web.archive.org/web/20230724050911/https://www.insidehighered.com/news/quick-takes/2023/06/21/harvard-dishonesty-researcher-now-administrative-leave . live .
Web site: Hamid . Rahem D. . Yuan . Claire . 2023-08-03 . Embattled by Data Fraud Allegations, Business School Professor Francesca Gino Files Defamation Suit Against Harvard . 2023-10-31 . . 2023-09-25 . https://web.archive.org/web/20230925083827/https://www.thecrimson.com/article/2023/8/3/hbs-prof-lawsuit-data-fraud-defamation/ . live .
News: Svrluga . Susan . 2023-08-03 . Professor accused of faking data in studies on dishonesty sues Harvard . en-US . Washington Post . live . 2023-11-04 . https://web.archive.org/web/20230930052928/https://www.washingtonpost.com/education/2023/08/03/harvard-honesty-lawsuit-research-misconduct/ . 2023-09-30 . 0190-8286.
Web site: Piper . Kelsey . Kelsey Piper . 2023-08-23 . A disgraced Harvard professor sued them for millions. Their recourse: GoFundMe. . 2023-10-31 . Vox . en . 2023-10-31 . https://web.archive.org/web/20231031230901/https://www.vox.com/future-perfect/23841742/francesca-gino-data-colada-lawsuit-gofundme-science-culture-transparency-academic-fraud-dishonesty . live .
Web site: O'Grady . Cathleen . 2023-10-13 . How the reform-minded new editor of psychology's flagship journal will shake things up . live . https://web.archive.org/web/20231029025437/https://www.science.org/content/article/how-reform-minded-new-editor-psychology-s-flagship-journal-will-shake-things . 2023-10-29 . 2023-10-31 . science.org.
Web site: Lee . Stephanie M. . 2024-09-11 . She Sued the Sleuths Who Found Fraud in Her Data. A Judge Just Ruled Against Her. . 2024-09-26 . The Chronicle of Higher Education.
Web site: Memorandum of Decision . Court Listener . Free Law Project . 26 September 2024 . September 11, 2024. Full text of decision on defendants' motions to dismiss