Artificial intelligence detection software aims to determine whether some content (text, image, video or audio) was generated using artificial intelligence (AI).
However, the reliability of such software is a topic of debate,[1] and there are concerns about the potential misapplication of AI detection software by educators.
Multiple AI detection tools have been demonstrated to be unreliable in terms of accurately and comprehensively detecting AI-generated text. In a study conducted by Weber-Wulff et al., and published in 2023, researchers evaluated 14 detection tools including Turnitin and GPT Zero, and found that "all scored below 80% of accuracy and only 5 over 70%."[2]
For text, this is usually done to prevent alleged plagiarism, often by detecting repetition of words as telltale signs that a text was AI-generated (including AI hallucinations). They are often used by teachers marking their students, usually on an ad hoc basis. Following the release of ChatGPT and similar AI text generative software, many educational establishments have issued policies against the use of AI by students.[3] AI text detection software is also used by those assessing job applicants, as well as online search engines.[4]
Current detectors may sometimes be unreliable and have incorrectly marked work by humans as originating from AI[5] [6] [7] while failing to detect AI-generated work in other instances.[8] MIT Technology Review said that the technology "struggled to pick up ChatGPT-generated text that had been slightly rearranged by humans and obfuscated by a paraphrasing tool".[9] AI text detection software has also been shown to discriminate against non-native speakers of English.[4]
Two students from the University of California, Davis, nearly faced expulsion after their professors scanned their essays with a text detection tool called Turnitin, which flagged the essays as having been generated by AI. However, following media coverage,[10] and a thorough investigation, the students were cleared of any wrongdoing.[11] [12]
In April 2023, Cambridge University and other members of the Russell Group of universities in the United Kingdom opted out of Turnitin's AI text detection tool, after expressing concerns it was unreliable.[13] The University of Texas at Austin opted out of the system six months later.[14]
In May 2023, a professor at Texas A&M University–Commerce used ChatGPT to detect whether his students' content was written by it, which ChatGPT said was the case. As such, he threatened to fail the class despite ChatGPT not being able to detect AI-generated writing.[15] No students were prevented from graduating because of the issue, and all but one student (who admitted to using the software) were exonerated from accusations of having used ChatGPT in their content.[16]
An article by Thomas Germain, published on Gizmodo in June 2024, reported job losses among freelance writers and journalists due to AI text detection software mistakenly classifying their work as AI-generated.[17]
There is software available designed to bypass AI text detection.[18]
A study published in August 2023 analyzed 20 abstracts from papers published in the Eye Journal, which were then paraphrased using GPT-4.0. The AI-paraphrased abstracts were examined for plagiarism using QueText and for AI-generated content using Originality.AI. The texts were then re-processed through an adversarial software called Undetectable.ai in order to reduce the AI-detection scores. The study found that the AI detection tool, Originality.AI, identified text generated by GPT-4 with a mean accuracy of 91.3%. However, after reprocessing by Undetectable.ai, the detection accuracy of Originality.ai dropped to a mean accuracy of 27.8%.[19] [20]
Some experts also believe that techniques like digital watermarking are ineffective because they can be removed or added to trigger false positives.[21]
Several purported AI image detection software exist, to detect AI-generated images (for example, those originating from Midjourney or DALL-E). They are not completely reliable.[22] [23]
Others claim to identify video and audio deepfakes, but this technology is also not fully reliable yet either.[24]
Despite debate around the efficacy of watermarking, Google DeepMind is actively developing a detection software called SynthID, which works by inserting a digital watermark that is invisible to the human eye into the pixels of an image.[25] [26]