Sparrow (chatbot) explained

Sparrow is a chatbot developed by the artificial intelligence research lab DeepMind, a subsidiary of Alphabet Inc. It is designed to answer users' questions correctly, while reducing the risk of unsafe and inappropriate answers.^[1] One motivation behind Sparrow is to address the problem of language models producing incorrect, biased or potentially harmful outputs.^[2] ^[3] Sparrow is trained using human judgements, in order to be more “Helpful, Correct and Harmless” compared to baseline pre-trained language models.^[4] The development of Sparrow involved asking paid study participants to interact with Sparrow, and collecting their preferences to train a model of how useful an answer is.^[5]

To improve accuracy and help avoid the problem of hallucinating incorrect answers, Sparrow has the ability to search the Internet using Google Search^[6] ^[7] ^[8] in order to find and cite evidence for any factual claims it makes.

To make the model safer, its behaviour is constrained by a set of rules, for example "don't make threatening statements" and "don't make hateful or insulting comments", as well as rules about possibly harmful advice, and not claiming to be a person.^[9] During development study participants were asked to converse with the system and try to trick it into breaking these rules.^[10] A 'rule model' was trained on judgements from these participants, which was used for further training.

Sparrow was introduced in a paper in September 2022, titled "Improving alignment of dialogue agents via targeted human judgements";^[11] however, the bot was not released publicly.^[12] ^[13] DeepMind CEO Demis Hassabis said DeepMind is considering releasing Sparrow for a "private beta" some time in 2023.^[14] ^[15] ^[16]

Training

Sparrow is a deep neural network based on the transformer machine learning model architecture. It is fine-tuned from DeepMind's Chinchilla AI pre-trained large language model (LLM),^[17] which has 70 Billion parameters.^[18]

Sparrow is trained using reinforcement learning from human feedback (RLHF),^[19] ^[20] although some supervised fine-tuning techniques are also used. The RLHF training utilizes two reward models to capture human judgements: a “preference model” that predicts what a human study participant would prefer and a “rule model” that predicts if the model has broken one of the rules.^[21]

Limitations

Sparrow's training data corpus is mainly in English, meaning it performs worse in other languages.

When adversarially probed by study participants it breaks the rules 8% of the time;^[22] however, this is still three times lower than the baseline prompted pre-trained model (Chinchilla).

External links

Notes and References

Web site: The secret to Sparrow, DeepMind's latest Q&A chatbot: Human feedback . Katyanna . Quach . January 23, 2023 . The Register . February 6, 2023.
Web site: The secret to Sparrow, DeepMind's latest Q&A chatbot: Human feedback . Katyanna . Quach . January 23, 2023 . The Register . February 6, 2023.
Web site: Deepmind Introduces 'Sparrow,' An Artificial Intelligence-Powered Chatbot Developed To Build Safer Machine Learning Systems . Khushboo . Gupta . September 28, 2022 . MarkTechPost . February 6, 2023.
Web site: The secret to Sparrow, DeepMind's latest Q&A chatbot: Human feedback . Katyanna . Quach . January 23, 2023 . The Register . February 6, 2023.
Web site: Deepmind Introduces 'Sparrow,' An Artificial Intelligence-Powered Chatbot Developed To Build Safer Machine Learning Systems . Khushboo . Gupta . September 28, 2022 . MarkTechPost . February 6, 2023.
Web site: The secret to Sparrow, DeepMind's latest Q&A chatbot: Human feedback . Katyanna . Quach . January 23, 2023 . The Register . February 6, 2023.
Web site: Deepmind Introduces 'Sparrow,' An Artificial Intelligence-Powered Chatbot Developed To Build Safer Machine Learning Systems . Khushboo . Gupta . September 28, 2022 . MarkTechPost . February 6, 2023.
Web site: Why DeepMind isn't deploying its new AI chatbot — and what it means for responsible AI . Sharon . Goldman . January 23, 2023 . Venture Beat . February 6, 2023.
Web site: The secret to Sparrow, DeepMind's latest Q&A chatbot: Human feedback . Katyanna . Quach . January 23, 2023 . The Register . February 6, 2023.
Web site: Deepmind Introduces 'Sparrow,' An Artificial Intelligence-Powered Chatbot Developed To Build Safer Machine Learning Systems . Khushboo . Gupta . September 28, 2022 . MarkTechPost . February 6, 2023.
Web site: DeepMind's AI chatbot can do things that ChatGPT cannot, CEO claims . Anthony . Cuthbertson . January 16, 2023 . The Independent . February 6, 2023.
Web site: The secret to Sparrow, DeepMind's latest Q&A chatbot: Human feedback . Katyanna . Quach . January 23, 2023 . The Register . February 6, 2023.
Web site: Why DeepMind isn't deploying its new AI chatbot — and what it means for responsible AI . Sharon . Goldman . January 23, 2023 . Venture Beat . February 6, 2023.
Web site: DeepMind's AI chatbot can do things that ChatGPT cannot, CEO claims . Anthony . Cuthbertson . January 16, 2023 . The Independent . February 6, 2023.
DeepMind's CEO Helped Take AI Mainstream. Now He's Urging Caution . Billy . Perrigo . January 12, 2023 . TIME . February 6, 2023.
Web site: Google's DeepMind says it'll launch a more grown-up ChatGPT rival soon . Mark . Wilson . January 16, 2023 . Tech Radar . February 6, 2023.
Web site: The secret to Sparrow, DeepMind's latest Q&A chatbot: Human feedback . Katyanna . Quach . January 23, 2023 . The Register . February 6, 2023.
Web site: An empirical analysis of compute-optimal large language model training . Jordan . Hoffmann . April 12, 2022 . DeepMind . February 6, 2023.
Web site: The secret to Sparrow, DeepMind's latest Q&A chatbot: Human feedback . Katyanna . Quach . January 23, 2023 . The Register . February 6, 2023.
Web site: Why DeepMind isn't deploying its new AI chatbot — and what it means for responsible AI . Sharon . Goldman . January 23, 2023 . Venture Beat . February 6, 2023.
Web site: Why DeepMind isn't deploying its new AI chatbot — and what it means for responsible AI . Sharon . Goldman . January 23, 2023 . Venture Beat . February 6, 2023.
Web site: Deepmind Introduces 'Sparrow,' An Artificial Intelligence-Powered Chatbot Developed To Build Safer Machine Learning Systems . Khushboo . Gupta . September 28, 2022 . MarkTechPost . February 6, 2023.

Sparrow (chatbot) explained

Training

Limitations

See also

External links

Notes and References