Expertise reporter
4 main synthetic intelligence (AI) chatbots are inaccurately summarising information tales, based on analysis carried out by the BBC.
The BBC gave OpenAI’s ChatGPT, Microsoft’s Copilot, Google’s Gemini and Perplexity AI content material from the BBC web site then requested them questions concerning the information.
It stated the ensuing solutions contained “important inaccuracies” and distortions.
In a blog, Deborah Turness, the CEO of BBC Information and Present Affairs, stated AI introduced “limitless alternatives” however the corporations growing the instruments had been “enjoying with fireplace.”
“We stay in troubled instances, and the way lengthy will it’s earlier than an AI-distorted headline causes important actual world hurt?”, she requested.
The tech corporations which personal the chatbots have been approached for remark.
‘Pull again’
In the study, the BBC requested ChatGPT, Copilot, Gemini and Perplexity to summarise 100 information tales and rated every reply.
It obtained journalists who had been related specialists within the topic of the article to fee the standard of solutions from the AI assistants.
It discovered 51% of all AI solutions to questions concerning the information had been judged to have important problems with some type.
Moreover, 19% of AI solutions which cited BBC content material launched factual errors, similar to incorrect factual statements, numbers and dates.
In her weblog, Ms Turness stated the BBC was looking for to “open up a brand new dialog with AI tech suppliers” so we are able to “work collectively in partnership to search out options.”
She known as on the tech corporations to “pull again” their AI information summaries, as Apple did after complaints from the BBC that Apple Intelligence was misrepresenting information tales.
Some examples of inaccuracies discovered by the BBC included:
- Gemini incorrectly stated the NHS didn’t advocate vaping as an assist to stop smoking
- ChatGPT and Copilot stated Rishi Sunak and Nicola Sturgeon had been nonetheless in workplace even after that they had left
- Perplexity misquoted BBC Information in a narrative concerning the Center East, saying Iran initially confirmed “restraint” and described Israel’s actions as “aggressive”
Usually, Microsoft’s Copilot and Google’s Gemini had extra important points than OpenAI’s ChatGPT and Perplexity, which counts Jeff Bezos as considered one of its traders.
Usually, the BBC blocks its content material from AI chatbots, but it surely opened its web site up at some stage in the exams in December 2024.
The report stated that in addition to containing factual inaccuracies, the chatbots “struggled to distinguish between opinion and truth, editorialised, and sometimes failed to incorporate important context.”
The BBC’s Programme Director for Generative AI, Pete Archer, stated publishers “ought to have management over whether or not and the way their content material is used and AI corporations ought to present how assistants course of information together with the size and scope of errors and inaccuracies they produce.”