Tech /
AI

Google’s AI chatbot Bard makes factual error in first demo

The mistake highlights the biggest problem of using AI chatbots to replace search engines — they make stuff up.

The mistake highlights the biggest problem of using AI chatbots to replace search engines — they make stuff up.

by James Vincent

Feb 8, 2023, 3:26 PM UTC

If you buy something from a Verge link, Vox Media may earn a commission. See our ethics statement.

A screenshot of Bard’s interface, saying “Introducing Bard, an experimental conversational AI service powered by LaMDA.”

A screenshot of Bard’s interface, saying “Introducing Bard, an experimental conversational AI service powered by LaMDA.”

Google has been scrambling to launch a competitor to ChatGPT — but perhaps rushing a little too hard.

Image: Google

James Vincent is a senior reporter who has covered AI, robotics, and more for eight years at The Verge.

On Monday, Google announced its AI chatbot Bard — a rival to OpenAI’s ChatGPT that’s due to become “more widely available to the public in the coming weeks.” But the bot isn’t off to a great start, with experts noting that Bard made a factual error in its very first demo.

A GIF shared by Google shows Bard answering the question: “What new discoveries from the James Webb Space Telescope can I tell my 9 year old about?” Bard offers three bullet points in return, including one that states that the telescope “took the very first pictures of a planet outside of our own solar system.”

However, a number of astronomers on Twitter pointed out that this is incorrect and that the first image of an exoplanet was taken in 2004 — as stated here on NASA’s website.

“Not to be a ~well, actually~ jerk, and I’m sure Bard will be impressive, but for the record: JWST did not take ‘the very first image of a planet outside our solar system,’” tweeted astrophysicist Grant Tremblay.

Bruce Macintosh, director of University of California Observatories at UC Santa Cruz, also pointed out the mistake. “Speaking as someone who imaged an exoplanet 14 years before JWST was launched, it feels like you should find a better example?” he tweeted.

(The mistake and the experts’ objections were first spotted by Reuters and New Scientist.)

A screenshot of an interaction with Bard. The question says: “What new discoveries from the James Space Webb Telescope can I tell my 9 year old about?” The answers include the bullet point: “JWST took the very first pictures of a planet outside of our own solar system. These distant worlds are called “exoplanets”. Exo means “from outside”.”

Bard’s very first answer contained a factual flub.

Image: Google

In a follow-up tweet, Tremblay added: “I do love and appreciate that one of the most powerful companies on the planet is using a JWST search to advertise their LLM. Awesome! But ChatGPT etc., while spooky impressive, are often *very confidently* wrong. Will be interesting to see a future where LLMs self error check.”

As Tremblay notes, a major problem for AI chatbots like ChatGPT and Bard is their tendency to confidently state incorrect information as fact. The systems frequently “hallucinate” — that is, make up information — because they are essentially autocomplete systems.

Rather than querying a database of proven facts to answer questions, they are trained on huge corpora of text and analyze patterns to determine which word follows the next in any given sentence. In other words, they are probabilistic, not deterministic — a trait that has led one prominent AI professor to label them “bullshit generators.”

Of course, the internet is already full of false and misleading information, but the issue is compounded by Microsoft and Google’s desire to use these tools as search engines. There, the chatbots’ answers take on the authority of a would-be all-knowing machine.

Microsoft, which demoed its new AI-powered Bing search engine yesterday, has tried to preempt these issues by placing liability on the user. “Bing is powered by AI, so surprises and mistakes are possible,” says the company’s disclaimer. “Make sure to check the facts, and share feedback so we can learn and improve!”

A spokesperson for Google, Jane Park, gave The Verge this statement: “This highlights the importance of a rigorous testing process, something that we’re kicking off this week with our Trusted Tester program. We’ll combine external feedback with our own internal testing to make sure Bard’s responses meet a high bar for quality, safety and groundedness in real-world information.”

Related:

Update, Wednesday February 8th, 11:03AM ET: The story has been updated with comment from Google.

See More:

Tina NguyenTwo hours ago

Google’s says its new AI agent can find new solutions in computing and math.

Emma RothMay 14

Pope Leo XIV names AI one of the reasons for his papal name

Wes DavisMay 10

Most Popular

OpenAI’s flagship GPT-4.1 model is now in ChatGPT

OpenAI’s flagship GPT-4.1 model is now in ChatGPT

SoundCloud changes its TOS again after an AI uproar

SoundCloud changes its TOS again after an AI uproar

Microsoft starts testing ‘Hey, Copilot!’ in Windows

Microsoft starts testing ‘Hey, Copilot!’ in Windows

Grok really wanted people to know that claims of white genocide in South Africa are highly contentious

Grok really wanted people to know that claims of white genocide in South Africa are highly contentious

Elon Musk’s apparent power play at the Copyright Office completely backfired

Elon Musk’s apparent power play at the Copyright Office completely backfired

Nvidia’s flattery of Trump wins reversal of AI chip limits and a Huawei clampdown

Nvidia’s flattery of Trump wins reversal of AI chip limits and a Huawei clampdown

OpenAI’s flagship GPT-4.1 model is now in ChatGPT

OpenAI’s flagship GPT-4.1 model is now in ChatGPT

OpenAI’s flagship GPT-4.1 model is now in ChatGPT

Jess Weatherbed11:47 AM UTC

SoundCloud changes its TOS again after an AI uproar

SoundCloud changes its TOS again after an AI uproar

SoundCloud changes its TOS again after an AI uproar

Wes DavisMay 14

Microsoft starts testing ‘Hey, Copilot!’ in Windows

Microsoft starts testing ‘Hey, Copilot!’ in Windows

Microsoft starts testing ‘Hey, Copilot!’ in Windows

Richard LawlerMay 14

Grok really wanted people to know that claims of white genocide in South Africa are highly contentious

Grok really wanted people to know that claims of white genocide in South Africa are highly contentious

Grok really wanted people to know that claims of white genocide in South Africa are highly contentious

Jay PetersMay 14

Elon Musk’s apparent power play at the Copyright Office completely backfired

Elon Musk’s apparent power play at the Copyright Office completely backfired

Elon Musk’s apparent power play at the Copyright Office completely backfired

Tina NguyenMay 14

Nvidia’s flattery of Trump wins reversal of AI chip limits and a Huawei clampdown

Nvidia’s flattery of Trump wins reversal of AI chip limits and a Huawei clampdown

Nvidia’s flattery of Trump wins reversal of AI chip limits and a Huawei clampdown

Jess WeatherbedMay 14

4:00 PM UTC

Sony WH-1000XM6 hands-on: back to the fold

9 minutes ago

Thanks, Trump tariffs, now I gotta replace my phone battery

4:00 PM UTC

Microsoft’s simplified Surface lineup puts another device on the chopping block

4:23 PM UTC

You can see right through Audio-Technica’s new transparent turntable

12:24 PM UTC

Apple’s CarPlay Ultra is finally here, if you have a new Aston Martin

3:00 PM UTC

America’s immigration system was a landmine, and Trump set it off