Doctors Are Using AI to Transcribe Conversations With Patients. But Researchers Say the Tool Is Hallucinating 'Entire' Sentences. The tool malfunctioned 312 times in one study, leading to concerns about bias and misdiagnoses.

By Sherin Shibu Edited by Melissa Malamut

Key Takeaways

  • OpenAI’s transcription tool Whisper has been used by AI healthcare company Nabla to transcribe seven million medical conversations between patents and doctors, per The Verge.
  • New research shows that Whisper has been adding inaccuracies to transcripts.
  • In one study of 13,140 audio clips, 312 contained hallucinations.

ChatGPT-maker OpenAI introduced Whisper two years ago as an AI tool that transcribes speech to text. Now, the tool is used by AI healthcare company Nabla and its 45,000 clinicians to help transcribe medical conversations across over 85 organizations, like the University of Iowa Health Care.

However, new research shows that Whisper has been "hallucinating," or adding statements that no one has said, into transcripts of conversations, raising the question of how quickly medical facilities should adopt AI if it yields errors.

According to the Associated Press, a University of Michigan researcher found hallucinations in 80% of Whisper transcriptions. An unnamed developer found hallucinations in half of more than 100 hours of transcriptions. Another engineer found inaccuracies in almost all of the 26,000 transcripts they generated with Whisper.

Faulty transcriptions of conversations between doctors and patients could have "really grave consequences," Alondra Nelson, professor at the Institute for Advanced Study in Princeton, NJ, told AP.

"Nobody wants a misdiagnosis," Nelson stated.

Related: AI Isn't 'Revolutionary Change,' and Its Benefits Are 'Exaggerated,' Says MIT Economist

Earlier this year, researchers at Cornell University, New York University, the University of Washington, and the University of Virginia published a study that tracked how many times OpenAI's Whisper speech-to-text service hallucinated when it had to transcribe 13,140 audio segments with an average length of 10 seconds. The audio was sourced from TalkBank's AphasiaBank, a database featuring the voices of people with aphasia, a language disorder that makes it difficult to communicate.

The researchers found 312 instances of "entire hallucinated phrases or sentences, which did not exist in any form in the underlying audio" when they ran the experiment in the spring of 2023.

Related: Google's New AI Search Results Are Already Hallucinating — Telling Users to Eat Rocks and Make Pizza Sauce With Glue

Among the hallucinated transcripts, 38% contained harmful language, like violence or stereotypes, that did not match the context of the conversation.

"Our work demonstrates that there are serious concerns regarding Whisper's inaccuracy due to unpredictable hallucinations," the researchers wrote.

The researchers say that the study could also mean a hallucination bias in Whisper, or a tendency for it to insert inaccuracies more often for a particular group — and not just for people with aphasia.

"Based on our findings, we suggest that this kind of hallucination bias could also arise for any demographic group with speech impairments yielding more disfluencies (such as speakers with other speech impairments like dysphonia [disorders of the voice], the very elderly, or non-native language speakers)," the researchers stated.

Related: OpenAI Reportedly Used More Than a Million Hours of YouTube Videos to Train Its Latest AI Model

Whisper has transcribed seven million medical conversations through Nabla, per The Verge.

Sherin Shibu

BIZ Experiences Staff

News Reporter

Sherin Shibu is a business news reporter at BIZ Experiences.com. She previously worked for PCMag, Business Insider, The Messenger, and ZDNET as a reporter and copyeditor. Her areas of coverage encompass tech, business, strategy, finance, and even space. She is a Columbia University graduate.

Want to be an BIZ Experiences Leadership Network contributor? Apply now to join.

Business Ideas

70 Small Business Ideas to Start in 2025

We put together a list of the best, most profitable small business ideas for BIZ Experiencess to pursue in 2025.

Science & Technology

OpenAI's Latest Move Is a Game Changer — Here's How Smart Solopreneurs Are Turning It Into Profit

OpenAI's latest AI tool acts like a full-time assistant, helping solopreneurs save time, find leads and grow their business without hiring.

Social Media

How To Start a Youtube Channel: Step-by-Step Guide

YouTube can be a valuable way to grow your audience. If you're ready to create content, read more about starting a business YouTube Channel.

Money & Finance

These Are the Expected Retirement Ages By Generation, From Gen Z to Boomers — and the Average Savings Anticipated. How Do Yours Compare?

Many Americans say inflation prevents them from saving enough and fear they won't reach their financial goals.

Starting a Business

I Built a $20 Million Company by Age 22 While Still in College. Here's How I Did It and What I Learned Along the Way.

Wealth-building in your early twenties isn't about playing it safe; it's about exploiting the one time in life when having nothing to lose gives you everything to gain.

Science & Technology

AI Isn't Plug-and-Play — You Need a Strategy. Here's Your Guide to Building One.

Don't just "add AI" — build a strategy. This guide helps founders avoid common pitfalls and create a step-by-step roadmap to harness real value from AI.