Tech News

OpenAI’s Transcription Tool Hallucinates. Hospitals Are Using It Anyway

October 30, 2024

On Saturday, an Related Press investigation revealed that OpenAI’s Whisper transcription software creates fabricated textual content in medical and enterprise settings regardless of warnings in opposition to such use. The AP interviewed greater than 12 software program engineers, builders, and researchers who discovered the mannequin repeatedly invents textual content that audio system by no means mentioned, a phenomenon usually referred to as a “confabulation” or “hallucination” within the AI subject.

Upon its release in 2022, OpenAI claimed that Whisper approached “human stage robustness” in audio transcription accuracy. Nevertheless, a College of Michigan researcher advised the AP that Whisper created false textual content in 80 p.c of public assembly transcripts examined. One other developer, unnamed within the AP report, claimed to have discovered invented content material in virtually all of his 26,000 check transcriptions.

The fabrications pose explicit dangers in well being care settings. Regardless of OpenAI’s warnings in opposition to utilizing Whisper for “high-risk domains,” over 30,000 medical employees now use Whisper-based instruments to transcribe affected person visits, in line with the AP report. The Mankato Clinic in Minnesota and Kids’s Hospital Los Angeles are amongst 40 well being programs utilizing a Whisper-powered AI copilot service from medical tech firm Nabla that’s fine-tuned on medical terminology.

Nabla acknowledges that Whisper can confabulate, but it surely additionally reportedly erases authentic audio recordings “for knowledge security causes.” This might trigger further points, since docs can not confirm accuracy in opposition to the supply materials. And deaf sufferers could also be extremely impacted by mistaken transcripts since they might don’t have any strategy to know if medical transcript audio is correct or not.

The potential issues with Whisper prolong past well being care. Researchers from Cornell College and the College of Virginia studied hundreds of audio samples and located Whisper including nonexistent violent content material and racial commentary to impartial speech. They discovered that 1 p.c of samples included “complete hallucinated phrases or sentences which didn’t exist in any type within the underlying audio” and that 38 p.c of these included “specific harms equivalent to perpetuating violence, making up inaccurate associations, or implying false authority.”

In a single case from the research cited by AP, when a speaker described “two different women and one girl,” Whisper added fictional textual content specifying that they “have been Black.” In one other, the audio mentioned, “He, the boy, was going to, I’m unsure precisely, take the umbrella.” Whisper transcribed it to, “He took a giant piece of a cross, a teeny, small piece … I’m certain he didn’t have a terror knife so he killed quite a lot of folks.”

An OpenAI spokesperson advised the AP that the corporate appreciates the researchers’ findings and that it actively research the right way to cut back fabrications and incorporates suggestions in updates to the mannequin.

Why Whisper Confabulates

The important thing to Whisper’s unsuitability in high-risk domains comes from its propensity to typically confabulate, or plausibly make up, inaccurate outputs. The AP report says, “Researchers aren’t sure why Whisper and comparable instruments hallucinate,” however that is not true. We all know precisely why Transformer-based AI fashions like Whisper behave this fashion.

Whisper relies on expertise that’s designed to foretell the subsequent almost definitely token (chunk of knowledge) that ought to seem after a sequence of tokens offered by a person. Within the case of ChatGPT, the enter tokens come within the type of a textual content immediate. Within the case of Whisper, the enter is tokenized audio knowledge.

Source link

OpenAI’s Transcription Tool Hallucinates. Hospitals Are Using It Anyway

Why Whisper Confabulates

Recent Posts

Tackling local crime key to ‘levelling up’, says think-tank

Redis’ license change and forking are a mess that everybody can feel bad about

Jim Cramer Advises Against Using Binance — Says Crypto Exchange Is ‘Way Too Sketchy’...

Bitcoin Proponents Accuse the New York Times of Publishing One-Sided ‘Hit Piece’ on Bitcoin...

Interest rate rises boost Warren Buffett’s Berkshire Hathaway results

Robert Kiyosaki Expects Bitcoin’s Price to Keep Rising — Says ‘I’m Betting on It’...

Hype grows over “autonomous” AI agents that loop GPT-4 outputs

Chinese Tech Giants Tencent and Bytedance Plan Cuts in Their Metaverse Divisions – Metaverse...

SOL Rebounds, TRX up for Fifth Straight Session – Market Updates Bitcoin News

Biden Vows to Veto House Republicans’ ‘Fair Tax Act’ Proposing Elimination of IRS –...

POPULAR POSTS

29 of the Best SEO Tools for Auditing & Monitoring Your...

The bad news in the good news

ZOGI Token Launches on Cronos, BNB Chain and Ethereum With Revolutionary...

POPULAR CATEGORY