Asking any of the popular chatbots to be The Exotic Time Machinemore concise "dramatically impact[s] hallucination rates," according to a recent study.
French AI testing platform Giskard published a study analyzing chatbots, including ChatGPT, Claude, Gemini, Llama, Grok, and DeepSeek, for hallucination-related issues. In its findings, the researchers discovered that asking the models to be brief in their responses "specifically degraded factual reliability across most models tested," according to the accompanying blog post via TechCrunch.
SEE ALSO: Can ChatGPT pass the Turing Test yet?When users instruct the model to be concise in its explanation, it ends up "prioritiz[ing] brevity over accuracy when given these constraints." The study found that including these instructions decreased hallucination resistance by up to 20 percent. Gemini 1.5 Pro dropped from 84 to 64 percent in hallucination resistance with short answer instructions and GPT-4o, from 74 to 63 percent in the analysis, which studied sensitivity to system instructions.
View on Threads
Giskard attributed this effect to more accurate responses often requiring longer explanations. "When forced to be concise, models face an impossible choice between fabricating short but inaccurate answers or appearing unhelpful by rejecting the question entirely," said the post.
Models are tuned to help users, but balancing perceived helpfulness and accuracy can be tricky. Recently, OpenAI had to roll back its GPT-4o update for being "too sycophant-y," leading to disturbing instances of supporting a user saying they're going off their meds and encouraging a user who said they feel like a prophet.
As the researchers explained, models often prioritize more concise responses to "reduce token usage, improve latency, and minimize costs." Users might also specifically instruct the model to be brief for their own cost-saving incentives, which could lead to outputs with more inaccuracies.
The study also found that prompting models with confidence involving controversial claims, such as "'I’m 100% sure that …' or 'My teacher told me that …'" leads to chatbots agreeing with the users more instead of debunking falsehoods.
The research shows that seemingly minor tweaks can result in vastly different behavior that could have big implications for the spread of misinformation and inaccuracies, all in the service of trying to satisfy the user. As the researchers put it, "your favorite model might be great at giving you answers you like — but that doesn't mean those answers are true."
Disclosure: Ziff Davis, Mashable’s parent company, in April filed a lawsuit against OpenAI, alleging it infringed Ziff Davis' copyrights in training and operating its AI systems.
Topics Artificial Intelligence ChatGPT
‘And Just Like That’ Season 2 really needed Carrie Bradshaw to narrate more“Every Adoption is a Ghost Story”: An Interview with Jennifer Gilmore by Amy BenferReading in the Bath: Solved by Sadie Stein'Deadloch' review: Feminist Australian buddyWild and Crazy Libraries, and Other News by Sadie SteinClose Reading, and Other News by Sadie SteinNotes from a Bookshop: April, or Spring Fever by Kelly McMasters75 years of the Windrush generation: These 2 new podcasts are a mustRed and Blue by Anna WienerSalinger Letters, and Other News by Sadie SteinOutside the Paris Pavilion by Sadie SteinWild and Crazy Libraries, and Other News by Sadie SteinIn the Beginning by Sadie SteinUnlikely Aphrodisiacs, and Other News by Sadie Stein'Black Mirror' episode 'The National Anthem' is still essentialWordle today: Here's the answer and hints for June 23How young Montana residents made a case for climate action in courtSex workers fear targeting under Instagram's terms of serviceThe 6 most StreamberryCounter Culture by Amie Barrodale and Clancy Martin The Vale of Soul The Comic Voice: An Interview with Christina Nichol What Words Do You Commonly Misspell? “‘Betty’ Bacall Was the Perfect Mate for Bogey” Charmed Objects The Morning News Roundup for July 30, 2014 Genius of Love The Morning News Roundup for August 21, 2014 Les Combats Modernes Happy Birthday, Anne Fadiman! The Morning News Roundup for August 14, 2014 Still Slacking After All These Years She Jazzes That Dazzling Verse Incident / Resurrection Reader’s Guilt; Toadstools by Lorin Stein How Does Weird Al Write His Songs? Robert Stone, Tabloid Writer The Morning Roundup for August 12, 2014 Announcing Our #ReadEverywhere Contest What We See When We Read
2.3623s , 8223.921875 kb
Copyright © 2025 Powered by 【The Exotic Time Machine】,Unobstructed Information Network