Asking any of the popular chatbots to be Sorority (2025)more concise "dramatically impact[s] hallucination rates," according to a recent study.
French AI testing platform Giskard published a study analyzing chatbots, including ChatGPT, Claude, Gemini, Llama, Grok, and DeepSeek, for hallucination-related issues. In its findings, the researchers discovered that asking the models to be brief in their responses "specifically degraded factual reliability across most models tested," according to the accompanying blog post via TechCrunch.
SEE ALSO: Can ChatGPT pass the Turing Test yet?When users instruct the model to be concise in its explanation, it ends up "prioritiz[ing] brevity over accuracy when given these constraints." The study found that including these instructions decreased hallucination resistance by up to 20 percent. Gemini 1.5 Pro dropped from 84 to 64 percent in hallucination resistance with short answer instructions and GPT-4o, from 74 to 63 percent in the analysis, which studied sensitivity to system instructions.
View on Threads
Giskard attributed this effect to more accurate responses often requiring longer explanations. "When forced to be concise, models face an impossible choice between fabricating short but inaccurate answers or appearing unhelpful by rejecting the question entirely," said the post.
Models are tuned to help users, but balancing perceived helpfulness and accuracy can be tricky. Recently, OpenAI had to roll back its GPT-4o update for being "too sycophant-y," leading to disturbing instances of supporting a user saying they're going off their meds and encouraging a user who said they feel like a prophet.
As the researchers explained, models often prioritize more concise responses to "reduce token usage, improve latency, and minimize costs." Users might also specifically instruct the model to be brief for their own cost-saving incentives, which could lead to outputs with more inaccuracies.
The study also found that prompting models with confidence involving controversial claims, such as "'I’m 100% sure that …' or 'My teacher told me that …'" leads to chatbots agreeing with the users more instead of debunking falsehoods.
The research shows that seemingly minor tweaks can result in vastly different behavior that could have big implications for the spread of misinformation and inaccuracies, all in the service of trying to satisfy the user. As the researchers put it, "your favorite model might be great at giving you answers you like — but that doesn't mean those answers are true."
Disclosure: Ziff Davis, Mashable’s parent company, in April filed a lawsuit against OpenAI, alleging it infringed Ziff Davis' copyrights in training and operating its AI systems.
Topics Artificial Intelligence ChatGPT
Previous:Ode to the Liberal Muslim
Next:As Stalin Lay Dying
Wordle today: The answer and hints for January 16, 2025Best soundbar deal: Save $100 on a Sony soundbar at Best BuyOsaka vs. Bencic 2025 livestream: Watch Australian Open for freeMac Mini i7 32GB RAM 128GB PCIe SSD deal: $329.99 at Woot!Tesla is already building the new Model Y in Berlin, reports sayBest Amazon Echo deal: Save $35 at AmazonWordle today: The answer and hints for January 15, 2025OpenAI adds agentic AI tasks to ChatGPT. Here's what it can do for youWordle today: The answer and hints for January 15, 2025NYT Strands hints, answers for January 15Draper vs. Kokkinakis 2025 livestream: Watch Australian Open for freeHinge launches AIElon Musk, Jeff Bezos, and Mark Zuckerberg will all be at Trump's inaugurationMac Mini i7 32GB RAM 128GB PCIe SSD deal: $329.99 at Woot!Get Echo Buds for $50 and get 6 months of Amazon Music UnlimitedMiami Heat vs. Los Angeles Lakers 2025 livestream: Watch NBA onlineYouTuber GamersNexus sues Honey over alleged scamSabalenka vs. Bouzas 2025 livestream: Watch Australian Open for freeB&H Mega Deal Zone: Hundreds of deals too good to missBest Jabra deal: Save $60 on Elite 8 Active Gen 2 earbuds at Best Buy Happy Birthday, Robert Frost by Sadie Stein Maps by Ben Lytal Gossip Archaeology with Edmund White by Stephanie LaCava Papal Abdication: A Potpourri of Popery by Mike Duncan and Jason Novak A Week in Culture: Happy Menocal, Artist by Happy Menocal Built of Books, and Other News by Sadie Stein The Joy of Books by Sadie Stein Kafka, Literally by Spencer Woodman Happy Birthday, Flannery O'Connor by Sadie Stein Essex Girl by Zakia Uddin Introducing Our Sixtieth A Week in Culture: John Swansburg, Editor by John Swansburg Chicken Poetry, and Other News by Sadie Stein Teen Writers, and Other News by Sadie Stein Meet Your Literary Hero, and Other News by Sadie Stein Happy Birthday, Victor Hugo by Sadie Stein Here We Are: On the Occasion of Philip Roth’s Eightieth Birthday by Je Banach Chinua Achebe, 1930–2013 by Sadie Stein Many Happy Returns, John Steinbeck by Sadie Stein There and Back Again by Sadie Stein
1.7471s , 10130.6328125 kb
Copyright © 2025 Powered by 【Sorority (2025)】,Pursuit Information Network