During its big GPT-5 livestream on Thursday, OpenAI showed off a few charts that made the model seem quite impressive — but if you look closely, some graphs were a little bit off.
In one, ironically showing how well GPT-5 does in “deception evals across models,” the scale is all over the place. For “coding deception,” for example, the chart shown onstage says GPT-5 with thinking apparently gets a 50.0 percent deception rate, but that’s compared to OpenAI’s smaller 47.4 percent o3 score which somehow has a larger bar. OpenAI appears to have accurate numbers for this chart in its GPT-5 blog post, however, where GPT-5’s deception rate is labeled as 16.5 percent.
With this chart, OpenAI showed onstage that one of GPT-5’s scores is lower than o3’s but is shown with a bigger bar. In this same chart, o3 and GPT-4o’s scores are different but shown with equally-sized bars. It was bad enough that CEO Sam Altman commented on it, calling it a “mega chart screwup,” though he noted that a correct version is in OpenAI’s blog post.
An OpenAI marketing staffer also apologized, saying, “We fixed the chart in the blog guys, apologies for the unintentional chart crime.”
this screenshot from GPT-5 livestream has to be among the worst chart crimes of the century pic.twitter.com/HXsK2CWCon
OpenAI didn’t immediately respond to a request for comment. And while it’s unclear if OpenAI used GPT-5 to actually make the charts, it’s still not a great look for the company on its big launch day — especially when it is touting the “significant advances in reducing hallucinations” with its new model.
Обзор на мобильную версию A Game About Digging A Hole
DeepMind CEO makes big brain claims, saying AGI could be here within 'five to 10 years' and cause humanity to experience widespread change that's '10 times bigger than the Industrial Revolution, and maybe 10 times faster'
Раскрой потенциал Мистера Террифика из DC Worlds Collide с этим гайдом
Коми, Камчатку, Архангельскую, Иркутскую, Калужскую, Костромскую, Курскую, Свердловскую и Оренбургскую области эксперты отнесли к регионам, где на осенних выборах "протестный потенциал выше среднего", говорится в докладе...
Чёрный день календаря. 8 августа: Архангельская трагедия. Как ошибка пилота погубила рейс Як-40
В Алтайском крае не будут проводить проверку на предмет чрезмерного роста тарифов на ЖКУ
Защищённый планшет промышленного класса Saotron RT-W11
Аренда квартир в июле 2025 года: за сколько можно снять жильё в Симферополе и Севастополе?
В Калининском районе Краснодарского края фрагменты дронов повредили контактную сеть на железной дороге между станциями Величковка и Ангелинская, сообщил оперштаб региона
Компания «Гранд Сервис Экспресс» информирует об изменениях в курсировании некоторых поездов «Таврия» с осени 2025 года