Pivot
  • Market Data & Reports
  • Podcasts
  • Events
  • Premium
  • English
    • Uzbek
No Result
View All Result
  • Login
  • News
  • Funding & Deals
  • Startups
  • Venture Capital
  • SaaS & AI
  • Founder Stories
  • Uzbek Startups
Pivot
  • Market Data & Reports
  • Podcasts
  • Events
  • Premium
  • English
    • Uzbek
No Result
View All Result
Pivot

OpenAI’s new “reasoning” models make more mistakes

by Gulnoza Sobirova
April 19, 2025
in News
Reading Time: 2 mins read
A A
OpenAI’s new “reasoning” models make more mistakes
Share on FacebookShare on TwitterShare on Telegram

The best, but the most confusing?

OpenAI’s recently released o3 and o4-mini artificial intelligence models have achieved amazing results in calculations and coding. However, there is one big problem — they “hallucinate” more than their predecessors, that is, fabricate data.

Has the previous progress stopped?

Usually, each new model is more accurate than the one before it. But according to OpenAI’s internal tests, the o3 and o4-mini models performed worse than previous reasoning models — o1, o3-mini, and even “traditional” models like GPT-4o.

How? Why?

For example, in OpenAI’s PersonQA test:

  • o1 – 16% hallucinated
  • o3-mini – 14.8%
  • o3 – 33%
  • o4-mini fabricated 48% of the time!

This is because these new models are trying to reason more. More reasoning means more correct thoughts, but also more incorrect conclusions.

Independent research confirms this

A non-governmental laboratory called Transluce found that the o3 model even invented completely false processes, such as “I ran the code on a MacBook Pro.” In fact, the model does not have this ability.

What can be done?

There are some solutions:

  • Models like GPT-4o give much more accurate results when working with web searches — 90% accuracy in the SimpleQA test!
  • But this approach does not always work, especially in areas where privacy is important.
  • Intelligence has increased, accuracy has decreased

OpenAI has acknowledged this problem and said that “more research is needed.” But this problem threatens the use of artificial intelligence, especially in law, medicine, or other fields that require high accuracy.

So as AI’s capabilities expand, the question of its reliability has become more pressing.

Previous Post

Burger King enters the Uzbek market

Next Post

Meta’s Secret AI Experiments

Gulnoza Sobirova

Related Posts

The President of Kyrgyzstan has granted 5-year tax exemptions to IT companies and startups

The President of Kyrgyzstan has granted 5-year tax exemptions to IT companies and startups

June 15, 2026
Elon Musk dropped from Forbes Billionaires list after becoming world’s first trillionaire

Elon Musk dropped from Forbes Billionaires list after becoming world’s first trillionaire

June 13, 2026
Apple Re-enters the AI Race: Next-Gen Siri Unveiled at WWDC 2026

Apple Re-enters the AI Race: Next-Gen Siri Unveiled at WWDC 2026

June 12, 2026
Tajikistan breaks ground on IT Hub Dushanbe – tech complex with AI at its core

Tajikistan breaks ground on IT Hub Dushanbe – tech complex with AI at its core

June 10, 2026
Next Post
Meta’s Secret AI Experiments

Meta's Secret AI Experiments

Salt-powered refrigerator: Life-saving invention of Indian teenagers

Salt-powered refrigerator: Life-saving invention of Indian teenagers

Please login to join discussion

false

  • Latest
  • Trending
  • Comments
The President of Kyrgyzstan has granted 5-year tax exemptions to IT companies and startups

The President of Kyrgyzstan has granted 5-year tax exemptions to IT companies and startups

June 15, 2026
From brokerage pain to AI startup: the story of TheCarGo

From brokerage pain to AI startup: the story of TheCarGo

June 13, 2026
Elon Musk dropped from Forbes Billionaires list after becoming world’s first trillionaire

Elon Musk dropped from Forbes Billionaires list after becoming world’s first trillionaire

June 13, 2026
The real capital in the AI age is not technology, but the Founder

The real capital in the AI age is not technology, but the Founder

June 12, 2026

18-year-old high school dropout raises $6.2M from Y Combinator

October 2, 2025
Junior crisis: are IT training centers creating an army of the unemployed?

Junior crisis: are IT training centers creating an army of the unemployed?

January 6, 2026
Airbnb: The $100 Billion Success Story – Its Origins and Transformative Impact on Hospitality!

Airbnb: The $100 Billion Success Story – Its Origins and Transformative Impact on Hospitality!

January 4, 2025
Alipos startup received a $200,000 investment offer on the “Taqdimot” TV show

Alipos startup received a $200,000 investment offer on the “Taqdimot” TV show

November 25, 2025
$1 billion allocated to the “Mahalla Project” program

$1 billion allocated to the “Mahalla Project” program

AloqaVentures: Fueling Innovation in Uzbekistan’s Startup Ecosystem

AloqaVentures: Fueling Innovation in Uzbekistan’s Startup Ecosystem

Musk’s xAI Valuation Surpasses $40 Billion After Funding Round

What changes does Elon Musk want to make with a $6 billion investment?

What changes does Elon Musk want to make with a $6 billion investment?

Pivot

We are the Intelligence Platform for Founders & Investors in Emerging Markets — combining news, data, and community to unlock opportunities across GCC, Central Asia, and frontier ecosystems.

Follow us

Categories

  • News
  • Funding & Deals
  • Startups
  • Venture Capital
  • SaaS & AI
  • Founder Stories
  • Uzbek Startups

Pages

  • Market Data & Reports
  • Podcasts
  • Events
  • Premium
  • English
    • Uzbek

Recent Post

  • The President of Kyrgyzstan has granted 5-year tax exemptions to IT companies and startups
  • From brokerage pain to AI startup: the story of TheCarGo
  • Elon Musk dropped from Forbes Billionaires list after becoming world’s first trillionaire
  • Privacy policy

© 2025 Pivot

Welcome Back!

Sign In with Google
Sign In with Linked In
OR

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • News
  • Funding & Deals
  • Startups
  • Venture Capital
  • SaaS & AI
  • Founder Stories
  • Uzbek Startups
  • Login
  • Cart
  • uz Uzbek
  • en English

© 2025 Pivot

Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?