Did OpenAI Cheat on Its Big Math Test?

Decrypt 10 months ago 383

How intelligent is a model that memorizes the answers before an exam? That’s the question facing OpenAI after it unveiled o3 in December, and touted its model's impressive benchmarks. At the time, some pundits hailed it as being almost as powerful as AGI, the level at which artificial intelligence is capable of achieving the same performance as a human on any task required by the user.

But money changes everything—even math tests, apparently.

OpenAI's victory lap over its o3 model's stunning 25.2% score on FrontierMath, a challenging mathematical benchmark developed by Epoch AI, hit a snag when it turned out the company wasn't just acing the test—OpenAI helped write it, too.

“We gratefully acknowledge OpenAI for their support in creating the benchmark,” Epoch AI wrote in an updated footnote on the FrontierMath whitepaper—and this was enough to raise some red flags among enthusiasts.

screenshot from Epoch AI's research paper recognizing OpenAI's support during the development of their FrontierMath benchmark datasted

BitRss shares this Content always with

License.

Read Entire Article

Screenshot generated in real time with SneakPeek Suite

Search Crypto News

The latest Top News, only from Leading exponents of BlockChain, Bitcoin, Altcoins and different Accredited Crypto Currency Sources.

Since 2015, our Mission was to Share, up-to-date, those News and Information we believe to represent in an Ethical and sincere manner the current Crypto Currencies World: everything you are looking for, in one place!

We have always tried to give priority to the News and the Sources; for this reason we have designed this New Version of BitRss.com with a clean and simple Style, usable by all Devices, fast and effective. Our exclusive Algorithm, in addition to filtering (a lot..) sponsored content of dubious interest, Lists the News, in Chronological order of Publication on the Internet, allowing our Users to Follow the Flow of Articles in a fast and intuitive way.

You can also check the Cryptocurrency Price in Real Time directly in the shared Articles (the TAG's highlighted in green), which allows you to Learn more about the Market Trend of that particular Coin with many other related information. Each content includes always a Screenshot of the Article's Source.

BitRss World Crypto News | Market BitRss | Short Urls
Design By New Web | ScriptNet

BitRss - World Crypto News

Did OpenAI Cheat on Its Big Math Test?

Related

U.S. traders pull back as Bitcoin slides below $90K

Digital asset ETPs post third straight week of net inflows, led by US demand

Decrypt’s 2025 Person of the Year: President Donald Trump

Bitcoin, Ethereum and XRP Fall to Lowest Prices in a Week as Liquidations Top $500 Million...

Hedera Slides to Lowest Point in a Year as Crypto Market Plunges

BNB falls below key support as crypto market cap slips toward $3 trillion

3 Altcoins Facing Liquidation Risks in the Third Week of December

How Bitcoin whales move markets, and the signals to ignore

Solana tops 2025 usage charts – So why didn’t SOL outperform ETH?

Why XRP Isn’t Reacting to Major Institutional and Regional Developments

Juventus Owner Rejects Tether's $1.2 Billion Acquisition Offer, Sending Team's Stock Soari...

Ripple to Expand $1.3 Billion RLUSD Stablecoin to Ethereum Layer-2 Networks

Ripple takes RLUSD multichain: Stablecoin expands to L2s with Wormhole

Bank of America says U.S. banks are heading for multi-year onchain future

Bitcoin is "Digital Labubu”! Crypto Plummets! SEC & OCC usher in Crypto Era!

Bitcoin Plunges Below $87K as Crypto Weakness Worsens

ICP Slips Back Toward Recent Lows as Rally Attempt Fades

Rekt Invitational Marks ‘Largest Live Streamed’ Crypto Golf Event

Search Crypto News

24/7 CRYPTOCURRENCY WORLD NEWS

24h Most Popular

Indian Enforcement Directorate Intensifies Crypto Crackdown: Major Ponzi Scheme Raids Sign...

Ripple (XRP) Holders Rotate Into a New $0.035 DeFi Crypto Ahead of Major Q4 Milestones, In...

TradFi Titans Embrace Crypto: Major Banks Launch Direct Digital Asset Services Amidst Regu...

“Qual é o valor do Bitcoin?” foi uma das perguntas mais feitas à Alexa em 2025

CFTC Greenlights Spot Bitcoin & Crypto Trading: A Landmark Shift for U.S. Markets

Bitcoin Ends Week in the Red, Shedding All Fed-Fueled Gains Amid Liquidity Worries

Cardano (ADA) Flexes Muscle with Billion-Dollar Day, Outperforming XRP: Decoding the Marke...

Avalanche’s (AVAX) $27 Target Falls Short as GeeFi (GEE) Might Be Your Best Chance to 100x...

Solana’s validator crisis explained – 800 nodes remain, $17 mln for one

Decentralized Oracles: The Unsung Heroes Securing DeFi’s Future and Bridging Real-World Da...

Utilities

Learn

Links