AI fashions from OpenAI and Google DeepMind completed gold medal ratings within the 2025 World Math Olympiad (IMO), one of the vital international’s oldest and maximum difficult highschool stage math competitions, the corporations independently introduced in contemporary days.
The end result underscores simply how briskly AI methods are advancing, and but, how calmly matched Google and OpenAI appear to be within the AI race. AI firms are competing fiercely for the general public belief of being forward within the AI race: an intangible combat of “vibes” that may have giant implications for securing best AI ability. A large number of AI researchers come from backgrounds in aggressive math, so benchmarks like IMO imply greater than others.
Final 12 months, Google scored a silver medal at IMO the use of a “formal” machine, that means it required people to translate issues right into a mechanical device‑readable structure. This 12 months, each OpenAI and Google entered “casual” methods into the contest, which have been in a position to ingest questions and generate evidence‑based totally solutions in herbal language. Each firms declare their AI fashions scored upper than maximum highschool scholars and Google’s AI fashion from remaining 12 months, with out requiring any human-machine translation.
In interviews with TechCrunch, researchers in the back of OpenAI and Google’s IMO efforts claimed that those gold medal performances constitute breakthroughs round AI reasoning fashions in non-verifiable domain names. Whilst AI reasoning fashions have a tendency to do neatly on questions with simple solutions, similar to simple arithmetic or coding duties, those methods fight on duties with extra ambiguous answers, similar to purchasing an ideal chair or serving to with advanced analysis.
On the other hand, Google is elevating questions round how OpenAI performed and introduced its gold medal IMO efficiency. Finally, in case you’re going to go into AI fashions right into a math contest for prime schoolers, you may as neatly argue like youngsters.
In a while after OpenAI introduced its feat on Saturday morning, Google DeepMind’s CEO and researchers took to social media to slam OpenAI for pronouncing its gold‑medal in advance — in a while after IMO introduced which top schoolers had gained the contest on Friday evening — and for now not having their fashion’s take a look at formally evaluated by way of IMO.
Thang Luong, a Google DeepMind senior researcher and lead for the IMO mission, advised TechCrunch that Google waited to announce its IMO effects to admire the scholars collaborating within the pageant.
Techcrunch match
San Francisco
|
October 27-29, 2025
Luong stated that Google has been operating with IMO’s organizers since remaining 12 months in preparation for the take a look at and sought after to have the IMO president’s blessing and legit grading sooner than pronouncing its legit effects, which it did on Monday morning.
“The IMO organizers have their grading guiding principle,” Luong stated. “So any analysis that’s now not according to that guiding principle may just now not make any declare about gold-medal stage [performance].”
Noam Brown, a senior OpenAI researcher who labored at the IMO fashion, advised TechCrunch that IMO reached out to OpenAI a couple of months in the past about collaborating in a proper math pageant, however the ChatGPT-maker declined as it used to be operating on herbal language methods that it concept have been extra price pursuing. Brown says OpenAI didn’t know IMO used to be undertaking an off-the-cuff take a look at with Google.
OpenAI says it employed third-party evaluators — 3 former IMO medalists who understood the grading machine — to grade its AI fashion’s efficiency. After OpenAI discovered of its gold medal rating, Brown stated the corporate reached out to IMO, which then advised the corporate to attend to announce till after IMO’s Friday evening award rite.
IMO didn’t reply to TechCrunch’s request for remark.
Google isn’t essentially flawed right here — it did undergo a extra legit, rigorous procedure to succeed in its gold medal rating — however the debate might omit the larger image: AI fashions from a number of main AI labs are making improvements to temporarily. Nations from all over the world despatched their brightest scholars to compete at IMO this 12 months, and only a few p.c of them scored in addition to OpenAI and Google’s AI fashions did.
Whilst OpenAI used to have an important lead over the trade, it definitely feels as despite the fact that the race is extra carefully matched than any corporate want to admit. OpenAI is predicted to free up GPT-5 within the coming months, and the corporate definitely hopes to present off the influence that it nonetheless leads the AI trade.