Discover the latest AI news, AI products, and AI projects platform

Daily discover the most amazing AI world - from breakthrough news to innovative products, from cutting-edge projects to tech trends

Categories

Tags

2025

August 15

AI Models Fail New Benchmark: GPT-5 Scores Zero in Doctoral-Level Test

A new AI evaluation benchmark, FormulaOne, has revealed surprising limitations in top models like GPT-5 and Grok4. Developed by research institution AAI, the test features 220 complex dynamic programming problems requiring doctoral-level reasoning. While models performed moderately on simpler questions, all failed completely on the most challenging problems, scoring zero points. The results raise important questions about AI's true reasoning capabilities.

AI Models Fail New Benchmark: GPT-5 Scores Zero in Doctoral-Level Test
DAMN
0