AI Struggles with PhD-Level Physics Tests
News·November 24, 2025
Cutting-edge AI models like Gemini3Pro and GPT-5 scored below 10% accuracy in CritPt, a new benchmark testing doctoral-level physics skills. Developed by 50+ physicists worldwide, these unpublished research challenges reveal AI's limitations in scientific creativity and precision. While promising as research assistants, current models still make subtle errors that could mislead human scientists.
#AI Research#Physics Benchmark#Machine Learning