Google's Gemini 3 Deep Think Outsmarts All But Seven Humans
Google's New AI Model Approaches Human-Level Reasoning

The artificial intelligence landscape shifted dramatically today as Google unveiled significant upgrades to its Gemini 3 Deep Think model. This specialized system focuses on complex problem-solving across multiple domains, demonstrating capabilities that rival - and sometimes surpass - human experts.
Programming Prowess That Turns Heads
On Codeforces, a competitive programming platform where coders battle algorithmic challenges, Gemini achieved an Elo rating of 3455. To put this in perspective, only seven humans worldwide currently maintain higher scores. Just twelve months ago, the strongest competing AI model scored nearly 700 points lower at 2727.
"What we're seeing here isn't just incremental improvement," explains Dr. Elena Vasquez, a computer science professor at MIT who reviewed the results. "This represents qualitative advancement in how AI systems approach complex problem decomposition."
Scientific Breakthroughs Beyond Expectations
The model's analytical abilities extend far beyond coding competitions:
- Peer review superpower: It identified subtle logical flaws in a high-level physics paper that had already passed human peer review
- Mathematical mastery: Successfully proved several challenging problems related to the famous Erdős conjecture
- Engineering intuition: Can convert hand-drawn sketches into production-ready 3D model files (like notebook stands) with tenfold efficiency gains
Benchmark Dominance Across Disciplines
The numbers speak volumes about Gemini's broad capabilities:
- Scored 48.4% on the rigorous "Last Human Exam" (HLE)
- Achieved 84.6% accuracy on ARC-AGI-2 benchmark tests
- Maintains strong performance across STEM fields while showing improved creative reasoning
Currently available exclusively to AI Ultra subscribers and select researchers via API access, this upgrade positions Google strongly against competitors' reasoning models.
Key Points:
- Programming: Now competes with top 0.001% of human coders worldwide
- Scientific analysis: Detects errors even expert reviewers miss
- Engineering applications: Revolutionizes prototyping speed
- Availability: Currently limited to premium subscribers and research partners


