Skip to main content

AI Trading Showdown: DeepSeek Outperforms Gemini in Market Test

AI Models Face Off in Real-Market Trading Challenge

Financial research lab nof1 has conducted a groundbreaking experiment called Alpha Arena, pitting six major AI models against each other in live trading scenarios on decentralized exchange Hyperliquid. Each model received $10,000 in real funds and operated under identical conditions to test their financial decision-making capabilities.

The Competitors and Results

The participating models included:

  • GPT-5
  • Gemini 2.5 Pro
  • Grok-4
  • Claude Sonet 4.5
  • DeepSeek V3.1
  • Qwen3Max

Image

The results revealed stark differences in performance:

  • DeepSeek V3.1 and Grok-4 tied for top position with returns exceeding 14%
  • Gemini 2.5 Pro suffered catastrophic losses of 42.57%, the worst performance recorded

The other models delivered mixed results, with none matching the top performers' success.

Beyond Simple Competition

The Alpha Arena project aims to evaluate more than just raw profitability. According to nof1 researchers, the primary objectives include:

  1. Assessing strategy stability under market volatility
  2. Testing risk response mechanisms across different model architectures
  3. Establishing benchmarks for AI-driven quantitative trading systems

The experiment demonstrates how large language models are evolving beyond text processing into complex financial applications.

Implications for Financial AI

The successful performance of certain models suggests promising applications for:

  • Automated portfolio management
  • Real-time trading algorithms
  • Risk assessment systems The dramatic failure of Gemini 2.5 Pro also underscores the importance of robust testing before deploying AI systems with real capital.

The financial sector continues to show strong interest in AI solutions that can process market data faster and more comprehensively than human traders.

Key Points:

  • DeepSeek V3.1 and Grok-4 achieved over 14% returns in live trading test
  • Gemini 2.5 Pro lost nearly half its allocated capital
  • Experiment conducted with $10,000 real funds per model on Hyperliquid exchange The study highlights both the potential and risks of AI-driven financial systems

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

Sakana AI's Tiny Plugin Could Revolutionize How AI Handles Massive Documents
News

Sakana AI's Tiny Plugin Could Revolutionize How AI Handles Massive Documents

Tokyo-based Sakana AI has unveiled groundbreaking technologies that could solve large language models' notorious 'memory anxiety.' Their Text-to-LoRA and Doc-to-LoRA systems enable AI to digest lengthy documents in under a second, shrinking memory requirements from gigabytes to mere megabytes. This breakthrough promises to make customizing AI models dramatically cheaper and more accessible.

February 28, 2026
AI InnovationMachine LearningNatural Language Processing
Chinese AI Models Outpace US Competitors in Global Adoption
News

Chinese AI Models Outpace US Competitors in Global Adoption

In a surprising shift, Chinese AI models have overtaken their US counterparts in global usage for the first time. Platforms like MiniMax and Moonshot AI are leading the charge, with Chinese models accounting for over 5 trillion weekly tokens - nearly double American offerings. This milestone reflects China's growing influence in artificial intelligence development.

February 27, 2026
AI CompetitionChinese TechMachine Learning
Moonshot AI's Kimi K2.5 Achieves Remarkable Profitability Milestone
News

Moonshot AI's Kimi K2.5 Achieves Remarkable Profitability Milestone

Moonshot AI's latest model, Kimi K2.5, has stunned the tech world by generating more revenue in its first 20 days than all of 2025 combined. The breakthrough comes primarily from overseas users and developers embracing its API services, propelling the company's valuation past $10 billion. Founder Yang Zhilin confirms the company is well-funded with no immediate IPO plans.

February 24, 2026
Artificial IntelligenceTech StartupsMachine Learning
News

Chinese AI Models Capture Global Spotlight During Lunar New Year

Chinese artificial intelligence models made waves internationally during the 2026 Spring Festival, capturing over 60% market share on OpenRouter's developer platform. Three domestic models - MiniMax M2.5, Kimi K2.5, and Zhipu GLM-5 - dominated the rankings by offering superior coding and automation capabilities at remarkably low costs. Their success highlights China's growing influence in AI productivity tools.

February 24, 2026
Artificial IntelligenceChinese TechDeveloper Tools
Google's Gemini 3.1 Pro Outshines Competitors With Breakthrough Reasoning Skills
News

Google's Gemini 3.1 Pro Outshines Competitors With Breakthrough Reasoning Skills

Google has unveiled Gemini 3.1 Pro, its most advanced AI model yet, showcasing remarkable improvements in logical reasoning and problem-solving. The new architecture delivers more than double the performance of its predecessor in critical tests, even surpassing GPT-5.2 in some benchmarks. Beyond raw power, Gemini 3.1 Pro introduces innovative multimodal capabilities, handling ultra-long contexts and generating visual representations of complex concepts.

February 24, 2026
AI InnovationGoogle TechMachine Learning
Google's Gemini 3.1 Pro Doubles Down on AI Reasoning Power
News

Google's Gemini 3.1 Pro Doubles Down on AI Reasoning Power

Google has unveiled Gemini 3.1 Pro, its latest AI model that dramatically improves reasoning capabilities. Benchmarks show it outperforms its predecessor by more than double in logical processing tests. The tech giant is making the model widely available through multiple platforms, offering enhanced features for premium subscribers.

February 20, 2026
AI InnovationGoogle TechMachine Learning