Skip to main content

AI Trading Showdown: DeepSeek Outperforms Gemini in Market Test

AI Models Face Off in Real-Market Trading Challenge

Financial research lab nof1 has conducted a groundbreaking experiment called Alpha Arena, pitting six major AI models against each other in live trading scenarios on decentralized exchange Hyperliquid. Each model received $10,000 in real funds and operated under identical conditions to test their financial decision-making capabilities.

The Competitors and Results

The participating models included:

  • GPT-5
  • Gemini 2.5 Pro
  • Grok-4
  • Claude Sonet 4.5
  • DeepSeek V3.1
  • Qwen3Max

Image

The results revealed stark differences in performance:

  • DeepSeek V3.1 and Grok-4 tied for top position with returns exceeding 14%
  • Gemini 2.5 Pro suffered catastrophic losses of 42.57%, the worst performance recorded

The other models delivered mixed results, with none matching the top performers' success.

Beyond Simple Competition

The Alpha Arena project aims to evaluate more than just raw profitability. According to nof1 researchers, the primary objectives include:

  1. Assessing strategy stability under market volatility
  2. Testing risk response mechanisms across different model architectures
  3. Establishing benchmarks for AI-driven quantitative trading systems

The experiment demonstrates how large language models are evolving beyond text processing into complex financial applications.

Implications for Financial AI

The successful performance of certain models suggests promising applications for:

  • Automated portfolio management
  • Real-time trading algorithms
  • Risk assessment systems The dramatic failure of Gemini 2.5 Pro also underscores the importance of robust testing before deploying AI systems with real capital.

The financial sector continues to show strong interest in AI solutions that can process market data faster and more comprehensively than human traders.

Key Points:

  • DeepSeek V3.1 and Grok-4 achieved over 14% returns in live trading test
  • Gemini 2.5 Pro lost nearly half its allocated capital
  • Experiment conducted with $10,000 real funds per model on Hyperliquid exchange The study highlights both the potential and risks of AI-driven financial systems

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

OpenAI's New Toolkit Makes AI Assistants Safer for Businesses
News

OpenAI's New Toolkit Makes AI Assistants Safer for Businesses

OpenAI has rolled out significant upgrades to its Agents SDK, giving developers better tools to create secure AI assistants. The standout feature is a sandbox environment that prevents unpredictable AI behavior from causing system-wide issues. Businesses can now test AI agents more safely while leveraging OpenAI's models. The update also introduces an integrated framework for smoother development, with Python support available now and TypeScript coming soon.

April 16, 2026
OpenAIAI DevelopmentEnterprise Technology
News

Xiaohongshu Shakes Up AI World by Open-Sourcing Its Relax Training Engine

In a surprising move, lifestyle platform Xiaohongshu has open-sourced its AI training engine called Relax, designed for multi-modal scenarios. This sophisticated tool handles text, images, audio and video through innovative parallel processing. The unexpected contribution from a non-traditional AI player signals the company's serious ambitions in artificial intelligence development and its desire to build influence in the tech community.

April 15, 2026
AIOpen SourceMachine Learning
HarmonyGNN: A Breakthrough in AI's Understanding of Complex Relationships
News

HarmonyGNN: A Breakthrough in AI's Understanding of Complex Relationships

A new AI training method called HarmonyGNN is revolutionizing how computers understand complex relationships in data. Developed by researchers at North Carolina State University, this technique helps neural networks better distinguish between different types of connections in graph data, achieving accuracy improvements up to 9.6%. The innovation could have significant implications for fields like drug discovery and weather forecasting.

April 14, 2026
Artificial IntelligenceMachine LearningGraph Neural Networks
Xiaomi's AI Model Joins Leading Open-Source Framework with Free Trial
News

Xiaomi's AI Model Joins Leading Open-Source Framework with Free Trial

Xiaomi has integrated its MiMo-V2 AI model series into the Hermes Agent framework, a major player in open-source AI development. Developers can now access Xiaomi's Pro, Omni, and Flash models for free for two weeks. This partnership combines Xiaomi's hardware expertise with Hermes' self-evolving capabilities, offering new possibilities for AI assistants. The move signals a shift in AI competition from conversational quality to execution efficiency.

April 10, 2026
XiaomiAI DevelopmentOpen Source
DeepSeek V4 Arrives Next Month: A Trillion-Parameter Powerhouse Built for China's AI Future
News

DeepSeek V4 Arrives Next Month: A Trillion-Parameter Powerhouse Built for China's AI Future

China's AI landscape is about to get a major upgrade. DeepSeek founder Liang Wenfeng has confirmed their next-generation V4 model will launch in late April 2026, packing trillion-parameter scale and breakthrough compatibility with domestic chips like Huawei's Ascend. This isn't just another model release - it's a strategic move that's already shaking up China's computing market, with tech giants stockpiling AI chips in anticipation. The model's 'Fast' and 'Expert' modes currently in testing hint at its versatile capabilities, from quick searches to complex problem-solving.

April 10, 2026
AI InnovationChina TechDeepSeek
News

DeepSeek V4 Emerges: A Glimpse Into China's Next-Gen AI Powerhouse

The tech world is abuzz as DeepSeek V4 enters intensive testing, revealing three distinct versions tailored for different needs. From lightning-fast responses to advanced visual analysis, this homegrown AI showcases China's push for technological independence. What makes this release particularly exciting is its deep integration with domestic chips, signaling a strategic move away from foreign dependencies. As the AI arms race heats up, could this be the model that redefines what Chinese-developed artificial intelligence can achieve?

April 8, 2026
AI DevelopmentChinese TechMachine Learning