Skip to main content

BAAI Unveils See3D: A Breakthrough in 3D Video Learning

BAAI Unveils See3D: A Breakthrough in 3D Video Learning

The Beijing Academy of Artificial Intelligence (BAAI) has announced the launch of See3D, an innovative 3D generation model designed to learn from large-scale unlabeled internet videos. This technological advancement aligns with the concept of "See Video, Get 3D" and represents a significant step forward in the field of 3D learning and generation.

Technical Innovations of See3D

See3D distinguishes itself by not relying on traditional camera parameters. Instead, it utilizes visual conditioning techniques to generate camera-direction controllable and geometrically consistent multi-view images based solely on visual cues obtained from videos. This approach eliminates the necessity for costly 3D or camera annotations, streamlining the process of learning 3D priors from abundant internet video data.

The model supports various forms of generation including:

  • Text-to-3D generation
  • Single view to 3D
  • Sparse views to 3D Additionally, it is capable of performing 3D editing and Gaussian rendering. BAAI has made the model, code, and a demo available as open-source resources, facilitating broader technical reference and experimentation.

Demonstrations of See3D's capabilities include:

  • Unlocking 3D interactive worlds
  • 3D reconstruction based on sparse images
  • Open-world 3D generation
  • 3D generation from single views These features highlight the extensive applicability of See3D in various creative 3D applications, enabling users to engage with 3D environments more dynamically.

image

Motivation Behind the Development

The impetus for developing See3D arises from the challenges associated with traditional 3D data collection methods, which are often time-consuming and expensive. In contrast, videos provide a wealth of multi-view correlations and camera motion information, making them valuable for revealing intricate 3D structures.

The See3D team has constructed a comprehensive dataset to facilitate this process, comprising 16 million video clips and 320 million frames of images. This dataset, named WebVi3D, is pivotal in enabling the model to generate pure 2D visual signals by introducing time-dependent noise to masked video data. This method supports scalable multi-view diffusion model training, achieving 3D generation without relying on camera conditions.

Key Advantages of See3D

See3D offers several key advantages:

  • Data Scalability: Sourced from a vast array of internet videos, the training data significantly enhances the scale of the constructed multi-view dataset.
  • Camera Controllability: The model supports scene generation under complex camera trajectories, ensuring geometric consistency across frames.
  • Geometric Consistency: The model maintains geometric integrity when generating multi-view images, which is crucial for realistic 3D representations. By expanding the scale of available datasets, See3D aims to provide new insights and methodologies for advancing 3D generation technology. The research team hopes this initiative will motivate the 3D research community to focus on large-scale unlabeled camera data, lowering the costs associated with 3D data collection and bridging gaps with existing closed-source 3D solutions.

Project Address: See3D Project

Key Points

  1. See3D can generate 3D images from unlabeled video data.
  2. The model eliminates the need for traditional camera parameters.
  3. It supports multiple forms of 3D generation and editing.
  4. The initiative aims to reduce costs in 3D data collection and promote research in the field.

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

News

Broadcom Bets Big on AI Chips: $100 Billion Revenue Goal by 2027

Broadcom CEO Hock Tan stunned investors with bold predictions during Wednesday's earnings call, forecasting AI chip revenue will smash the $100 billion mark within three years. The announcement sent Broadcom shares soaring over 5% after hours, fueled by strong first-quarter results showing AI revenue doubling to $8.4 billion. With tech giants like Google and Meta driving demand for custom chips, Broadcom appears well-positioned to capitalize on the AI hardware boom.

March 6, 2026
SemiconductorsArtificialIntelligenceTechIndustry
GPT-5.4 Arrives With Mind-Reading AI and Million-Token Memory
News

GPT-5.4 Arrives With Mind-Reading AI and Million-Token Memory

OpenAI's latest model, GPT-5.4, introduces revolutionary features that bring us closer to truly intelligent digital assistants. The new Thinking mode lets users peer into the AI's reasoning process, while million-token memory enables handling massive documents. Perhaps most impressive are its native computer operation abilities - this AI doesn't just talk, it can actually work across your applications.

March 6, 2026
AIOpenAIGPT
News

Alibaba Denies Qwen Team Exodus Rumors, Vows Continued AI Innovation

Alibaba has firmly dismissed online rumors about mass resignations in its Qwen AI model team. The tech giant confirmed the team remains intact and focused on advancing artificial general intelligence (AGI) through open-source development. Contrary to speculation, Alibaba emphasized its commitment to technological breakthroughs over commercial metrics, while actively recruiting global AI talent.

March 6, 2026
ArtificialIntelligenceTechIndustryChinaTech
News

Tech Titans Bet Big on 3D Future as VAST Lands $50M Investment

Chinese tech giants Alibaba and Baidu are doubling down on the future of 3D content creation. VAST, an AI company specializing in 3D generation models, just secured $50 million in Series A funding led by Alibaba and Hengxu Capital. Their TripoAI platform has become a game-changer for designers, automating complex modeling tasks while building a community of over 6.5 million creators. With nearly 100 million models generated, VAST plans to use this investment to push boundaries in algorithm development and make 3D creation accessible to everyone.

March 6, 2026
ArtificialIntelligence3DModelingTechInvestment
News

Hong Kong AI Stocks Rally as MiniMax Earnings Spark Sector Revival

Hong Kong's AI sector staged an impressive comeback on March 5, led by MiniMax's surprising 13% surge following strong earnings. The rally extended to other AI players like Zhipu and autonomous driving firms WeRide and Pony.ai, fueled by renewed investor confidence in AI commercialization. Analysts point to MiniMax's international revenue growth and improving margins as key drivers behind this market turnaround.

March 5, 2026
HongKongStocksArtificialIntelligenceTechInvesting
OpenAI gears up for historic IPO with $730B valuation
News

OpenAI gears up for historic IPO with $730B valuation

OpenAI has taken a major step toward going public by hiring top law firms Cooley and Wachtell Lipton Rosen & Katz. The ChatGPT maker could launch its IPO as early as this year, potentially valuing the company at a staggering $730 billion. This move would mark one of the largest tech debuts ever, giving retail investors their first chance to own a piece of the AI revolution.

March 5, 2026
OpenAIIPOArtificialIntelligence