Ex-Google CEO's Startup Unveils 24B-Parameter AI for Chemistry
In a significant advancement for scientific artificial intelligence, FutureHouse—a startup funded by former Google CEO Eric Schmidt—has open-sourced ether0, a specialized 24-billion-parameter model designed for chemical reasoning. The breakthrough system demonstrates how targeted AI can achieve superior performance with dramatically reduced data requirements compared to conventional approaches.

Unlike general-purpose language models that struggle with domain-specific challenges, ether0 exhibits remarkable chemical intuition without extensive pre-training. The FutureHouse team accomplished this through innovative post-training techniques that focus the model's capabilities on scientific reasoning. "We're moving beyond multiple-choice tests to genuine scientific problem-solving," explains the research team behind the project.
The model's development involved mining chemical experiment data from thousands of academic papers, tracking molecular properties like solubility and odor characteristics. Researchers transformed this raw data into verifiable scientific questions that train ether0 to think like a chemist.
Built on the Mistral-Small-24B architecture and refined through reinforcement learning, ether0 processed over 640,000 chemical problems spanning 18 specialized tasks. These include synthetic feasibility assessments, blood-brain barrier permeability predictions, and odor analysis—challenges where traditional models often falter.
Performance benchmarks tell a compelling story. When pitted against both general AI systems (Claude, o1) and specialized chemistry models (ChemDFM, TxGemma), ether0 consistently achieved higher accuracy in open-answer scenarios. In multiple-choice formats, it remained highly competitive—sometimes doubling competitors' accuracy on specific tasks.
The efficiency gains are equally impressive. Traditional non-reasoning models require 50 times more data to approach ether0's reaction prediction accuracy. While independent validation remains ongoing, early tests show the model can effectively reason about molecular structures absent from its training data—a hallmark of true understanding rather than pattern recognition.
Currently in prototype stage, ether0 already demonstrates three transformative capabilities:
- Natural language question comprehension
- Step-by-step chemical reasoning
- Novel molecular structure generation
These strengths position it as particularly valuable for drug-like molecule design, where rapid iteration and accurate prediction are paramount. As research continues, ether0 may establish a new standard for building general scientific reasoning systems across disciplines.
Key Points
- FutureHouse's ether0 sets new benchmarks for chemical AI with 24 billion parameters
- The model outperforms specialized competitors while using far less training data
- Innovative techniques like reasoning behavior distillation boost performance
- Applications range from drug discovery to materials science innovation
- Open-source availability accelerates scientific AI development worldwide


