Skip to main content

ElevenLabs CEO: AI Voice Models to Commoditize Soon

ElevenLabs CEO Predicts Commoditization of AI Voice Models

At TechCrunch Disrupt 2025, Mati Staniszewski, co-founder and CEO of ElevenLabs, made a bold prediction: AI voice models will become commoditized within the next two to three years. While currently a competitive differentiator, Staniszewski believes performance gaps between models will narrow significantly for mainstream languages and general voice styles.

Image

Image source note: The image is AI-generated, and the licensing service provider is Midjourney

Short-Term Focus on Models, Long-Term on Products

When questioned about investing heavily in R&D for potentially homogeneous future models, Staniszewski explained: "Today, models remain the biggest technical barrier. If AI voice sounds unnatural or unsmooth, user experience suffers." He highlighted ElevenLabs' breakthroughs in model architecture, particularly in emotional expression and multilingual prosody modeling, as key differentiators.

The company is already preparing for the post-model era. "Our long-term strategy isn't just being a model supplier," Staniszewski emphasized. "We're building complete 'AI + product' experiences." Drawing parallels to Apple's hardware-software integration approach with smartphones, ElevenLabs aims to use its proprietary models as engines powering high-value applications.

Multi-Modal Integration Emerges as Next Frontier

Looking ahead 1-2 years, Staniszewski anticipates rapid convergence of single-modal voice systems into multi-modal platforms. "You'll generate audio and video simultaneously," he predicted, "or dynamically link large language models with voice engines during conversations." He cited Google's Veo3 video generation model as evidence that cross-modal collaboration represents the next technological frontier.

To position itself competitively, ElevenLabs is actively pursuing partnerships with third-party models and open-source communities. These collaborations explore embedding ElevenLabs' audio capabilities into broader AI ecosystems—potentially enabling immersive virtual humans, advanced smart customer service systems, or innovative interactive entertainment experiences.

Commoditization Signals Value Shift, Not Decline

Staniszewski rejects notions that model commoditization spells industry decline. Instead, he sees it representing a shift in value creation from underlying technology to application innovation. "Future companies will select models based on specific scenarios," he explained. "Different solutions for customer service versus game voice acting versus educational explanations."

The CEO emphasized that reliability, scalability, and scenario adaptability will surpass raw sound quality as primary decision factors. Accordingly, ElevenLabs is strengthening its API platform, developer toolchain, and industry-specific solutions—ensuring customers can integrate high-quality voices seamlessly into business workflows.

Key Points:

  • Commoditization timeline: AI voice models expected to become standardized commodities within 2-3 years
  • Strategic pivot: ElevenLabs transitioning from pure model development to integrated product solutions
  • Multi-modal future: Convergence of audio with video generation and LLMs emerging as next competitive battleground
  • Value migration: Industry focus shifting from technical superiority to application-specific implementations

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

News

ElevenLabs Hits $11 Billion Valuation in AI Voice Boom

ElevenLabs, the AI voice technology leader, has secured $500 million in Series D funding, catapulting its valuation to $11 billion. The company's enterprise-focused voice solutions are driving impressive growth, with annual recurring revenue surpassing $330 million. Major investors like Sequoia Capital and Andreessen Horowitz are betting big on voice AI's potential to transform customer interactions.

February 5, 2026
AI VoiceStartup FundingEnterprise Tech
Music Legends Team Up With AI for Groundbreaking Album
News

Music Legends Team Up With AI for Groundbreaking Album

Legendary artists like Liza Minnelli and Art Garfunkel are collaborating with ElevenLabs on 'The Eleven Album,' blending human artistry with AI innovation. The project promises full creative control for musicians while exploring new sonic territories across genres from pop to electronic. As the music industry grapples with technology's role, this ambitious venture could redefine creative partnerships.

January 22, 2026
AIinMusicElevenLabsMusicInnovation
NYU Professor's 42-Cent AI Oral Exams Expose Cheating Gap
News

NYU Professor's 42-Cent AI Oral Exams Expose Cheating Gap

An NYU professor found students acing written assignments often couldn't explain basic concepts when quizzed verbally. His solution? AI-powered oral exams costing just 42 cents per student. While stressful for some, 70% agreed these tests better measured real understanding than traditional methods. The experiment reveals both cheating vulnerabilities and AI's potential to transform academic assessment.

January 5, 2026
AI in EducationAcademic IntegrityNYU Innovation
Microsoft's New Open-Source Voice Model Talks Almost as Fast as You Think
News

Microsoft's New Open-Source Voice Model Talks Almost as Fast as You Think

Microsoft has quietly released VibeVoice-Realtime-0.5B, a surprisingly nimble text-to-speech model that responds in just 300 milliseconds - faster than most humans can blink. This lightweight yet powerful tool can handle marathon 90-minute readings without missing a beat, juggle four distinct character voices simultaneously, and even detect emotions in text. While its English performance shines, the Chinese version still needs some polish. Already available on HuggingFace with an MIT license, developers are quickly integrating it into everything from audiobook apps to real-time translation tools.

December 5, 2025
MicrosoftText-to-SpeechAI Voice
ElevenLabs Unleashes All-in-One AI Studio for Creators
News

ElevenLabs Unleashes All-in-One AI Studio for Creators

ElevenLabs has transformed from a voice specialist into a full-fledged multimedia powerhouse. Their new platform lets creators generate images, videos, voiceovers, and music in one seamless workflow - potentially cutting production time from hours to minutes. Marketing teams and content creators can now produce polished commercials entirely within ElevenLabs' ecosystem.

November 18, 2025
AI Content CreationMultimodal AIVideo Production
ByteDance Unveils Four Advanced AI Models with Enhanced Features
News

ByteDance Unveils Four Advanced AI Models with Enhanced Features

ByteDance's Volcano Engine has launched four new or upgraded AI models, including enhanced versions of Doubao Large Model 1.6 and two new voice synthesis models. These innovations offer improved performance, flexibility, and cost-efficiency for enterprise users.

October 16, 2025
Artificial IntelligenceByteDanceVoice Synthesis