Skip to main content

Stepfun's New AI Model Packs Speed and Smarts for Digital Assistants

Stepfun Launches Lightning-Fast AI Model for Digital Assistants

Tech company Stepfun has rolled out its latest innovation - the Step3.5Flash open-source model designed specifically to power intelligent digital assistants. This lightweight solution promises both speed and sophistication, offering developers an affordable alternative to closed-source options.

Image

Built for Speed and Substance

The new model breaks ground in several key areas:

  • Blazing fast responses: Processes up to 350 tokens per second, particularly efficient with coding tasks
  • Closed-source rival: Matches proprietary models in agent applications and mathematical logic
  • Extended focus: Handles complex, lengthy tasks with stability, managing contexts up to 256K tokens

Under the Hood: Smart Architecture Choices

Step3.5Flash employs an innovative sparse MoE (Mixture of Experts) architecture, activating only about 11 billion of its total 196 billion parameters per token. The model incorporates MTP-3 technology that predicts three tokens simultaneously, effectively doubling efficiency. A clever combination of sliding window and global attention mechanisms helps it pinpoint crucial information in lengthy texts while keeping computational costs manageable.

Real-World Performance That Impresses

In practical demonstrations, the model has shown remarkable versatility:

  • Code generation: Transforms text descriptions into functional WebGL2.0 visualization platforms
  • Number crunching: Solves complex mathematical operations without external tools
  • Cloud coordination: Breaks down vague user requests (like price comparisons) into actionable steps for local devices

The model is currently available on GitHub, HuggingFace, and OpenRouter, with optimizations for personal workstations including NVIDIA DGX and Apple M4Max systems. Stepfun has also announced development of its next-generation Step4 model and is inviting developer input to shape future agent technologies.

Where to get it:

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

News

Mistral AI's Small4: A Triple-Threat Open Source Model Arrives

Mistral AI has unveiled its latest open-source marvel - the Small4 model. This isn't just another incremental update; it combines three powerful capabilities into one package: logical reasoning, multimodal processing, and coding assistance. With its efficient 128-expert architecture and configurable performance modes, developers now have a versatile tool that adapts to different needs while cutting computational costs.

March 17, 2026
AI modelsopen sourceMistral AI
IBM's Granite 4.0 Speech Model: Smaller Size, Bigger Performance
News

IBM's Granite 4.0 Speech Model: Smaller Size, Bigger Performance

IBM has unveiled Granite 4.0 1B Speech, a compact yet powerful multilingual speech recognition model designed for edge computing. Half the size of its predecessor, it delivers improved accuracy while supporting Japanese ASR and English-Chinese translation. The innovative two-stage architecture allows flexible deployment on resource-constrained devices, topping benchmarks with an impressive 5.52% word error rate.

March 16, 2026
IBMspeech recognitionedge computing
ChatGPT Just Became Your Personal Assistant for Everything
News

ChatGPT Just Became Your Personal Assistant for Everything

OpenAI has transformed ChatGPT from a simple chatbot into a powerful hub connecting your favorite apps. Now you can order food, book trips, create designs, and more—all through natural conversation. While currently limited to North America, this feature hints at a future where AI seamlessly bridges our digital services.

March 16, 2026
ChatGPTAI integrationDigital assistants
News

Google Tests AI Assistant That Can Control Your Android Phone

Google is experimenting with a groundbreaking feature that lets its Gemini AI assistant take control of Android phones to perform everyday tasks. Currently in beta testing, 'Screen Automation' could revolutionize how we interact with our devices - from booking appointments to online shopping. While promising convenience, Google cautions users about potential errors and privacy considerations.

February 4, 2026
Google AIAndroid automationDigital assistants
Tencent's New Translation Tech Fits in Your Pocket
News

Tencent's New Translation Tech Fits in Your Pocket

Tencent has unveiled HY-MT1.5, a breakthrough translation system that brings powerful AI capabilities to mobile devices. The lightweight 1.8B version delivers near-instant translations while using minimal memory, perfect for smartphones. Meanwhile, the more robust 7B model excels at complex translations for enterprise use. What makes these models special? They combine massive training with human feedback to handle everything from technical jargon to cultural nuances - all while preserving document formatting.

January 5, 2026
machine translationAI modelsmobile technology
News

Meta Makes Billion-Dollar Bet on AI Startup Manus

Meta has made a strategic move in the AI arms race, acquiring Singapore-based startup Manus for billions of dollars. The deal marks Meta's third-largest acquisition ever and brings aboard Manus' innovative 'general agent' technology that's been turning heads in Silicon Valley. Founder Shao Hong will join Meta as VP, signaling the social media giant's serious commitment to advancing its AI capabilities.

December 30, 2025
MetaAI startupsTech acquisitions