Miso Labs: The most emotive foundation models for voice

Miso Labs

The most emotive foundation models for voice

Spring 2026

Active

The most emotive foundation models for voice

Miso Labs is building the world’s most emotive foundation models for voice. We believe that the next generation of AI interactions shouldn't just be functional—they should be human. By bringing warmth and lightning-fast speed to the voice layer, we empower developers to build voice agents that users truly love.

Active Founders

Cassidy Dalva

Founder

co-founder @ miso labs | stanford alum

Cassidy Dalva

Founder

co-founder @ miso labs | stanford alum

Aoden Teo

Founder

Math major from Stanford building the future of AI voice.

Aoden Teo

Founder

Math major from Stanford building the future of AI voice.

Company Launches

Miso Labs - emotive voice models

See original launch post

Today, we’re excited to introduce Miso One, the most emotive voice model in the world.

Miso One is an 8-billion-parameter text-to-speech model for highly expressive speech generation. It emotes like a human and responds faster than a human, with just 110 milliseconds of latency.

We’ve open-sourced the model weights, with API access coming soon.

https://www.youtube.com/watch?v=HizlJgDbac8