Play is a Voice AI company that specializes in building conversational voice models capable of cloning any voice or accent and generating speech in real-time.
Building the future of voice technology at Play.ht. I'm super passionate about building and scaling products that add value and make a difference in people's lives. I've been a software engineer and a product person in my past role at OLX where I met Mahmoud to start Play.ht. My experience ranges from bootstrapping products to profitability, SEO, product management, and building and deploying end-to-end systems that deliver high-impact business value within constrained timelines.
We are thrilled to announce the release of the FASTEST Voice LLM to date! Real-time speech streaming from text in 300ms or less. Dive in and test it using our Playground, available SDKs, or these Replit demos for Nodejs and a chatGPT integration.
YC Deal
For all YC companies, get 50% off API Plans for 2 years, check it here.
At PlayHT, our vision is to redefine human interactions with AI agents. Whether it’s for customer support or sales calls, AI tutors, or bringing Gaming NPCs to life, our goal is to revolutionize the way humans communicate with generative AI agents.
Today we announce our latest milestone on the road to fulfilling that vision: the launch of PlayHT Turbo, a new version of our conversational voice model, PlayHT 2.0 that generates speech in under 300ms via network and < 100ms for on-premise solutions.
PlayHT 2.0 Turbo supports input text streaming. This feature seamlessly integrates with LLMs, like chatGPT. Simply feed the output stream of tokens/words from the LLM and the SDK will process the tokens in the best way that can balance both generating expressive contextual speech and reducing the TTFB (time to first byte).
Once Turbo receives text, it starts streaming audio in approximately 70ms. However, due to inevitable network costs, users typically receive the audio stream within a 200ms to 400ms window.
Check out our demo showcasing the integration with chatGPT with both input and output streaming:
https://www.youtube.com/watch?v=hF6IueCacfg
Ready to redefine human-AI communication? Build the next AI Therapist, AI Tutor, Gaming NPCs, or Personal Assistant that actually sounds human? We built this API for you, get started now for free, and join our discord and show us what you are building!
How can you help?