Speech Models - Search News

How Large Scale Speech Models Will Impact Voice AI

Gautam Jha is the Co-Founder & CTO of Kalpa Labs, an SF-based YC backed startup building large scale Foundational speech models. Voice is quickly becoming a primary interface for enterprise software, ...

16d

Google’s Gemini 3.1 Flash TTS model offers unparalleled control over AI voices

Google LLC’s DeepMind artificial intelligence unit today rolled out a new text-to-speech model called Gemini 3.1 Flash TTS.

Geeky Gadgets

OpenAI Launches New Speech-to-Text AI Audio Models API for Developers

OpenAI has today introduced a suite of advanced audio models and tools through its API, designed to empower developers in creating sophisticated, voice-driven applications. These updates include ...

Deepgram Launches Flux Multilingual: The World’s First Multilingual Conversational Speech Recognition Model

Deepgram, the real-time AI infrastructure company underpinning the Voice AI economy, today announced the general availability (GA) of Flux Multilingual, expanding its conversational speech recognition ...

Deepgram expands Flux to 10 languages with mid-call switching for voice agents

Real-time voice artificial intelligence startup Deepgram Inc. today announced the general availability of Flux Multilingual, ...

TechCrunch

AssemblyAI lands $50M to build and serve AI speech models

Companies are betting big on generative AI to gain a competitive edge. But adoption challenges remain. According to a recent survey from EY, a significant portion of businesses looking to embrace ...

9to5Mac

Apple’s latest AI model listens for what makes speech sound ‘off’, here’s why that matters

As part of its fantastic body of work on speech and voice models, Apple has just published a new study that takes a very human-centric approach to a tricky machine learning problem: not just ...

techtimes

Amazon Researchers Train the Largest Ever Text-to-Speech AI Model to Date

Amazon researchers have unveiled the largest text-to-speech AI model to date, which they claimed shows "emergent" qualities that enhance its ability to speak even complex sentences naturally.

Forbes

Why The Speech AI Industry Is Hitting A Wall And What Comes Next

The global speech and voice recognition market is projected to grow from $20 billion in 2023 to over $53 billion by 2030. That number sounds impressive until you look at how the industry is actually ...

The Daily Pennsylvanian

AI platforms display considerable variance in judging hate speech, Annenberg School study finds

Large language models powered by artificial intelligence differ in how they classify and respond to hate speech, according to recent findings by Annenberg School for Communication researchers. In a ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results