News

The key factors to evaluate when selecting a speaker diarization API, from accuracy metrics to handling overlapping speech. AI diarization ...
OpenAI has unveiled its latest speech-to-speech artificial intelligence (AI) model, gpt-realtime, designed to generate more ...
Microsoft’s VibeVoice is an open-source text-to-speech model for podcast-length, multi-speaker audio that captures the ...
A new study finds that humans are using words commonly found in AI chatbots more often. Researchers found a measurable shift ...
AI-powered voice assistants are transforming customer interactions across India, handling queries in multiple languages with ...
OpenAI made its Realtime API generally available this week, enabling developers to build voice agents. This API supports remote MCP servers, image inputs, and phone calling through Session Initiation ...