News

Learn how to build an AI voice agent with DeepSeek R1. Step-by-step guide to tools, APIs, and Python integration for real-time interaction.
Hugging Face's new FastRTC library enables Python developers to build real-time voice and video AI applications in just a few lines of code.
Verbi is a modular voice-to-voice AI assistant designed for natural, conversational interactions. It captures speech input, converts it to text, processes the text, and generates a spoken response.
Realtime API supports multi-model text and speech experiences including natural speech-to-speech conversations using preset voices already supported in the API.
OpenAI's Realtime API is now generally available, featuring the new gpt-realtime model for more natural voice agents at a 20% ...
Three, all new proprietary voice models called gpt-4o-transcribe, gpt-4o-mini-transcribe and gpt-4o-mini-tts.
The new API features will help enterprises build autonomous, multimodal voice agents with remote tool access, PBX integration ...