OpenAI has introduced advanced transcription and voice models, with reduced word error rate and improved language recognition. These models are effective for applications like customer service and meeting transcription.
OpenAI has also launched the GPT-4o Mini TTS model, allowing developers to customize voice outputs with precision, adjusting tone, emotion, and speed.
The new models are available through OpenAI's API and integrated with the Agents SDK, simplifying the development process for audio-based applications.
OpenAI is actively engaging with the community through events like the 'Deep Research in the OpenAI Forum' virtual session.