OpenAI Adds Real-Time Voice, Translation to API
OpenAI has launched several new voice intelligence features in its API, including GPT-Realtime-2, a voice model powered by GPT-5-class reasoning for handling complex user requests. The update also introduces GPT-Realtime-Translate, supporting over 70 input and 13 output languages, and GPT-Realtime-Whisper, a live speech-to-text transcription tool.
The features target industries like customer service, education, media, and creator platforms. OpenAI says it has built guardrails to prevent misuse, including automatic conversation halting for harmful content violations. Translate and Whisper are billed by the minute, while GPT-Realtime-2 is billed by token consumption.
