:

OPENAI LAUNCHES THREE VOICE MODELS FOR API

AI DESK2 MIN READ
THU, MAY 7, 2026

■ AI-SUMMARIZED FROM 3 SOURCES BELOW

OpenAI released three new realtime voice models designed to enable developers to build a new class of voice applications. The models include GPT-Realtime-2 with GPT-5-class reasoning, GPT-Realtime-Whisper for transcription, and GPT-Realtime-Translate for multilingual support.

GPT-Realtime-2 brings advanced reasoning capabilities to voice interactions, matching GPT-5 performance in real-time conversations. The model enables developers to build applications that can understand and respond to complex spoken queries without the traditional latency associated with voice processing. GPT-Realtime-Whisper handles live speech transcription, converting audio input to text in real time. This component allows for immediate processing of spoken input across applications. GPT-Realtime-Translate supports translation across 70+ languages, enabling developers to build multilingual voice applications. The feature allows real-time conversation across language barriers without separate processing steps. OpenAI stated the models will "unlock a new class of voice apps for developers." The release targets the API, making the technology available to third-party developers building commercial and internal applications. In parallel, OpenAI launched Trusted Contact, an optional safety feature for ChatGPT. The feature allows users over 18 to assign an emergency contact for mental health and safety concerns. The capability expands existing teenage safety options to adult users, enabling designated contacts to be notified when the platform detects potential safety issues. The Trusted Contact feature is optional and gives users control over their safety settings. When activated, designated contacts can receive alerts if OpenAI's systems detect concerning behavior patterns. Both announcements reflect OpenAI's focus on expanding voice capabilities while strengthening user safety features. The voice models target developers seeking to integrate advanced audio processing into applications, while the Trusted Contact feature addresses growing concerns about digital safety and mental health support.

■ SOURCES

TechmemeTechmemeThe Decoder

■ SUMMARY WRITTEN BY AI FROM THE LINKS ABOVE

■ MORE FROM THE AI DESK

Low-quality AI-generated content, known as "AI slop," is degrading the value of online forums and communities. The influx of automated posts is drowning out authentic human discussion and expertise.

JUST NOWAI Desk

Microsoft is reviving the shuttered Three Mile Island nuclear plant to meet surging electricity demands from its AI operations. The facility, site of America's worst nuclear disaster in 1979, will generate power exclusively for the tech giant's data centers.

JUST NOWAI Desk

Perplexity's Personal Computer AI assistant is now available to all Mac users, no longer requiring a paid subscription. The move democratizes access to the company's AI agent technology.

1H AGOAI Desk

The Pentagon awarded Meta-backed Scale AI a $500 million contract to process data and support military decision-making through the Chief Digital and AI Office. The deal follows a $100 million agreement inked earlier this year.

1H AGOAI Desk

■ SUBSCRIBE TO THE DAILY BRIEF

ONE EMAIL, 5 STORIES, 06:00 UTC. UNSUBSCRIBE ANYTIME.