Latent SpaceMay 8, 2026, 7:11 AMimportant 85

OpenAI 推出 GPT-Realtime-2、GPT-Translate 與 GPT-Whisper:全新 SOTA 即時語音 API

Original: [AINews] GPT-Realtime-2, -Translate, and -Whisper: new SOTA realtime voice APIs

OpenAI has continued to expand the reach of its GPT-5 technology, officially launching three new voice and audio APIs: GPT-Realtime-2…

OpenAI 推出全新一代即時語音與音訊 API,包含 GPT-Realtime-2、GPT-Translate 以及 GPT-Whisper。這些 API 將 GPT-5 的強大能力導入語音領域,提供全新業界領先(SOTA)的即時語音互動、多語言翻譯與語音識別效能,展現了 OpenAI 將 GPT-5 架構全面鋪設至各類應用場景的野心。

OpenAI has continued to expand the reach of its GPT-5 technology, officially launching three new voice and audio APIs: GPT-Realtime-2, GPT-Translate, and GPT-Whisper. This marks OpenAI setting a new state-of-the-art (SOTA) benchmark in real-time voice interaction, cross-language translation, and speech recognition (Speech-to-Text).

Full summary

Free shows the 3-line summary; Pro unlocks the full deep summary (~300 words) so you never have to click through.

See Pro plans →

Want the original English / full article?

Read on Latent Space →

Summaries are AI-generated; the original article is authoritative.