Home
/
Audio
/
GPT-4o Audio
GPT-4o Audio
OpenAI
Realtime audio model for low-latency voice and text interactions over WebRTC or WebSocket
Pricing
per minute
Input / min
$0.60
Output / min
$2.40
Specifications
API Model ID
gpt-4o-audio-preview
Type
Realtime
Modalities
text
audio
Capabilities
streaming
tool-use
vision
Other OpenAI Audio Models
Model
Input / min
Output / min
Whisper
$0.0060
—
TTS
—
$0.01
TTS HD
—
$0.03