GPT-4o Realtime

Input:$60/M

Output:$240/M

The Realtime API allows developers to build low-latency, Multimodal experiences, including speech-to-speech functionality. Text and Audio processed by the Realtime API are priced separately. This model supports a maximum context length of 128,000 tokens.

Commercial Use

Features

Pricing

API

Versions

Features for GPT-4o Realtime

Explore the key features of GPT-4o Realtime, designed to enhance performance and usability. Discover how these capabilities can benefit your projects and improve user experience.

Pricing for GPT-4o Realtime

Explore competitive pricing for GPT-4o Realtime, designed to fit various budgets and usage needs. Our flexible plans ensure you only pay for what you use, making it easy to scale as your requirements grow. Discover how GPT-4o Realtime can enhance your projects while keeping costs manageable.

Comet Price (USD / M Tokens)	Official Price (USD / M Tokens)	Discount
Input:$60/M Output:$240/M	Input:$75/M Output:$300/M	-20%

Versions of GPT-4o Realtime

The reason GPT-4o Realtime has multiple snapshots may include potential factors such as variations in output after updates requiring older snapshots for consistency, providing developers a transition period for adaptation and migration, and different snapshots corresponding to global or regional endpoints to optimize user experience. For detailed differences between versions, please refer to the official documentation.

version
gpt-4o-realtime-preview
gpt-4o-realtime-preview-2024-10-01
gpt-4o-realtime-preview-2024-12-17
gpt-4o-realtime-preview-2025-06-03

GPT-4o Realtime

Features for GPT-4o Realtime

Pricing for GPT-4o Realtime

Sample code and API for GPT-4o Realtime

Versions of GPT-4o Realtime

More Models