The Realtime API allows developers to build low-latency, Multimodal experiences, including speech-to-speech functionality. Text and Audio processed by the Realtime API are priced separately. This model supports a maximum context length of 128,000 tokens.
Commercial Use
Features
Pricing
API
Versions
Features for GPT-4o Realtime
Explore the key features of GPT-4o Realtime, designed to enhance performance and usability. Discover how these capabilities can benefit your projects and improve user experience.
Pricing for GPT-4o Realtime
Explore competitive pricing for GPT-4o Realtime, designed to fit various budgets and usage needs. Our flexible plans ensure you only pay for what you use, making it easy to scale as your requirements grow. Discover how GPT-4o Realtime can enhance your projects while keeping costs manageable.
Comet Price (USD / M Tokens)
Official Price (USD / M Tokens)
Discount
Input:$60/M
Output:$240/M
Input:$75/M
Output:$300/M
-20%
Sample code and API for GPT-4o Realtime
Access comprehensive sample code and API resources for GPT-4o Realtime to streamline your integration process. Our detailed documentation provides step-by-step guidance, helping you leverage the full potential of GPT-4o Realtime in your projects.
Versions of GPT-4o Realtime
The reason GPT-4o Realtime has multiple snapshots may include potential factors such as variations in output after updates requiring older snapshots for consistency, providing developers a transition period for adaptation and migration, and different snapshots corresponding to global or regional endpoints to optimize user experience. For detailed differences between versions, please refer to the official documentation.