OpenAI: GPT-4o Audio
The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currently not supported. Audio tokens are priced at $40 per million input and $80 per million output audio tokens.
Undisclosed
Parameters
128K tokens
Context Window
Proprietary
License
Aug 15, 2025
Released
๐ฐ Pricing
Input
$2.50
per 1M tokens
Output
$10.00
per 1M tokens
API Available
This model is accessible via API for integration into your applications.
โญ Related Models
GPT-5
OpenAI
OpenAI's latest flagship. Advanced multimodal capabilities with native tool use, improved reasoning, and massive knowledge.
GPT-4o
OpenAI
Omni model supporting text, vision, and audio natively. Fast and capable with strong multimodal understanding.
GPT-4o Mini
OpenAI
Compact and affordable. Surprisingly capable for its price point, ideal for high-volume applications.