← Back to all models

🎵

OpenAI: GPT-4o Audio

OpenAI·Audio & Speech

🔥 58trending

The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currently not supported. Audio tokens are priced at $40 per million input and $80 per million output audio tokens.

#text+audio->text+audio#top-provider

🧮

Undisclosed

Parameters

📏

128K tokens

Context Window

🔒

Proprietary

License

📅

Aug 15, 2025

Released

💰 Pricing

Input

$2.50

per 1M tokens

Output

$10.00

per 1M tokens

🔌

API Available

This model is accessible via API for integration into your applications.

⭐ Related Models

GPT-5

OpenAI

OpenAI's latest flagship. Advanced multimodal capabilities with native tool use, improved reasoning, and massive knowledge.

🔒 proprietary📏 256K ctx💰 $10

Dec 1, 2025View details →

GPT-4o

OpenAI

Omni model supporting text, vision, and audio natively. Fast and capable with strong multimodal understanding.

🔒 proprietary📏 128K ctx💰 $2.50

May 13, 2024View details →

GPT-4o Mini

OpenAI

Compact and affordable. Surprisingly capable for its price point, ideal for high-volume applications.

🔒 proprietary📏 128K ctx💰 $0.15

Jul 18, 2024View details →