← Back to all models

🎵

OpenAI: GPT Audio

OpenAI·Audio & Speech

🔥 41trending

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced at $32 per million input tokens and $64 per million output tokens.

#text+audio->text+audio#top-provider

🧮

Undisclosed

Parameters

📏

128K tokens

Context Window

🔒

Proprietary

License

📅

Jan 19, 2026

Released

💰 Pricing

Input

$2.50

per 1M tokens

Output

$10.00

per 1M tokens

🔌

API Available

This model is accessible via API for integration into your applications.

⭐ Related Models

GPT-5

OpenAI

OpenAI's latest flagship. Advanced multimodal capabilities with native tool use, improved reasoning, and massive knowledge.

🔒 proprietary📏 256K ctx💰 $10

Dec 1, 2025View details →

GPT-4o

OpenAI

Omni model supporting text, vision, and audio natively. Fast and capable with strong multimodal understanding.

🔒 proprietary📏 128K ctx💰 $2.50

May 13, 2024View details →

GPT-4o Mini

OpenAI

Compact and affordable. Surprisingly capable for its price point, ideal for high-volume applications.

🔒 proprietary📏 128K ctx💰 $0.15

Jul 18, 2024View details →