OpenAI: GPT Audio
The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced at $32 per million input tokens and $64 per million output tokens.
Undisclosed
Parameters
128K tokens
Context Window
Proprietary
License
Jan 19, 2026
Released
๐ฐ Pricing
Input
$2.50
per 1M tokens
Output
$10.00
per 1M tokens
API Available
This model is accessible via API for integration into your applications.
โญ Related Models
GPT-5
OpenAI
OpenAI's latest flagship. Advanced multimodal capabilities with native tool use, improved reasoning, and massive knowledge.
GPT-4o
OpenAI
Omni model supporting text, vision, and audio natively. Fast and capable with strong multimodal understanding.
GPT-4o Mini
OpenAI
Compact and affordable. Surprisingly capable for its price point, ideal for high-volume applications.