GPT Audio

GPT Audio

OpenAI

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced...

What is GPT Audio?

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced...

Specifications

  • Developer: OpenAI
  • Context window: 128K tokens
  • Max output: 16.4K tokens
  • Input modalities: text, audio
  • Output modalities: text, audio
  • Input price: $2.50 per 1M tokens
  • Output price: $10.00 per 1M tokens
  • Knowledge cutoff:
  • Supported parameters: frequency_penalty, logit_bias, logprobs, max_tokens, presence_penalty, response_format, seed, stop, structured_outputs, temperature, tool_choice, tools, top_logprobs, top_p

Use GPT Audio with PromptLayer

PromptLayer lets teams manage, evaluate, and observe prompts that run on GPT Audio alongside every other model in their stack. Version prompts, run evals across models, and ship safe rollouts from the same dashboard.

Ready to try it yourself? Sign up for PromptLayer and start managing your prompts in minutes.

Socials
PromptLayer
Company
All services online
Location IconPromptLayer is located in the heart of New York City
PromptLayer © 2026