POST /v1/audio/transcriptions — transcribe audio to text.
json (default), text, srt, verbose_json (with segments + word timestamps), vtt.
API key in format: Bearer inf_***
Transcription result. Shape depends on response_format: JSON (json, verbose_json) or plain text (text, srt, vtt).
response_format: json (default) or verbose_json