API Endpoints Guide

You can transcribe input audio or video using one of these APIs:

  • Synchronous API — for quick, immediate results.
  • Asynchronous API — for longer jobs with job IDs and polling.
  • OpenAI-Compatible API — for seamless integration with OpenAI clients and SDKs.

Subtitle generation via the API is not supported yet.

Speech-to-Text Models

We currently support these AI models:

  • Formalmodel-01052025
  • Conversationalmodel-23012026

Pick one with the stt_model query parameter on the upload endpoint.

Listen to the sample audio used in the examples below:

Formal model

model-01052025 Released 01 Nov 2025 Default

Best for formal speech and subtitles. In some cases, non-formal speech can be turned into formal.

Transcribe with the Formal model:

curl --location 'https://api.hispeech.ai/api/v1/transcriptions/upload?stt_model=model-01052025' \
--header 'x-auth-token: YOUR_API_KEY' \
--form 'file=@"/path/to/audio_or_video.mp3"' \
--form 'wait_for_result="true"'

Formal is the default model, so you can also omit stt_model and call .../transcriptions/upload directly.

Response Example:

{
    "job_id": "a91f17d0-5d9a-11f1-8d6f-215414e3f380",
    "error": null,
    "success": true,
    "secure_mode": false,
    "return_word_timestamps": false,
    "status": 200,
    "transcription": "Խոսում է, գրում է, պատմում է, ասում է։",
    "audio_duration": 3
}

Conversational model

model-23012026 Released 23 Jan 2026

Best for conversational (non-dialectal) speech. Transcribes exactly what it hears.

Transcribe with the Conversational model:

curl --location 'https://api.hispeech.ai/api/v1/transcriptions/upload?stt_model=model-23012026' \
--header 'x-auth-token: YOUR_API_KEY' \
--form 'file=@"/path/to/audio_or_video.mp3"' \
--form 'wait_for_result="true"'

Response Example:

{
    "job_id": "84379d20-5d9a-11f1-a2d3-8a2446801cc3",
    "error": null,
    "success": true,
    "secure_mode": false,
    "return_word_timestamps": false,
    "status": 200,
    "transcription": "Խոսում ա գրում ա, պատմում ա, ասում ա։",
    "audio_duration": 3
}
Do not forget to replace:
   • YOUR_API_KEY
   • /path/to/audio_or_video.mp3
with your own correct values.