Video-to-Text (Transcription)

Video-to-Text automatically transcribes audio or video files into text. The API returns a written transcript along with a task ID for tracking processing progress.

post

Endpoint for requesting video transcription

Authorizations
Header parameters
Acceptstring · enumRequiredPossible values:
Body
video_urlstringRequired

URL of video to transcribe

Example: https://www.youtube.com/watch?v=jNQXAC9IVRw
include_tsbooleanRequired

Should transcription include timestamps

modelstringOptional

The model to use for generation

Example: whisper-3-large
Responses
200

ID of the inference request.

application/json
post
POST /api/v1/client/vid2txt HTTP/1.1
Host: api.deapi.ai
Authorization: Bearer JWT
Accept: application/json
Content-Type: application/json
Content-Length: 103

{
  "video_url": "https://www.youtube.com/watch?v=jNQXAC9IVRw",
  "include_ts": true,
  "model": "whisper-3-large"
}
{
  "data": {
    "request_id": 1
  }
}
post

Endpoint for calculating price for video to text inference

Authorizations
Header parameters
Acceptstring · enumRequiredPossible values:
Body
video_urlstringRequired

URL of video to transcribe

Example: https://www.youtube.com/watch?v=jNQXAC9IVRw
include_tsbooleanRequired

Should transcription include timestamps

modelstringOptional

The model to use for generation

Example: whisper-3-large
Responses
200

Calculated price for video to text inference.

application/json
post
POST /api/v1/client/vid2txt/price-calculation HTTP/1.1
Host: api.deapi.ai
Authorization: Bearer JWT
Accept: application/json
Content-Type: application/json
Content-Length: 103

{
  "video_url": "https://www.youtube.com/watch?v=jNQXAC9IVRw",
  "include_ts": true,
  "model": "whisper-3-large"
}
{
  "data": {
    "price": 0.25
  }
}

Last updated