Image-to-Text (OCR)

Image-to-text models translate images into written output. The most common uses are image captioning and OCR, which describe scenes and extract text, respectively.

post

Endpoint for requesting image2text (OCR) inference

Authorizations

Header parameters

Acceptstring · enumRequiredPossible values:

Body

imagestring · binaryRequired

Image file to extract text from

modelstringRequired

The OCR model to use for text extraction

Example: Nanonets_Ocr_S_F16

languagestring | nullableOptional

Language code for OCR processing (optional)

Example: en

formatstring · enum | nullableOptional

Output format for extracted text

Example: textPossible values:

Responses

200

ID of the inference request.

application/json

401

Unauthorized user.

application/json

404

Unauthorized user.

application/json

post

/api/v1/client/img2txt

POST /api/v1/client/img2txt HTTP/1.1
Host: api.deapi.ai
Authorization: Bearer YOUR_SECRET_TOKEN
Accept: application/json
Content-Type: multipart/form-data
Content-Length: 79

{
  "image": "binary",
  "model": "Nanonets_Ocr_S_F16",
  "language": "en",
  "format": "text"
}

{
  "data": {
    "request_id": 1
  }
}

post

Endpoint for calculating price for image2text (OCR) inference

Authorizations

Header parameters

Acceptstring · enumRequiredPossible values:

Body

imagestring · binaryOptional

Image file to extract text from

modelstringRequired

The OCR model to use for text extraction. Available models can be retrieved via the GET /api/v1/client/models endpoint.

Example: Nanonets_Ocr_S_F16

languagestring | nullableOptional

Language code for OCR processing (optional)

Example: en

formatstring · enum | nullableOptional

Output format for extracted text

Example: textPossible values:

Responses

200

Calculated price for img2txt inference.

application/json

401

Unauthorized user.

application/json

404

Unauthorized user.

application/json

post

/api/v1/client/img2txt/price-calculation

POST /api/v1/client/img2txt/price-calculation HTTP/1.1
Host: api.deapi.ai
Authorization: Bearer YOUR_SECRET_TOKEN
Accept: application/json
Content-Type: application/json
Content-Length: 79

{
  "image": "binary",
  "model": "Nanonets_Ocr_S_F16",
  "language": "en",
  "format": "text"
}

{
  "data": {
    "price": 0.15
  }
}

PreviousText-to-Speech (TTS)NextVideo-to-Text (Transcription)

Last updated 1 month ago