Models
The list of models will be updated.
Available Models
Nanonets_Ocr_S_F16 (img2txt) – Lightweight OCR model by Nanonets for fast and accurate image-to-text extraction.
WhisperLargeV3 (audio2text) – Advanced speech recognition model by OpenAI for high-accuracy audio transcription.
Kokoro (txt2audio) – Expressive text-to-speech model designed for natural, emotional voice synthesis.
Flux1schnell (txt2img) – High-speed text-to-image generation model optimized for quick visual outputs.
Last updated