Transcription: Generate video captions from the original audio. Our AI models transcribe the speech of one or multiple speakers, even if they use different languages.
Translation: Translate captions into more than 100 languages to make your video accessible to multilingual audiences.
How it works
We use Whisper ASR from OpenAI, along with a range of other specialized AI models. These AI ASR models operate on the Gcore infrastructure, so no files are transferred to external services. After processing, all original files are also deleted from the AI system’s local storage.
Key benefits
Ease of use. You can generate ready-to-use subtitles in the Customer Portal or with a few API requests.
Supported languages. AI can recognize and transcribe audio in over 100 languages worldwide. If your desired language isn't listed, please contact Gcore support team, and we'll consider adding it in the future.
Multi-platform support. Generate video captions for any MP4 video that’s uploaded to Gcore Video Hosting or stored externally, for instance, on AWS.
Multi-language support. We support the recognition of multiple spoken languages in a single video (the “code-switching” feature). This accounts for video participants switching between several languages in their speech.
Generate and translate AI captions for your video content
You can transcribe and translate subtitles in two ways: via API or in the Customer Portal. For step-by-step instructions, check out the relevant guide: