Paste a video URL or upload a file. Dokitscript transcribes it, translates it into your target language, and generates a natural AI voice MP3 you can download — powered by ElevenLabs.
TikTok · Instagram · YouTube · Facebook · X · LinkedIn · Last updated June 2026
Try video translation free →How do I translate a video and get the audio in another language? Paste the video URL (or upload a file) into Dokitscript, wait for the transcript, use the AI Translation feature to select your target language, then click Listen. Dokitscript uses ElevenLabs' eleven_multilingual_v2 model to generate a natural AI voice reading the translated text, and produces a downloadable 128 kbps MP3. Transcription runs on OpenAI Whisper and auto-detects the source language (90+ languages supported); translated audio is available in approximately 29 languages and requires the Starter plan or higher.
How it works
No software to install. Works entirely in your browser.
Paste a TikTok, Instagram, YouTube, Facebook, X, or LinkedIn video URL — or upload a local audio or video file up to 50 MB.
Dokitscript transcribes the video in 90+ languages. The source language is detected automatically — no need to specify it.
Click AI Translation and choose your target language — French, Spanish, Japanese, German, Arabic, and more. The full transcript is translated in seconds.
ElevenLabs reads the translated text in a natural AI voice. Download the result as a 128 kbps MP3 file in your chosen language.
Features
Everything from foreign-language video to translated MP3, in one tool.
No need to know what language the video is in. OpenAI Whisper identifies it automatically and transcribes it accurately across 90+ languages.
The translation step is powered by Claude AI and produces natural, fluent translated text before it is converted to spoken audio.
The translated text is read by ElevenLabs' eleven_multilingual_v2 model — one of the most natural-sounding multilingual AI voices available.
The output is a standard MP3 file in your target language, ready to play on any device or import into a video editor, podcast tool, or learning app.
Paste a URL from TikTok, Instagram Reels, YouTube Shorts, YouTube, Facebook, X (Twitter) or LinkedIn. File upload also works for local recordings.
You always get the full written transcript in both the source and target language alongside the MP3. Export as TXT or SRT any time.
Languages
Transcription and audio generation cover different language sets — here is the honest breakdown.
Dokitscript can transcribe speech in over 90 languages, including English, French, Spanish, Arabic, Chinese, Hindi, Japanese, Korean, Portuguese, German, Italian, Turkish, Russian, and many more. The source language is detected automatically.
The translated MP3 voice output is powered by ElevenLabs and currently supports approximately 29 target languages:
Note: transcription supports 90+ source languages; translated audio output supports ~29 target languages. If your target language is not in the audio list, you will still get the fully translated text transcript.
Use cases
Anywhere a foreign-language video needs to be heard in a different language.
Found a TikTok or YouTube video in a language you don't speak? Translate it and listen to the audio in your language instead of reading subtitles.
Translate a video into your target language and listen to it as an MP3. Training your ear with real-world content is more effective than textbook exercises.
Turn a video you created in English into an audio version in French, Spanish, or Japanese. Share the MP3 as a companion track for international followers.
Translate an episode transcript and generate a voiceover in the target language. Publish it as a bonus episode for your audience in another country.
Translate recorded training videos into the local language of each team. Distribute the audio files without re-recording sessions from scratch.
Convert a foreign-language video into an audio file in the listener's own language — useful for accessibility tools, commutes, and low-bandwidth environments.
Plans
Transcription and AI translation are available on every plan. Translated audio download requires Starter or higher.
| Plan | Price | Transcriptions | Max video length | Translated audio (MP3) |
|---|---|---|---|---|
| Free | $0 | 5 / month | 3 minutes | Not available |
| Starter | $4.99 / mo | 200 / month | 8 minutes | 6 min / month |
| Pro | $14.99 / mo | Unlimited | 45 minutes | 60 min / month |
| Business | $79.99 / mo | Unlimited | 5 hours | 240 min / month |
Audio minutes are counted per generated MP3. Unused minutes do not roll over. See full pricing →
FAQ
Related tools
Free to start. Translated audio download from $4.99/month. No software needed.
Get started free →