Paste a video URL or upload a file. Dokitscript transcribes it, translates it, and creates a natural AI voice-over MP3 you can download — powered by ElevenLabs.
TikTok · Instagram · YouTube · Facebook · X · LinkedIn · Last updated June 2026
Try AI voiceover free →How does the AI voiceover from video work? Dokitscript creates an AI voiceover from your video's transcript or translation — not from an arbitrary script you type. Paste the video URL (or upload a file) into Dokitscript, wait for the transcript generated by OpenAI Whisper (90+ languages), use the AI Translation feature to translate the text into your target language, then click Listen. ElevenLabs' eleven_multilingual_v2 model reads the translated text aloud and produces a downloadable 128 kbps MP3 voice-over. Audio generation supports approximately 29 languages and requires the Starter plan or higher.
How it works
No software to install. Works entirely in your browser.
Paste a TikTok, Instagram, YouTube, Facebook, X, or LinkedIn video URL — or upload an audio or video file up to 50 MB.
Dokitscript transcribes the video in 90+ languages. The spoken language is auto-detected, or you can select it manually.
Use the AI Translation feature to translate the transcript into French, Spanish, Japanese, German, or any of the supported languages. This is the script the voice-over will read.
ElevenLabs generates a natural AI voice reading the translated text. Download the result as a 128 kbps MP3 file ready to use.
Features
From video URL to ready-to-use voice-over MP3, in one tool.
Voice-overs are generated with ElevenLabs' eleven_multilingual_v2 model — one of the most natural-sounding multilingual AI voices available today.
OpenAI Whisper handles the speech-to-text step. It auto-detects the spoken language and supports over 90 languages for transcription.
The translation step runs on Claude AI and produces natural-sounding text in the target language — that text becomes the voice-over script.
The voice-over output is a standard MP3 file you can use in video editors, podcasts, language learning apps, social media repurposing, or accessibility tools.
Paste a URL from TikTok, Instagram Reels, YouTube Shorts, YouTube, Facebook, X (Twitter) or LinkedIn. File upload also works for local recordings.
You always receive the complete written transcript and the translated text alongside the MP3. Export as TXT or SRT at any time.
Languages
Transcription and voice-over generation cover different language sets — here is the honest breakdown.
Dokitscript can transcribe speech in over 90 languages, including English, French, Spanish, Arabic, Chinese, Hindi, Japanese, Korean, Portuguese, German, Italian, and many more. The spoken language is detected automatically.
The MP3 voice-over is powered by ElevenLabs and currently supports approximately 29 languages:
Note: transcription supports 90+ languages; voice-over audio generation supports ~29. If your target language is not in the audio list, you will still get the translated text transcript.
Use cases
Anywhere creators and teams need a voice-over in another language, fast.
Record your video once in English, then generate a Spanish, French, or Japanese voice-over to publish on each market's channel — without re-recording.
Turn a TikTok or Instagram Reel into a voice-over track in another language. Pair it with a translated caption to reach new audiences without extra recording sessions.
Convert video content into a standalone audio file so users with visual impairments — or people who prefer to listen — can access it in their language.
Translate an episode into a second language and generate an AI voice-over track. Publish it as a bonus episode for your international listeners.
Convert recorded training videos into voice-over MP3s in multiple languages for distributed teams. No studio time required.
Get an AI-voiced MP3 as a draft reference before bringing in a voice actor. Useful for client approvals and timing checks in early production.
Plans
Transcription and translation are available on every plan. Voice-over MP3 generation requires Starter or higher.
| Plan | Price | Transcriptions | Max video length | Voice-over audio (MP3) |
|---|---|---|---|---|
| Free | $0 | 5 / month | 3 minutes | Not available |
| Starter | $4.99 / mo | 200 / month | 8 minutes | 6 min / month |
| Pro | $14.99 / mo | Unlimited | 45 minutes | 60 min / month |
| Business | $79.99 / mo | Unlimited | 5 hours | 240 min / month |
Voice-over minutes are counted per generated MP3. Unused minutes do not roll over. See full pricing →
FAQ
Related tools
Free to start. Voice-over generation from $4.99/month. No software needed.
Get started free →