Download
Kaizen Speech Studio Kaizen Speech Studio Help All Help Contact
Pro feature

Dub Video (AI video translation)

Upload a video in English, get it back in German. AI transcription + translation + voice synthesis in one pipeline.

Dub Video UI with source/target language selectors and progress stages

What happens under the hood

  1. Speech Studio uploads your video to your Azure Blob Storage (the videos container you set up).
  2. Calls Azure Video Translation API with source and target language.
  3. Azure transcribes the original audio, translates, generates target-language voices, re-syncs to the video.
  4. Speech Studio downloads the result.

Prerequisites

Dubbing flow

  1. Open Dub Video.
  2. Click Browse and pick your source video (MP4, MOV common).
  3. Pick Source language (auto-detect works, but specifying is faster).
  4. Pick Dub into language.
  5. (Optional) Tick Add subtitles to get an SRT track in the target language.
  6. Click Start Dubbing.
  7. Watch the progress stages: Uploading → Translating → Generating → Downloading.
  8. When done, the dubbed video opens automatically and is saved next to the source.

Processing time

Azure Video Translation is batch-mode. Expect ~2–4× video length (a 10-minute video takes 20–40 minutes to fully dub). The progress panel updates throughout.

Cost

Azure charges per minute of video dubbed. Check Microsoft's Video Translation pricing page for current rates. Speech Studio shows the estimated cost before you hit Start.

Tips

  • Clean source audio = better translation. Background music or noise degrades accuracy.
  • Short videos (<10 min) are lower cost and give you a chance to iterate.
  • Not every language pair is equal in quality — English → major European languages is best; niche-to-niche is weaker.