August 13th, 2024

Open-source tool translates and dubs videos into other languages using AI

pyvideotrans is a GitHub tool for video translation and dubbing, supporting multiple languages. It offers features like subtitle generation, audio extraction, and batch processing, with installation guides for various operating systems.

Read original articleLink Icon
FrustrationCuriositySkepticism
Open-source tool translates and dubs videos into other languages using AI

The GitHub repository for **pyvideotrans** is a tool designed for video translation and dubbing, enabling users to translate videos into different languages while automatically generating subtitles and voiceovers. It supports a wide range of languages, including Chinese, English, Korean, Japanese, Russian, and many others. Key functionalities include translating audio from videos, converting audio to subtitles, creating voiceovers from SRT files, and merging audio, video, and subtitles into a single file. The tool also allows for batch processing of subtitles and the extraction of audio from video files. Users can download videos from YouTube as well. Installation options include a pre-packaged version for Windows and source deployment instructions for MacOS, Linux, and Windows. Additional resources such as documentation, voice recognition models, and video tutorials are available to assist users. The repository also connects to related projects focused on voice synthesis and offline translation.

- pyvideotrans translates videos and generates subtitles and voiceovers in multiple languages.

- It supports batch processing for subtitle creation and translation.

- Users can extract audio from videos and merge audio, video, and subtitles.

- Installation is available for various operating systems with detailed guides.

- The repository includes links to related projects for enhanced functionalities.

AI: What people are saying
The comments on the article about pyvideotrans reflect a range of opinions and concerns regarding AI-driven video translation and dubbing.
  • Some users express skepticism about the effectiveness and ethical implications of AI dubbing, particularly regarding voice cloning and job displacement.
  • There is interest in the potential for AI to enhance accessibility, especially for live TV and news broadcasts.
  • Several commenters inquire about the quality of translations and the technology used, including the need for open-source solutions.
  • Concerns are raised about the nuances lost in translation and the potential for misinformation in political contexts.
  • Users share alternative tools and services they have encountered, indicating a competitive landscape in video translation technology.
Link Icon 20 comments
By @netsharc - 8 months
Huh, if it's good enough, movies/TV shows dubbed with AI-clone of the original voice would be great (if we can ignore the ethics of using the actor's voice and the loss of work for the dubbing companies and actors).

For example here's how weird Friends is in German: https://www.youtube.com/watch?v=nCoNSZV--z0 . Or Italian: https://www.youtube.com/watch?v=wO5qTzvyQ1s

Can AI detect the emotional tone of sentences yet, and recreate it in the target language?

By @cubbic - 8 months
Oh I made something similar but for Netflix!

https://github.com/cubbK/dubbing_ai_netflix_client

I want to learn swedish and because there are so few dubbed movies in Swedish I take the subtitles(Netflix is good at having subtitles in different languages) and text-to-speech it :)

By @vouaobrasil - 8 months
Unlike other methods of automation, AI is replacing human beings too fast. And before you say, "new jobs will be created" -- look at history. After the computer, new jobs have been created, but what kind of jobs? Every year, we are becoming more entwined in wage slavery as the wealth accumulates at the top and jobs become more meaningless.

So, no, new jobs will not be created, except the kind of jobs that crush the human spirit into oblivion so that the rich tech oligarchs can play God.

By @skummetmaelk - 8 months
Localization and dubbing is a sad endeavour. By trying to accomodate everyone's individual preference for information transmission we accomplish nothing more than reducing our ability to understand each other in the long run.

Having a Babelfish is all well and good. Until it stops working, and you realise no one can understand each other any more.

Ironically localization is often pushed by well meaning Americans who only speak one language. "Oh, you're in a French speaking region. You MUST want French language. Let me force it down your throat while I prance around virtue signalling about how inclusive we are"

By @LeoPanthera - 8 months
For years I've wanted this for live TV. Even just subtitles would be amazing. I've always wanted to be able to watch news TV from other countries.
By @codedokode - 8 months
As I understand, it first extracts text from original video into subtitles, translates them using external LLM, and then converts text to speech. All of this is done using thrid-party solutions, and the project seems to be just a GUI app that allows to integrate them.

You obviously cannot use this to translate songs or movies because this method loses important information like voice, intonation, etc.

So it is still better to use subtitles.

By @mrtksn - 8 months
Back in high school, when I got my first PC a plumber came over to fix some stuff and when he saw the computer he got excited and asked some questions and one of the questions was “how do you translate the VCD with this, I have a movie to watch but hate subtitles”.

I was like “silly dude doesn’t know how computers work” but maybe I was the silly one who can’t dare to imagine how something like that can work.

By @gagabity - 8 months
Cool what languages can it do?

Yandex browser does the most impressive version of this and for free but only to Russian I believe, its quite amazing it does appropriate different voices and follows the correct intonation for everyone, just takes a few seconds for a YT video.

By @nsonha - 8 months
This could be useful in combating fake news. In many videos especially in political news, foreign languages are dub over with sometimes nuanced translation that can skew audiences to (mis)understand the content in certain ways.
By @cyberax - 8 months
I would pay a lot for a tool that removes the freaking laugh track from videos.

I just physically can't watch them. I wanted to watch the Blackadder series, but I couldn't even get through one episode.

By @lossolo - 8 months
Based on english doc it seems it's not dubbing but voice over.
By @alphabetatheta - 8 months
Some of my friends have been using: https://app.vivalabs.ai/ as a more managed/paid service recently
By @underdeserver - 8 months
https://deepdub.ai/ have been doing this commercially for a while.
By @hulitu - 8 months
> Open-source tool translates and dubs videos into other languages using AI

Is there any assessment about how good the translation is ?

By @CyberDildonics - 8 months
Is this using an open source text to speech model or is it going out to some other internet service?
By @paulkon - 8 months
Is there an open source speech-to-speech model which retains intonation, cadence and delivery?
By @randomgiy3142 - 8 months
Because translations are copyrighted so it is complex to get legal rights for them.
By @ewuhic - 8 months
This one does dubbing, but is there an equivalent tool for subs?
By @cyanydeez - 8 months
So uh, what's with AI products throwing out the gold standard in testing these claims.
By @ranger_danger - 8 months
An English README would have been nice