August 13th, 2024

Open-source tool translates and dubs videos into other languages using AI

pyvideotrans is a GitHub tool for video translation and dubbing, supporting multiple languages. It offers features like subtitle generation, audio extraction, and batch processing, with installation guides for various operating systems.

Read original article

FrustrationCuriositySkepticism

Open-source tool translates and dubs videos into other languages using AI

The GitHub repository for **pyvideotrans** is a tool designed for video translation and dubbing, enabling users to translate videos into different languages while automatically generating subtitles and voiceovers. It supports a wide range of languages, including Chinese, English, Korean, Japanese, Russian, and many others. Key functionalities include translating audio from videos, converting audio to subtitles, creating voiceovers from SRT files, and merging audio, video, and subtitles into a single file. The tool also allows for batch processing of subtitles and the extraction of audio from video files. Users can download videos from YouTube as well. Installation options include a pre-packaged version for Windows and source deployment instructions for MacOS, Linux, and Windows. Additional resources such as documentation, voice recognition models, and video tutorials are available to assist users. The repository also connects to related projects focused on voice synthesis and offline translation.

- pyvideotrans translates videos and generates subtitles and voiceovers in multiple languages.

- It supports batch processing for subtitle creation and translation.

- Users can extract audio from videos and merge audio, video, and subtitles.

- Installation is available for various operating systems with detailed guides.

- The repository includes links to related projects for enhanced functionalities.

Show HN: A simple app for downloading YouTube transcripts in bulk

The GitHub repository offers a tool to download YouTube Shorts transcripts via a Streamlit app. It includes a guide notebook and instructions for local app usage on HuggingFace Space.

Transcribro: On-device Accurate Speech-to-text

The GitHub repository for "Transcribro" offers project details, downloads, community links, contribution guidelines, donations, branding guidelines, and keyboard UI screenshots. Contact for project-specific support or inquiries.

Audapolis: Edit audio files by word, not waveform

The Audapolis project on GitHub offers a tailored editor for spoken-word media with audio-to-text transcription. It supports various media types, works on Windows, Linux, and macOS, and stores data locally. Funding sources include governmental and foundation support.

Show HN: An app for downloading YouTube videos, transcripts, and thumbnails

ytdatakit is a versatile application for downloading YouTube videos, transcripts, thumbnails, and channel data, executable via Python or Docker, designed for efficient multimedia data retrieval.

OTranscribe: A free and open tool for transcribing audio interviews

oTranscribe is a free web application for transcribing audio recordings, allowing seamless navigation, automatic saving, and local file storage. It supports multiple export formats and is open source under the MIT license.

AI: What people are saying

The comments on the article about pyvideotrans reflect a range of opinions and concerns regarding AI-driven video translation and dubbing.

Some users express skepticism about the effectiveness and ethical implications of AI dubbing, particularly regarding voice cloning and job displacement.
There is interest in the potential for AI to enhance accessibility, especially for live TV and news broadcasts.
Several commenters inquire about the quality of translations and the technology used, including the need for open-source solutions.
Concerns are raised about the nuances lost in translation and the potential for misinformation in political contexts.
Users share alternative tools and services they have encountered, indicating a competitive landscape in video translation technology.

20 comments

By @netsharc - 8 months

Huh, if it's good enough, movies/TV shows dubbed with AI-clone of the original voice would be great (if we can ignore the ethics of using the actor's voice and the loss of work for the dubbing companies and actors).

For example here's how weird Friends is in German: https://www.youtube.com/watch?v=nCoNSZV--z0 . Or Italian: https://www.youtube.com/watch?v=wO5qTzvyQ1s

Can AI detect the emotional tone of sentences yet, and recreate it in the target language?

By @cubbic - 8 months

Oh I made something similar but for Netflix!

https://github.com/cubbK/dubbing_ai_netflix_client

I want to learn swedish and because there are so few dubbed movies in Swedish I take the subtitles(Netflix is good at having subtitles in different languages) and text-to-speech it :)

By @vouaobrasil - 8 months

Unlike other methods of automation, AI is replacing human beings too fast. And before you say, "new jobs will be created" -- look at history. After the computer, new jobs have been created, but what kind of jobs? Every year, we are becoming more entwined in wage slavery as the wealth accumulates at the top and jobs become more meaningless.

So, no, new jobs will not be created, except the kind of jobs that crush the human spirit into oblivion so that the rich tech oligarchs can play God.

By @skummetmaelk - 8 months

Localization and dubbing is a sad endeavour. By trying to accomodate everyone's individual preference for information transmission we accomplish nothing more than reducing our ability to understand each other in the long run.

Having a Babelfish is all well and good. Until it stops working, and you realise no one can understand each other any more.

Ironically localization is often pushed by well meaning Americans who only speak one language. "Oh, you're in a French speaking region. You MUST want French language. Let me force it down your throat while I prance around virtue signalling about how inclusive we are"

By @LeoPanthera - 8 months

For years I've wanted this for live TV. Even just subtitles would be amazing. I've always wanted to be able to watch news TV from other countries.

By @codedokode - 8 months

As I understand, it first extracts text from original video into subtitles, translates them using external LLM, and then converts text to speech. All of this is done using thrid-party solutions, and the project seems to be just a GUI app that allows to integrate them.

You obviously cannot use this to translate songs or movies because this method loses important information like voice, intonation, etc.

So it is still better to use subtitles.

By @mrtksn - 8 months

Back in high school, when I got my first PC a plumber came over to fix some stuff and when he saw the computer he got excited and asked some questions and one of the questions was “how do you translate the VCD with this, I have a movie to watch but hate subtitles”.

I was like “silly dude doesn’t know how computers work” but maybe I was the silly one who can’t dare to imagine how something like that can work.

By @gagabity - 8 months

Cool what languages can it do?

Yandex browser does the most impressive version of this and for free but only to Russian I believe, its quite amazing it does appropriate different voices and follows the correct intonation for everyone, just takes a few seconds for a YT video.

By @nsonha - 8 months

This could be useful in combating fake news. In many videos especially in political news, foreign languages are dub over with sometimes nuanced translation that can skew audiences to (mis)understand the content in certain ways.

By @cyberax - 8 months

I would pay a lot for a tool that removes the freaking laugh track from videos.

I just physically can't watch them. I wanted to watch the Blackadder series, but I couldn't even get through one episode.

By @lossolo - 8 months

Based on english doc it seems it's not dubbing but voice over.

By @alphabetatheta - 8 months

Some of my friends have been using: https://app.vivalabs.ai/ as a more managed/paid service recently

By @underdeserver - 8 months

https://deepdub.ai/ have been doing this commercially for a while.

By @hulitu - 8 months

> Open-source tool translates and dubs videos into other languages using AI

Is there any assessment about how good the translation is ?

By @CyberDildonics - 8 months

Is this using an open source text to speech model or is it going out to some other internet service?

By @paulkon - 8 months

Is there an open source speech-to-speech model which retains intonation, cadence and delivery?

By @randomgiy3142 - 8 months

Because translations are copyrighted so it is complex to get legal rights for them.

By @ewuhic - 8 months

This one does dubbing, but is there an equivalent tool for subs?

By @cyanydeez - 8 months

So uh, what's with AI products throwing out the gold standard in testing these claims.

By @ranger_danger - 8 months

An English README would have been nice

Show HN: A simple app for downloading YouTube transcripts in bulk

The GitHub repository offers a tool to download YouTube Shorts transcripts via a Streamlit app. It includes a guide notebook and instructions for local app usage on HuggingFace Space.

Transcribro: On-device Accurate Speech-to-text

Audapolis: Edit audio files by word, not waveform

Show HN: An app for downloading YouTube videos, transcripts, and thumbnails

ytdatakit is a versatile application for downloading YouTube videos, transcripts, thumbnails, and channel data, executable via Python or Docker, designed for efficient multimedia data retrieval.

Open-source tool translates and dubs videos into other languages using AI

Related

Show HN: A simple app for downloading YouTube transcripts in bulk

Transcribro: On-device Accurate Speech-to-text

Audapolis: Edit audio files by word, not waveform

Show HN: An app for downloading YouTube videos, transcripts, and thumbnails

OTranscribe: A free and open tool for transcribing audio interviews

Related

Show HN: A simple app for downloading YouTube transcripts in bulk

Transcribro: On-device Accurate Speech-to-text

Audapolis: Edit audio files by word, not waveform

Show HN: An app for downloading YouTube videos, transcripts, and thumbnails

OTranscribe: A free and open tool for transcribing audio interviews