Logo Logo
deutsch
directory schraeg
Knowledge
Codecs
Hardware
Camcorders· Cinema-Kamera· Computers· GPU· Video-DSLR· accessories
Software
Compositing· Color correction· DV Editing
DV-Movies

HowTo
Shooting· Sound
Misc
Reviews
/// News
AI synchronizes lip movements with audio in real time

AI synchronizes lip movements with audio in real time

[14:44 wed,23.September 2020   by ]    

The new DeepLearning Algorithm "Wav2Lip" of an Indian research team can match the lip movements of a speaker to the words from any audio recording. It beautifully demonstrates the continuous progress that machine learning technology is making, as the new method delivers significantly better results than older projects. Not only does it work in real time, but - and this is the real progress - it is also more universal, because it can handle any face, any language and any voice.

The usefulness of such an algorithm for working with video is obvious - as already shown in the demo video, it can be used to adapt the lip movement of a speaking person in a video to a synchronous version created in another language in order to eliminate the asynchronicity of mouth movements and words, which is otherwise disturbing for many viewers. This is practical for post-synchronized film versions as well as for lip-syncing lectures, press conferences or figures from animated films into other languages.



And last but not least, this technology could also help in principle to make it easier to use the voices in the post by overdub instead of the original sound in scenic productions. Even minor speech errors (which would otherwise render a scene unusable) could be easily corrected by briefly "tracking" the lips automatically.

Using deep learning algorithms, it would also be conceivable to automatically offer different language versions of any clips, for example on YouTube. YouTube already provides an automatic transcription, and the next steps are already possible using algorithms: the translation of the transcribed text into another language, speech synthesis with the voice of the original voice, and then lip-syncing the video with the new audio.

Of course, the technology can also be misused to generate clips in which people seem to be saying things they never said - the new audio can also be generated via neural network to mimic the real voice.



How good the Wav2Lip algorithm is, anyone can try it out for themselves on the project&s demo website and upload a short (maximum 20 seconds) video clip of a person speaking plus a speech audio clip to get an output of the newly lip-synced clip. For those who want to try more, please visit GitHub to find the appropriate program code. (Thanks to our forum member Ruessel for the news)


Bild zur Newsmeldung:
Wav2Lip-Schema

Link more infos at bei bhaasha.iiit.ac.in

deutsche Version dieser Seite: KI synchronisiert Lippenbewegungen mit Audio in Echtzeit

  

  Vorige News lesen Nächste News lesen 
bildZwei neue Flexscan Monitore mit USB-C von Eizo: EV2795 und EV2495 bildCanon Cinema EOS C70 - S35 RF-Mount mit optionalem EF-Speedbooster


related news:1E0Adobe CC Update: AI audio transcription and new GPU acceleration for Adobe Premiere Pro 20.October 2020
Apple iPhone 12 Pro with 5G, Dolby Vision video recording and LiDAR 13.October 2020
Sennheiser MKE 200: New compact mini microphone for DSLRs and mirrorless cameras 1.September 2020
miniDSP ambiMIK-1: Ambisonic USB microphone for 3D 360° audio 16.August 2020
Zoom-PodTrak-Podcasting-Recorder P4 and ZDM-1 Podcast Mic Pack 14.August 2020
New: Magix Vegas Pro 18 with Sound Forge Pro integration -- cloud tools for teams to follow 4.August 2020
Zoom H8 Handy Recorder offers up to 10 XLR microphone-inputs 3.July 2020
alle Newsmeldungen zum Thema Sound


[nach oben]
















Archiv Newsmeldungen

2020

October - September - August - July - June - May - April - March - February - January

2019
December - November - October - September - August - July - June - May - April - March - February - January

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

2001

2000






































deutsche Version dieser Seite: KI synchronisiert Lippenbewegungen mit Audio in Echtzeit



last update : 28.Oktober 2020 - 20:57 - slashCAM is a project by channelunit GmbH- mail : slashcam@--antispam:7465--slashcam.de - deutsche Version