.footer { } Logo Logo
deutsch
/// News
Blackmagic URSA Cine 12K explained: Dynamic range, monitoring setup ...

Microsoft VASA-1 generates realistic video portraits from an audio file

[10:50 Thu,18.April 2024   by Rudi Schmidts]    

A research group at Microsoft has unveiled a new AI framework called VASA-1 that enables the generation of lifelike, talking faces with strikingly appealing visual capabilities. The framework only requires a static image and a voice audio clip as input.

However, unlike other models, VASA-1 goes beyond simple lip movements and generates a wide range of facial nuances and natural head movements. Under the hood of VASA-1 is a holistic model for generating facial dynamics and head movements based on an explicitly trained latent space for faces.



VASA1_1
Microsoft VASA-1 generates animated, realistic video portraits from an audio file



The generated videos show a new quality of realistic facial and head movements and can be generated online with a resolution of 512x512 pixels and up to 40 frames per second - with extremely low startup latency. The possibilities for manipulating the direction of gaze, framing and emotions (!!) are also extremely remarkable.

The researchers emphasize once again that all portrait images generated are virtual and do not represent real people. They also emphasize that they are aware of the responsibility of using AI and want to highlight the positive potential of their technology for education, accessibility and therapeutic support. An equally enormous potential for the future of interactive, lifelike avatars is also clearly addressed here.

The following demonstration shows how VASA-1 could theoretically even be used in real-time video conferencing:



However, to prevent misuse, there are currently no plans to release demos, APIs or products until it has been ensured that the technology can be used responsibly and in compliance with regulations.

In fact, in many of the examples shown, it is only possible to recognize that these are artificially generated avatars - and not real people - if you look very closely.

VASA1_189
Microsoft VASA-1 generates animated, realistic video portraits from an audio file


Link more infos at bei www.microsoft.com

deutsche Version dieser Seite: Microsoft VASA-1 generiert realistische Video-Portraits aus einer Audiodatei

  



[nach oben]












Archiv Newsmeldungen

2024

July - June - May - April - March - February - January

2023
December - November - October - September - August - July - June - May - April - March - February - January

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

2001

2000






































deutsche Version dieser Seite: Microsoft VASA-1 generiert realistische Video-Portraits aus einer Audiodatei



last update : 26.Juli 2024 - 18:02 - slashCAM is a project by channelunit GmbH- mail : slashcam@--antispam:7465--slashcam.de - deutsche Version