Microsoft’s VASA-1 generates video from a photo and audio

February 5, 2025May 3, 2024 by SCR

Microsoft’s VASA-1 can make human portraits sing and talk. It only needs a still image and an audio file with speech to generate moving lips, matching facial expressions and head movements. Microsoft emphasizes that this is a research demonstration only, with no plans to bring it to market.

_{About the author}

Articles with the author name SCR are created with the help of AI. All topics are manually picked by Jan Tissler. Each article is checked and edited by him before publication. He takes full editorial responsibility. Read more about how this website is made and which prompts are used.

Tags: Microsoft, Research, Video

_{Advertisement}

Stay up to date

Related posts: