Eerily realistic: Microsoft’s new AI model makes images talk, sing

  • 📰 IntEngineering
  • ⏱ Reading Time:
  • 43 sec. here
  • 2 min. at publisher
  • 📊 Quality Score:
  • News: 20%
  • Publisher: 63%

United States Headlines News

United States Latest News,United States Headlines

VASA is a framework for generating lifelike talking faces with appealing visual affective skills.

model that converts images of a person’s face and audio clips into a video with proper lip-syncing, facial expressions, and head movements. Developed by a team of AI researchers at Microsoft Research Asia, the new AI model is called VASA-1.

VASA— short for Visual Affective Skills Animator— is capable of transforming any static images whether clicked by the camera, painted, or drawn, into “exquisitely synchronized” animations. The team utilized the publicly available VoxCeleb2 dataset which contains video clips of over 6,000 real-life celebrities. Discarding clips with multiple individuals and of low quality, the team trained their model on the processed dataset.The model offers control over gaze, distance, and emotions in the generated video.

“We are exploring visual affective skill generation for virtual, interactive characters, NOT impersonating any person in the real world,” they wrote in aThe research team maintains that the model will be used for education and provide companionship. They have also refused to release the code that powers the model.

Source: Tech Daily Report (techdailyreport.net)

 

Thank you for your comment. Your comment will be published after being reviewed.
Please try again later.
We have summarized this news so that you can read it quickly. If you are interested in the news, you can read the full text here. Read more:

 /  🏆 287. in US

United States Latest News, United States Headlines

Similar News:You can also read news stories similar to this one that we have collected from other news sources.

Microsoft’s VASA-1 AI Can Make Any Person’s Image Move and SpeakMicrosoft has unveiled a new lip-syncing tool that transforms a still image of a person into an animated clip of them talking or singing.
Source: petapixel - 🏆 527. / 51 Read more »

Microsoft VASA-1 AI turns photos into lifelike talking videos, and it’s insaneMicrosoft's new VASA-1 AI model can combine a portrait image with an audio file to create a high-quality video of someone talking.
Source: BGR - 🏆 234. / 63 Read more »

Young Sheldon Season 7 Set Images Tease George's Funeral As Cast Eerily Wears All BlackGeorge's time has come.
Source: screenrant - 🏆 7. / 94 Read more »

Panic Attacks and Heart Attacks Can Feel Eerily Similar—Here’s How To Tell the DifferenceLearn the differences between heart attack vs. panic attack, including their symptoms, causes, and how to effectively respond to each one.
Source: iamwellandgood - 🏆 462. / 53 Read more »

Kentucky Fan Eerily Predicted in 2023 Tweet Exactly How Wildcats Would Land Mark Pope as Next CoachNailed it.
Source: SInow - 🏆 273. / 63 Read more »

New Apple TV+ Show Is Eerily Similar To A 14-Year-Old Leonardo DiCaprio Thriller (& It's A Big Hint)Apple show is similar to a Leonardo DiCaprio film.
Source: screenrant - 🏆 7. / 94 Read more »