Nvidia-backed AI startup unveiled its next generation of AI avatars, which can convey human emotions like joy, sorrow, and enthusiasm and comprehend context from text inputs.
The London-based company Synthesia claimed that a model trained “to understand the intricate relationship between what we say and how we say it” powers its fourth generation of “Expressive Avatars.” According to the business, the avatars can lip sync, use appropriate body language, and follow scripts just like actual actors.
Synthesia stated in a blog post, “With these new avatars, we’re not just creating digital renders; we’re introducing digital actors.” “This technology blurs the boundary between the virtual and the real by giving digital avatars a level of sophistication and realism.” Synthesia’s EXPRESS-1 model powers the avatars; it is trained to anticipate facial expressions and movements (such blinking) to match the avatar’s tone, resulting in interactions that are more like those of a human.
A group of AI experts and businesspeople launched the company in 2017, and its main focus is on producing videos without the need for studios, cameras, or microphones. Businesses have utilized Synthesia’s avatars to create training films and presentations, the company said.
According to Synthesia, some material categories would be blocked to prevent exploitation by dishonest parties, particularly in light of the impending US presidential election. Synthesia made note of how AI has been used to disseminate false information in another blog post, as well as how its avatars in particular may be used to mimic the voice and appearance of a real person.
According to the company, more than 200,000 users have created more than 18 million video presentations in more than 130 languages using its 225 available avatars. Additionally, Synthesia provides services to more than 55,000 businesses worldwide, including half of the Fortune 100.
Chip giant Nvidia participated in a $90 million Series C investment round that Synthesia disclosed in June of last year. Venture capital firm Accel led the round, which helped the startup reach a $1 billion value. Accel had previously invested in Facebook and Spotify.
“Productivity can be improved because you are reducing the cost of producing the video to that of making a PowerPoint,” Philippe Botteri, an investor at Accel, told CNBC in June. “Video is a much better way to communicate knowledge. When we think about the potential of the company and the valuation, we think about what it can return, [and] in the case of Synthesia, we’re just scratching the surface.”