博彩评级网-博彩网_百家乐投资_全讯网新2

position: EnglishChannel  > News> Upload a Photo, Get a Video

Upload a Photo, Get a Video

Source: Science and Technology Daily | 2025-06-11 11:29:11 | Author: LI LInxu

The rapid developments in AI have unlocked new possibilities for digital representation. With the help of AI models, you can now achieve a remarkable feat: bringing characters to life with just an image and an audio clip.

Jointly developed by Tencent Hunyuan and Tencent Music, the newly released HunyuanVideo-Avatar, a multimodal diffusion transformer-based model, is capable of simultaneously generating dynamic, emotion-controllable, and multi-character dialogue videos. This capability supports head-and-shoulder, half-body, and full-body views, encompassing multiple styles, species, and even dual-character scenes.

To put it simply, you just upload a photo and a voice clip, and the model figures out the context, emotion and lip movements to create a realistic animated video.

For instance, if you upload an image of a woman sitting on a beach with a guitar, along with a piece of lyrical music,  the model understands the scene as "a woman playing the guitar and singing a lyrical song by the sea," and subsequently generates a video of the woman performing the song.

The model provides video creators with highly consistent and dynamic video generation capabilities. Its versatility can unlock a myriad of applications in fields like entertainment, media, e-commerce, advertising and education.

It has already been applied in multiple scenarios within Tencent Music, such as AI companions for music listening, long-form audio podcasts, and music videos (MVs).

For example, on the app QQ Music, when users listen to songs by "AI Leehom" (a fully AI-driven singer created by Tencent Music and Team Leehom), a lively and adorable AI Leehom image synchronizes its singing in real-time on the player.

On WeSing, a popular karaoke singing app, users can upload their images to generate personalized MVs of themselves singing.

In subject consistency and audio-video synchronization, the HunyuanVideo-Avatar shows top-tier industry performance. For video dynamics and natural body movements, it exceeds open-source solutions and rivals closed-source ones.

Currently, the model supports audio uploads of up to 14 seconds for video generation, with more capabilities to be released and open-sourced in the future.

Editor:李林旭

Top News

Energy Cooperation Gets New Direction

?Chinese President Xi Jinping sent a congratulatory message to the 7th China-Russia Energy Business Forum in Beijing on November 25, sparking enthusiastic responses from various sectors in both countries.

WEEKLY REVIEW (Dec.3-10)

Liang Wenfeng, founder and CEO of the Chinese AI firm DeepSeek, and "deep diver" Chinese geoscientist Du Mengran are on the annual "Nature's 10" list, which highlights 10 people at the heart of some of the biggest science stories of 2025.

抱歉,您使用的瀏覽器版本過低或開啟了瀏覽器兼容模式,這會影響您正常瀏覽本網頁

您可以進行以下操作:

1.將瀏覽器切換回極速模式

2.點擊下面圖標升級或更換您的瀏覽器

3.暫不升級,繼續瀏覽

繼續瀏覽
百家乐手论坛48491| 娱乐城送注册金| 圣安娜百家乐代理| 南通棋牌游戏中心下载| 百家乐官网蓝盾在线现| 大发888体育场| 百家乐官网送钱平台| 岫岩| 澳门百家乐玩法心得技巧| 百家乐官网娱乐城有几家| 线上百家乐的玩法技巧和规则| 百家乐官网轮盘桌| 娱乐城注册送钱| 赌博百家乐的乐趣| 电脑打百家乐官网怎么赢| 曼哈顿娱乐场| 百博百家乐的玩法技巧和规则| 个人百家乐官网策略| 皇冠开户投注网| 蓝盾百家乐庄家利润分| 百家乐官网与21点| 一二博网| 百家乐游戏筹码| 金宝博百家乐现金| 百家乐官网技巧头头娱乐| 皇城国际| 百家乐官网怎么打啊| 威尼斯人娱乐城送彩金| 破解百家乐官网真人游戏| 大竹县| 大发888 没人举报吗| 百家乐平台出租家乐平台出租| 菲律宾百家乐官网排行| 永利百家乐官网游戏| 棋牌乐| 大发888网页游戏平台| 网络百家乐真人游戏| 棋牌百家乐怎么玩| 反赌百家乐官网的玩法技巧和规则| 百家乐官网赌场作弊| 云博备用网址|