Dec. 6, 2023, 9 a.m. | Sana Hassan

MarkTechPost www.marktechpost.com

The problem of video understanding and generation scenarios has been addressed by researchers of Tencent AI Lab and The University of Sydney by presenting GPT4Video. This unified multi-model framework supports LLMs with the capability of both video understanding and generation. GPT4Video developed an instruction-following-based approach integrated with the stable diffusion generative model, which effectively and […]


The post Tencent AI Lab Introduces GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation appeared first on MarkTechPost.

ai shorts applications artificial intelligence capability computer vision framework lab language language model large language large language model llms machine learning multimodal multimodal large language model presenting researchers safety sydney tech news technology tencent understanding university university of sydney video video understanding

More from www.marktechpost.com / MarkTechPost

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Business Data Analyst

@ Alstom | Johannesburg, GT, ZA