all AI news
Video as the New Language for Real-World Decision Making
Feb. 28, 2024, 5:46 a.m. | Sherry Yang, Jacob Walker, Jack Parker-Holder, Yilun Du, Jake Bruce, Andre Barreto, Pieter Abbeel, Dale Schuurmans
cs.CV updates on arXiv.org arxiv.org
Abstract: Both text and video data are abundant on the internet and support large-scale self-supervised learning through next token or frame prediction. However, they have not been equally leveraged: language models have had significant real-world impact, whereas video generation has remained largely limited to media entertainment. Yet video data captures important information about the physical world that is difficult to express in language. To address this gap, we discuss an under-appreciated opportunity to extend video generation …
abstract arxiv cs.ai cs.cv data decision decision making entertainment impact internet language language models making media next prediction scale self-supervised learning supervised learning support text through token type video video data video generation world
More from arxiv.org / cs.CV updates on arXiv.org
TransRUPNet for Improved Polyp Segmentation
54 minutes ago |
arxiv.org
Learning to Complement with Multiple Humans
54 minutes ago |
arxiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Sr. VBI Developer II
@ Atos | Texas, US, 75093
Wealth Management - Data Analytics Intern/Co-op Fall 2024
@ Scotiabank | Toronto, ON, CA