[P] Video-to-Text model descriptive style (not subtitles) | allainews.com

Aug. 1, 2023, 1:02 a.m. | /u/Yip37

Machine Learning www.reddit.com

I was wondering if there's already something like CLIP (the model that looks at an image and describes it), but for videos. So you show a video of, say, a dog jumping and grabbing a tennis ball and it outputs "dog grabbing a tennis ball", something like that.

My first thought was object detection, and input that interaction of the objects (tennis ball, dog) to the model with the target being "dog grabbing tennis ball". My ultimate goal being real-time …

clip image machinelearning show something subtitles tennis text thought video videos

More from www.reddit.com / Machine Learning

[D] The "it" in AI models is really just the dataset? 2 hours ago | www.reddit.com

ai models dataset machinelearning

[P] Open Source / Projects Based Machine Learning Community? 7 hours ago | www.reddit.com

building collaborations community devs +16

[R] DDPM for Timeseries Generation 9 hours ago | www.reddit.com

column data data generation dataset +13

[P] [D] Examples of client projects that you have delivered 10 hours ago | www.reddit.com

client consulting examples freelance +6

[D] is any traditional industry employee here can share if they are using gen ai … 10 hours ago | www.reddit.com

ai at work banking employee enterprises +6

[N] AI engineers report burnout and rushed rollouts as ‘rat race’ to stay competitive hits … 20 hours ago | www.reddit.com

ai tools article artificial artificial intelligence +17

[D] software to design figures 22 hours ago | www.reddit.com

algorithms alphatensor alphazero create +11

[D] How to train a text detection model that will detect it's orientation (rotation) ranging … 22 hours ago | www.reddit.com

case convention detection image +6

[R] HGRN2: Gated Linear RNNs with State Expansion 1 day, 3 hours ago | www.reddit.com

abstract attention expansion however +15

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net