Jan. 15, 2022, 1:57 a.m. | /u/xiikjuy

Deep Learning www.reddit.com

For CV tasks like pose estimation or object/face detection, the demo could be straightforward, like using a camera capturing the speaker/audience and project the real-time results on the monitor. I wonder how some deoms are done with a video captioning or VQA model? Showing the results of some pre-downloaded videos or have it in an interactive way? I think the latter might be risky .... THanks.

submitted by /u/xiikjuy
[link] [comments]

advice captioning deeplearning video

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Program Control Data Analyst

@ Ford Motor Company | Mexico

Vice President, Business Intelligence / Data & Analytics

@ AlphaSense | Remote - United States