June 11, 2024, 6:10 p.m. | Rachel Gordon | MIT CSAIL

MIT News - Machine learning news.mit.edu

DenseAV, developed at MIT, learns to parse and understand the meaning of language just by watching videos of people talking, with potential applications in multimedia search, language learning, and robotics.

algorithm algorithms applications artificial intelligence computer science and technology computer vision imaging language machine learning meaning mit mit schwarzman college of computing multimedia people potential research robotics school of engineering search videos

More from news.mit.edu / MIT News - Machine learning

Senior Data Engineer

@ Displate | Warsaw

Director of Data Science (f/m/x)

@ AUTO1 Group | Berlin, Germany

Business Intelligence Analyst I [BI Analyst I]

@ Capitec Bank | Stellenbosch, Western Cape, ZA

Data Governance Associate Director

@ Publicis Groupe | London, United Kingdom

Technical Lead - Power BI

@ Birlasoft | INDIA - PUNE - BIRLASOFT OFFICE - HINJAWADI, IN

Data Analyst

@ FirstRand Corporate Centre | 1 First Place, Cnr Simmonds & Pritchard Streets, Johannesburg, 2001