Web: http://arxiv.org/abs/2205.05072

May 11, 2022, 1:10 a.m. | Tingle Li, Yichen Liu, Andrew Owens, Hang Zhao

cs.CV updates on arXiv.org arxiv.org

From the patter of rain to the crunch of snow, the sounds we hear often
convey the visual textures that appear within a scene. In this paper, we
present a method for learning visual styles from unlabeled audio-visual data.
Our model learns to manipulate the texture of a scene to match a sound, a
problem we term audio-driven image stylization. Given a dataset of paired
audio-visual data, we learn to modify input images such that, after
manipulation, they are more …

arxiv audio cv learning

More from arxiv.org / cs.CV updates on arXiv.org

Data Analyst, Patagonia Action Works

@ Patagonia | Remote

Data & Insights Strategy & Innovation General Manager

@ Chevron Services Company, a division of Chevron U.S.A Inc. | Houston, TX

Faculty members in Research areas such as Bayesian and Spatial Statistics; Data Privacy and Security; AI/ML; NLP; Image and Video Data Analysis

@ Ahmedabad University | Ahmedabad, India

Director, Applied Mathematics & Computational Research Division

@ Lawrence Berkeley National Lab | Berkeley, Ca

Business Data Analyst

@ MainStreet Family Care | Birmingham, AL

Assistant/Associate Professor of the Practice in Business Analytics

@ Georgetown University McDonough School of Business | Washington DC