Web: http://arxiv.org/abs/2209.07474

Sept. 16, 2022, 1:12 a.m. | Farrukh Rahman, Ömer Mubarek, Zsolt Kira

cs.LG updates on arXiv.org arxiv.org

Recently vision transformers have been shown to be competitive with
convolution-based methods (CNNs) broadly across multiple vision tasks. The less
restrictive inductive bias of transformers endows greater representational
capacity in comparison with CNNs. However, in the image classification setting
this flexibility comes with a trade-off with respect to sample efficiency,
where transformers require ImageNet-scale training. This notion has carried
over to video where transformers have not yet been explored for video
classification in the low-labeled or semi-supervised settings. Our work …

arxiv transformers video

More from arxiv.org / cs.LG updates on arXiv.org

Postdoctoral Fellow: ML for autonomous materials discovery

@ Lawrence Berkeley National Lab | Berkeley, CA

Research Scientists

@ ODU Research Foundation | Norfolk, Virginia

Embedded Systems Engineer (Robotics)

@ Neo Cybernetica | Bedford, New Hampshire

2023 Luis J. Alvarez and Admiral Grace M. Hopper Postdoc Fellowship in Computing Sciences

@ Lawrence Berkeley National Lab | San Francisco, CA

Senior Manager Data Scientist

@ NAV | Remote, US

Senior AI Research Scientist

@ Earth Species Project | Remote anywhere

Research Fellow- Center for Security and Emerging Technology (Multiple Opportunities)

@ University of California Davis | Washington, DC

Staff Fellow - Data Scientist

@ U.S. FDA/Center for Devices and Radiological Health | Silver Spring, Maryland

Staff Fellow - Senior Data Engineer

@ U.S. FDA/Center for Devices and Radiological Health | Silver Spring, Maryland

Research Engineer - VFX, Neural Compositing

@ Flawless | Los Angeles, California, United States

[Job-TB] Senior Data Engineer

@ CI&T | Brazil

Data Analytics Engineer

@ The Fork | Paris, France