all AI news
Dual Swin-Transformer based Mutual Interactive Network for RGB-D Salient Object Detection. (arXiv:2206.03105v1 [cs.CV])
June 8, 2022, 1:12 a.m. | Chao Zeng, Sam Kwong
cs.CV updates on arXiv.org arxiv.org
Salient Object Detection is the task of predicting the human attended region
in a given scene. Fusing depth information has been proven effective in this
task. The main challenge of this problem is how to aggregate the complementary
information from RGB modality and depth modality. However, conventional deep
models heavily rely on CNN feature extractors, and the long-range contextual
dependencies are usually ignored. In this work, we propose Dual
Swin-Transformer based Mutual Interactive Network. We adopt Swin-Transformer as
the feature …
More from arxiv.org / cs.CV updates on arXiv.org
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs
1 day, 13 hours ago |
arxiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Global Data Architect, AVP - State Street Global Advisors
@ State Street | Boston, Massachusetts
Data Engineer
@ NTT DATA | Pune, MH, IN