Deep Semantic Multimodal Hashing Network for Scalable Image-Text and Video-Text Retrievals. (arXiv:1901.02662v3 [cs.CV] UPDATED) | allainews.com

Jan. 6, 2022, 2:10 a.m. | Lu Jin, Zechao Li, Jinhui Tang

cs.CV updates on arXiv.org arxiv.org

Hashing has been widely applied to multimodal retrieval on large-scale
multimedia data due to its efficiency in computation and storage. In this
article, we propose a novel deep semantic multimodal hashing network (DSMHN)
for scalable image-text and video-text retrieval. The proposed deep hashing
framework leverages 2-D convolutional neural networks (CNN) as the backbone
network to capture the spatial information for image-text retrieval, while the
3-D CNN as the backbone network to capture the spatial and temporal information
for video-text retrieval. …

arxiv cv multimodal network semantic text video

More from arxiv.org / cs.CV updates on arXiv.org

OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM 17 hours ago | arxiv.org

arxiv benchmark cs.cv eess.iv +5

SE(3)-Equivariant and Noise-Invariant 3D Rigid Motion Tracking in Brain MRI 17 hours ago | arxiv.org

arxiv brain cs.cv eess.iv +4

Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation 17 hours ago | arxiv.org

arxiv box creative cs.ai +10

Spiking Structured State Space Model for Monaural Speech Enhancement 17 hours ago | arxiv.org

abstract arxiv challenges computational +17

Improved cryo-EM Pose Estimation and 3D Classification through Latent-Space Disentanglement 17 hours ago | arxiv.org

abstract arxiv challenges classification +18

Multilevel Geometric Optimization for Regularised Constrained Linear Inverse Problems 17 hours ago | arxiv.org

abstract arxiv box compute +7

Diffusion$^2$: Dynamic 3D Content Generation via Score Composition of Orthogonal Diffusion Models 17 hours ago | arxiv.org

abstract arxiv capability consistent +18

A Concise but High-performing Network for Image Guided Depth Completion in Autonomous Driving 17 hours ago | arxiv.org

arxiv autonomous autonomous driving cs.cv +4

Delocate: Detection and Localization for Deepfake Videos with Randomly-Located Tampered Traces 17 hours ago | arxiv.org

abstract arxiv cs.cr cs.cv +10

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Lead Data Engineer

@ JPMorgan Chase & Co. | Jersey City, NJ, United States

View on ai-jobs.net

Senior Machine Learning Engineer

@ TELUS | Vancouver, BC, CA

View on ai-jobs.net

CT Technologist - Ambulatory Imaging - PRN

@ Duke University | Morriville, NC, US, 27560

View on ai-jobs.net

BH Data Analyst

@ City of Philadelphia | Philadelphia, PA, United States

View on ai-jobs.net