UniBind: LLM-Augmented Unified and Balanced Representation Space to Bind Them All | allainews.com

March 20, 2024, 4:45 a.m. | Yuanhuiyi Lyu, Xu Zheng, Jiazhou Zhou, Lin Wang

cs.CV updates on arXiv.org arxiv.org

arXiv:2403.12532v1 Announce Type: new
Abstract: We present UniBind, a flexible and efficient approach that learns a unified representation space for seven diverse modalities -- images, text, audio, point cloud, thermal, video, and event data. Existing works, eg., ImageBind, treat the image as the central modality and build an image-centered representation space; however, the space may be sub-optimal as it leads to an unbalanced representation space among all modalities. Moreover, the category names are directly used to extract text embeddings for …

abstract arxiv audio build cloud cs.cv data diverse event image imagebind images llm representation space text them type video

More from arxiv.org / cs.CV updates on arXiv.org

One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts 21 hours ago | arxiv.org

abstract arxiv building construction +18

Uncertainty Quantification with Deep Ensembles for 6D Object Pose Estimation 21 hours ago | arxiv.org

abstract applications arxiv automation +15

Morphing Tokens Draw Strong Masked Image Models 21 hours ago | arxiv.org

arxiv cs.cv image tokens +1

Compact 3D Scene Representation via Self-Organizing Gaussian Grids 21 hours ago | arxiv.org

arxiv compact cs.cv representation +2

Fingerprint Matching with Localized Deep Representation 21 hours ago | arxiv.org

abstract accuracy acquisition arxiv +8

A Survey on Transferability of Adversarial Examples across Deep Neural Networks 21 hours ago | arxiv.org

abstract adversarial adversarial examples arxiv +27

Content Bias in Deep Learning Image Age Approximation: A new Approach Towards better Explainability 21 hours ago | arxiv.org

abstract age approximation arxiv +15

Continual Action Assessment via Task-Consistent Score-Discriminative Feature Distribution Modeling 21 hours ago | arxiv.org

arxiv assessment consistent continual +6

DA-RAW: Domain Adaptive Object Detection for Real-World Adverse Weather Conditions 21 hours ago | arxiv.org

abstract arxiv cs.cv cs.ro +17

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Lead Data Modeler

@ Sherwin-Williams | Cleveland, OH, United States

View on ai-jobs.net