Probing Cross-modal Semantics Alignment Capability from the Textual Perspective. (arXiv:2210.09550v1 [cs.CL]) | allainews.com

Oct. 19, 2022, 1:17 a.m. | Zheng Ma, Shi Zong, Mianzhi Pan, Jianbing Zhang, Shujian Huang, Xinyu Dai, Jiajun Chen

cs.CL updates on arXiv.org arxiv.org

In recent years, vision and language pre-training (VLP) models have advanced
the state-of-the-art results in a variety of cross-modal downstream tasks.
Aligning cross-modal semantics is claimed to be one of the essential
capabilities of VLP models. However, it still remains unclear about the inner
working mechanism of alignment in VLP models. In this paper, we propose a new
probing method that is based on image captioning to first empirically study the
cross-modal semantics alignment of VLP models. Our probing method …

alignment arxiv perspective semantics

More from arxiv.org / cs.CL updates on arXiv.org

A Survey of Graph Meets Large Language Model: Progress and Future Directions 8 hours ago | arxiv.org

arxiv cs.cl cs.lg cs.si +9

Never Train from Scratch: Fair Comparison of Long-Sequence Models Requires Data-Driven Priors 8 hours ago | arxiv.org

abstract architectures arxiv benchmarks +18

LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tools and Self-Explanations 8 hours ago | arxiv.org

abstract arxiv conversational cs.ai +17

DP-NMT: Scalable Differentially-Private Machine Translation 8 hours ago | arxiv.org

abstract arxiv concerns concrete +22

DEFT: Data Efficient Fine-Tuning for Pre-Trained Language Models via Unsupervised Core-Set Selection 8 hours ago | arxiv.org

abstract advances arxiv availability +16

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models 8 hours ago | arxiv.org

abstract art arxiv benchmarking +21

Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench 8 hours ago | arxiv.org

arxiv cs.cl llms type

Noise-Robust De-Duplication at Scale 8 hours ago | arxiv.org

abstract applications articles arxiv +18

ICDM 2020 Knowledge Graph Contest: Consumer Event-Cause Extraction 8 hours ago | arxiv.org

abstract applications arxiv attention +16

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Management Assistant

@ World Vision | Amman Office, Jordan

View on ai-jobs.net

Cloud Data Engineer, Global Services Delivery, Google Cloud

@ Google | Buenos Aires, Argentina

View on ai-jobs.net