Grounding Visual Representations with Texts for Domain Generalization. (arXiv:2207.10285v2 [cs.CV] UPDATED) | allainews.com

Aug. 10, 2022, 1:11 a.m. | Seonwoo Min, Nokyung Park, Siwon Kim, Seunghyun Park, Jinkyu Kim

cs.CL updates on arXiv.org arxiv.org

Reducing the representational discrepancy between source and target domains
is a key component to maximize the model generalization. In this work, we
advocate for leveraging natural language supervision for the domain
generalization task. We introduce two modules to ground visual representations
with texts containing typical reasoning of humans: (1) Visual and Textual Joint
Embedder and (2) Textual Explanation Generator. The former learns the
image-text joint embedding space where we can ground high-level
class-discriminative information into the model. The latter leverages …

More from arxiv.org / cs.CL updates on arXiv.org

LLMs for Science: Usage for Code Generation and Data Analysis 1 day ago | arxiv.org

abstract analysis arxiv become +26

VAL: Interactive Task Learning with GPT Dialog Parsing 1 day ago | arxiv.org

abstract acquisition arxiv box +22

Convergences and Divergences between Automatic Assessment and Human Evaluation: Insights from Comparing ChatGPT-Generated Translation and … 1 day ago | arxiv.org

abstract arxiv assessment automated +23

Some things are more CRINGE than others: Iterative Preference Optimization with the Pairwise Cringe Loss 1 day ago | arxiv.org

abstract arxiv binary cs.ai +13

DBCopilot: Scaling Natural Language Querying to Massive Databases 1 day ago | arxiv.org

abstract advances arxiv challenges +31

ARN: Analogical Reasoning on Narratives 1 day ago | arxiv.org

abstract analogy arxiv cognitive +17

Applying BioBERT to Extract Germline Gene-Disease Associations for Building a Knowledge Graph from the Biomedical … 1 day ago | arxiv.org

abstract arxiv biomedical building +24

Learning the meanings of function words from grounded language using a visual question answering model 1 day ago | arxiv.org

abstract acquisition arxiv children +17

RETVec: Resilient and Efficient Text Vectorizer 1 day ago | arxiv.org

arxiv cs.ai cs.cl resilient +2

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Engineer

@ Parker | New York City

View on ai-jobs.net

Sr. Data Analyst | Home Solutions

@ Three Ships | Raleigh or Charlotte, NC

View on ai-jobs.net