[N] Vision-Language Pre-training: Basics, Recent Advances, and Future Trends - Microsoft 2022 - 102 Pages! | allainews.com

Nov. 12, 2022, 11:55 p.m. | /u/Singularian2501

Machine Learning www.reddit.com

Paper: [https://arxiv.org/abs/2210.09263](https://arxiv.org/abs/2210.09263)

Abstract:

>This paper surveys vision-language pre-training (VLP) methods for **multimodal intelligence** that have been developed in the last few years. We group these approaches into three categories: (*i*) VLP for image-text tasks, such as image captioning, image-text retrieval, visual question answering, and visual grounding; (*ii*) VLP for core computer vision tasks, such as (open-set) image classification, object detection, and segmentation; and (*iii*) VLP for video-text tasks, such as video captioning, video-text retrieval, and video question answering. For each …

basics future language machinelearning microsoft pre-training training trends vision

More from www.reddit.com / Machine Learning

[D] Product evaluations is one of the most under-discussed topics 2 hours ago | www.reddit.com

ai consultancy cases client consultancy +8

[D] Training model on tabular data resulting in high loss 4 hours ago | www.reddit.com

context data function hello +7

[D] Reproducing and Comparing Models from Research - Best Practices? 6 hours ago | www.reddit.com

analysis apply best practices big +13

[P] Training a VQGAN but GAN loss keeps going up 15 hours ago | www.reddit.com

image imagenet look loss +8

[R] [2404.10667] VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time 16 hours ago | www.reddit.com

audio generated machinelearning

[N] Feds appoint “AI doomer” to run US AI safety institute 18 hours ago | www.reddit.com

ai development article chance development +16

[D] Is there a way to determine if the representations a model learns are spherical … 20 hours ago | www.reddit.com

deep learning embeddings examples feature +4

[R] RuleOpt: Optimization-Based Rule Learning for Classification 22 hours ago | www.reddit.com

algorithm classification ensemble extraction +13

[D] LSTM Time Series Forecasting 22 hours ago | www.reddit.com

data forecast forecasting however +10

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

View on ai-jobs.net

Automated Greenhouse Expert - Phenotyping & Data Analysis (all genders)

@ Bayer | Frankfurt a.M., Hessen, DE

View on ai-jobs.net

Machine Learning Scientist II

@ Expedia Group | India - Bengaluru

View on ai-jobs.net

Data Engineer/Senior Data Engineer, Bioinformatics

@ Flagship Pioneering, Inc. | Cambridge, MA USA

View on ai-jobs.net

Intern (AI lab)

@ UL Solutions | Dublin, Co. Dublin, Ireland

View on ai-jobs.net

Senior Operations Research Analyst / Predictive Modeler

@ LinQuest | Colorado Springs, Colorado, United States

View on ai-jobs.net