A survey on knowledge-enhanced multimodal learning | allainews.com

March 26, 2024, 4:44 a.m. | Maria Lymperaiou, Giorgos Stamou

cs.LG updates on arXiv.org arxiv.org

arXiv:2211.12328v3 Announce Type: replace
Abstract: Multimodal learning has been a field of increasing interest, aiming to combine various modalities in a single joint representation. Especially in the area of visiolinguistic (VL) learning multiple models and techniques have been developed, targeting a variety of tasks that involve images and text. VL models have reached unprecedented performances by extending the idea of Transformers, so that both modalities can learn from each other. Massive pre-training procedures enable VL models to acquire a certain …

abstract arxiv cs.ai cs.lg images knowledge multimodal multimodal learning multiple representation survey targeting tasks text type

More from arxiv.org / cs.LG updates on arXiv.org

APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference 54 minutes ago | arxiv.org

abstract arxiv cs.cl cs.lg +15

Brain-Inspired Spiking Neural Networks for Industrial Fault Diagnosis: A Survey, Challenges, and Opportunities 54 minutes ago | arxiv.org

abstract arxiv brain brain-inspired +21

Data-driven Energy Efficiency Modelling in Large-scale Networks: An Expert Knowledge and ML-based Approach 54 minutes ago | arxiv.org

abstract arxiv challenge complexity +23

Learned Regularization for Inverse Problems: Insights from a Spectral Model 54 minutes ago | arxiv.org

abstract art arxiv convergence +14

LLMs cannot find reasoning errors, but can correct them given the error location 54 minutes ago | arxiv.org

abstract arxiv become chen +17

Conditional Denoising Diffusion Probabilistic Models for Data Reconstruction Enhancement in Wireless Communications 54 minutes ago | arxiv.org

abstract arxiv channels communications +17

Deep ReLU networks and high-order finite element methods II: Chebyshev emulation 54 minutes ago | arxiv.org

abstract arxiv continuous cs.lg +17

Robust Energy Consumption Prediction with a Missing Value-Resilient Metaheuristic-based Neural Network in Mobile App Development 54 minutes ago | arxiv.org

abstract app application arxiv +21

On Universally Optimal Algorithms for A/B Testing 54 minutes ago | arxiv.org

abstract a/b testing algorithm algorithms +17

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

View on ai-jobs.net

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

View on ai-jobs.net

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

View on ai-jobs.net

Director, Venture Capital - Artificial Intelligence

@ Condé Nast | San Jose, CA

View on ai-jobs.net

Senior Molecular Imaging Expert (Senior Principal Scientist)

@ University of Sydney | Cambridge (USA)

View on ai-jobs.net