Variational Transformer: A Framework Beyond the Trade-off between Accuracy and Diversity for Image Captioning. (arXiv:2205.14458v2 [cs.CV] UPDATED) | allainews.com

Sept. 22, 2022, 1:12 a.m. | Longzhen Yang, Yihang Liu, Yitao Peng, Lianghua He

cs.LG updates on arXiv.org arxiv.org

Accuracy and Diversity are two essential metrizable manifestations in
generating natural and semantically correct captions. Many efforts have been
made to enhance one of them with another decayed due to the trade-off gap. In
this work, we will show that the inferior standard of accuracy draws from human
annotations (leave-one-out) are not appropriate for machine-generated captions.
To improve diversity with a solid accuracy performance, we exploited a novel
Variational Transformer framework. By introducing the "Invisible Information
Prior" and the "Auto-selectable …

accuracy arxiv captioning diversity framework image trade transformer

More from arxiv.org / cs.LG updates on arXiv.org

Training towards significance with the decorrelated event classifier transformer neural network 19 hours ago | arxiv.org

abstract analysis application arxiv +28

An adaptive standardisation methodology for Day-Ahead electricity price forecasting 19 hours ago | arxiv.org

abstract algorithms arxiv complexity +18

SYNAuG: Exploiting Synthetic Data for Data Imbalance Problems 19 hours ago | arxiv.org

abstract arxiv cs.cv cs.lg +17

Semantic Positive Pairs for Enhancing Visual Representation Learning of Instance Discrimination methods 19 hours ago | arxiv.org

abstract algorithms arxiv augmentation +17

Description-Based Text Similarity 19 hours ago | arxiv.org

abstract arxiv cases cs.cl +14

Improving Gradient Methods via Coordinate Transformations: Applications to Quantum Machine Learning 19 hours ago | arxiv.org

abstract algorithms applications arxiv +13

A Generative Framework for Low-Cost Result Validation of Machine Learning-as-a-Service Inference 19 hours ago | arxiv.org

abstract applications arxiv as-a-service +26

Digital Over-the-Air Federated Learning in Multi-Antenna Systems 19 hours ago | arxiv.org

abstract arxiv communication computation +16

Bagging Provides Assumption-free Stability 19 hours ago | arxiv.org

abstract algorithm arxiv assumptions +15

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Senior Business Intelligence Developer / Analyst

@ Transamerica | Work From Home, USA

View on ai-jobs.net

Data Analyst (All Levels)

@ Noblis | Bethesda, MD, United States

View on ai-jobs.net