Prophet Attention: Predicting Attention with Future Attention for Improved Image Captioning. (arXiv:2210.10914v1 [cs.CV]) | allainews.com

Oct. 21, 2022, 1:16 a.m. | Fenglin Liu, Xuewei Ma, Xuancheng Ren, Xian Wu, Wei Fan, Yuexian Zou, Xu Sun

cs.CV updates on arXiv.org arxiv.org

Recently, attention based models have been used extensively in many
sequence-to-sequence learning systems. Especially for image captioning, the
attention based models are expected to ground correct image regions with proper
generated words. However, for each time step in the decoding process, the
attention based models usually use the hidden state of the current input to
attend to the image regions. Under this setting, these attention models have a
"deviated focus" problem that they calculate the attention weights based on
previous …

arxiv attention captioning future image prophet

More from arxiv.org / cs.CV updates on arXiv.org

Having Second Thoughts? Let's hear it 25 minutes ago | arxiv.org

abstract arxiv brain cognitive +20

Towards Imbalanced Motion: Part-Decoupling Network for Video Portrait Segmentation 25 minutes ago | arxiv.org

abstract arxiv attention cs.cv +15

Decoupling Dynamic Monocular Videos for Dynamic View Synthesis 25 minutes ago | arxiv.org

abstract arxiv challenge cs.cv +13

From CNNs to Shift-Invariant Twin Models Based on Complex Wavelets 25 minutes ago | arxiv.org

abstract accuracy arxiv cnns +20

Behind Every Domain There is a Shift: Adapting Distortion-aware Vision Transformers for Panoramic Semantic Segmentation 25 minutes ago | arxiv.org

arxiv cs.cv cs.ro domain +10

Self-supervised Feature-Gate Coupling for Dynamic Network Pruning 25 minutes ago | arxiv.org

abstract arxiv computational cost +16

An Organic Weed Control Prototype using Directed Energy and Deep Learning 25 minutes ago | arxiv.org

abstract array arxiv control +15

You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet 25 minutes ago | arxiv.org

abstract arxiv attention attention mechanisms +20

Generative Adversarial Networks in Ultrasound Imaging: Extending Field of View Beyond Conventional Limits 25 minutes ago | arxiv.org

abstract adversarial arxiv beyond +18

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

View on ai-jobs.net

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

View on ai-jobs.net

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

View on ai-jobs.net

Senior Applied Data Scientist

@ dunnhumby | London

View on ai-jobs.net

Principal Data Architect - Azure & Big Data

@ MGM Resorts International | Home Office - US, NV

View on ai-jobs.net