Matching Visual Features to Hierarchical Semantic Topics for Image Paragraph Captioning. (arXiv:2105.04143v2 [cs.CV] UPDATED) | allainews.com

July 27, 2022, 1:12 a.m. | Dandan Guo, Ruiying Lu, Bo Chen, Zequn Zeng, Mingyuan Zhou

cs.CV updates on arXiv.org arxiv.org

Observing a set of images and their corresponding paragraph-captions, a
challenging task is to learn how to produce a semantically coherent paragraph
to describe the visual content of an image. Inspired by recent successes in
integrating semantic topics into this task, this paper develops a plug-and-play
hierarchical-topic-guided image paragraph generation framework, which couples a
visual extractor with a deep topic model to guide the learning of a language
model. To capture the correlations between the image and text at multiple …

arxiv captioning cv features hierarchical image semantic topics

More from arxiv.org / cs.CV updates on arXiv.org

AV-RIR: Audio-Visual Room Impulse Response Estimation 6 hours ago | arxiv.org

arxiv audio cs.cv cs.sd +3

A Hierarchical Architecture for Neural Materials 6 hours ago | arxiv.org

abstract architecture arxiv cs.cv +8

SegFormer3D: an Efficient Transformer for 3D Medical Image Segmentation 6 hours ago | arxiv.org

arxiv cs.cv image medical +3

NU-Class Net: A Novel Deep Learning-based Approach for Video Quality Enhancement 6 hours ago | arxiv.org

abstract arxiv class compression +18

Mosaic-SDF for 3D Generative Models 6 hours ago | arxiv.org

2d image abstract arxiv cs.cv +14

PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection 6 hours ago | arxiv.org

3d object 3d object detection arxiv cs.cv +6

A Multilevel Guidance-Exploration Network and Behavior-Scene Matching Method for Human Behavior Anomaly Detection 6 hours ago | arxiv.org

anomaly anomaly detection arxiv behavior +7

ChatPose: Chatting about 3D Human Pose 6 hours ago | arxiv.org

abstract arxiv cs.cv framework +14

Boosting Audio-visual Zero-shot Learning with Large Language Models 6 hours ago | arxiv.org

arxiv audio boosting cs.cv +7

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Management Assistant

@ World Vision | Amman Office, Jordan

View on ai-jobs.net

Cloud Data Engineer, Global Services Delivery, Google Cloud

@ Google | Buenos Aires, Argentina

View on ai-jobs.net