July 27, 2022, 1:11 a.m. | Dandan Guo, Ruiying Lu, Bo Chen, Zequn Zeng, Mingyuan Zhou

stat.ML updates on arXiv.org arxiv.org

Observing a set of images and their corresponding paragraph-captions, a
challenging task is to learn how to produce a semantically coherent paragraph
to describe the visual content of an image. Inspired by recent successes in
integrating semantic topics into this task, this paper develops a plug-and-play
hierarchical-topic-guided image paragraph generation framework, which couples a
visual extractor with a deep topic model to guide the learning of a language
model. To capture the correlations between the image and text at multiple …

arxiv captioning cv features hierarchical image semantic topics

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

AI Scientist/Engineer

@ OKX | Singapore

Research Engineering/ Scientist Associate I

@ The University of Texas at Austin | AUSTIN, TX

Senior Data Engineer

@ Algolia | London, England

Fundamental Equities - Vice President, Equity Quant Research Analyst (Income & Value Investment Team)

@ BlackRock | NY7 - 50 Hudson Yards, New York

Snowflake Data Analytics

@ Devoteam | Madrid, Spain