April 23, 2024, 4:48 a.m. | Yingxuan Li, Kiyoharu Aizawa, Yusuke Matsui

cs.CV updates on arXiv.org arxiv.org

arXiv:2306.17469v2 Announce Type: replace
Abstract: The expanding market for e-comics has spurred interest in the development of automated methods to analyze comics. For further understanding of comics, an automated approach is needed to link text in comics to characters speaking the words. Comics speaker detection research has practical applications, such as automatic character assignment for audiobooks, automatic translation according to characters' personalities, and inference of character relationships and stories.
To deal with the problem of insufficient speaker-to-text annotations, we created …

abstract analyze applications arxiv automated characters comics cs.cv dataset detection development dialogue market practical research scale speaker speaking text type understanding words

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne