April 23, 2024, 4:48 a.m. | Yingxuan Li, Kiyoharu Aizawa, Yusuke Matsui

cs.CV updates on arXiv.org arxiv.org

arXiv:2306.17469v2 Announce Type: replace
Abstract: The expanding market for e-comics has spurred interest in the development of automated methods to analyze comics. For further understanding of comics, an automated approach is needed to link text in comics to characters speaking the words. Comics speaker detection research has practical applications, such as automatic character assignment for audiobooks, automatic translation according to characters' personalities, and inference of character relationships and stories.
To deal with the problem of insufficient speaker-to-text annotations, we created …

abstract analyze applications arxiv automated characters comics cs.cv dataset detection development dialogue market practical research scale speaker speaking text type understanding words

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US