all AI news
Cross-view Semantic Alignment for Livestreaming Product Recognition. (arXiv:2308.04912v1 [cs.CV])
cs.CV updates on arXiv.org arxiv.org
Live commerce is the act of selling products online through live streaming.
The customer's diverse demands for online products introduce more challenges to
Livestreaming Product Recognition. Previous works have primarily focused on
fashion clothing data or utilize single-modal input, which does not reflect the
real-world scenario where multimodal data from various categories are present.
In this paper, we present LPR4M, a large-scale multimodal dataset that covers
34 categories, comprises 3 modalities (image, video, and text), and is 50?
larger than …
act alignment arxiv challenges clothing commerce customer data diverse fashion live streaming livestreaming multimodal multimodal data product products recognition selling semantic streaming through world