Nov. 5, 2023, 6:49 a.m. | Xihua Sheng, Li Li, Dong Liu, Houqiang Li

cs.CV updates on arXiv.org arxiv.org

Almost all digital videos are coded into compact representations before being
transmitted. Such compact representations need to be decoded back to pixels
before being displayed to humans and - as usual - before being
enhanced/analyzed by machine vision algorithms. Intuitively, it is more
efficient to enhance/analyze the coded representations directly without
decoding them into pixels. Therefore, we propose a versatile neural video
coding (VNVC) framework, which targets learning compact representations to
support both reconstruction and direct enhancement/analysis, thereby being
versatile …

algorithms analyze arxiv coding digital framework human humans machine machine vision pixels video videos vision

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Director, Clinical Data Science

@ Aura | Remote USA

Research Scientist, AI (PhD)

@ Meta | Menlo Park, CA | New York City