all AI news
VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision. (arXiv:2306.10681v2 [eess.IV] UPDATED)
cs.CV updates on arXiv.org arxiv.org
Almost all digital videos are coded into compact representations before being
transmitted. Such compact representations need to be decoded back to pixels
before being displayed to humans and - as usual - before being
enhanced/analyzed by machine vision algorithms. Intuitively, it is more
efficient to enhance/analyze the coded representations directly without
decoding them into pixels. Therefore, we propose a versatile neural video
coding (VNVC) framework, which targets learning compact representations to
support both reconstruction and direct enhancement/analysis, thereby being
versatile …
algorithms analyze arxiv coding digital framework human humans machine machine vision pixels video videos vision