June 1, 2022, 1:12 a.m. | Ce Zheng, Matias Mendieta, Taojiannan Yang, Chen Chen

cs.CV updates on arXiv.org arxiv.org

Recently, vision transformers have shown great success in 2D human pose
estimation (2D HPE), 3D human pose estimation (3D HPE), and human mesh
reconstruction (HMR) tasks. In these tasks, heatmap representations of the
human structural information are often extracted first from the image by a CNN,
and then further processed with a transformer architecture to provide the final
HPE or HMR estimation. However, existing transformer architectures are not able
to process these heatmap inputs directly, forcing an unnatural flattening of …

arxiv cv heatmap human network transformer

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote