all AI news
Translating a Visual LEGO Manual to a Machine-Executable Plan. (arXiv:2207.12572v1 [cs.CV])
July 27, 2022, 1:12 a.m. | Ruocheng Wang, Yunzhi Zhang, Jiayuan Mao, Chin-Yi Cheng, Jiajun Wu
cs.CV updates on arXiv.org arxiv.org
We study the problem of translating an image-based, step-by-step assembly
manual created by human designers into machine-interpretable instructions. We
formulate this problem as a sequential prediction task: at each step, our model
reads the manual, locates the components to be added to the current shape, and
infers their 3D poses. This task poses the challenge of establishing a 2D-3D
correspondence between the manual image and the real 3D object, and 3D pose
estimation for unseen 3D objects, since a new …
More from arxiv.org / cs.CV updates on arXiv.org
Jobs in AI, ML, Big Data
Senior ML Researcher - 3D Geometry Processing | 3D Shape Generation | 3D Mesh Data
@ Promaton | Europe
Data Scientist, Senior
@ Pacific Gas and Electric Company | Oakland, CA, US, 94612
AML Reporting Data Specialist
@ Wise | Tallinn, Estonia
Bachelorarbeit im Bereich IT - "Einsatz von Generative AI im Konzernumfeld" (WiSe 24/25)
@ AGCO | Marktoberdorf, DE
Big Data Engineer
@ ACL Technology | Argentina
REF25217Q-Deputy Manager - MIS (Power BI, Dashboard, Excel) - GGN
@ WNS Global Services | Gurgaon, India