May 17, 2024, 4:45 a.m. | Binghui Chen, Chongyang Zhong, Wangmeng Xiang, Yifeng Geng, Xuansong Xie

cs.CV updates on arXiv.org arxiv.org

arXiv:2405.09985v1 Announce Type: new
Abstract: Due to the significant advances in large-scale text-to-image generation by diffusion model (DM), controllable human image generation has been attracting much attention recently. Existing works, such as Controlnet [36], T2I-adapter [20] and HumanSD [10] have demonstrated good abilities in generating human images based on pose conditions, they still fail to meet the requirements of real e-commerce scenarios. These include (1) the interaction between the shown product and human should be considered, (2) human parts like …

abstract adapter advances arxiv attention commerce controlnet cs.cv diffusion diffusion model e-commerce good human human images image image generation images marketing object scale text text-to-image type

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

Software Engineer III -Full Stack Developer - ModelOps, MLOps

@ JPMorgan Chase & Co. | NY, United States

Senior Lead Software Engineer - Full Stack Senior Developer - ModelOps, MLOps

@ JPMorgan Chase & Co. | NY, United States

Software Engineer III - Full Stack Developer - ModelOps, MLOps

@ JPMorgan Chase & Co. | NY, United States

Research Scientist (m/w/d) - Numerische Simulation Laser-Materie-Wechselwirkung

@ Fraunhofer-Gesellschaft | Freiburg, DE, 79104

Research Scientist, Speech Real-Time Dialog

@ Google | Mountain View, CA, USA