VirtualModel: Generating Object-ID-retentive Human-object Interaction Image by Diffusion Model for E-commerce Marketing | allainews.com

May 17, 2024, 4:45 a.m. | Binghui Chen, Chongyang Zhong, Wangmeng Xiang, Yifeng Geng, Xuansong Xie

cs.CV updates on arXiv.org arxiv.org

arXiv:2405.09985v1 Announce Type: new
Abstract: Due to the significant advances in large-scale text-to-image generation by diffusion model (DM), controllable human image generation has been attracting much attention recently. Existing works, such as Controlnet [36], T2I-adapter [20] and HumanSD [10] have demonstrated good abilities in generating human images based on pose conditions, they still fail to meet the requirements of real e-commerce scenarios. These include (1) the interaction between the shown product and human should be considered, (2) human parts like …

abstract adapter advances arxiv attention commerce controlnet cs.cv diffusion diffusion model e-commerce good human human images image image generation images marketing object scale text text-to-image type

More from arxiv.org / cs.CV updates on arXiv.org

SSL-OTA: Unveiling Backdoor Threats in Self-Supervised Learning for Object Detection 10 hours ago | arxiv.org

abstract adoption arxiv attacks +19

MELEP: A Novel Predictive Measure of Transferability in Multi-Label ECG Diagnosis 10 hours ago | arxiv.org

abstract annotated data arxiv assessment +16

Smartphone region-wise image indoor localization using deep learning for indoor tourist attraction 10 hours ago | arxiv.org

abstract arxiv block concrete +17

LEAP-VO: Long-term Effective Any Point Tracking for Visual Odometry 10 hours ago | arxiv.org

abstract arxiv assessment context +15

A Simple Video Segmenter by Tracking Objects Along Axial Trajectories 10 hours ago | arxiv.org

arxiv cs.cv objects replace +4

MobileDiffusion: Instant Text-to-Image Generation on Mobile Devices 10 hours ago | arxiv.org

abstract architecture arxiv cs.cv +21

Exploring Frequency-Inspired Optimization in Transformer for Efficient Single Image Super-Resolution 10 hours ago | arxiv.org

abstract arxiv cs.cv current +15

AViT: Adapting Vision Transformers for Small Skin Lesion Segmentation Datasets 10 hours ago | arxiv.org

arxiv cs.cv datasets replace +6

Exploring One-shot Semi-supervised Federated Learning with A Pre-trained Diffusion Model 10 hours ago | arxiv.org

abstract arxiv challenges client +17

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

Software Engineer III -Full Stack Developer - ModelOps, MLOps

@ JPMorgan Chase & Co. | NY, United States

View on ai-jobs.net

Senior Lead Software Engineer - Full Stack Senior Developer - ModelOps, MLOps

@ JPMorgan Chase & Co. | NY, United States

View on ai-jobs.net

Software Engineer III - Full Stack Developer - ModelOps, MLOps

@ JPMorgan Chase & Co. | NY, United States

View on ai-jobs.net

Research Scientist (m/w/d) - Numerische Simulation Laser-Materie-Wechselwirkung

@ Fraunhofer-Gesellschaft | Freiburg, DE, 79104

View on ai-jobs.net

Research Scientist, Speech Real-Time Dialog

@ Google | Mountain View, CA, USA

View on ai-jobs.net