ASSET: Autoregressive Semantic Scene Editing with Transformers at High Resolutions. (arXiv:2205.12231v1 [cs.CV]) | allainews.com

May 25, 2022, 1:13 a.m. | Difan Liu, Sandesh Shetty, Tobias Hinz, Matthew Fisher, Richard Zhang, Taesung Park, Evangelos Kalogerakis

cs.CV updates on arXiv.org arxiv.org

We present ASSET, a neural architecture for automatically modifying an input
high-resolution image according to a user's edits on its semantic segmentation
map. Our architecture is based on a transformer with a novel attention
mechanism. Our key idea is to sparsify the transformer's attention matrix at
high resolutions, guided by dense attention extracted at lower image
resolutions. While previous attention mechanisms are computationally too
expensive for handling high-resolution images or are overly constrained within
specific image regions hampering long-range interactions, …

arxiv cv semantic transformers

More from arxiv.org / cs.CV updates on arXiv.org

Attention-Map Augmentation for Hypercomplex Breast Cancer Classification 1 day ago | arxiv.org

arxiv attention augmentation cancer +5

Hidden Flaws Behind Expert-Level Accuracy of GPT-4 Vision in Medicine 1 day ago | arxiv.org

abstract accuracy analysis arxiv +26

A Survey on Autonomous Driving Datasets: Statistics, Annotation Quality, and a Future Outlook 1 day ago | arxiv.org

abstract advances algorithms annotation +20

Towards Effective Multi-Moving-Camera Tracking: A New Dataset and Lightweight Link Model 1 day ago | arxiv.org

arxiv cs.cv dataset moving +2

Holodeck: Language Guided Generation of 3D Embodied AI Environments 1 day ago | arxiv.org

abstract arxiv cs.ai cs.cl +12

Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance 1 day ago | arxiv.org

3d object 3d object detection arxiv cs.cv +6

Fine-tuning vision foundation model for crack segmentation in civil infrastructures 1 day ago | arxiv.org

abstract adapter ai models arxiv +15

FG-MDM: Towards Zero-Shot Human Motion Generation via Fine-Grained Descriptions 1 day ago | arxiv.org

abstract arxiv beyond cs.cv +16

X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model 1 day ago | arxiv.org

abstract adapter arxiv control +20

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Engineer

@ Parker | New York City

View on ai-jobs.net

Sr. Data Analyst | Home Solutions

@ Three Ships | Raleigh or Charlotte, NC

View on ai-jobs.net