April 8, 2024, 4:42 a.m. | Botao Ren, Botian Xu, Yifan Pu, Jingyi Wang, Zhidong Deng

cs.LG updates on arXiv.org arxiv.org

arXiv:2404.04140v1 Announce Type: cross
Abstract: In many image domains, the spatial distribution of objects in a scene exhibits meaningful patterns governed by their semantic relationships. In most modern detection pipelines, however, the detection proposals are processed independently, overlooking the underlying relationships between objects. In this work, we introduce a transformer-based approach to capture these inter-object relationships to refine classification and regression outcomes for detected objects. Building on two-stage detectors, we tokenize the region of interest (RoI) proposals to be processed …

abstract aerial arxiv cs.cv cs.lg detection distribution domains however image images improving modern object objects patterns pipelines proposals relationships semantic spatial transformer type work

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US