DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection | allainews.com

April 2, 2024, 7:49 p.m. | Shilin Xu, Xiangtai Li, Size Wu, Wenwei Zhang, Yunhai Tong, Chen Change Loy

cs.CV updates on arXiv.org arxiv.org

arXiv:2310.01393v3 Announce Type: replace
Abstract: Open-vocabulary object detection (OVOD) aims to detect the objects beyond the set of classes observed during training. This work introduces a straightforward and efficient strategy that utilizes pre-trained vision-language models (VLM), like CLIP, to identify potential novel classes through zero-shot classification. Previous methods use a class-agnostic region proposal network to detect object proposals and consider the proposals that do not match the ground truth as background. Unlike these methods, our method will select a subset …

arxiv cs.cv detection dynamic object self-training simple training type

More from arxiv.org / cs.CV updates on arXiv.org

Having Second Thoughts? Let's hear it 2 hours ago | arxiv.org

abstract arxiv brain cognitive +20

Towards Imbalanced Motion: Part-Decoupling Network for Video Portrait Segmentation 2 hours ago | arxiv.org

abstract arxiv attention cs.cv +15

Decoupling Dynamic Monocular Videos for Dynamic View Synthesis 2 hours ago | arxiv.org

abstract arxiv challenge cs.cv +13

From CNNs to Shift-Invariant Twin Models Based on Complex Wavelets 2 hours ago | arxiv.org

abstract accuracy arxiv cnns +20

Behind Every Domain There is a Shift: Adapting Distortion-aware Vision Transformers for Panoramic Semantic Segmentation 2 hours ago | arxiv.org

arxiv cs.cv cs.ro domain +10

Self-supervised Feature-Gate Coupling for Dynamic Network Pruning 2 hours ago | arxiv.org

abstract arxiv computational cost +16

An Organic Weed Control Prototype using Directed Energy and Deep Learning 2 hours ago | arxiv.org

abstract array arxiv control +15

You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet 2 hours ago | arxiv.org

abstract arxiv attention attention mechanisms +20

Generative Adversarial Networks in Ultrasound Imaging: Extending Field of View Beyond Conventional Limits 2 hours ago | arxiv.org

abstract adversarial arxiv beyond +18

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

View on ai-jobs.net

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

View on ai-jobs.net

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

View on ai-jobs.net

Senior Applied Data Scientist

@ dunnhumby | London

View on ai-jobs.net

Principal Data Architect - Azure & Big Data

@ MGM Resorts International | Home Office - US, NV

View on ai-jobs.net