MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection | allainews.com

March 25, 2024, 4:45 a.m. | Taeheon Kim, Sangyun Chung, Damin Yeom, Youngjoon Yu, Hak Gu Kim, Yong Man Ro

cs.CV updates on arXiv.org arxiv.org

arXiv:2403.15209v1 Announce Type: new
Abstract: Multispectral pedestrian detection is attractive for around-the-clock applications due to the complementary information between RGB and thermal modalities. However, current models often fail to detect pedestrians in obvious cases, especially due to the modality bias learned from statistically biased datasets. From these problems, we anticipate that maybe understanding the complementary information itself is difficult to achieve from vision-only models. Accordingly, we propose a novel Multispectral Chain-of-Thought Detection (MSCoTDet) framework, which incorporates Large Language Models (LLMs) …

abstract applications arxiv bias cases cs.cv current datasets detection fusion however information language modal multi-modal pedestrian pedestrians type

More from arxiv.org / cs.CV updates on arXiv.org

Anatomically aware dual-hop learning for pulmonary embolism detection in CT pulmonary angiograms an hour ago | arxiv.org

abstract arxiv cases cs.cv +13

PREGO: online mistake detection in PRocedural EGOcentric videos an hour ago | arxiv.org

abstract applications arxiv capability +12

Uncertainty estimates for semantic segmentation: providing enhanced reliability for automated motor claims handling an hour ago | arxiv.org

abstract arxiv automated automation +17

CG-HOI: Contact-Guided 3D Human-Object Interaction Generation an hour ago | arxiv.org

abstract arxiv cs.cv dynamic +9

DSD-DA: Distillation-based Source Debiasing for Domain Adaptive Object Detection an hour ago | arxiv.org

abstract alignment arxiv bias +14

ViCor: Bridging Visual Understanding and Commonsense Reasoning with Large Language Models an hour ago | arxiv.org

abstract arxiv capabilities commonsense +21

REB: Reducing Biases in Representation for Industrial Anomaly Detection an hour ago | arxiv.org

anomaly anomaly detection arxiv biases +7

Q-HyViT: Post-Training Quantization of Hybrid Vision Transformers with Bridge Block Reconstruction for IoT Systems an hour ago | arxiv.org

arxiv block bridge cs.ai +11

Multicenter Privacy-Preserving Model Training for Deep Learning Brain Metastases Autosegmentation an hour ago | arxiv.org

abstract arxiv brain cs.cv +16

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net