all AI news
STrajNet: Occupancy Flow Prediction via Multi-modal Swin Transformer. (arXiv:2208.00394v1 [cs.CV])
Aug. 2, 2022, 2:13 a.m. | Haochen Liu, Zhiyu Huang, Chen Lv
cs.CV updates on arXiv.org arxiv.org
Making an accurate prediction of occupancy and flow is essential to enable
better safety and interaction for autonomous vehicles under complex traffic
scenarios. This work proposes STrajNet: a multi-modal Swin Transformerbased
framework for effective scene occupancy and flow predictions. We employ Swin
Transformer to encode the image and interaction-aware motion representations
and propose a cross-attention module to inject motion awareness into grid cells
across different time steps. Flow and occupancy predictions are then decoded
through temporalsharing Pyramid decoders. The proposed …
More from arxiv.org / cs.CV updates on arXiv.org
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs
1 day, 23 hours ago |
arxiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
AIML - Sr Machine Learning Engineer, Data and ML Innovation
@ Apple | Seattle, WA, United States
Senior Data Engineer
@ Palta | Palta Cyprus, Palta Warsaw, Palta remote