all AI news
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks. (arXiv:2211.09808v1 [cs.CV])
Nov. 18, 2022, 2:14 a.m. | Hao Li, Jinguo Zhu, Xiaohu Jiang, Xizhou Zhu, Hongsheng Li, Chun Yuan, Xiaohua Wang, Yu Qiao, Xiaogang Wang, Wenhai Wang, Jifeng Dai
cs.CV updates on arXiv.org arxiv.org
Despite the remarkable success of foundation models, their task-specific
fine-tuning paradigm makes them inconsistent with the goal of general
perception modeling. The key to eliminating this inconsistency is to use
generalist models for general task modeling. However, existing attempts at
generalist models are inadequate in both versatility and performance. In this
paper, we propose Uni-Perceiver v2, which is the first generalist model capable
of handling major large-scale vision and vision-language tasks with competitive
performance. Specifically, images are encoded as general …
More from arxiv.org / cs.CV updates on arXiv.org
Retrieval-Augmented Egocentric Video Captioning
2 days, 16 hours ago |
arxiv.org
Mirror-Aware Neural Humans
2 days, 16 hours ago |
arxiv.org
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US