all AI news
What are some foundational papers in CV that every newcomer should read?
Feb. 25, 2024, 4:21 a.m. | /u/Xfragrance
Computer Vision www.reddit.com
"Attention is All You Need" by Ashish Vaswani et al. (2017): This paper introduced the Transformer architecture, which revolutionized natural language processing and has also impacted CV tasks like image captioning and object detection.
"DETR: End-to-End Object Detection with Transformers" by Nicolas Carion et al. (2020): This paper proposed DETR, a Transformer-based model that achieved state-of-the-art performance in object detection without relying on traditional hand-crafted features.
"Diffusion Models Beat Real-to-Real Image Generation" by Aditya Ramesh et al. (2021): …
architecture ashish vaswani attention attention is all you need captioning computervision detection detr end-to-end object detection every image language language processing natural natural language natural language processing object detection with transformers paper papers processing tasks thoughts transformer transformer architecture transformers
More from www.reddit.com / Computer Vision
YOLOv8 appears overtrained despite minimal training epochs
2 days, 16 hours ago |
www.reddit.com
Tennis 3D Recreation from Monocular Footage.
2 days, 17 hours ago |
www.reddit.com
Jobs in AI, ML, Big Data
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York