all AI news
Topic: captioning
RSCaMa: Remote Sensing Image Change Captioning with State Space Model
1 week, 3 days ago |
arxiv.org
Learning text-to-video retrieval from image captioning
1 week, 4 days ago |
arxiv.org
The Solution for the CVPR2024 NICE Image Captioning Challenge
2 weeks, 4 days ago |
arxiv.org
LangNav: Language as a Perceptual Representation for Navigation
1 month, 1 week ago |
arxiv.org
Streaming Dense Video Captioning
1 month, 1 week ago |
arxiv.org
LocCa: Visual Pretraining with Location-aware Captioners
1 month, 1 week ago |
arxiv.org
TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
1 month, 1 week ago |
arxiv.org
Text Data-Centric Image Captioning with Interactive Prompts
1 month, 1 week ago |
arxiv.org
CLAMP: Contrastive LAnguage Model Prompt-tuning
1 month, 1 week ago |
arxiv.org
Image Captioning in news report scenario
1 month, 2 weeks ago |
arxiv.org
Visually-Aware Context Modeling for News Image Captioning
1 month, 2 weeks ago |
arxiv.org
Towards More Unified In-context Visual Understanding
1 month, 3 weeks ago |
arxiv.org
TARN-VIST: Topic Aware Reinforcement Network for Visual Storytelling
1 month, 3 weeks ago |
arxiv.org
FlexCap: Generating Rich, Localized, and Flexible Captions in Images
1 month, 3 weeks ago |
arxiv.org
Are Vision Language Models Texture or Shape Biased and Can We Steer Them?
1 month, 3 weeks ago |
arxiv.org
How to Understand Named Entities: Using Common Sense for News Captioning
1 month, 4 weeks ago |
arxiv.org
Sieve: Multimodal Dataset Pruning Using Image Captioning Models
1 month, 4 weeks ago |
arxiv.org
Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation
1 month, 4 weeks ago |
arxiv.org
Nothing found.
Items published with this topic over the last 90 days.
Latest
RSCaMa: Remote Sensing Image Change Captioning with State Space Model
1 week, 3 days ago |
arxiv.org
Learning text-to-video retrieval from image captioning
1 week, 4 days ago |
arxiv.org
The Solution for the CVPR2024 NICE Image Captioning Challenge
2 weeks, 4 days ago |
arxiv.org
LangNav: Language as a Perceptual Representation for Navigation
1 month, 1 week ago |
arxiv.org
Streaming Dense Video Captioning
1 month, 1 week ago |
arxiv.org
LocCa: Visual Pretraining with Location-aware Captioners
1 month, 1 week ago |
arxiv.org
TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
1 month, 1 week ago |
arxiv.org
Text Data-Centric Image Captioning with Interactive Prompts
1 month, 1 week ago |
arxiv.org
CLAMP: Contrastive LAnguage Model Prompt-tuning
1 month, 1 week ago |
arxiv.org
Image Captioning in news report scenario
1 month, 2 weeks ago |
arxiv.org
Visually-Aware Context Modeling for News Image Captioning
1 month, 2 weeks ago |
arxiv.org
Towards More Unified In-context Visual Understanding
1 month, 3 weeks ago |
arxiv.org
TARN-VIST: Topic Aware Reinforcement Network for Visual Storytelling
1 month, 3 weeks ago |
arxiv.org
FlexCap: Generating Rich, Localized, and Flexible Captions in Images
1 month, 3 weeks ago |
arxiv.org
Are Vision Language Models Texture or Shape Biased and Can We Steer Them?
1 month, 3 weeks ago |
arxiv.org
How to Understand Named Entities: Using Common Sense for News Captioning
1 month, 4 weeks ago |
arxiv.org
Sieve: Multimodal Dataset Pruning Using Image Captioning Models
1 month, 4 weeks ago |
arxiv.org
Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation
1 month, 4 weeks ago |
arxiv.org
Topic trend (last 90 days)
Top (last 7 days)
Nothing found.
Jobs in AI, ML, Big Data
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US