April 11, 2024, 2:37 a.m. | /u/pidoyu

Machine Learning www.reddit.com

Hello everyone!

I would like to share our recent CVPR work, hoping to spread our simple ideas.

**\[TL;DR\]** The auto-regression model can predict labels from just an input image, without a predefined query gallery (e.g., CLIP-like models) or predefined class concepts (e.g., VGG/ResNet-like models). The model predicts top-K labels, e.g., **top-100**, from the entire textual space (any label).

For more details, please visit our paper and project: [https://github.com/kaiyuyue/nxtp](https://github.com/kaiyuyue/nxtp).

Your thoughts and feedback are appreciated. Thank you very much!

\----- figure …

auto class clip concepts cvpr hello ideas image labels machinelearning object query recognition regression resnet simple vgg work

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US