[P] I built an open SotA image tagging model to do what CLIP won't | allainews.com

Dec. 21, 2023, 1:34 a.m. | /u/fpgaminer

Machine Learning www.reddit.com

I'm a hobbyist ML researcher and finally, after a year of work, built a state of the art machine vision model from scratch. It's ViT-B/16 based, 448x448x3 input, 91M parameters, trained for 660M samples, with multi-label classification as the target task, on over 5000 unique tags.

All the big foundation vision models today were trained on heavily filtered datasets, greatly limiting the concepts they can represent, in line with arbitrary sets of rules for what is deemed "wholesome" by leading …

art classification clip image machine machinelearning machine vision parameters researcher sota state state of the art tagging tags vision vit work

More from www.reddit.com / Machine Learning

[D] How would you diagnose these spikes in the training loss? 5 hours ago | www.reddit.com

loss machinelearning training training loss

"transformers can use meaningless filler tokens (e.g., '......') in place of a chain of thought" … 7 hours ago | www.reddit.com

abstract chain of thought converge however +9

[D] What are the most common and significant challenges moving your LLM (application/system) to production? 9 hours ago | www.reddit.com

application building challenges companies +10

[P] Natural language to MongoDB query conversion. 11 hours ago | www.reddit.com

machinelearning

[D] Role of the Identity Matrix in PointNet's Input Transformation Block 13 hours ago | www.reddit.com

block cloud code context +7

[P] NLLB-200 Distill 350M for en-ko 16 hours ago | www.reddit.com

cpu english good gpu +9

[D] Real talk about RAG 23 hours ago | www.reddit.com

data deal documents machinelearning +5

[P] Classification finetuning experiments on small GPT-2 sized LLMs 1 day, 5 hours ago | www.reddit.com

acc classification context cpu +16

[D] Llama-3 based OpenBioLLM-70B & 8B: Outperforms GPT-4, Gemini, Meditron-70B, Med-PaLM-1 & Med-PaLM-2 in Medical-domain 1 day, 5 hours ago | www.reddit.com

70b art biomedical domain +16

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Research Scientist, Demography and Survey Science, University Grad

@ Meta | Menlo Park, CA | New York City

View on ai-jobs.net

Computer Vision Engineer, XR

@ Meta | Burlingame, CA

View on ai-jobs.net