Uncensored auto-captioning libraries that work well for NSFW image datasets | allainews.com

April 24, 2024, 4:40 a.m. | /u/jferments

Computer Vision www.reddit.com

I have a large (>2.5 million files) dataset of NSFW images that I would like to auto-generate detailed (\~100-150 token) captions for, using a visual language model similar to CogVLM or Llava.

I have tried both CogVLM and Llava, and unfortunately both models are far too heavily censored to complete the task. The responses range either from outright refusal to caption the images, or captions that are so heavily filtered for "appropriateness" that they fail to describe the important features …

auto captioning captions computervision dataset datasets files generate image image datasets images language language model libraries llava nsfw token visual visual language model work

More from www.reddit.com / Computer Vision

In bundle adjustment tasks, how are the weights for reprojection errors and GCPs set? 1 day, 1 hour ago | www.reddit.com

computervision control errors gcp +4

How does pose estimation with collision detection work? (ex: shaking hands, punching in the face, … 2 days ago | www.reddit.com

app building code collision +9

Is it possible to calculate the distance of an object using a single camera? 3 days, 8 hours ago | www.reddit.com

cameras computervision feature flair +4

KAN: Kolmogorov–Arnold Networks - For Computer Vision 4 days ago | www.reddit.com

computer computer vision computervision latest +4

Object detection evaluation - FROC analysis 4 days, 8 hours ago | www.reddit.com

analysis coco code computervision +8

Pose Estimation Given CAD Model 4 days, 13 hours ago | www.reddit.com

cad computation computervision current +6

I got asked what my “credentials” are because I suggested compression 4 days, 17 hours ago | www.reddit.com

big client compression computervision +5

Training an Unbeatable Connect 4 Ai 4 days, 19 hours ago | www.reddit.com

computervision training

Introduction to Computer Vision by Hany Farid, UC Berkeley 4 days, 19 hours ago | www.reddit.com

berkeley computer computer vision computervision +3

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Consultant Senior Power BI & Azure - CDI - H/F

@ Talan | Lyon, France

View on ai-jobs.net