Uncensored auto-captioning libraries that work well for NSFW image datasets | allainews.com

April 24, 2024, 4:40 a.m. | /u/jferments

Computer Vision www.reddit.com

I have a large (>2.5 million files) dataset of NSFW images that I would like to auto-generate detailed (\~100-150 token) captions for, using a visual language model similar to CogVLM or Llava.

I have tried both CogVLM and Llava, and unfortunately both models are far too heavily censored to complete the task. The responses range either from outright refusal to caption the images, or captions that are so heavily filtered for "appropriateness" that they fail to describe the important features …

auto captioning captions computervision dataset datasets files generate image image datasets images language language model libraries llava nsfw token visual visual language model work

More from www.reddit.com / Computer Vision

football player detection and tracking + camera calibration 22 hours ago | www.reddit.com

calibration computervision detection football +1

Which according to you'll are the best phd programs for this field? 1 day, 3 hours ago | www.reddit.com

computervision computing europe good +6

What kind of compression or image processing techniques might Apple be using here? This is … 1 day, 8 hours ago | www.reddit.com

apple browser compression computervision +10

YOLOv8 appears overtrained despite minimal training epochs 1 day, 19 hours ago | www.reddit.com

computervision dataset expect image +4

Tennis 3D Recreation from Monocular Footage. 1 day, 21 hours ago | www.reddit.com

computervision context finally project

Is automatic 3D segmentation using machine learning a relatively under-researched topic? 2 days, 11 hours ago | www.reddit.com

computervision machine machine learning paper +3

Choosing HW for home CCTV image classification 3 days, 3 hours ago | www.reddit.com

become cctv classification computervision +5

Modern best practices for image segmentation tasks 3 days, 5 hours ago | www.reddit.com

best practices computervision data however +12

Extremely accurate position finding - camera question 3 days, 12 hours ago | www.reddit.com

application camera+ combo computervision +8

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net