Oct. 16, 2022, 9:34 a.m. | /u/walt74

machinelearningnews www.reddit.com

LAION released a new dataset of synthetic image captions: [Laion coco: 600M synthetic captions from Laion2B-en](https://laion.ai/blog/laion-coco/).

> We present LAION-COCO, the world’s largest dataset of 600M generated high-quality captions for publicly available web-images. Laion5B has five billion natural captions. They provide a lot of information, but could synthetic captions complement them? To answer this question, we use a combination of existing, publicly available models to produce high quality captions for images in the style of MS COCO. We captioned 600M …

laion machinelearningnews

More from www.reddit.com / machinelearningnews

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Data Engineer - Takealot Group (Takealot.com | Superbalist.com | Mr D Food)

@ takealot.com | Cape Town