April 30, 2024, 6:24 a.m. | /u/tamilselvan_eswar

Computer Vision www.reddit.com

All recent news on CV focuses on new pretraining strategies for LLM or anything related to multimodal. There's minimal research or new products by Google or Meta on OCR or document analysis and parsing.

Is this due to the field being mature with limited development potential, or are companies aiming for a larger share of the market than on smaller tasks?

I can still see lot of potential usecases on document parsing and data retrieval which does not demand an …

analysis big computervision development document google investing llm meta multimodal ocr parsing pretraining products research strategies

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US