Oct. 23, 2023, 3:30 p.m. | Venelin Valkov

Venelin Valkov www.youtube.com

LLaVA, a Large Multimodal Model (LMM), allows you to have image-based conversations. Similar to GPT-4V but without the price tag, LLaVA is free and open source. In this video, we'll explore the original model and then level up with the newer and improved LLaVA 1.5. We'll set up a Google Colab notebook and put LLaVA to the test by running some prompts for different tasks (OCR, image understanding, Q&A over images, etc). What type of results do we get?

Project …

chat conversations explore free gpt gpt-4v image images llava llava 1.5 lmm multimodal multimodal model open source price video

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US