Sept. 6, 2023, 2 p.m. | Louis Bouchard

Hacker Noon - ai hackernoon.com

LLaVA is an end-to-end large multimodal model that connects a vision encoder and LLM for general-purpose visual and language understanding. GPT-4 was used to generate a large and high-quality dataset to train a new model that understands images.

Read All

ai ai model training dataset encoder general generate gpt gpt-4 images language language understanding llava llm llms multimodal multimodal ai quality understanding vision

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne