Oct. 16, 2023, 12:57 p.m. | AI Jason

AI Jason www.youtube.com

A step by step tutorial of how to build vision powered AI agent via autogen + llava + stable diffusion AND Break down of 160-page analysis of GPT4V capabilities

🤘 Get 15% off on sceneXplain via my code AIJASON : https://go.jina.ai/scenexplainjason

🔗 Links
- Follow me on twitter: https://twitter.com/jasonzhou1993
- Join my AI email list: https://www.ai-jason.com/
- My discord: https://discord.gg/eZXprSaCDE
- sceneXplain: https://go.jina.ai/scenexplainjason
- Vision-agent Github: https://github.com/JayZeeDesign/vision-agent-with-llava


⏱️ Timestamps
0:00 Intro
1:15 What is multi-modal model
2:12 GPT4V ability break …

agent analysis autogen build capabilities cases diffusion gpt4v intro llava modal multi-modal page prompt stable diffusion tutorial use cases via vision visual

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York