all AI news
Gemini Demo But With GPT-4 Vision API
Dec. 10, 2023, 10:41 a.m. | Unconventional Coding
Unconventional Coding www.youtube.com
In today's video I showcase a Python program I have made using OpenAI's GPT-4 Vision API, Speech-to-text API and Whisper, that attempts to accomplish what the Google Gemini multimodal demo shows.
More information on the project coming up in future videos.
Support: https://buymeacoffee.com/unconv
Consultations: https://www.buymeacoffee.com/unconv/e/146735
Memberships: https://www.buymeacoffee.com/unconv/membership
00:00 Demo
03:25 Bloopers
05:26 Unedited Version
api demo future gemini google google gemini gpt gpt-4 gpt-4 vision information multimodal openai openai's gpt-4 project python shows speech speech-to-text text video videos vision whisper
More from www.youtube.com / Unconventional Coding
Learning GameDev with Raylib in C: 2D Racing Game - Part 1
3 weeks, 5 days ago |
www.youtube.com
Automatic AI YouTube Short Captions w/ Whisper and CV2 (Part 5)
4 weeks, 1 day ago |
www.youtube.com
Adding NPCs to my C Raylib Game | First Game in C Part 3
2 months, 2 weeks ago |
www.youtube.com
Letting ChatGPT Answer My Emails with Python Gmail API
2 months, 3 weeks ago |
www.youtube.com
Reverse Engineering Gmail's Autocomplete Feature
2 months, 4 weeks ago |
www.youtube.com
Python is low-class. Switching to TypeScript
3 months, 1 week ago |
www.youtube.com
Jobs in AI, ML, Big Data
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US