Dec. 10, 2023, 10:41 a.m. | Unconventional Coding

Unconventional Coding www.youtube.com

Github: https://github.com/unconv/gpt4v-gemini

In today's video I showcase a Python program I have made using OpenAI's GPT-4 Vision API, Speech-to-text API and Whisper, that attempts to accomplish what the Google Gemini multimodal demo shows.

More information on the project coming up in future videos.

Support: https://buymeacoffee.com/unconv
Consultations: https://www.buymeacoffee.com/unconv/e/146735
Memberships: https://www.buymeacoffee.com/unconv/membership

00:00 Demo
03:25 Bloopers
05:26 Unedited Version

api demo future gemini google google gemini gpt gpt-4 gpt-4 vision information multimodal openai openai's gpt-4 project python shows speech speech-to-text text video videos vision whisper

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US