Sept. 18, 2023, 4:02 p.m. | Yuichi Inoue

Towards Data Science - Medium towardsdatascience.com

Developing LLM-integrated GIT vision language models.

Summary of this article:

  • Explaining GIT, a Vision Language Model developed by Microsoft.
  • Replacing GIT’s language model with large language models (LLMs) using PyTorch and Hugging Face’s Transformers.
  • Introducing how to fine-tune GIT-LLM models using LoRA.
  • Testing and discussing the developed models.
  • Investigating if “Image Embeddings” embedded by the Image Encoder of GIT indicate specific characters in the same space as “Text Embedding”.

Large language models (LLM) are showing their value more and more. …

artificial intelligence data science deep learning large language models vision-and-language

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US