[R] LaVIN: Large Vision-Language Instructed Model | allainews.com

May 29, 2023, 4:24 p.m. | /u/Technical-Vast1314

Machine Learning www.reddit.com



https://preview.redd.it/t37xwe9i6u2b1.png?width=1440&format=png&auto=webp&v=enabled&s=8b0fa631e2988ea7b64a615544a86ef331bd65d2

Paper: [https://arxiv.org/pdf/2305.15023.pdf](https://arxiv.org/pdf/2305.15023.pdf)

Project: [https://github.com/luogen1996/LaVIN](https://github.com/luogen1996/LaVIN)



Adapting large language models to multimodal instructions typically requires a significant amount of training time. Both BLIP2 and mini-GPT4 require large sets of paired text and image samples for pretraining. Additionally, LLaVA requires fine-tuning of the entire large language model. These approaches greatly increase the cost of multimodal adaptation and can lead to a decrease in the textual capabilities of the large language model.

In this paper, we propose **an efficient multimodal instruction …

cost fine-tuning gpt4 image language language model language models large language model large language models machinelearning multimodal text training vision

More from www.reddit.com / Machine Learning

[Discussion] Are there specific technical/scientific breakthroughs that have allowed the significant jump in maximum context … 6 hours ago | www.reddit.com

claude context gpt gpt-4 +14

[D] How to evaluate RAG - both retrieval and generation, when all I have is … 8 hours ago | www.reddit.com

data documents embedding embedding models +7

[D] Has anyone tried distilling large language models the old way? 12 hours ago | www.reddit.com

distillation however language language model +9

[D] Llama-3 (7B and 70B) on a medical domain benchmark 18 hours ago | www.reddit.com

70b ai community benchmark community +10

[D] Data Scientist: job preparation guide 2024 18 hours ago | www.reddit.com

data data scientist genai guide +7

[D] ICML Meta Reviews 19 hours ago | www.reddit.com

machinelearning

[R] Show Your Work with Confidence: Confidence Bands for Tuning Curves 20 hours ago | www.reddit.com

abstract accounting function hyperparameter +11

[R] InternVL v1.5 open sourced, ranking first in OpenCompass multi-modal benchmark 20 hours ago | www.reddit.com

benchmark cvpr demo download +7

[N] Meta releases Llama 3 20 hours ago | www.reddit.com

machinelearning

(373) Applications Manager – Business Intelligence - BSTD

@ South African Reserve Bank | South Africa

View on ai-jobs.net

Data Engineer Talend (confirmé/sénior) - H/F - CDI

@ Talan | Paris, France

View on ai-jobs.net

Data Science Intern (Summer) / Stagiaire en données (été)

@ BetterSleep | Montreal, Quebec, Canada

View on ai-jobs.net

Director - Master Data Management (REMOTE)

@ Wesco | Pittsburgh, PA, United States

View on ai-jobs.net

Architect Systems BigData REF2649A

@ Deutsche Telekom IT Solutions | Budapest, Hungary

View on ai-jobs.net

Data Product Coordinator

@ Nestlé | São Paulo, São Paulo, BR, 04730-000

View on ai-jobs.net