April 18, 2024, 4:42 p.m. | /u/flyforlight

Machine Learning www.reddit.com

https://preview.redd.it/fh44g3n4m9vc1.png?width=1383&format=png&auto=webp&s=9b3e499bd51aeb10559f4636eba2a1677d4a08a3

InternVL is a multi-modal foundation model, which is accepted as an Oral paper for CVPR 2024. The latest version InternVL v1.5 ranks first in the OpenCompass multi-modal model benchmark.

Demo:

[https://internvl.opengvlab.com/](https://internvl.opengvlab.com/)

Model Download:

[https://huggingface.co/collections/OpenGVLab/internvl-65b92d6be81c86166ca0dde4](https://huggingface.co/collections/OpenGVLab/internvl-65b92d6be81c86166ca0dde4)

OpenCompass:

[https://rank.opencompass.org.cn](https://link.zhihu.com/?target=https%3A//rank.opencompass.org.cn/home)



Some examples:





https://preview.redd.it/rwj7vs9rm9vc1.jpg?width=902&format=pjpg&auto=webp&s=514e14e692db8ea7bd5a66cc36b1ca3f8351102c

https://preview.redd.it/vtwjml3qm9vc1.png?width=2508&format=png&auto=webp&s=e32c044d4bc60ef28baf64dccdcb5fe9b10dfc61



https://preview.redd.it/p51vt3xpn9vc1.png?width=2609&format=png&auto=webp&s=73907e5ffb4d9b9bd4250cbce53e3bd29dedabf1

benchmark cvpr demo download examples foundation foundation model machinelearning modal multi-modal paper

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Software Engineer, Data Tools - Full Stack

@ DoorDash | Pune, India

Senior Data Analyst

@ Artsy | New York City