all AI news
[D] How Exactly does Fuyu's image to embedding with nn.Linear work? Could you do more with it?
Nov. 8, 2023, 11:04 p.m. | /u/vatsadev
Machine Learning www.reddit.com
- model takes in text the regular way, text -> tokens -> embeddings
- it also takes image -> embeddings
- it has a vanilla decoder, so only text comes out, they add special tokens around images, so i'm assuming the decoder ignores output images
So, from what I know, nn.Linear takes in a tensor and makes embeddings …
break it down embedding embeddings image linear machinelearning text tokens work
More from www.reddit.com / Machine Learning
Jobs in AI, ML, Big Data
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Robotics Technician - 3rd Shift
@ GXO Logistics | Perris, CA, US, 92571