[D] - Given that we can lossily transform text to images and vice versa, multimodality should not be required for AGI or the construction of world-models. Any causal relationship that can be inferred from images/audio/video should be inferable from t | allainews.com

Aug. 30, 2023, 1:42 p.m. | /u/30299578815310

Machine Learning www.reddit.com

Consider video data that captures various interactions between entities—let's say Person A and Person B. We then apply a video summarization network T(x), where x is some video or an entity in the video, onto the video. For sake of argument, let's assume T(x) provides a description of x so detailed that we can decode the description back into the original video without losing much information via some arbitrary text-video model. Now, if we can infer a causal relationship in …

agi apply audio construction data images interactions machinelearning multimodality person relationship text text to images video video data world

More from www.reddit.com / Machine Learning

[N] AI is promoted from back-office duties to investment decisions 7 hours ago | www.reddit.com

decisions investment machinelearning office +1

[P] Baysian bandits item pricing in a Moonlighter shop simulation 8 hours ago | www.reddit.com

agent bayesian customer game +8

[D] The Dilemma of Taking Notes on Every ML Resource or Accepting Knowledge Loss Over … 9 hours ago | www.reddit.com

every knowledge loss machine +7

[R] MetaEarth - A Generative Foundation Model for Global-Scale Remote Sensing Image Generation 10 hours ago | www.reddit.com

foundation foundation model generative global +5

If LLMs are token-based autoregressive models, how do they generate images? (Transformers + VQVAE) [D] 11 hours ago | www.reddit.com

autoregressive autoregressive models gemini generate +10

[Discussion] Are people interested in creating a mid-tier GPU rig using two RTX A6000's joined … 13 hours ago | www.reddit.com

costs grant grant program machinelearning +3

[Research] Tangles: a new mathematical ML tool in book announced by Diestel 13 hours ago | www.reddit.com

artificial artificial intelligence book cambridge +11

[R] Tech report on FineWeb: decanting the web for the finest text data at scale 17 hours ago | www.reddit.com

arc benchmarks crawl datasets +10

[D] Teacher student training strategy 20 hours ago | www.reddit.com

accuracy data extract llama3 +12

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

View on ai-jobs.net

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

View on ai-jobs.net

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

View on ai-jobs.net

Principal Data Architect - Azure & Big Data

@ MGM Resorts International | Home Office - US, NV

View on ai-jobs.net

GN SONG MT Market Research Data Analyst 11

@ Accenture | Bengaluru, BDC7A

View on ai-jobs.net