[R] Infinite context Transformers | allainews.com

April 11, 2024, 5:35 p.m. | /u/Dyoakom

Machine Learning www.reddit.com

I took a look and didn't see any discussion thread here on this paper which looks perhaps promising.

[https://arxiv.org/abs/2404.07143](https://arxiv.org/abs/2404.07143)

What are your thoughts? Could it be one of the techniques behind the Gemini 1.5 reported 10m token context length?

context gemini gemini 1.5 look machinelearning paper thoughts thread token transformers

More from www.reddit.com / Machine Learning

[D] Something I always think about, for top conferences like ICML, NeurIPS, CVPR,..etc. How many … an hour ago | www.reddit.com

conferences cvpr etc good +8

[D] Benchmark creators should release their benchmark datasets in stages 2 hours ago | www.reddit.com

benchmark benchmarks concerns data +11

[P] spRAG - Open-source RAG implementation for challenging real-world tasks 3 hours ago | www.reddit.com

core hey implementation machinelearning +7

[D] Why do juniors (undergraduates or first- to second-year PhD students) have so many papers … 8 hours ago | www.reddit.com

academic conferences etc hello +12

[D] How can I detect the text orientation using MMOCR or MMDET models? 11 hours ago | www.reddit.com

example image images issue +5

[D] Current state of Chatbot pipelines in Commercial settings? 16 hours ago | www.reddit.com

build chatbot commercial current +12

[R] Training-free Graph Neural Networks and the Power of Labels as Features 19 hours ago | www.reddit.com

features free graph graph neural networks +6

[D] Modern best coding practices for Pytorch (for research)? 22 hours ago | www.reddit.com

coding config example good +14

[R] Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic … 1 day ago | www.reddit.com

breaking data machinelearning model collapse +3

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

#13721 - Data Engineer - AI Model Testing

@ Qualitest | Miami, Florida, United States

View on ai-jobs.net

Elasticsearch Administrator

@ ManTech | 201BF - Customer Site, Chantilly, VA

View on ai-jobs.net