LongLoRA: New method extends LLAMA2 7B to 100k context length, 70B to 32k context length on on a single 8 × A100 machine | allainews.com

Sept. 22, 2023, 2:20 p.m. | /u/Successful-Western27

Artificial Intelligence www.reddit.com

As AI models get bigger, training them requires more and more computing power. Researchers are looking for ways to train these large AI models without needing Google-scale resources.

A new paper proposes [LongLoRA](https://arxiv.org/pdf/2309.12307.pdf), **a fine-tuning approach that can extend LLaMA2 7B to 100k context length and 70B model to 32k context length on a single 8× A100 machine.**

Here are my highlights from the paper:

Big one of course: LongLoRA efficiently fine-tunes large AI models on longer texts

Key points: …

32k context a100 ai models artificial big bigger computing computing power context google highlights llama2 machine paper power researchers resources scale them training

More from www.reddit.com / Artificial Intelligence

Will ai take my job? 7 hours ago | www.reddit.com

artificial databases industry job +4

Katy Perry's Fan-Made AI Image Is So Real It Fooled the World Into Thinking She … 10 hours ago | www.reddit.com

ai image artificial image met gala +2

Apple is reportedly developing chips to run AI software in data centers 13 hours ago | www.reddit.com

ai software apple artificial chip +15

This is BIG. OpenAI just announed, they are partnering with Stack Overflow to use it … 1 day, 7 hours ago | www.reddit.com

artificial big database database for llm +5

Stretchable e-skin could give robots human-level touch sensitivity 1 day, 16 hours ago | www.reddit.com

artificial control devices electronic +5

One-Minute Daily AI News 5/7/2024 1 day, 18 hours ago | www.reddit.com

ai news alphabet artificial chatbot +21

Microsoft readies new AI model to compete with Google, OpenAI 1 day, 20 hours ago | www.reddit.com

ai language model ai model artificial co-founder +16

AI project - City Council Voting record over the last 3+ years. 1 day, 20 hours ago | www.reddit.com

ai studio artificial city dating +12

Best tool for upscaling lots of long videos? 2 days ago | www.reddit.com

artificial bonus extract family +9

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net