Releasing GPT-JT powered by open-source AI

Nov. 29, 2022, 3:19 p.m. | Together

Blog Content - TOGETHER www.together.xyz

With a new decentralized training algorithm, we fine-tuned GPT-J (6B) on
3.53 billion tokens, resulting in GPT-JT (6B), a model that outperforms
many 100B+ parameter models on classification benchmarks.

algorithm benchmarks billion classification decentralized gpt gpt-j gpt-jt open-source ai tokens training

Visit resource

More from www.together.xyz / Blog Content - TOGETHER

RedPajama-Data-v2: an Open Dataset with 30 Trillion Tokens for Training Large Language Models 6 months ago | www.together.xyz

annotations data data quality dataset +12

Flash-Decoding for long-context inference 6 months, 3 weeks ago | www.together.xyz

attention context decoding faster +3

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads 7 months, 3 weeks ago | www.together.xyz

api context decoding framework +6

Llama-2-7B-32K-Instruct — and fine-tuning for Llama-2 models with Together API 8 months, 2 weeks ago | www.together.xyz

api context fine-tuning llama +2

Faster inference enables up to 5x price reduction on Together API 8 months, 3 weeks ago | www.together.xyz

ai stack api cost efficiency +12

Preparing for the era of 32K context: Early learnings and explorations 9 months ago | www.together.xyz

context document understanding llama research +2

Monarch Mixer: A new model architecture for increased efficiency 9 months, 1 week ago | www.together.xyz

architecture efficiency exploration look +2

Introducing Together AI Chief Scientist Tri Dao, as he releases FlashAttention-2 to speed up model … 9 months, 2 weeks ago | www.together.xyz

ai models dao fine-tuning inference +5

Together AI and Snorkel AI empower enterprises to build proprietary LLMs 9 months, 2 weeks ago | www.together.xyz

build data enterprises environments +5

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

View more jobs

all AI news

Releasing GPT-JT powered by open-source AI

More from www.together.xyz / Blog Content - TOGETHER

Jobs in AI, ML, Big Data

Founding AI Engineer, Agents

AI Engineer Intern, Agents

AI Research Scientist

Data Architect

Data ETL Engineer

Lead GNSS Data Scientist