Large Language Models, GPT-2 — Language Models are Unsupervised Multitask Learners | allainews.com

Feb. 10, 2024, 4:42 p.m. | Vyacheslav Efimov

Towards Data Science - Medium towardsdatascience.com

Large Language Models, GPT-2 — Language Models Are Unsupervised Multitask Learners

Acing GPT capabilities by turning it into a powerful multitask zero-shot model

Introduction

GPT is a well-known series of models whose last versions are currently dominating in various NLP tasks. The first GPT version was a significant milestone: being trained on enormous 120M parameters, this model demonstrated state-of-the-art performance on top benchmarks. Starting from this point, researchers tried to improve the base version.

In 2019, researchers from OpenAI officially …

capabilities gpt gpt-2 language language models large language large language models machine learning nlp series tasks transformers unsupervised versions zero-shot

More from towardsdatascience.com / Towards Data Science - Medium

Lunar Crater Detection: Computer Vision in Space 50 minutes ago | towardsdatascience.com

autonomous computer computer vision data +10

Plotting Golf Courses in R with Google Earth 50 minutes ago | towardsdatascience.com

data science data visualization golf

Transformers: From NLP to Computer Vision 7 hours ago | towardsdatascience.com

architecture computer computer vision data +10

Expectations & Realities of a Student Data Scientist 8 hours ago | towardsdatascience.com

career college computer data +13

A 10-Minute Template to Build an AI Assistant on HuggingFace 8 hours ago | towardsdatascience.com

ai assistant artificial intelligence assistant build +9

Prompt Like a Data Scientist: Auto Prompt Optimization and Testing with DSPy 8 hours ago | towardsdatascience.com

ai data science deep-dives llm +1

Evaluate RAGs Rigorously or Perish 1 day ago | towardsdatascience.com

artificial intelligence data science large language models optimization +1

Why Data Science May Not Be For You 1 day ago | towardsdatascience.com

artificial intelligence career careers data +6

Enhance Your Network with the Power of a Graph DB 1 day, 9 hours ago | towardsdatascience.com

code data data analysis data science +11

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net