Questions about GPT-1 in Huggingface. | allainews.com

Jan. 28, 2024, 10:48 a.m. | /u/Invariant_apple

Natural Language Processing www.reddit.com

Hi all, currently I'm learning about LLMs and I have a couple of noob questions.

First, let's start with the GPT-1 paper: https://cdn.openai.com/research-covers/language-unsupervised/language\_understanding\_paper.pdf

**Question 1: What is exactly the shape of the input to the embedding step?**

Let's look at expression block (2) in the paper.

According to the paper the input to the model is named **U**. From what I have gathered so far, these should be token\_id's after a first tokenization step. However I am a bit confused …

block embedding gpt gpt-1 huggingface languagetechnology llms look paper question questions

More from www.reddit.com / Natural Language Processing

Do I need graph database for this Entity Linking problem? 1 day, 12 hours ago | www.reddit.com

articles build business companies +14

Recommendation on NLP-tools and algorithms for modelling diachronic change in meaning? 2 days, 20 hours ago | www.reddit.com

algorithms change focus hello +11

What can I do during my NLP Master's program to best prepare me for top … 3 days, 18 hours ago | www.reddit.com

computer computer science languagetechnology master +4

Alternatives to Rasa? 6 days, 10 hours ago | www.reddit.com

alternative chatbots database document +8

Can LLMs Consistently Deliver Comedy? 1 week ago | www.reddit.com

comedy create filtering however +9

Topic modeling with short sentences 1 week ago | www.reddit.com

algorithms data dataset kind +4

How big does a dataset have to be to fine-tune a transformer model for NER. 1 week, 1 day ago | www.reddit.com

bert big database dataset +15

PhD in Linguistics: Which skills should I focus on? 1 week, 2 days ago | www.reddit.com

communication computer computer science fields +12

Is the MA in computational linguistics that bad in Tubingen ? 1 week, 2 days ago | www.reddit.com

computational languagetechnology linguistics

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net