Huggingface not saving model checkpoint | allainews.com

April 28, 2023, 1:37 p.m. | /u/Tiny-Entertainer-346

Natural Language Processing www.reddit.com

I am trying to train T5 model. This is how my training arguments look like:

args = Seq2SeqTrainingArguments(
model_dir,
evaluation_strategy="steps",
eval_steps=100,
logging_strategy="steps",
logging_steps=100,
save_strategy="steps",
save_steps=200,
learning_rate=4e-5,
per_device_train_batch_size=batch_size,
per_device_eval_batch_size=batch_size,
weight_decay=0.01,
save_total_limit=3,
num_train_epochs=10,
predict_with_generate=True,
fp16=True,
load_best_model_at_end=True,
metric_for_best_model="rouge1",
report_to="tensorboard"
)

My model trained for 7600 steps. But the last model saved was for checkpoint 1800:

[trainer screenshot](https://i.stack.imgur.com/MBoFu.png)

Why is this so?

fp16 huggingface languagetechnology look saving tensorboard training true

More from www.reddit.com / Natural Language Processing

Which NLP-master programs in Europe are more cs-leaning? 2 days, 10 hours ago | www.reddit.com

computational english europe germany +12

What do you think is the state of the art technique for matching a piece … 4 days, 8 hours ago | www.reddit.com

art city database example +9

Multilabel text classification on unlabled data 4 days, 21 hours ago | www.reddit.com

classification data finance isn +11

I made a text-game where all the LLMs trick each other pretending to be humans. … 5 days, 13 hours ago | www.reddit.com

game humans languagetechnology llms +3

Help with fraud recognition 5 days, 19 hours ago | www.reddit.com

bank code country detection +7

AI-proof language-related jobs in the United States? 1 week ago | www.reddit.com

jobs language languagetechnology management +4

Leveling up RAG 1 week ago | www.reddit.com

advanced advice cleaning context +8

Did we just receive an AI-generated meta-review? 1 week, 2 days ago | www.reddit.com

generated languagetechnology meta review

Found a Way to Keep Transcripts Going 24/7 1 week, 3 days ago | www.reddit.com

apple apple silicon bugs check +10

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Machine Learning Engineer

@ Apple | Sunnyvale, California, United States

View on ai-jobs.net