How to add additional module to BERT architecture, then load the original weight and use it | allainews.com

May 18, 2022, 1:40 a.m. | /u/scp-8989

Natural Language Processing www.reddit.com

I aim to

1. Add an additional module to BERT architecture (huggingface’s transformers)
2. Load the BERT’s weight to the BERT model with new architecture
3. Then use BERT directly or continue train BERT

I’m very confused how to do it. Since we usually use `from_pretrained` directly to load the model (both weight and architecture) from huggingface.

In more detail, I’m working on both [prajjwal1/bert-tiny](https://huggingface.co/prajjwal1/bert-tiny) and [bert-base-uncased](https://huggingface.co/bert-base-uncased).

architecture bert languagetechnology

More from www.reddit.com / Natural Language Processing

Anyone working on mathematics of transformers? 10 hours ago | www.reddit.com

graduate languagetechnology transformers

What Do You Love About NLP? 22 hours ago | www.reddit.com

coding communication computer conversations +7

Show Your Work with Confidence: Confidence Bands for Tuning Curves 1 day, 14 hours ago | www.reddit.com

abstract accounting function hyperparameter +11

How to Install and Deploy LLaMA 3 Into Production 1 day, 15 hours ago | www.reddit.com

70b beast easy gpu +8

The Languages AI Is Leaving Behind 4 days, 14 hours ago | www.reddit.com

languages languagetechnology

Feeling so inferior in the NLP job market. 5 days, 11 hours ago | www.reddit.com

job language languagetechnology master +5

NLP: building a sentiment model 5 days, 12 hours ago | www.reddit.com

began building create dataset +12

Online course Recommendations : I need to take some programming and CS or ai courses … 1 week ago | www.reddit.com

admissions ai courses computational computer +17

ReFT: Representation Finetuning for Language Models 1 week, 6 days ago | www.reddit.com

abstract languagetechnology

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Engineer

@ Parker | New York City

View on ai-jobs.net

Sr. Data Analyst | Home Solutions

@ Three Ships | Raleigh or Charlotte, NC

View on ai-jobs.net