all AI news
Questions about BigBird
Jan. 20, 2022, 3:48 p.m. | /u/KushnarevaL
Natural Language Processing www.reddit.com
Hello, people. I still have some questions after reading the paper about Big Bird model ( https://arxiv.org/pdf/2007.14062v2.pdf ) and will be happy if some Big Bird specialists will help me to understand this model better.
- Is distribution of random attention (Figure 1 (a)) fixed from advance for all inputs, or it somehow can be different for different inputs even on the same head?
- In BIGBIRD-ETC, do they add some additional global tokens, aside of [CLS]?
- In BIGBIRD-ITC, how is the …
More from www.reddit.com / Natural Language Processing
ReFT: Representation Finetuning for Language Models
1 week, 1 day ago |
www.reddit.com
Best Masters Program?
1 week, 3 days ago |
www.reddit.com
Jobs in AI, ML, Big Data
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Data Analyst - Associate
@ JPMorgan Chase & Co. | Mumbai, Maharashtra, India
Staff Data Engineer (Data Platform)
@ Coupang | Seoul, South Korea
AI/ML Engineering Research Internship
@ Keysight Technologies | Santa Rosa, CA, United States
Sr. Director, Head of Data Management and Reporting Execution
@ Biogen | Cambridge, MA, United States
Manager, Marketing - Audience Intelligence (Senior Data Analyst)
@ Delivery Hero | Singapore, Singapore