March 27, 2024, 1:35 p.m. | /u/artificial_intelect

Machine Learning www.reddit.com

[https://x.com/vitaliychiley/status/1772958872891752868?s=20](https://x.com/vitaliychiley/status/1772958872891752868?s=20)

Shill disclaimer: I was the pretraining lead for the project

DBRX deets:

* 16 Experts (12B params per single expert; top\_k=4 routing)
* 36B active params (132B total params)
* trained for 12T tokens
* 32k sequence length training

expert experts machinelearning params per pretraining project routing tokens total training

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior ML Engineer

@ Carousell Group | Ho Chi Minh City, Vietnam

Data and Insight Analyst

@ Cotiviti | Remote, United States