March 13, 2024, 11:40 a.m. | /u/lildaemon

Machine Learning

I'm running exactly the same code on both machines. It's a language model that I'm training from scratch. The cpu trained model creates coherent words(it's a character level model), while the gpu trained model generates gibberish. Both have similar accuracy for next character prediction.

I prompted both models with "The" and looked at the completion.

GPU trained model completion: " on nut\\nik o o nu ko ge ede ed eet eg ed "

CPU trained model completion: " soldiers and …

code cpu gpu language language model machine machinelearning machines running scratch training words

Senior Data Engineer

@ Displate | Warsaw

Principal Software Engineer

@ Microsoft | Prague, Prague, Czech Republic

Sr. Global Reg. Affairs Manager

@ BASF | Research Triangle Park, NC, US, 27709-3528

Senior Robot Software Developer

@ OTTO Motors by Rockwell Automation | Kitchener, Ontario, Canada

Coop - Technical Service Hub Intern

@ Teradyne | Santiago de Queretaro, MX

Coop - Technical - Service Inside Sales Intern

@ Teradyne | Santiago de Queretaro, MX