March 13, 2024, 11:40 a.m. | /u/lildaemon

Machine Learning www.reddit.com

I'm running exactly the same code on both machines. It's a language model that I'm training from scratch. The cpu trained model creates coherent words(it's a character level model), while the gpu trained model generates gibberish. Both have similar accuracy for next character prediction.

I prompted both models with "The" and looked at the completion.

GPU trained model completion: " on nut\\nik o o nu ko ge ede ed eet eg ed "

CPU trained model completion: " soldiers and …

code cpu gpu language language model machine machinelearning machines running scratch training words

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote