April 22, 2023, 5:22 p.m. | /u/DietzscheNostoevsky

Data Science www.reddit.com

I was trying to evaluate different classification models on MNIST dataset.

There are two datasets provided : \`train\` - 42000 images, and \`test\` - 28000 images.

​

I first divided the original training dataset (42000 images) into a (80:20 split ) of \`train\_set\` (33600) and \`test\_set\` (8400) .

I trained several models, from on \`training set\`, \`cross-validated\` them on the \`training\_set\` only, and lastly evaluated the final model on the \`test\_set\` for generalization error.

​

Now that my final model …

classification datascience dataset datasets error images mnist set test training validation

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Sr. BI Analyst

@ AkzoNobel | Pune, IN