Scratch Implementation of Vision Transformer in PyTorch | allainews.com

April 13, 2023, 11:46 p.m. | /u/incrapnito

Computer Vision www.reddit.com

I am sharing my scratch PyTorch implementation of Vision Transformer. It has a detailed step-by-step guide of Self-attention and model specifics for learning Vision Transformers. The network is a small scaled-down version of the original architecture and achieves around 99.4% test Accuracy on MNIST and 92.5% on FashionMNIST.

Hope you find it helpful. Feedbacks appreciated.

GitHub: [https://github.com/s-chh/PyTorch-Vision-Transformer-ViT-MNIST](https://github.com/s-chh/PyTorch-Vision-Transformer-ViT-MNIST)

accuracy architecture attention computervision guide implementation mnist network pytorch self-attention small test transformer transformers vision

More from www.reddit.com / Computer Vision

how to utilize my time? 1 day, 2 hours ago | www.reddit.com

basics computer computer vision computervision +7

3D CV resources/roadmap 1 day, 4 hours ago | www.reddit.com

classification computervision experience good +10

Count the cigarette packs in the image (Stuck since December!!!) 1 day, 7 hours ago | www.reddit.com

computervision count ever face +6

Need to detect small objects in a large Image accurately with smaller model size and … 1 day, 11 hours ago | www.reddit.com

computervision good image inference +6

In the world's first autonomous racing championship, AI racers completed their eight-lap race in one … 1 day, 13 hours ago | www.reddit.com

abu dhabi ai-driven ai-powered autonomous +7

What can i do to improve my model 2 days, 4 hours ago | www.reddit.com

class computervision hello light +3

Measuring and Reducing Malicious Use With Unlearning 2 days, 9 hours ago | www.reddit.com

computervision measuring publication unlearning

Occlusion resilient pose estimation 2 days, 12 hours ago | www.reddit.com

computervision confidence multiple people +2

In bundle adjustment tasks, how are the weights for reprojection errors and GCPs set? 2 days, 19 hours ago | www.reddit.com

computervision control errors gcp +4

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net