Optimizing ViT-B-16 Transformer Training: How to Accelerate Training Time | allainews.com

Feb. 27, 2024, 8:28 a.m. | /u/PhanTrang356

Computer Vision www.reddit.com

Hi, I am attempting to fine-tune the ViT-B-16 transformer on the ImageNet and SUN397 datasets. However, it currently requires at least 2 days to run for 15 epochs, and I also need to fine-tune the hyperparameters. I am utilizing 4 GPUs with 40 GB of memory and a batch size of 512. Is there any potential way to accelerate the training time or identify hyperparameters that won't take as much time?

computervision datasets gpus imagenet least memory training transformer vit

More from www.reddit.com / Computer Vision

My New project . open cv real time face and emotion recognation. drop ur thought … 8 hours ago | www.reddit.com

computervision emotion face project +1

Developing Software vs Off the Shelf 17 hours ago | www.reddit.com

computervision industry manufacturing opencv +5

YOLOv8 TensorRT based on the references provided by Ultralytics 19 hours ago | www.reddit.com

case computervision jetson jetson orin +4

CNN vs. Vision Transformer: A Practitioner's Guide to Selecting the Right Model 23 hours ago | www.reddit.com

architecture blog cnn computervision +12

Processing 80 camera streams on a single rack-mounted server - anyone worked on a similar … 1 day, 15 hours ago | www.reddit.com

application cameras computervision decoding +7

Predicting the real world coordinates (x,y,z) of a ball from 2d image taken from a … 1 day, 18 hours ago | www.reddit.com

2d image box center computervision +7

2024 review of OCR tools extracting text from handwritten forms and documents 1 day, 20 hours ago | www.reddit.com

case computervision documents example +10

Looking for Recent Visual Programming Tools for Computer Vision 1 day, 23 hours ago | www.reddit.com

advance coding computer computer vision +13

Multi box localization 2 days, 1 hour ago | www.reddit.com

box computervision experience extract +10

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net