Feb. 27, 2024, 8:28 a.m. | /u/PhanTrang356

Computer Vision www.reddit.com

Hi, I am attempting to fine-tune the ViT-B-16 transformer on the ImageNet and SUN397 datasets. However, it currently requires at least 2 days to run for 15 epochs, and I also need to fine-tune the hyperparameters. I am utilizing 4 GPUs with 40 GB of memory and a batch size of 512. Is there any potential way to accelerate the training time or identify hyperparameters that won't take as much time?

computervision datasets gpus imagenet least memory training transformer vit

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US