[D] Speed Up in FP32 vs FP16 | allainews.com

Jan. 29, 2024, 9:33 p.m. | /u/MaintenanceNo5993

Machine Learning www.reddit.com

- Task: Training and Fine Tuning on Single node 2 GPUs
- Model: CLIP ViT-B-32
- Dataset: MSCOCO Captions
- Number of Workers: 4
- Batch Size: 240 in case of FP16 and 160 in case of FP32

For both FP32 and FP16, each epoch is taking around 6 mins.

One of reason I consider is that *majority of time might constitute of data movement* rather GPU processing, as in case of FP32 there's hardly a moment when GPU utilization …

captions case clip dataset fp16 gpus machinelearning node reason speed training vit workers

More from www.reddit.com / Machine Learning

[D] Is there a more systematic way of choosing the layers or how deep the … 6 hours ago | www.reddit.com

architecture deep learning least machinelearning +6

[D] Where does the real value of a data scientist come from? 10 hours ago | www.reddit.com

code companies data data scientist +11

[D] NVIDIA GPU Benchmarks & Comparison 13 hours ago | www.reddit.com

a100 ada cards cloud +15

[N] 1st Workshop on In-Context Learning at ICML 2024 13 hours ago | www.reddit.com

context context learning icml in-context learning +2

[R] A Careful Examination of Large Language Model Performance on Grade School Arithmetic 14 hours ago | www.reddit.com

abstract benchmark benchmarks claim +21

[D] [R] Are there any methods/works that enable extracting high-quality dense feature map from CLIP/OpenCLIP … 17 hours ago | www.reddit.com

clip compute feature finetuning +8

[P] [D] Is inference time the important performance metric for ML Models on edge/mobile? 22 hours ago | www.reddit.com

apps devices edge embed +15

[D] UI-based Agents - the next big thing? 23 hours ago | www.reddit.com

agents ai agents become big +10

[D] Any-dimensional equivariant neural networks 23 hours ago | www.reddit.com

abstract assumptions authors cases +18

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net