Pruning AI Models for Peak Performance - NVIDIA DRIVE Labs Ep. 31 | allainews.com

Sept. 19, 2023, 3:55 p.m. | NVIDIA

NVIDIA www.youtube.com

Check out HALP (Hardware-Aware Latency Pruning), a new method designed to adapt convolutional neural networks (CNNs) and #transformer-based architectures for real-time performance. HALP optimizes pre-trained models to maximize compute utilization. In testing with NVIDIA DRIVE Orin™ on the road, it consistently outperformed alternative approaches.

00:00:00 - Introducing Hardware-Aware Latency Pruning (HALP)
00:00:29 - Common Model Optimization
00:00:59 - DNN Pruning
00:01:21 - Hardware Aware Latency Pruning
00:01:31 - Classification Tasks
00:01:37 - 3D Object Detection
00:02:04 - HALP with Transformers …

adapt ai models architectures check cnns compute convolutional neural networks drive drive labs hardware labs latency networks neural networks nvidia peak performance pre-trained models pruning real-time testing transformer

More from www.youtube.com / NVIDIA

Healthcare Is Adopting Generative AI, Becoming One of the Largest Tech Industries 5 days, 13 hours ago | www.youtube.com

accelerated computing ai-powered assistants cloud +19

NVIDIA Grace CPU Superchip 6 days, 15 hours ago | www.youtube.com

AI Factory for the New Industrial Revolution | NVIDIA GTC24 1 week, 4 days ago | www.youtube.com

AI Enhanced Broadcast Interviews with Quicklink - NVIDIA Partner Showcase Series 3 weeks, 6 days ago | www.youtube.com

ai features broadcast ceo features +6

Powering the New Era of Computing | NVIDIA GTC 2024 | Official Keynote Outro 1 month ago | www.youtube.com

accelerated computing center computing data +13

Live From GTC: A Conversation With Slalom 1 month, 1 week ago | www.youtube.com

brad capabilities communications conversation +11

Fusing Real-Time AI With Digital Twins 1 month, 1 week ago | www.youtube.com

automation developers digital digital twin +14

Speech Recognition with Speechmatics - NVIDIA M&E Partner Showcase 1 month, 1 week ago | www.youtube.com

Automotive Update from GTC 2024 1 month, 1 week ago | www.youtube.com

ai applications applications architecture automotive +23

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Codec Avatars Research Engineer

@ Meta | Pittsburgh, PA

View on ai-jobs.net