[N] MLPerf submission: 175X increase in NLP Performance utilizing sparsity | allainews.com

Sept. 10, 2022, 6:24 p.m. | /u/markurtz

Machine Learning www.reddit.com

Utilizing the oBERT research we published at Neural Magic and some further iteration, we’ve enabled an increase in NLP performance of 175X on CPUs while retaining 99% accuracy on the question-answering task in MLPerf. A combination of distillation, layer dropping, quantization, and unstructured pruning with oBERT enabled these large performance gains through the [DeepSparse Engine](https://github.com/neuralmagic/deepsparse). All of our contributions and research are open-sourced or free to use. Read through the [oBERT paper on arxiv](https://arxiv.org/abs/2203.07259), try out the research in [SparseML](https://github.com/neuralmagic/sparseml). …

machinelearning mlperf nlp performance sparsity

More from www.reddit.com / Machine Learning

[P] LeRobot: Hugging Face's library for real-world robotics 6 hours ago | www.reddit.com

academia advanced advanced ai ai development +13

[D] Kolmogorov-Arnold Network is just an MLP 7 hours ago | www.reddit.com

machinelearning mlp network relu +1

[D] Why Gemma has such crazy big MLP hidden dim size? 7 hours ago | www.reddit.com

big gemma hidden machinelearning +1

[R] Why can Llama-3 work with 32K context if it only had 8K context length? 8 hours ago | www.reddit.com

32k context config context dynamic +7

[D] Is there a formal name for "dialogue classification?" 14 hours ago | www.reddit.com

agents classification customer customer service +11

How Large Language Models play video games [D] 15 hours ago | www.reddit.com

agents case engineering explore +15

[Project] An LLM-Powered Web App for SEC Filing Insights 15 hours ago | www.reddit.com

apis app financial future +18

[Research] Understanding The Attention Mechanism In Transformers: A 5-minute visual guide. 🧠 19 hours ago | www.reddit.com

architectures attention dictionary guide +12

[D] Is there a more systematic way of choosing the layers or how deep the … 1 day ago | www.reddit.com

architecture deep learning least machinelearning +6

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Sr. Software Development Manager, AWS Neuron Machine Learning Distributed Training

@ Amazon.com | Cupertino, California, USA

View on ai-jobs.net