Nov. 8, 2022, 2:12 a.m. | Kaixin Zhang, Hongzhi Wang, Han Hu, Songling Zou, Jiye Qiu, Tongxin Li, Zhishun Wang

cs.LG updates on arXiv.org arxiv.org

Recently, deep learning has been an area of intense research. However, as a
kind of computing-intensive task, deep learning highly relies on the scale of
GPU memory, which is usually prohibitive and scarce. Although some extensive
works have been proposed for dynamic GPU memory management, they are hard to
apply to systems with multiple dynamic workloads, such as in-database machine
learning systems.


In this paper, we demonstrated TENSILE, a method of managing GPU memory in
tensor granularity to reduce the …

arxiv gpu memory scheduling tensor

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Analyst - Associate

@ JPMorgan Chase & Co. | Mumbai, Maharashtra, India

Staff Data Engineer (Data Platform)

@ Coupang | Seoul, South Korea

AI/ML Engineering Research Internship

@ Keysight Technologies | Santa Rosa, CA, United States

Sr. Director, Head of Data Management and Reporting Execution

@ Biogen | Cambridge, MA, United States

Manager, Marketing - Audience Intelligence (Senior Data Analyst)

@ Delivery Hero | Singapore, Singapore