Aug. 31, 2023, 12:43 p.m. | /u/ebazarov

Machine Learning www.reddit.com

**TL;DR:**

We've built an asynchronous server using Starlette with Uvicorn and are facing significant memory management issues. Despite multiple attempts by switching to other frameworks and different techniques, we have a hard time resolving the issue, the allocated memory is not released back to the system after high-load scenarios.

**There's an** [**open discussion**](https://github.com/encode/uvicorn/discussions/2078) **under the Uvicorn repository and we prepared a repository for Reproduction** [**GitHub Repo**](https://github.com/Besedo/memory_issue)

[Memory consumption over time with Starlette + Gunicorn while waiting for 1 hour after …

ai models asynchronous fastapi frameworks inference issue machinelearning management memory multiple python server

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Sr. Software Development Manager, AWS Neuron Machine Learning Distributed Training

@ Amazon.com | Cupertino, California, USA