Memory leak in C++ when running module in separate thread #24237

ghost · 2019-08-13T10:47:35Z

🐛 Bug

When calling the forward function of a Module, some memory is allocated that is not de-allocated at the end of the thread.

To Reproduce

Steps to reproduce the behavior:

Module scripted from Python as in tutoriel:

import torchvision
import torch

model = torchvision.models.resnet18()
example = torch.rand(1,3,224,224)
my_torchscript_module = torch.jit.trace(model, example)
torch.jit.save(my_torchscript_module, "sciptedModule.pt")

Loaded and ran in C++ in separate thread:

#include "torch/script.h"
#include "torch/torch.h"


void runModel(at::Tensor, torch::jit::script::Module);

int main()
{
	torch::NoGradGuard no_guard;
	torch::jit::script::Module m_module = torch::jit::load("./sciptedModule.pt");
	m_module.eval();
	at::Tensor testTensor = torch::rand({ 1,3,224,224}, at::kFloat);
	testTensor = testTensor.div(testTensor.norm());
	for (int i = 0; i < 10000; i++) {
		std::thread newThread(&runModel, testTensor, m_module);
		newThread.join();
	}
}

void runModel(at::Tensor testTensor, torch::jit::script::Module m_module) {
	torch::NoGradGuard no_guard;
	at::Tensor out = m_module.forward({ testTensor }).toTensor().detach();
}

Expected behavior

Inference is done in separate thread with no increase in memory

Environment

PyTorch version: 1.2.0
Is debug build: No
CUDA used to build PyTorch: None

OS: Microsoft Windows 10 Home
GCC version: Could not collect
CMake version: version 3.12.2

Python version: 3.6
Is CUDA available: No
CUDA runtime version: No CUDA
GPU models and configuration: No CUDA
Nvidia driver version: No CUDA
cuDNN version: No CUDA

Versions of relevant libraries:
[pip] numpy==1.16.2
[pip] numpydoc==0.8.0
[pip] torch==1.2.0
[pip] torchvision==0.4.0
[conda] _tflow_1100_select 0.0.3 mkl
[conda] _tflow_select 2.3.0 mkl
[conda] blas 1.0 mkl
[conda] cpuonly 1.0 0 pytorch
[conda] libmklml 2019.0.3 0
[conda] mkl 2019.1 144
[conda] mkl-include 2019.1 144
[conda] mkl-service 1.1.2 py36hb782905_5
[conda] mkl_fft 1.0.10 py36h14836fe_0
[conda] mkl_random 1.0.2 py36h343c172_0
[conda] pytorch 1.2.0 py3.6_cpu_1 [cpuonly] pytorch
[conda] tensorflow-base 1.10.0 mkl_py36h81393da_0
[conda] torchvision 0.4.0 py36_cpu [cpuonly] pytorch

Additional context

When running on main thread, the memory seems to be allocated once on first call and then re-used.
Python threading doesn't have this problem

The text was updated successfully, but these errors were encountered:

pietern · 2019-08-13T10:55:53Z

This is likely to be caused by some thread local state that isn't cleaned up.

Could you try running without MKL and see what happens?

ghost · 2019-08-13T12:40:44Z

Hi @pietern , thanks for quick answer. Sure, do you mean in the python part when tracing the model? I don't think I use MKL in the C++ part, unless it's inside the torch lib.

pietern · 2019-08-13T12:43:49Z

I mean on the C++ side. PyTorch compiled with MKL support will transparently use it, I think.

ghost · 2019-08-13T13:13:20Z

I'm not sure how to check if it uses it or run without it. Could you walk me through or point to some documentation?
I only downloaded the latest stable version and link to it.

ghost · 2019-08-22T08:45:38Z

I found a way to not use std::thread in my application, so this is not a problem for me anymore.

pietern · 2019-08-26T12:06:58Z

While this still might be an issue by itself, I'll close the issue since you found a workaround.

Bsting · 2019-10-15T07:26:40Z

i am facing the similar problem
https://discuss.pytorch.org/t/heap-size-increase-constantly-when-inference-with-new-thread/57621/4

tiberiusferreira · 2019-11-27T01:05:57Z

I'm facing the same problem using the Rust bindings.
The thread local variables seem to not be cleaned after inference.

jingxil · 2020-01-18T04:21:10Z

Hi @pietern, I am facing a similar problem. The memory usage is keeping going up when I do the inference in separate threads. And without the mkl lib, the memory usage is stable. I tried to set the env variable MKL_DISABLE_FAST_MM=1. But it did not work out.

Junan007 · 2020-03-10T05:52:38Z

I am facing a similar problem.

WilliamTambellini · 2021-09-20T23:18:17Z

Still the same issue with libtorch 1.7.

ofaucoz · 2022-10-12T12:55:42Z

I'm having the exact same issue using libtorch called in a thread from unity.
This is a derived code from my problem:

C++ script

torch::NoGradGuard no_grad;
at::Tensor tensor_image = torch::from_blob(...)
tensor_image.set_requires_grad(false);

vector<torch::jit::IValue> inputs;
inputs.push_back(tensor_image);
at::Tensor output;
output = model.forward(inputs).toTensor();
...

(Unity) C# script calling the libtorch script

void Update(){
    ...
    ThreadMl = new Thread(Action);
    ThreadMl.Start();
}

private void Action(){ // launched in a thread
    ...
    ScriptML(...); // Memory being leaked
    ...
}

libtorch version : (CPU - Windows) 1.11

RVirmoors · 2023-02-05T17:47:03Z

this is very much still an active issue and should probably be reopened

joshhansen · 2023-05-19T00:04:34Z

I seem to be getting this - coming from Rust like @tiberiusferreira. In a multithreaded async environment, inference appears to leak memory.

ofaucoz · 2023-09-06T11:55:24Z

< 8000 table class="d-block user-select-contain" data-paste-markdown-skip>

Any news to fix this issue ?

ezorita · 2024-02-01T17:37:20Z

Also having the same issue running a model for inference in a Thread class.

thecargocultnz · 2025-02-19T01:40:14Z

Why is this closed???
This is still an issue in Feb 2025, and renders torch non-viable in a commercially released product.
Any serious inference is gonna cause this massive memory leak.

To be clear: we cannot use torch any longer. I don't know how anyone does? Is it just for academics?

thecargocultnz · 2025-02-19T01:43:23Z

@pietern can we get it reopened please?

pietern closed this as completed Aug 26, 2019

tiberiusferreira mentioned this issue Nov 27, 2019

Thread Local variables not cleaned up LaurentMazare/tch-rs#125

Open

This was referenced Sep 1, 2021

RPC memory leak for CPU #61920

Open

Memory leak in multi-thread inference #64412

Open

cloudhan mentioned this issue Sep 6, 2021

Memory Leak in MKL OpenMP on AVX2 machine #64535

Open

nshaheed mentioned this issue Sep 28, 2022

fix memory leak on windows caused by threading acids-ircam/nn_tilde#25

Merged

Algabeno mentioned this issue Mar 8, 2024

libtorch memory leakage when inference by Multi-thread #121485

Open

endink mentioned this issue Dec 27, 2024

打包 dll 使用 Extern C 调用 Huiyicc/gpt_sovits_cpp#6

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Memory leak in C++ when running module in separate thread #24237

Memory leak in C++ when running module in separate thread #24237

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Memory leak in C++ when running module in separate thread #24237

Memory leak in C++ when running module in separate thread #24237

Comments

🐛 Bug

To Reproduce

Expected behavior

Environment

Additional context

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

C++ script

(Unity) C# script calling the libtorch script

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!