-
Notifications
You must be signed in to change notification settings - Fork 18.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Starting container fails with 'System error: read parent: connection reset by peer' #14203
Comments
System error: read parent: connection reset by peer
In the hopes that it'll prevent "System error: read parent: connection reset by peer" errors. See moby/moby#14203
I've included my repro instructions here: kubernetes/kubernetes#9822 (comment) Unfortunately, I don't know how to get docker into the magic state where this is reproducible. |
Please note that the issue we found with kubernetes is with 1.6.2 docker release. |
We see this exact error in several of our containers. Our only workaround is to rename the container. Interested in hearing other workarounds or solutions. |
Any advice for this bug? I suffered it for a long time with docker v1.6.2 in our production environment. |
Seems like an error connecting to sqlite, or rather while reading it. |
Docker 1.7.1 and Ubuntu 14.04. Updated my environment through apt-get and restarted. Nginx container no longer failed to start. |
@jjelev Yep, as stated in the original description, restarting Docker temporarily fixes the problem. Unfortunately, the problem reappears later. |
We're seeing this too, will try and dig up some more information. We're simultaneously executing a lot of the same container, might have something to do with it? Most executions work fine but this has just started happening. |
Wow, as has been seen by @cjcullen , the length of the docker run command seems to have something to do with this. I changed the length of one of the -e vars and my deterministic container launch failure went away. |
I've been having the same issue. Added an extra space to the failing command and it worked. So odd. |
We've hit this bug twice this morning after never seeing it in months of Docker use. Re: string length, we have made tweaks to our command recently. |
I didn't expect the workaround to work, but after adding a few spaces in an environment variable, we now had a full week without the error occurring. Previously, it occurred almost daily. We added this environment variable to all our compose containers:
|
I am having this same problem and I same docker version as OP |
I also had to add
to all my dockerbuild files for this error to go away... |
+1 one for the weird |
Nope, this is weird, I can't imagine what it would be. |
+1 adding a FOO env variable. Docker version 1.8.2 and docker-compose 1.4.0. |
This is a fix for moby/moby#14203
This is a fix for moby/moby#14203 (cherry picked from commit 491e35b)
@meeee @andrecp @AlbertodelaCruz @chrisjhoughton Just to clarify, by adding a FOO env variable do you mean adding a line like the following to the Dockerfile?
I tried this but it doesn't work for me... |
hello , i have also encountered this problem in docker version 1.9.1 . I know that the fix is avilable in docker 1.10 but is there a way to get the fix in docker 1.9.1 ? We may not be able to migrate to docker 1.10 soon . Is there a work around we can try in docker exec command ? Many people reported that docker run -e DOCKER_FIX='' worked for them . |
You could try passing in a longer command line, for example: docker exec the-container sh -c "intended-command args # extra padding junk On Thu, Mar 24, 2016, 1:46 AM rachit1arora notifications@github.com wrote:
|
This works around the docker bug: moby/moby#14203
Automatic merge from submit-queue e2e: add a dummy environment variable in the service tests This works around the docker bug: moby/moby#14203
This works around the docker bug: moby/moby#14203
This works around the docker bug: moby/moby#14203
This works around the docker bug: moby/moby#14203
I get the same error with this
I was doing a long running task using
|
I'm getting the error
on CoreOS. docker -v reports |
@vielmetti looks like possibly the JSON of one of your containers got corrupted, might be worth trying to find which one and either remove that container, or try to fix the JSON. Also keep in mind that CoreOS ships with a modified version of Docker (see coreos@1f8f545 for the commit it's built from), and issues should be reported in their issue tracker first |
Thanks @thaJeztah , I opened a CoreOS issue which appears to be unrelated to this particular issue (the error text is the same but the reproduction is different). |
This works around the docker bug: moby/moby#14203
This works around the docker bug: moby/moby#14203
Which version of this bug can be resolved ???? |
@shawntoxu see the milestone attached to this issue; it's in docker 1.10.0 |
thx. 在2017年06月29日 08:35,Sebastiaan van Stijn 写道: @shawntoxu see the milestone attached to this issue; it's in docker 1.10.0 — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.
|
We are still seeing this sporadic error "... [9] System error: read parent: connection reset by peer" with docker 1.10.3. Maybe it is, because we are on CentOS7.2? It seems the fix of this error requires that you are using opencontainers/runc, but I guess we don't use that in our setup. uname -aLinux ilgnext-jenkins.svl.ibm.com 3.10.0-327.el7.x86_64 #1 SMP Thu Nov 19 22:10:57 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux docker -vDocker version 1.10.3, build 9419b24-unsupported` Any chances that this error is fixed in a more recent version for CentOS, e.g. in CentOS7.3? Or else, is there a solid way to test, if on my upgraded system the error does no longer appear. |
@jfdoerre docker 1.10 is no longer maintained, and the version in the CentOS repository is the Red Hat fork of docker, for which the code doesn't live in this repository.
Be aware that CentOS is a rolling release, which means that when 7.3 was released, 7.2 no longer receives updates, so it's indeed recommended to be on the current version If you're still seeing this on the current (17.03 or 17.06) release of the official Docker packages, please open a new issue. |
I'm locking the conversation on this issue, because the original issue was resolved; if you encounter this issue on an up to date version of docker, please open a new issue instead. |
On our CI server, we run tests in Docker containers using docker-compose. We link 2-15 containers during one run. We ensure that test jobs running concurrently have different docker-compose project names.
Since we upgraded to Docker 1.7.0 (from 1.6), docker-compose to 1.3.1 (from 1.2), and started killing containers instead of stopping them (faster, remove them anyway), we twice had containers failing to start with the following message from compose:
/var/log/docker.log
contains the following:The container is created, but doesn't start. Trying to manually start it using
docker start
fails with the same error message. Memory is available and the kernel log doesn't show any message from the OOM killer.Restarting docker temporarily solves the problem, so I assume this is a problem with docker itself, not with docker-compose.
docker version
:docker info
:uname -a
:Environment details (AWS, VirtualBox, physical, etc.): Physical machine
Steps to Reproduce:
docker-compose run ...
,kill
andrm
50-200 (estimated) containers with links using docker-composeActual Results: Starting the container fails.
Expected Results: Container starts.
Additional info:
Kubernetes seemed to have a similar problem, apparently some sort of race. The issue also links to a few occurences where people had a similar problem (mainly on IRC).
The text was updated successfully, but these errors were encountered: