Fix autostart for swarm scope connected containers #26449

mrjana · 2016-09-09T17:02:51Z

With swarm scope network connected containers with autostart enabled
there was a dependency problem with the cluster to be initialized before
we can autostart them. With the current container restart code happening
before cluster init, these containers were not getting autostarted
properly. Added a fix to delay the container start of those containers
which has atleast one swarm scope endpoint to until after the cluster is
initialized.

Signed-off-by: Jana Radhakrishnan mrjana@docker.com

mavenugo · 2016-09-10T18:35:53Z

@mrjana I think there is a timing issue between the overlay cleanup and container restart. when I have multiple containers with --restart=always, they fail to come up due to the error : ERRO[0019] subnet sandbox join failed for "10.0.0.0/24": overlay subnet 10.0.0.0/24 has conflicts in the host while running in host mode

mavenugo · 2016-09-13T17:41:30Z

@mrjana i stand corrected. the above issue was not due to this PR. I had some stale bridges in my setup that resulted in this conflicting subnet in kernel < 3.16 (host-mode). With those stale bridges removed and using moby/libnetwork#1442, this works fine.

mavenugo · 2016-09-13T17:58:41Z

LGTM

vikstrous · 2016-09-13T18:28:52Z

LGTM

cpuguy83 · 2016-09-13T18:35:52Z

Side note, noticed that when you disconnect a container from a swarm network you get an error in the daemon logs: ERRO[0099] task unavailable method=(*Dispatcher).processUpdates module=dispatcher task.id=b6fv33i39ahpjjex083752804

cpuguy83 · 2016-09-13T18:37:41Z

Can you add a test case for swarm daemon restart w/ attached container + autorestart?

mrjana · 2016-09-13T19:00:06Z

Side note, noticed that when you disconnect a container from a swarm network you get an error in the daemon logs: ERRO[0099] task unavailable method=(*Dispatcher).processUpdates module=dispatcher task.id=b6fv33i39ahpjjex083752804

That's because the task is removed in the manager before the the dispatcher is processing an update about the task completing from the agent.

mrjana · 2016-09-13T19:00:41Z

Will add a test case

The swarm scope network connected containers with autostart enabled there was a dependency problem with the cluster to be initialized before we can autostart them. With the current container restart code happening before cluster init, these containers were not getting autostarted properly. Added a fix to delay the container start of those containers which has atleast one swarm scope endpoint to until after the cluster is initialized. Signed-off-by: Jana Radhakrishnan <mrjana@docker.com>

mrjana · 2016-09-13T21:22:29Z

@cpuguy83 Added a test case

mavenugo · 2016-09-13T22:00:01Z

integration-cli/docker_cli_swarm_test.go

+
+	out, err = d.Cmd("ps", "-q")
+	c.Assert(err, checker.IsNil)
+	c.Assert(strings.TrimSpace(out), checker.Not(checker.Equals), "")


I understand it doesn't make a big difference. But it will be great if you can get the container-id prior to restart and then compare it here. that makes it more correct.

There is only one container running on this daemon. So I don't think that is necessary.

cpuguy83 · 2016-09-14T01:09:55Z

LGTM

mavenugo · 2016-09-14T01:22:13Z

Re-LGTM.

win2lin failure is unrelated. merging.

GordonTheTurtle added the status/0-triage label Sep 9, 2016

justincormack added status/1-design-review and removed status/0-triage labels Sep 9, 2016

mrjana force-pushed the net branch 2 times, most recently from 37bc1e3 to 741b57d Compare September 9, 2016 18:57

mavenugo added status/2-code-review and removed status/1-design-review labels Sep 13, 2016

mrjana force-pushed the net branch from 741b57d to c9fb551 Compare September 13, 2016 21:22

mavenugo reviewed Sep 13, 2016
View reviewed changes

cpuguy83 mentioned this pull request Sep 14, 2016

Flaky Test TestBuildApiBuildGitWithF #26554

Closed

mavenugo merged commit 1d76ab4 into moby:master Sep 14, 2016

thaJeztah added this to the 1.13.0 milestone Sep 20, 2016

thaJeztah mentioned this pull request Sep 17, 2023

daemon: Improve NetworkingConfig & EndpointSettings validation #46183

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix autostart for swarm scope connected containers #26449

Fix autostart for swarm scope connected containers #26449

Uh oh!

mrjana commented Sep 9, 2016 •

edited

Loading

Uh oh!

mavenugo commented Sep 10, 2016

Uh oh!

mavenugo commented Sep 13, 2016

Uh oh!

mavenugo commented Sep 13, 2016

Uh oh!

vikstrous commented Sep 13, 2016

Uh oh!

cpuguy83 commented Sep 13, 2016

Uh oh!

cpuguy83 commented Sep 13, 2016

Uh oh!

mrjana commented Sep 13, 2016

Uh oh!

mrjana commented Sep 13, 2016

Uh oh!

mrjana commented Sep 13, 2016

Uh oh!

mavenugo Sep 13, 2016

Uh oh!

mrjana Sep 13, 2016

Uh oh!

mavenugo Sep 13, 2016

Uh oh!

cpuguy83 commented Sep 14, 2016

Uh oh!

mavenugo commented Sep 14, 2016

Uh oh!

Uh oh!

Fix autostart for swarm scope connected containers #26449

Fix autostart for swarm scope connected containers #26449

Uh oh!

Conversation

mrjana commented Sep 9, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mavenugo commented Sep 10, 2016

Uh oh!

mavenugo commented Sep 13, 2016

Uh oh!

mavenugo commented Sep 13, 2016

Uh oh!

vikstrous commented Sep 13, 2016

Uh oh!

cpuguy83 commented Sep 13, 2016

Uh oh!

cpuguy83 commented Sep 13, 2016

Uh oh!

mrjana commented Sep 13, 2016

Uh oh!

mrjana commented Sep 13, 2016

Uh oh!

mrjana commented Sep 13, 2016

Uh oh!

mavenugo Sep 13, 2016

Choose a reason for hiding this comment

Uh oh!

mrjana Sep 13, 2016

Choose a reason for hiding this comment

Uh oh!

mavenugo Sep 13, 2016

Choose a reason for hiding this comment

Uh oh!

cpuguy83 commented Sep 14, 2016

Uh oh!

mavenugo commented Sep 14, 2016

Uh oh!

Uh oh!

mrjana commented Sep 9, 2016 •

edited

Loading