@cidlik:matrix.org | Hello! | 08:40 |
---|---|---|
It seems that I find a Zuul bug: https://storyboard.openstack.org/#!/story/2011221 | ||
If a some job executing was aborted due to new patchset uploading, then processes which were created continue to execute. It means that processes in one node start to interfere among themselves. | ||
Does anyone have to face with the same problem? | ||
-@gerrit:opendev.org- Zuul merged on behalf of James E. Blair https://matrix.to/#/@jim:acmegating.com: | 09:04 | |
- [zuul/zuul] 928842: Use a consistent zk timeout in tests https://review.opendev.org/c/zuul/zuul/+/928842 | ||
- [zuul/zuul] 928843: Increase zk connection timeout in tests https://review.opendev.org/c/zuul/zuul/+/928843 | ||
@cidlik:matrix.org | * Hello! | 11:39 |
It seems that I find a Zuul bug: https://storyboard.openstack.org/#!/story/2011221 | ||
The executor kills ansible in a such way that ansible leaves running processes on remote nodes. If ansible runs shell task with simple `sleep <LARGE_TIMEOUT>` and job is terminated due to uploading new patchset or by job timeout, then sleep will hang on the node. I see this behavior on static nodes. | ||
Does anyone have to face with the same problem? | ||
-@gerrit:opendev.org- Simon Westphahl proposed: [zuul/zuul] 928785: Store multiple provider node launch attempts https://review.opendev.org/c/zuul/zuul/+/928785 | 11:52 | |
@mnasiadka:matrix.org | Hello - does Zuul support running ,,ad-hoc'' jobs? Like passing some arguments to a job and run it? | 12:59 |
@fungicide:matrix.org | > <@cidlik:matrix.org> Hello! | 13:25 |
> It seems that I find a Zuul bug: https://storyboard.openstack.org/#!/story/2011221 | ||
> The executor kills ansible in a such way that ansible leaves running processes on remote nodes. If ansible runs shell task with simple `sleep <LARGE_TIMEOUT>` and job is terminated due to uploading new patchset or by job timeout, then sleep will hang on the node. I see this behavior on static nodes. | ||
> Does anyone have to face with the same problem? | ||
the static driver probably doesn't get as much attention as drivers for ephemeral resources. one of the reasons we stopped using static job workers in opendev many years ago is that finding and terminating every possible process a job might leave behind is close to impossible. is there even a way to tell ansible to stop all processes it has forked or shelled out? the cases i'm aware of where people have to use the static node driver in production, they design their jobs so that a pre-run playbook looks for any still running processes or other junk a previous run might have left behind and cleans them up, though you could probably have some luck doing that with a post-run cleanup playbook too. point is, the jobs have to know what should be terminated/removed, not every kind of job zuul might run can rely on features like, say, explicit process groups | ||
@fungicide:matrix.org | > <@mnasiadka:matrix.org> Hello - does Zuul support running ,,ad-hoc'' jobs? Like passing some arguments to a job and run it? | 13:27 |
no, zuul doesn't operate on single jobs, the closest you can come is triggering a pipeline with specific criteria | ||
@fungicide:matrix.org | e.g. enqueuing a particular change/pr/branch state or firing a timer | 13:28 |
@fungicide:matrix.org | you could of course create separate pipelines for each individual job, but that isn't going to scale well at all | 13:31 |
-@gerrit:opendev.org- Simon Westphahl proposed: [zuul/zuul] 928913: Retry node launches on failure https://review.opendev.org/c/zuul/zuul/+/928913 | 13:53 | |
@noonedeadpunk:matrix.org | > <@mnasiadka:matrix.org> Hello - does Zuul support running ,,ad-hoc'' jobs? Like passing some arguments to a job and run it? | 14:07 |
we have a "debug" pipeline where you need to set +W and CR+1 and type `debug` to trigger the job | ||
@fungicide:matrix.org | yeah, comment triggers are fairly common. some of opendev's tenants have an "experimental" pipeline where you can comment `check experimental` on a change and get it enqueued. that's not the same as running a specific job (though if that project's experimental pipeline jobs are all configured in the project, and the project is untrusted, the change could simply remove all other jobs from the experimental pipeline except the one you want run) | 14:32 |
-@gerrit:opendev.org- Zuul merged on behalf of James E. Blair https://matrix.to/#/@jim:acmegating.com: [zuul/zuul] 928877: Fix override control release note https://review.opendev.org/c/zuul/zuul/+/928877 | 16:26 | |
-@gerrit:opendev.org- James E. Blair https://matrix.to/#/@jim:acmegating.com proposed: | 16:35 | |
- [zuul/zuul] 928874: Add aws snapshot image upload test https://review.opendev.org/c/zuul/zuul/+/928874 | ||
- [zuul/zuul] 928875: Add testing for AWS image import method https://review.opendev.org/c/zuul/zuul/+/928875 | ||
- [zuul/zuul] 928876: Test AWS ebs-direct image upload https://review.opendev.org/c/zuul/zuul/+/928876 | ||
- [zuul/zuul] 928950: Add some image upload debugging https://review.opendev.org/c/zuul/zuul/+/928950 | ||
-@gerrit:opendev.org- James E. Blair https://matrix.to/#/@jim:acmegating.com proposed on behalf of Simon Westphahl: | 16:35 | |
- [zuul/zuul] 928785: Store multiple provider node launch attempts https://review.opendev.org/c/zuul/zuul/+/928785 | ||
- [zuul/zuul] 928913: Retry node launches on failure https://review.opendev.org/c/zuul/zuul/+/928913 | ||
-@gerrit:opendev.org- James E. Blair https://matrix.to/#/@jim:acmegating.com proposed: [zuul/zuul] 928959: Inherit some attributes from provider configs https://review.opendev.org/c/zuul/zuul/+/928959 | 17:37 | |
-@gerrit:opendev.org- Benedikt Löffler proposed: [zuul/nodepool] 928962: Make AWS create_fleet more robust https://review.opendev.org/c/zuul/nodepool/+/928962 | 18:47 | |
-@gerrit:opendev.org- Tristan Cacqueray https://matrix.to/#/@tristanc_:matrix.org proposed: [zuul/zuul] 928963: Use function syntax for declarations https://review.opendev.org/c/zuul/zuul/+/928963 | 19:24 | |
-@gerrit:opendev.org- Tristan Cacqueray https://matrix.to/#/@tristanc_:matrix.org proposed: [zuul/zuul] 928963: Use function syntax for declarations https://review.opendev.org/c/zuul/zuul/+/928963 | 20:27 | |
@tristanc_:matrix.org | felixedel: I just went through the fe-status-view-2.0 changes, it looks amazing, thanks! I just left a minor comment on 926722 , and added 928963. | 20:29 |
-@gerrit:opendev.org- James E. Blair https://matrix.to/#/@jim:acmegating.com proposed: [zuul/zuul] 928971: Add missing AWS image configuration options https://review.opendev.org/c/zuul/zuul/+/928971 | 21:19 |
Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!