Saturday, 2023-02-11

clarkbgmann: yes its a bug in zuul. Until zuul gets restarted (which happens automatically in a few hours) it will continue to happen00:02
gmannclarkb: ok00:03
clarkbnormally we might try and restart things more quickly but we're trying to land the bugfix for the other issue that caused us to delay things this morning so that it can be picked up by the restarts too00:06
gmannk00:12
opendevreviewGhanshyam proposed openstack/tempest master: Separate the extra tests to run in a separate job  https://review.opendev.org/c/openstack/tempest/+/87305500:12
clarkbyou can track the automated restarts here: https://zuul.opendev.org/components its the scheduler components (which go last after we do all the executors) that we need updated00:13
clarkbyou'll see ze01 is paused. That is beacuse we've asked it to stop running any new jobs, complete running jobs, then exit gracefully.00:13
clarkbIt should show Running once it has done that and been restarted. In this case very likely with a newer version number as well00:17
clarkbthen ze02 will go through the process and so on00:18
opendevreviewGhanshyam proposed openstack/tempest master: Prepare tempest-slow-parallel job and run periodically  https://review.opendev.org/c/openstack/tempest/+/87344100:26
opendevreviewGhanshyam proposed openstack/tempest master: Revert "Mark tempest-multinode-full-py3 as n-v"  https://review.opendev.org/c/openstack/tempest/+/87328300:28
opendevreviewGhanshyam proposed openstack/tempest master: Revert "Mark tempest-multinode-full-py3 as n-v"  https://review.opendev.org/c/openstack/tempest/+/87328300:29
opendevreviewGhanshyam proposed openstack/tempest master: Minimize the tests footprint in multinode job  https://review.opendev.org/c/openstack/tempest/+/87344201:39
opendevreviewGhanshyam proposed openstack/tempest master: Separate the extra tests to run in a separate job  https://review.opendev.org/c/openstack/tempest/+/87305501:58
opendevreviewGhanshyam proposed openstack/tempest master: Revert "Mark tempest-multinode-full-py3 as n-v"  https://review.opendev.org/c/openstack/tempest/+/87328301:58
gmannkopecmartin: dansmith: this is the series to try to decrease the timeout failure, moving some jobs to periodic to make space for the  new job and spiting the tester footprints for a few jobs https://review.opendev.org/q/topic:bug%252F2004780 05:44
opendevreviewMerged openstack/tempest master: Mark test_minimum_basic_instance_hard_reboot_after_vol_snap_deletion as slow test  https://review.opendev.org/c/openstack/tempest/+/87269110:07
dansmithclarkb: still seeing the config error as of midnight last night.. I see not all the zuul workers are on the same version, so I assume we're still stuck14:52
dansmithglance and nova (and maybe other) gates have been 100% blocked for days because we're trying to land a fix, and that fix has hit the config error several times, among other things14:53
opendevreviewGhanshyam proposed openstack/tempest master: Move a few jobs to periodic  https://review.opendev.org/c/openstack/tempest/+/87316319:05
opendevreviewGhanshyam proposed openstack/tempest master: Prepare tempest-slow-parallel job and run periodically  https://review.opendev.org/c/openstack/tempest/+/87344119:05
opendevreviewGhanshyam proposed openstack/tempest master: Minimize the tests footprint in multinode job  https://review.opendev.org/c/openstack/tempest/+/87344219:05
opendevreviewGhanshyam proposed openstack/tempest master: Separate the extra tests to run in a separate job  https://review.opendev.org/c/openstack/tempest/+/87305519:05
opendevreviewGhanshyam proposed openstack/tempest master: Revert "Mark tempest-multinode-full-py3 as n-v"  https://review.opendev.org/c/openstack/tempest/+/87328319:06
opendevreviewGhanshyam proposed openstack/tempest master: Revert "Mark tempest-multinode-full-py3 as n-v"  https://review.opendev.org/c/openstack/tempest/+/87328319:15
opendevreviewGhanshyam proposed openstack/tempest master: DNM: Test RBAC Nova service role  https://review.opendev.org/c/openstack/tempest/+/86459519:50
opendevreviewGhanshyam proposed openstack/tempest master: Add test for assisted volume snapshot  https://review.opendev.org/c/openstack/tempest/+/86483919:56
opendevreviewGhanshyam proposed openstack/tempest master: DNM: Test RBAC Nova service role  https://review.opendev.org/c/openstack/tempest/+/86459519:57
clarkbdansmith: it was the schedulers which happen last that needed to be upgraded which happend around 9am pacific today20:04
dansmithclarkb: okay saw it several times at 9:10am today, so maybe that was just before20:05
dansmithhttps://review.opendev.org/c/openstack/tempest/+/873300 for reference20:05
clarkbgmann' change above https://review.opendev.org/c/openstack/tempest/+/873163 seems happy now20:05
dansmithit's in the gate now (again, for the many-eth time) so I'm hoping...20:05
gmannyeah, config error is gone now20:06
clarkbI've just rechecked rosmaita's change with the same problem and it is in check now20:06
dansmithgmann: well, it came and went several times for me, but I hope it's gone for good20:08
dansmithI'd recheck something, config error, immediate recheck, run check, fail in gate with config error, recheck, config error, recheck, run, etc20:08
dansmithpresumably because of which worker I was hitting each time or something20:08
gmannohk. I think I checked ~10ish20:09
clarkbdansmith: I think it was which scheduler. We had separately done some docker upgrade work and I think one of the schedulers may have ended up with the bug fix deployed. But ya everything according to the components list should now be new enough to include the fix20:10
dansmithack20:10
clarkband we also deployed the fix for the issue that caused us to have the pause yesterday mornin in processing thing20:10
clarkbwe would've deployed the other fix more quickly if we weren't also trying to get that one landed and deployed20:10
*** dmellado_ is now known as dmellado21:09
*** dmellado_ is now known as dmellado23:47

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!