Saturday, 2025-10-04

Clark[m]Looks like it passed so that is working00:25
tkajinamhttps://review.opendev.org/c/openstack/zaqar/+/963027?tab=change-view-tab-header-zuul-results-summary02:55
tkajinamI wonder if some maintenance is on-going ?02:55
tonybtkajinam: Not that I'm aware of03:36
tonyblet me poke in the logs03:36
tkajinamtonyb, thx !04:26
tonybtkajinam: Sorry The reason for the failure is beyond my zuul knowledge.04:28
tonybIt looks like ultimately it comes down to:04:28
tonyb2025-10-04 02:56:22,582 ERROR zuul.Launcher: [e: 6c2aa68a3b4f4952a2035cfc6f8795a5] [req: 3adb2de70cb64298896a80d3d070bf0f] Exception loading ZKObject at /zuul/nodeset/requests/3adb2de70cb64298896a80d3d070bf0f/revision04:28
tkajinamit seems the frequent failure started at 2025-10-04 02:13:2404:28
tkajinamhmm wait probably even earlier04:28
tkajinamtonyb, np !04:28
tkajinam2025-10-04 00:41:1204:29
tonybbut I really don't know what would cause ZK to fail to have those objects04:29
tkajinamhttps://zuul.opendev.org/t/openstack/builds?skip=1650 no node failure here04:29
tkajinamhttps://zuul.opendev.org/t/openstack/builds?skip=1600 it started here04:29
tonybWow just skip a cool 1650 builds ;P04:30
tkajinamwe may have to skip further a few hours later :-P04:30
tkajinammaybe zookeeper cluster is mulfunctioning but that's not what I'm familiar with04:31
tkajinamthat timestamp would help identifying the problem later04:31
tonybI'll keep poking04:31
tkajinamhttps://zuul.opendev.org/t/openstack/build/90bb41a0fa1540bb964ee0e1b539c088 is the "first one", for records04:31
tonybhttps://grafana.openstack.org/d/21a6e53ea4/zuul-status?orgId=1&from=2025-10-04T00:20:00.000Z&to=2025-10-04T01:20:00.000Z&timezone=utc is also interesting04:35
tkajinamyeah04:42
tonybI'm out of ideas, the zk cluster seems healthy, graphite says very little helpful.  We'll need to wait for another infra-root for help05:20
fricklerit looks like there may have been some temporary incompatibility between executors and launchers/schedulers. seems the issue resolved itself when the latter were upgraded 2h ago. I reenqued the reqs periodic-weekly jobs to verify11:00
tonybhmm I avoided restarting things as I didn't want to confuse any debugging.12:10
fungiyeah, when things crop up with zuul in the early hours of a saturday utc, suspect something related to our automated zuul upgrades since that's when they kick off14:54
fungii don't think zuul upstream does any testing with mismatched versions of components, so i guess not too surprising to occasionally have a change merge that assumes it is applied simultaneously to multiple components14:55
Clark[m]There are specific upgrade tests that test compatibility between mismatched versions. But that requires expecting problems and having test cases in advance15:32
corvuswe do perform testing with mismatched components, but not so much for niz15:33
fungiwhich makes sense as it's still basically in beta16:50
corvusi'd call it alpha :)17:01
fungiwfm17:24
fungialeph17:25

Generated by irclog2html.py 4.0.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!