Friday, 2026-05-22

-@gerrit:opendev.org- Michal Nasiadka proposed: [opendev/system-config] 989567: Add backup03 https://review.opendev.org/c/opendev/system-config/+/98956705:15
@mnasiadka:matrix.orgClark: since follow: true is default in ansible.builtin.file: Ansible will follow the link and when the target is directory, it should be fine05:21
-@gerrit:opendev.org- Martin Zobel proposed: [openstack/diskimage-builder] 989657: Add DIB_USE_RSYNC option to copy image contents with rsync https://review.opendev.org/c/openstack/diskimage-builder/+/98965711:34
@clarkb:matrix.orgmnasiadka: got it. Is that documented somewhere?14:44
@clarkb:matrix.orgprobably not expliitly just have to interpret what follow true will do14:45
@mnasiadka:matrix.orgClark: Not really explicitly - there's https://docs.ansible.com/projects/ansible/latest/collections/ansible/builtin/file_module.html#parameter-follow14:45
@clarkb:matrix.orgthanks14:45
@clarkb:matrix.orgThe changes lgtm now and I did some checks on the server too. Not sure if Friday is the best day to add a third backup target (I'm also trying to take advantage of some good weather today to get out on the bike if possible). But I'll let other reviewers decide if they want to proceed 14:47
@clarkb:matrix.orgmoving the opendev system-config-run job debugging here15:03
@clarkb:matrix.orghttps://zuul.opendev.org/t/openstack/build/c70cb69401c943068b43223f9c607032 failed to lookup opendev.org to clone the git repos for dns zones15:04
@clarkb:matrix.org I can currently lookup opendev.org against both ns03 and ns04 over ipv4 (what the job env would have used) using dig15:04
@clarkb:matrix.orgso whatever the issue is is either intermittent or maybe environment/route/network specific?15:04
@fungicide:matrix.orgClark: i'm about three problems out on other stuff at the moment so haven't paged that change back into brain context, but suspect that it could be something to do with what addresses the resolver is listening on. if memory serves we do some gymnastics to make bind and unbound coexist on adns* by binding to different addresses15:05
@clarkb:matrix.organd the problem could be between our test nodes and cloudflare/google resolvers or between cloudflare/google resolvers and ns03/ns0415:06
@clarkb:matrix.orgfungi: oh you think that maybe the local deployment of dns tooling is breaking our unbound resolver that should service things? I could see that being the case15:06
@clarkb:matrix.orgmaybe we should put a hold in place and see if we catch one and try ot debug from there? I can do that15:06
@fungicide:matrix.orggiven that the change itself is altering what addresses we're using in which vars15:07
@clarkb:matrix.orgya though its doing that at an ansible level whcih I'm not sure if that affects unbound or bind/nsd15:07
@fungicide:matrix.orgit may affect the configuration templates we deploy with ansible15:07
@fungicide:matrix.orgbut this is entirely conjecture on my part15:08
@clarkb:matrix.orgyes that is possible. I'm skimming those now. THe bind template seems to use ansibel gathered facts so not the values in inventory15:08
@fungicide:matrix.organd yeah, seems like it must be cloud-specific behavior differences since it's passing in check, though also suspicious that we keep consistently passing in check and failing in gate, that part is probably mere coincidence15:09
@fungicide:matrix.orgin theory we don't run the job any differently between the two pipelines15:10
@clarkb:matrix.orgok our normal test nodes use unbound in a forwarding setup. But then because these are system-config-run jobs we apply the prod setup which recurses? I wonder if we are racing unbound config updates and reloading/restarting and catching unbound at a time where it isn't functional15:10
@clarkb:matrix.orgno looks like while we update the unbound config we don't force it to restart so it should continue running with the old config. Unless it automatically reloads its config later and we're racing that?15:13
@clarkb:matrix.orgso maybe before holding a node we can shim in some dns lookups and unbound state debugging to better characterize (and maybe trip) this15:14
@clarkb:matrix.orgI'll work on a change that does that15:14
-@gerrit:opendev.org- Jeremy Stanley https://matrix.to/#/@fungicide:matrix.org proposed: [openstack/project-config] 989781: Reset PuppetForge password https://review.opendev.org/c/openstack/project-config/+/98978115:21
@clarkb:matrix.orgIn other news etherpad survived the night. Upstream confirmed the bug and has a fix. We will probably have to upgrade to 3.1.1 or newer to get it though15:31
-@gerrit:opendev.org- Clark Boylan proposed: [opendev/system-config] 989784: Add some dns lookup debugging to adns test server https://review.opendev.org/c/opendev/system-config/+/98978415:33
@clarkb:matrix.orgsomething like that maybe15:33
-@gerrit:opendev.org- Zuul merged on behalf of Jeremy Stanley https://matrix.to/#/@fungicide:matrix.org: [openstack/project-config] 989781: Reset PuppetForge password https://review.opendev.org/c/openstack/project-config/+/98978115:35
@clarkb:matrix.orgfeel free to suggest additional debugging items. I think my main questions are "is unbound running" and if so with what config. Then what do some explicit lookups look like15:35
@fungicide:matrix.orgClark: upstream claude came up with a fix? ;)15:35
@clarkb:matrix.organd take it from there15:36
@clarkb:matrix.orgfungi: I haven't read the commit messages to see but I woudl expect so :)15:36
@fungicide:matrix.org`Co-Authored-By: Claude Opus 4.7 ...` yep!15:40
@clarkb:matrix.orgthe gitea 1.26.2 screenshots here: https://e427a35962f8c8fc85a5-28f35d2f5f941083fd95522e9f6bd028.ssl.cf1.rackcdn.com/openstack/890938b8e5014de38c98eff46c2d1af5/bridge99.opendev.org/screenshots/ look good to me. The change passed our testing checks too15:51
-@gerrit:opendev.org- Sabbir Ahmed proposed: [openstack/project-config] 989446: Add starlingx/app-machine-operator project https://review.opendev.org/c/openstack/project-config/+/98944616:34
@clarkb:matrix.orgok I'm going to pop out for a bit now. There is a hill to climb before it gets too hot17:00
@fungicide:matrix.orggood luck!17:09
@clarkb:matrix.orghttps://cdf674af691ebd2f77e5-7909900ed724dc9ef77c09a94566cfc0.ssl.cf1.rackcdn.com/openstack/446e5621d04a445185fd4b8d7965e251/bridge99.opendev.org/ara-report/results/373.html my debug dns change passed and the output in the debug script looks good (expected since there wasn't a failure). I guess I'll recheck20:21
@clarkb:matrix.orgbut that seems to confirm we are not restarting unbound. It is possible that we may wish to in order to ensure the config we expect is in use. But I think the test node config should be fine too20:22

Generated by irclog2html.py 4.1.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!