Monday, 2025-02-17

fungicorvus: this might be related to my observation on friday, if so mnaser is aware that something recently broke their hosted cloud profile for vexxhost13:10
corvusfungi: hrm.  well, it appears it is not fixed yet.  i suppose we either need to completely remove vexxhost, or update our clouds.yaml to hold the content previously found in the hosted profile.  does anyone know what that content is supposed to be?15:01
corvusarchive.org as of 2021 says: https://paste.opendev.org/show/b4iuUmOz9aBRYxguMRtO/15:04
corvusi'm working on that15:11
tkajinamI wonder if anyone is aware of the consistent node failure in arm64 jobs ?15:14
corvustkajinam: yes, one of opendev's public clouds is currently unavailable15:16
fungitkajinam: we only have one provider for those (osusol) and it's not working at the momenty15:16
fungitkajinam: but if you know of any other openstack cloud providers interested in donating arm server instance quota, please do get them in touch with us15:16
fungiwe used to have two providers, but linaro decided they couldn't keep providing it15:17
opendevreviewJames E. Blair proposed opendev/system-config master: Use local profile for vexxhost  https://review.opendev.org/c/opendev/system-config/+/94193315:17
corvusi manually made those changes on zl01 and it looks like it's happy to me, so i think that has all the settings we need15:18
fungii adjusted my personal cloud.yaml similarly and it's working for my account too15:26
fungicorvus: is there a reason requires_floating_ip: false is only set in the nodepool clouds.yaml, not for bridge?15:27
fungiand i guess the nodepool builders don't look at the image_format from the profile to decide what to upload15:28
corvusit looked like we set a lot fewer things in that file so i tried to keep it small; do you think we should add that one too?15:28
funginah, just curious about the divergence15:28
corvusfungi: i think the nodepool builders do look at that, but 'raw' is already in the file  (i wonder why)15:29
corvusthat=image format15:29
fungiaha, got it15:29
fungii totally missed that we were overriding it there15:29
fungiprobably we had it in there before it was added to the remote profile and could clean it up once it comes back15:30
corvussounds reasonable15:32
tkajinamcorvus fungi, ah, ok. thanks. got it.15:33
tkajinamfungi, I know some people working for arm things so I'll discuss it if I get a chance to talk with them.15:34
fungithanks!!!15:34
Clark[m]fungi: tkajanim: clarification the linaro cloud was hosted on "Works on ARM" hardware hosted by equinix. That. Hardware was pulled by works on arm and wasn't a linaro decision.15:54
Clark[m]Then sometime later equinix announced they are shutting all their hardware hosting down. Not sure if related15:55
fungiyeah, linaro wasn't able to find other resources to continue providing it15:59
clarkbany reason to not approve https://review.opendev.org/c/opendev/system-config/+/941679 and https://review.opendev.org/c/opendev/zone-opendev.org/+/941168 ? I guess we may want to see 941933 make things happy with clouds first?16:05
fungiboth lgtm16:13
corvusi +3d them.  if we bork the clouds.yaml on bridge -- well, deleting is a manual process anyway, so should be easy to detect and fix.16:38
fungifair enough16:38
clarkbwfm16:38
opendevreviewMerged opendev/zone-opendev.org master: Cleanup zuul-lb01 and reset zuul ttl  https://review.opendev.org/c/opendev/zone-opendev.org/+/94116816:41
opendevreviewClark Boylan proposed openstack/diskimage-builder master: Change grub variables for style and timeout  https://review.opendev.org/c/openstack/diskimage-builder/+/93768416:47
opendevreviewClark Boylan proposed openstack/diskimage-builder master: Change grub variables for style and timeout  https://review.opendev.org/c/openstack/diskimage-builder/+/93768416:49
opendevreviewMerged opendev/system-config master: Use local profile for vexxhost  https://review.opendev.org/c/opendev/system-config/+/94193317:19
opendevreviewMerged zuul/zuul-jobs master: [remove-registry-tag] Allow using in a loop  https://review.opendev.org/c/zuul/zuul-jobs/+/94151617:21
fungifix for the missing vexxhost profile deployed successfully17:29
fungiopenstackclient on bridge is working with vexxhost again17:29
fungis/fix/workaround/17:31
clarkbI've just realized that the buildkit image used by docker buildx commands may be somewhat hardcoded into docker?17:53
clarkbthat didn't even occur to me as a potential issue when setting up a mirror for that image17:53
fungihow so?17:53
corvusoh i think there may be a way to start a builder with a specific image, then docker will use an already running builder17:56
clarkbwell I mirrored buildkit:buildx-stable-1 into quay.io so that we can use it in zuul-jobs17:56
corvus1 sec17:56
clarkbbut I don't see a docker buildx create flag to use a specific image17:56
clarkbmaybe the trick is to pull the image firstthen rename it then buildx won't pull?17:56
fungiaha, got it. the tool is looking only to dockerhub17:57
fungion my way out to run a quick errand, hopefully back in half an hour17:57
opendevreviewClark Boylan proposed zuul/zuul-jobs master: Use registry:2 image mirrored to quay.io  https://review.opendev.org/c/zuul/zuul-jobs/+/94197017:58
clarkbthat is one half of the update i was going to make17:59
corvusclarkb: https://docs.docker.com/build/builders/drivers/docker-container/17:59
corvusclarkb: i think maybe if we start the builder ourselves with the quay image then i think docker builds should use that builder automatically and not try to start a new one18:00
clarkbcorvus: gotcha so docker buildx create --driver-put=image=quay.io/opendevmirror/buildkit:buildx-stable-1 ?18:00
clarkbI'll push a change up to exercise ^18:00
corvuss/put/opt/ but yeah something like that18:01
clarkboh we need to set the docker-container driver as the driver as well since that is different than the default docker driver I guess18:01
corvusyeah18:01
clarkbthanks I'll look into that now. 941970 is a related change (registry is the other image we mirrored)18:01
corvusi haven't tried this exact thing, so, some experimentation may be necessary :)18:02
corvusi did include a step to start the docker builder in https://review.opendev.org/923084 -- but i constructed it to start the default builder with the default options18:03
corvus(that was to address a race with podman starting multiple builders)18:03
opendevreviewClark Boylan proposed zuul/zuul-jobs master: Use mirrored buildkit:buildx-stable-1 image  https://review.opendev.org/c/zuul/zuul-jobs/+/94199218:08
clarkbfirst experimentation step18:08
opendevreviewMerged opendev/system-config master: Remove codesearch01  https://review.opendev.org/c/opendev/system-config/+/94167918:24
opendevreviewClark Boylan proposed opendev/system-config master: Adjust LE role file matchers on system-config-run-* jobs  https://review.opendev.org/c/opendev/system-config/+/94199718:25
clarkbI think we can dequeue 940219,5 from the opendev promote pipeline now. That was stuck there due to the empty nodeset with zuul launcher bug iirc18:33
clarkbany objections to me doing so now?18:33
fungino objection from me18:41
clarkbdone18:48
clarkbspeaking of bindep do we want to merge some bindep changes?18:52
clarkbthe buildx change failed on docker rate limits pulling the multiarch image before it got to buildx19:00
clarkbI'll recheck in a bit to see if that even works at all. But also I'll get a change up for multiarch mirroring I guess19:00
fungiyeah, i think we can merge whichever bindep changes folks arre comfortable with and then work on porting the same ideas to, say, git-review or something next19:11
clarkbI think we can do https://review.opendev.org/c/opendev/bindep/+/938568/9 and parents at least19:13
opendevreviewClark Boylan proposed opendev/system-config master: Mirror multiarch/qemu-user-static  https://review.opendev.org/c/opendev/system-config/+/94200219:24
clarkbcorvus: that pointer seems to have done it. The multiarch jobs pass after a recheck and grepping the logs shows it fetching from quay.io with no logs indicating it also pulled from docker19:47
clarkblast year we did a tour at dino lab in victoria bc and at the end they sit you down with some rocks containing fossils and the various air tools to slowly chip away at the rock surrounding the fossils. This container image stuff feels a lot like that. Slowly removing what we don't want and eventually we'll be where we want to be19:49
corvusnice!20:04
corvusi'm going to restart the remaining zuul components since the vexxhost fix has merged (and the zuul-web that was stuck has healed itself)20:05
corvusit's just the components on zuul02 that are the old version now20:06
opendevreviewMerged opendev/system-config master: Mirror multiarch/qemu-user-static  https://review.opendev.org/c/opendev/system-config/+/94200220:06
opendevreviewClark Boylan proposed zuul/zuul-jobs master: Replace debian:testing with quay.io/opendevmirror/httpd:alpine  https://review.opendev.org/c/zuul/zuul-jobs/+/94200820:08
clarkbcorvus: ack thanks20:08
clarkb942008 is another replace docker hub hosted image with something roughly equivalent that we can fetch from quay20:08
clarkbhopefully with this set of updates the zuul-jobs role testing and the roles themselves will be a bit more reliable for us20:09
opendevreviewMerged openstack/diskimage-builder master: Change grub variables for style and timeout  https://review.opendev.org/c/openstack/diskimage-builder/+/93768421:12
clarkbI'm going to approve the dns change to remove codesearch01 now that the inventory cleanup is done21:41
clarkbI need to get a meeting agenda together21:42
clarkbanything need to be edited in? The service coordinator nomination period ends tomorrow so I'll call that out21:42
opendevreviewMerged opendev/zone-opendev.org master: Remove codesearch01 from DNS  https://review.opendev.org/c/opendev/zone-opendev.org/+/94168121:48
corvusall zuul components are running the same (latest) code now21:50
corvusalso, nl03 has recovered and we're launching nodes in osuosl again21:51
clarkbthanks!21:51
clarkbmy initial meeting agenda edits are in. Let me know if I need to add or remove things22:06
fungiinfra-root: do we have consensus for changes 816741, 938520 and 938568 in bindep? (the pyproject.toml series up through dropping support for python 3.6 and associated cleanup/simplifications)22:32
fungimaybe we can approve those tomorrow if there are no objections22:32
clarkbno objections from me22:32
fungimainly because in the coming weeks i'd like to start applying a similar pattern to some of our other tools too22:33
clarkb++22:34
clarkbdeleting zuul-lb01 and codesearch01 is approaching on my todo list. That can be another one for tomorrow if there are no objcetions before then22:35
clarkbfungi: I'm noticing that we may want to set up logrotate rules for the logs in /var/lib/mailman/core/var/logs23:04
clarkbspecifically on lists.opendev.org23:04
opendevreviewJames E. Blair proposed opendev/system-config master: Use a dedicated zuul launcher temp dir on /opt  https://review.opendev.org/c/opendev/system-config/+/94201823:16
corvuswe ran out of tmp space downloading images ^23:16
clarkb+2 from me23:17
clarkbthat may even auto restart things due to the docker compose config update23:18
corvusi manually removed some orphaned image file (i just proposed a change to do that automatically) and the launcher seems to have recovered after that without further intervention23:27
corvus2025-02-17 23:21:46,734 INFO zuul.Launcher: Starting upload <ImageUpload 99fba2c3128743f0b0599414ca914d51 state: uploading endpoint: raxflex/raxflex-SJC3 artifact: 2276952861474de8aae5689a9999fdcf validated: True external_id: None>23:28

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!