Wednesday, 2024-06-19

opendevreviewTony Breeds proposed opendev/system-config master: DNM: Initial dump or mediawiki role and config  https://review.opendev.org/c/opendev/system-config/+/92132200:39
opendevreviewTony Breeds proposed opendev/system-config master: Add golang based docker compose tool.  https://review.opendev.org/c/opendev/system-config/+/92076001:28
opendevreviewTony Breeds proposed opendev/system-config master: Add an opendev specific build of mediawiki  https://review.opendev.org/c/opendev/system-config/+/92132101:28
opendevreviewTony Breeds proposed opendev/system-config master: DNM: Initial dump or mediawiki role and config  https://review.opendev.org/c/opendev/system-config/+/92132201:28
tonybfrickler: If you're still around can you take another look at: https://review.opendev.org/c/opendev/system-config/+/92076001:29
opendevreviewTony Breeds proposed opendev/system-config master: DNM: Initial dump or mediawiki role and config  https://review.opendev.org/c/opendev/system-config/+/92132203:37
*** mordred1 is now known as Guest1011504:52
opendevreviewAurelio Jargas proposed zuul/zuul-jobs master: Add ensure-poetry role  https://review.opendev.org/c/zuul/zuul-jobs/+/92228607:15
opendevreviewAurelio Jargas proposed zuul/zuul-jobs master: Add ensure-poetry role  https://review.opendev.org/c/zuul/zuul-jobs/+/92228608:38
opendevreviewJens Harbott proposed openstack/project-config master: nodepool: revert dedicated diskimages list for rax-iad  https://review.opendev.org/c/openstack/project-config/+/92229509:30
opendevreviewJens Harbott proposed openstack/project-config master: nodepool: reorder providers to list rax-dfw last  https://review.opendev.org/c/openstack/project-config/+/92229609:30
opendevreviewMerged openstack/project-config master: nodepool: revert dedicated diskimages list for rax-iad  https://review.opendev.org/c/openstack/project-config/+/92229510:05
opendevreviewMerged openstack/project-config master: nodepool: reorder providers to list rax-dfw last  https://review.opendev.org/c/openstack/project-config/+/92229610:06
fricklerhmm, no immediate improvement with ^^, just more errors from rax. waiting now to see what happens when the next image rebuild finishes10:38
opendevreviewMerged opendev/system-config master: Report Zuul events for WanderTracks repos  https://review.opendev.org/c/opendev/system-config/+/92177611:05
opendevreviewJens Harbott proposed openstack/project-config master: Drop wheel publishing for centos-8-stream  https://review.opendev.org/c/openstack/project-config/+/92231312:24
opendevreviewMerged opendev/system-config master: Add golang based docker compose tool.  https://review.opendev.org/c/opendev/system-config/+/92076013:08
fricklerso ubuntu-jammy is now available on openmetal-iad3, time to boot some instances? or just add into the normal pool at once and see what happens?14:04
SvenKieskewhat do I need to do, to get this merged? https://review.opendev.org/c/openstack/project-config/+/920472 (I'm not even 100% sure this is the correct channel to ask? project-config has no real documentation wrt to code owners etc that I could deduce from the readme)14:22
fungifrickler: i guess the main concern is that it boots but networking is set up incorrectly such that it can't reach the mirror... though even then builds would fail early enough that they'll get retried i think?14:24
fungiSvenKieske: the config-core reviewers are the code owners for openstack/project-config14:24
fungii'll take a look14:24
SvenKieskety, so I guess this is the correct channel to ask for reviews?14:26
fungiSvenKieske: sure, or #openstack-infra since it's an openstack-specific request (that's the channel for the tact sig, who are openstack's liaisons to the opendev collaboratory)14:33
fungiultimately it's mostly the same people wearing different hats14:34
* fungi has a vast collection of funny-looking hats14:34
SvenKieskeah I always forget that channel, put it into autojoin, ty!14:35
fungiugh, did a new filelock release just break us again?14:36
JayFhttps://github.com/tox-dev/filelock/issues/343 it was just yanked14:38
fungiSuccessfully installed cachetools-5.3.3 chardet-5.2.0 colorama-0.4.6 distlib-0.3.8 filelock-3.15.2 packaging-24.1 platformdirs-4.2.2 pluggy-1.5.0 pyproject-api-1.6.1 tomli-2.0.1 tox-4.15.1 virtualenv-20.26.214:39
fungiyeah, looks like that was it. thanks JayF!14:39
fungiguess you were already looking into it too ;)14:39
fungirecheck time14:40
opendevreviewMerged openstack/project-config master: restrict enforced backport voting to master branch  https://review.opendev.org/c/openstack/project-config/+/92047214:56
fungiSvenKieske: that's ^ successfully deployed to gerrit two minutes ago if you want to test at your convenience15:02
SvenKieskefungi: ty!15:11
*** dtantsur_ is now known as dtantsur15:12
SvenKieskeseems to work :)15:13
fungiperfect15:19
JayFfungi: saw your message, knew the github repo from the other day and did a quick look15:19
JayFhappy to get bonus credit but I don't deserve it :D 15:20
fricklerfungi: does it make sense to start with max_servers=1 to limit the possible impact? the etherpad simply says to set it to 50 next, which would be the full size of the cloud16:01
fungiyeah, that seems like a good way of testing the waters without the added effort of manually booting and inspecting a node. almost all the ways i can think of it failing would result in build retries not failures16:08
opendevreviewJens Harbott proposed openstack/project-config master: Enable openmetal cloud for nodepool  https://review.opendev.org/c/openstack/project-config/+/92233816:27
opendevreviewMerged openstack/project-config master: Enable openmetal cloud for nodepool  https://review.opendev.org/c/openstack/project-config/+/92233816:45
fricklerthis doesn't look good: 2024-06-19 16:54:06,884 ERROR nodepool.StateMachineNodeLauncher.openmetal-iad3: [e: f237626ad7db4a0cb8323bfbc206d7af] [node_request: 900-0024530346] [node: 0037763929] Error in creating the server. Compute service reports fault: No valid host was found. 16:57
frickleralthough another node is in use, hmm16:59
fungiwonder if we have something set wrong in placement17:03
fungirelated to the availability zones we created? is nodepool using the correct az?17:04
fungiif no az is specified then nova will just try to boot in one at random, yeah?17:04
fungiif i remember the discussion correctly we created two host aggregates mapped to different availability zones, and one presumably only allows our control plane project to create server instances in it17:07
fricklernot sure, there is one node running and I can log into it, not sure how to best identify which build it is associated with17:07
fungii think it must have already been deleted17:10
fungibecause i'm logged into a node with a sub-minute uptime17:10
*** dasm is now known as Guest1017717:13
fricklero.k. found one build and it looks fine https://zuul.opendev.org/t/openstack/build/fd247729c365494f9db27b05fa40ffd317:13
frickleralso no failing or retrying builds found that would match17:14
fungi0037764030 seems to be running a devstack job17:14
fungiso yeah, current guess is that we're round-robining server create requests between the two availability zones and half of them will die with "no valid host"17:15
fricklerI'd still like to investigate nova logs a bit for the error above, but that'll have to wait until tomorrow unless someone else is faster17:16
fungithe half that don't fail appear to work17:16
fricklerhmm, that might be possible, yes17:16
fungiwhen tonyb wakes up, he may be more familiar with the details, but if that really is the problem then just specifying the right az in nodepool's clouds.yaml ought to solve it17:16
fungi`openstack availability zone list` returns 4 entries named "nova" and one named "reserved"17:20
fungihttps://docs.openstack.org/openstacksdk/latest/user/config/configuration.html doesn't seem to mention how to specify a default az17:22
fungimaybe it has to go in nodepool.yaml17:23
fricklerif you add "--long" you see more details, two of the "nova" zones are actually neutron zones and one cinder I think17:23
fungiaha, yep https://zuul-ci.org/docs/nodepool/latest/openstack.html17:23
frickler"If this setting is omitted, nodepool will fetch the list of all availability zones from nova."17:24
fricklerso we want to override to just "nova"17:25
fungiyeah, about to push that up now17:25
fricklerbecause "reserved" is meant to not be used by nodepool17:25
opendevreviewJeremy Stanley proposed openstack/project-config master: Use the "nova" AZ for Nodepool in openmetal-iad3  https://review.opendev.org/c/openstack/project-config/+/92234317:28
corvusthat should work -- though there must be some way of restricting the azs per tenant at the cloud level, right?17:30
corvus(it's fine if we decide it's not worth our time; but...  it's super weird that we're being told about an az that we can't use :)17:30
fungiagreed, there probably is. the difference is i know how to configure nodepool but i'm less confident in my ability to configure... nova? keystone? what filters the az list?17:41
fungiwhen more people are around who have familiarity with how those availability zones were configured, we can clean that up if there's a way to solve it on the cloud end17:42
corvusprobably involves policy.json  :)17:53
opendevreviewMerged openstack/project-config master: Use the "nova" AZ for Nodepool in openmetal-iad3  https://review.opendev.org/c/openstack/project-config/+/92234318:18
fungideploy of that ^ completed at 18:24:55 utc, so if we continue to see the same errors on nl02 then we'll know the guess was incorrect18:30
fungithere's a debian-bookworm node in use there as of 3 minutes ago, so probably booted after the config update18:31
fungioh, though do launcher containers need a restart to reread their nodepool.yaml?18:31
fungii never can remember. guess i'll see if the playbook does a sighup or something18:32
corvusnope, they reread it themselves18:35
corvusso within a few seconds of it landing on disk it should take effect for new launches18:35
fungiokay, cool18:35
fungiso the currently in-use node 0037764223 got a launcher thread at 18:27:19 utc, ~2.5 minutes after the deploy job reported back successful18:37
fungilast update time on nodepool.yaml was actually at 18:22:49 utc, so nearly 5 minutes18:38
fungimost recent "No valid host was found" so far was 18:03:26 utc18:40
tonybthe availability zones were created for us but the openmetal folks. so I don't have any real insight20:50
tonybI can look into the details more20:52
tonybcorvus: could you look at the open mediawiki patches.  I'm getting an invalid reference trying to pull the new container.20:54
tonyblooking at the inventory the mediawiki job knows it needs the artifact from the previous job.20:55
tonybthe container exists in the insecure-registry20:55
tonybI thought, and this may be the root issue, that trying to pull the image with a full reference would "magically" use the buildset registry as a pull through mirror for docker.io20:57
tonybsaying this now I wonder if switching to compose V2 changes they way that needs to be configured 20:58
*** iurygregory_ is now known as iurygregory21:28
opendevreviewCurt Moore proposed openstack/diskimage-builder master: Adapt to upstream CentOS Stream mirror changes  https://review.opendev.org/c/openstack/diskimage-builder/+/92235221:50
corvustonyb: do you happen to have a noble node available?22:17
corvustonyb: but i agree with your supposition; if we're going to use go-docker-compose then we might need to jump to the end state of also using podman...22:19
opendevreviewSteve Baker proposed openstack/diskimage-builder master: Change DIB_BOOTLOADER_DEFAULT_CMDLINE empty string behaviour  https://review.opendev.org/c/openstack/diskimage-builder/+/88464422:25
tonybcorvus: I don't have one handy but I can do.22:27
corvusi'm still digging through the logs22:28
tonybI'll switch to python compose and see if it works to confirm that suspicion 22:28
corvus++22:28
corvustonyb: yeah, i don't see anything obviously wrong; so my guess is same as yours: the registry mirror config may not work for go-docker-compose.  your test should confirm that.  if it does, we can either work to fix that (involves some trial and error on a noble host) or skip that step and go all the way to using podman+go-docker-compose, or just use python-docker-compose for this and treat go-docker-compose as a next step.22:36
Guest10115so exciting that docker-compose was re-written22:37
corvusi've done some preliminary work on go-docker-compose with podman that proves it out; i can provide that if we want to go that direction.22:37
corvusmordred: yes it's very docker.22:38
Guest10115corvus: your face is very docker?22:38
corvus(having said that, i am glad that it works with podman which gives us a route to finally speculatively test quay.io images mixed with docker.io images)22:39
Guest10115can I just say that it's weird that go-docker-compose works with podman while python-docker-compose doesn't?22:39
corvusit must have been an accident :)22:40
Guest10115definitely22:40
tonybcorvus: thank you for looking.  I'll verify it is a golang compose issue and then make a plan.22:48
opendevreviewCurt Moore proposed openstack/diskimage-builder master: Add support for Fedora 40  https://review.opendev.org/c/openstack/diskimage-builder/+/92235322:48
tonybI admit that, while I'm wary of "do it later", I'm not sure I have the cycles for integrating podman right now.  maybe next quarter.22:51
tonybGuest10115, mordred: are you using a matrix-IRC bridge? the name I see on "regular" IRC changes from time to time 23:09
Guest10115Yeah ... I'm only using a matrix client currently. I should probably check and make sure it's tied in properly with nickserv stuff23:12
tonybGuest10115: that would be helpful:)23:18

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!