Tuesday, 2025-01-14

clarkbjust a few minutes until our weekly meeting18:52
clarkb#startmeeting infra19:00
opendevmeetMeeting started Tue Jan 14 19:00:28 2025 UTC and is due to finish in 60 minutes.  The chair is clarkb. Information about MeetBot at http://wiki.debian.org/MeetBot.19:00
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.19:00
opendevmeetThe meeting name has been set to 'infra'19:00
clarkb#link https://lists.opendev.org/archives/list/service-discuss@lists.opendev.org/thread/3WNI573NTZ5VYPTQ5DBQVKXVJSC2QTGB/ Our Agenda19:00
clarkb#topic Announcements19:00
clarkbThe OpenInfra foundation individual board member election is happening right now. It ends at 1900 UTC Friday19:01
clarkbthis is a good time to check your email inbox for your ballto if you are a foundation member19:01
clarkband if you did get one please vote. There is a great list of candidate19:02
clarkb*candidates19:02
clarkbAnything else to announce?19:02
fungii'm going to be around at odd hours for the next week, just a heads up19:03
fungi(travelling tomorrow through monday)19:03
fungii'll still be back in time for our thing starting on tuesday though19:03
clarkbthanks for the heads up19:04
clarkb#topic Zuul-launcher image builds19:04
clarkb#link https://review.opendev.org/q/hashtag:niz+status:open is next set of work happening in Zuul19:04
clarkbI believe this set of changes is next up. And I believe that was hung up on reliabiltiy of docker images which should be improving now that we're mirroring images19:05
clarkbcorvus: is now a good time to start looking into that or does it still need to settle a bit?19:05
corvusi'm about to refresh the change to switch to using quay images in zuul19:05
corvusso i think things are settled enough now to start merging changes19:05
clarkbgreat!19:06
corvusi'll wait until that stack is fully reviewed before i approve them19:06
corvus(i think that's an all-or-none stack)19:06
clarkbI'll try to take a look at them before too long19:06
corvusthanks, i'd love to merge them this week19:06
corvusstack has +2s on half of it, so i think it's plausible19:07
clarkbanything else on this topic?19:07
corvusnope19:07
clarkb#topic Upgrading old servers19:07
clarkbtonyb isn't around today so don't expect any updates on wiki updates19:07
clarkbgiven that I think we can jump straight into the noble and podman status19:07
clarkb#topic Deploying new Noble Servers19:07
clarkbLast week I discovered that the Noble image tonyb uploaded everywhere didn't actually make it into rax. I suspect that the ansible playbook using the openstack modules isn't capable of uploading to rax19:08
clarkbI manually uploaded the vhd image tonyb created in his bridge homedir using openstacksdk which is capable of uploading to rax19:08
fungidid you upload it to all regions or just dfw?19:09
clarkbDuring testing of those images I discovered that OSC cannot boot instances with network addresses in rax, but again using the sdk directly worked (launch node uses the sdk)19:09
clarkbfungi: all three regions19:09
fungithanks!19:09
clarkbthen yesterday I booted a replacement paste server on top of this image. Deployment of this then got hung up on docker rate limits19:09
clarkb#link https://review.opendev.org/c/opendev/system-config/+/939124 deploy paste02 on noble19:10
clarkbthe parent change of ^ switches paste to pulling mariadb:10.11 from quay instead of docker hub to work around that19:10
clarkbif we're comforatble doing that for production deployments it would be great to get those changes in so we can start on migrating the paste data and get some real world feedback on podman on noble19:11
fungiinterestingly, we also needed to use configdrive with the noble image tony built, but not with the jammy image rackspace supplies19:11
clarkboh right. The image is from ubuntu upstream's qcow2 converted to vhd aiui and the cloud init there needs config drive (I'm guessing rax metadata services isn't compatible?)19:11
clarkbthe other thing we discovered is that podman on noble will only start containers on boot if they are marked restart:always19:12
fungimight be down to a difference in cloud-init version, yeah. it's definitely installed at least19:12
clarkbthis is a behavior change from docker which would restart under other situations as well (though that restarting wasn't super consistent due to spontaneous reboots putting containers in a failed state)19:12
clarkbanyway long story short reviews on the quay.io source for mariadb:10.11 would be great that way we can proceed if comfortable with that19:13
clarkband we'll see if we learn more new things afterwards19:13
corvus+2 from me i say +w at will19:13
fungii'm almost certain we will19:13
clarkbexcellent thanks19:13
clarkb#topic Mirroring Useful Container Images19:13
clarkbwhich takes us to a followup on this. Indications so far are that this will be super helpful. Thank you corvus for pushing this along19:14
funginice segue19:14
corvus\o/ whew!  i hope it works!19:14
fungiyes, thank you!!!19:14
clarkbcurrently the list of tags is somewhat limited so I had to add 10.11 to mariadb yseterday. If you see or think of other tags we should add then adding them early helps keep things moving along19:14
clarkb#link https://review.opendev.org/c/opendev/system-config/+/93916919:15
clarkbthe change to do that is simple for an existing image ^19:15
fungii also mentioned during the openstack tc meeting a few minutes ago that we're making some progress on a reusable pattern for this sort of solution19:15
corvusalso, protip, if adding a new image+job, don't forget to add the new job to the project pipeline config.  ;)19:15
clarkbthe other thing I noticed this morning in checking that the 10.11 was mirrored properly is that we get different manifests due to different arches19:15
clarkbcorvus: I don't know if there is a way to have docker or skopeo etc copy all available arch images in the manifest?19:16
clarkbthat might result in over mirroring but woudl make checking status simpler and avoid problems with arm for example19:16
corvusyes i think there is -- but isn't everything x86 right now?19:16
clarkbcorvus: yes everything but nodepool is x86 in opendev19:16
corvuslike, this should/will be a problem once we do something with arm...19:17
clarkbso I think we're most likely to hit this when building nodepool images that try to fetch teh python base image from quay?19:17
clarkbsince I think we're only mirroring x86 right now19:17
clarkband maybe that is a non issue due to niz?19:17
corvusbut for the moment, we're mirroring on x86 nodes and using on x86 nodes so shouldn't be an immediate problem19:17
corvusokay sounds like we're on the same page19:17
corvusit's certainly a low priority for me due to niz :)19:17
clarkbyup I just wanted to call it out as something I noticed. Its not impactful yet19:17
fungiif we ever end up containering anything on the mirror servers in our cloud providers, i could see it coming up19:18
corvusif someone does want to address that; i would look into existing patterns in zuul-jobs first because i'm pretty sure we have some "all arches" stuff in there somewhere with skopeo or similar.19:18
fungibut yeah, nothing on the horizon19:18
clarkbI was able to confirm the 10.11 images matched between quay and docker hub by fetching them locally. A docker image list showed them with the same id and docker inspect collapses them both into the same json oputput19:18
clarkbjust in case anyone else needs to do that later ^ seems to work19:18
clarkbany other questions concerns or feedback with the mirrored images?19:19
corvusi think --all is the arg to skopeo19:19
fungipipelines... do we run the mirror routine in promote/post in addition to periodic?19:20
clarkbfungi: no currently it is only periodic19:20
fungior i guess, what i meant to ask is should we?19:20
clarkbbut it is also primarily for images we don't produce19:20
clarkbI think the only exception to that is the base python images19:20
clarkbso we could potentailly run the mirroring for those as part of promote/post19:20
corvuswe won't need to once we switch canonical locations to quay19:21
clarkbtrue19:21
corvuswhich depends on the noble upgrade.  so there's a window where this might be useful19:21
fungiwell, more for updates to system-config which add a new image, we might want to merge another change to use it from that location without waiting a day19:21
clarkboh I see19:21
clarkbya I think that could be optimized19:21
fungimainly pondering how we might be able to iterate more quickly on switching image locations or updating versions19:22
corvusi think we could do both things; should look at file matchers for general images in system-config.  can just chain jobs for python-base updates.19:22
fungii suppose it wouldn't just be on adding a new image but also adding different versions when we're pinning specific ones19:22
clarkbcorvus: ++19:22
fungiyeah, that should work19:22
corvusfun trick btw:19:22
corvusadd file matchers that never match anything, then the job only runs on job config updates.19:23
clarkbhacks19:23
fungiheh19:23
corvusthat's where i'd start for the system-config general new image problem19:23
clarkbwill the file matchers get ignored in periodic though?19:23
clarkbor will that stop us from running in periodic?19:23
clarkbmaybe we need a child job for post/promote and that hack19:23
corvusi would only add them in a project-pipeline variant19:24
clarkbaha19:24
fungiyeah, that seems cleaner19:24
corvusso not to the job, just to the invocation of it on system-config19:24
corvusin post or deploy or whatever19:24
clarkbyup makes sense19:24
fungithat way the funky nonsensical overrides are all in one spot and become more obvious/don't get forgotten19:24
clarkbfungi: is that someting you might be interested in pushing up?19:25
fungii want to say yes, but i'm not sure i'd get to it before next week with other stuff currently on my plate19:25
clarkback, its not urgent. Lets see where we end up19:25
fungiso happy to if nobody else beats me to it19:25
clarkb#topic Gerrit 3.10.419:26
clarkbGerrit made new bugfix releases19:26
clarkbwhich seems like a good time to also bundle in the h2 compaction timeout increase change19:26
clarkb#link https://review.opendev.org/c/opendev/system-config/+/939167 New bugfix releases available19:26
clarkb#link https://review.opendev.org/c/opendev/system-config/+/93800019:27
clarkbfungi: ^ maybe if you get a quite time this week during travel we can sit down and aldn those and do a couple of restarts19:27
clarkb*quiet time19:27
fungisure!19:27
clarkbI don't think the bugfixes are super urgent. But they do look like good ones (including a NPE fix)19:28
clarkb#topic Running certcheck on bridge19:28
clarkbthe certcheck tool uses a script that is a bit old on cacti and doesn't support SNI. If we move the check to running on bridge we get two things: the latest version and SNI support and we'd better match how we test this in our CI jobs (since we always have a bridge it runs there I guess)19:29
fungii guess in retrospect, the three of us discussing this yesterday are the only ones in today's meeting, so deferring the decision was sorta pointless ;)19:29
clarkbfungi wanted to bring this up to check if there were any concerns with adding exttra tools like this to the bastion host19:29
clarkbI think the concern is valid but the risk in checking a tls connection once per day to a set of services that we (mostly) run seems low19:30
clarkbI think I would be ok with moving this to bridge19:30
fungiseems like we established yesterday that none of the three of us had concerns, so i guess let's plan to move it19:30
corvus++19:30
clarkbsounds good!19:30
clarkb#topic Service Coordinator Election19:31
fungii have a wip change to install it from a distro package instead of from git, i'll try to find time to amend that soon to move it off cacti and onto bridge19:31
clarkbI haven't seen or heard any objections to the proposal I made last week for election plans19:31
clarkbproposal: Nominations Open From February 4, 2025 to February 18, 2025. Voting February 19, 2025 to February 26, 2025. All times are UTC19:32
clarkbConsidering that I'll plan to make this official via email to the service-discuss list shortly19:32
clarkbyou have until then to lodge your objections :)19:32
fungithanks!19:32
clarkb#topic Beginning of the Year (Virtual) Meetup19:32
clarkb#link https://etherpad.opendev.org/p/opendev-january-2025-meetup19:32
clarkbI have started gathering discussion topics on this etherpad which will also serve as the meetpad location for this event19:33
clarkbthe main thing to decide is timing. Last week I suggested January 21-23 timeframe whcih is next week. No objections to that and frickler indicated that we could proceed with planning and not worry too much about their abiltiy to attend19:34
fungiyeah, i'll be getting home the afternoon before that, so wfm19:34
clarkbI'm thinking spreading it out over multiple days is a good idea so that we can do shorter gatherings rather than one long one. The 21st is also the day of our normal meeting19:34
clarkbmaybe on the 21st we do something like 2100-2300 UTC. Then 22 and 23 we do 1800-2000 if frickler can attend and 2100-2300?19:35
clarkbthen if we decide we don't need all of that time we can skip the 23rd too or something19:36
fungii've got a couple of unrelated conference calls on tuesday and wednesday, but will be available most of the time19:36
clarkb(basically I'm trying to be flexible and accomodate timezones and the massive block of meetings on Tuesday everyone seems to have)19:36
clarkbcorvus: fungi since you're here are those blocks of time particularly problematic?19:37
corvus1 sec19:37
fungibefore 19z on monday and 15-16z on tuesday are my other calls, so your proposed times look fine for me19:38
corvusi don't currently have any blockers there19:38
fungier, i mean before 19z on tuesday and 15-16z on wednesday are my other calls19:39
fungianyway, wfm19:39
clarkbexcellent I'll get that dded to the etherpad and we can be flexible if necessary (like dropping the time on the 23rd if we don't need it)19:40
clarkb#topic Open Discussion19:40
clarkbI failed to put the Gitea 1.23.1 upgrade on the agenda19:40
clarkbbut there is also a proposed Gitea upgrade19:40
clarkbI guess we could try landing that and see if we hit the mariadb problems and switch over images there too?19:41
clarkbnot sure if anyone else wants to inspect the held node19:41
clarkbI guess see where we get with paste02 and if that makes progress look to gitea next19:43
clarkbAnything else that wasn't in the agenda that we should cover before calling it a meeting?19:43
fungithere's a bunch of changes for bindep that could use another pair of eyes (many of which are mine)19:44
fungitesting fixes, support updates, packaging modernization19:44
clarkbfungi: did we ever end up being able to explain why the old setup stopped working?19:44
fungiwhich old setup?19:45
clarkbThats probably not critical and most likely due to changes in setuptools since the original change was first proposed19:45
clarkbfungi: the original change I pushed to have bindep use pyproject.toml worked but then you had to rework it to get it working today19:45
clarkbbut ya I'll put reviewing that on the todo list19:45
fungii think you're talking about an earlier iteration of the pyproject.toml changes, i incorporated your feedback around that19:45
fungii think it's split up fairly logically now19:46
clarkbright. I'm just curious if we know why that became necessary19:47
clarkbit did work when originally proposed but then that same pyproject.toml stopped working19:47
fungilooking19:47
fungiwe're talking about 816741 i guess19:48
clarkbyes looks like it19:48
clarkbps9 passed then it stopped passing and you had to add the py_module stuff19:49
clarkbI guess setuptools changed somehow to make that mandatory19:49
fungiit's virtually identical to what you had before, but needed a tweak to account for newer setuptools and editable installs, looks like?19:49
fungiand changes to how setuptools did module searching19:49
clarkbya the nox stuff is a sideeffect of setuptools and pip not supporting editable wheels on python3.619:50
fungibut yeah, i think it's all down to setuptools differences19:50
clarkbhttps://review.opendev.org/c/opendev/bindep/+/816741/21/setup.cfg its this difference that I'm looking at I guess and ya setuptools changed seems to be the answer19:50
fungiright19:50
fungione of those worked on 3.6 but then setuptools changed the name of the option19:51
clarkbI think we can alnd everything up to https://review.opendev.org/c/opendev/bindep/+/938522 then maybe make a release? then land the pyproject.toml update?19:51
fungiyeah, also this all started because we wanted a candidate to exercise newer pbr when that gets tagged19:52
clarkbthen we can decide if we do a release with pyproject.toml before dropping python3.619:52
clarkband I'll try to review the later half of that stack soon so that I can have an informed opinion on ^19:53
fungiright, the later changes in that series are more like "we can do this, should we?"19:53
fungiin order to settle on a good example for how we might want to update the packaging for our other tools19:53
fungiand also the drop 3.6 change is as much to confirm all the todo comments are correct about whenwe'll be able to clean bits up19:54
clarkb++19:55
clarkbanything else? last call19:56
clarkbsounds like that is it. Thanks everyone!19:57
clarkbWe'll be back next week19:57
clarkb#endmeeting19:57
opendevmeetMeeting ended Tue Jan 14 19:57:21 2025 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)19:57
opendevmeetMinutes:        https://meetings.opendev.org/meetings/infra/2025/infra.2025-01-14-19.00.html19:57
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/infra/2025/infra.2025-01-14-19.00.txt19:57
opendevmeetLog:            https://meetings.opendev.org/meetings/infra/2025/infra.2025-01-14-19.00.log.html19:57
fungithanks clarkb!19:57

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!