Tuesday, 2025-08-12

corvusmatrix comm check18:58
clarkbits working18:59
clarkband we'll get started momentarily18:59
clarkb#startmeeting infra19:00
opendevmeetMeeting started Tue Aug 12 19:00:18 2025 UTC and is due to finish in 60 minutes.  The chair is clarkb. Information about MeetBot at http://wiki.debian.org/MeetBot.19:00
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.19:00
opendevmeetThe meeting name has been set to 'infra'19:00
clarkb#link https://lists.opendev.org/archives/list/service-discuss@lists.opendev.org/thread/B2OLLYGACZDURRJWKZSHE34JVQ5HT5RY/ Our Agenda19:00
clarkb#topic Announcements19:00
clarkbI didn't have anything to announce. Did anyone else?19:00
funginot i19:01
clarkbok I'll probably keep things rolling today as we have a lot of agenda items. I believe we should be able to get through them all in our hour but want to do my best to ensure we do19:02
clarkb#topic Gerrit 3.11 Upgrade Planning19:02
clarkbI don't have anything new on this item. Its perpetually on my list of todos but requires I be able to sit down for a good chunk of time and not get distracted and that hasn't happened :/19:02
clarkbWere there any new questions, comments, or concerns on this subject?19:02
fungii didn't have any19:03
clarkb#topic Upgrading old servers19:04
clarkbThis topic does have updates largely due to the efforts of fungi19:04
clarkbfungi: want to fill us in on the refstack and eavesdrop01 changes?19:04
* fungi deletes things19:04
fungieavesdrop was a pretty straightforward server replacement with a cinder volume move19:05
fungiold server has now been cleaned up along with its dns records19:05
fungirefstack server has been imaged for archival and deleted, dns records for that cleaned up as well as our deployment automation and documentation purged19:05
fungithe refstack service sunsetting was announced to the openstack-discuss ml in case anyone asks19:06
fungisince it was an openstack-specific service19:06
fungiany questions? presumably no19:07
clarkbthank you for taking care of that. These updates are important for two reasons. The first is it concludes the upgrades for the "easy" list of servers that need upgrading. And second this means all of our python containers are running on Ubuntu Noble now which means we can potentially make changes to our base container image locations (whcih is a topic for later)19:07
clarkbThe leftovers are AFS servers, kerberos servers, graphite, and backup servers19:07
clarkb#link https://etherpad.opendev.org/p/opendev-server-replacement-sprint19:07
clarkbI updated the lists on ^19:07
clarkbOf those servers I think afs and kerberos servers may want to be done in place rather than with replacements19:08
clarkbdue to dns record management for those services as well as the size of the data its just easier this way I suspect. And we have redundancy that allows us to step through it one at a time19:08
fungii can probably arrange to drive that if it's the plan, i'm rather well-versed at in-place debuntu upgrades19:08
clarkbthen graphite and backup servers would follow our normal new server replacement process19:08
clarkbfungi: that would be great19:08
clarkbI suspect of those in place upgrades the kerberos servers are easiest19:09
corvusi agree that in-place replacement sounds best19:09
clarkbthen the afs db servers then the afs fileservers19:09
fungiafs servers probably won't be that hard either19:09
fungiespecially since we build our own packages19:09
fungiso the openafs versions aren't really changing much, if at all19:10
clarkbI'm happy to help as well19:10
clarkbwe can sync up outside of the meeting and start sketching out some plans (and not even necessarily today just some point soon hopefully)19:10
fungiyep, sgtm19:10
clarkbany other inputs on this subject?19:10
clarkb#topic Matrix for OpenDev comms19:11
clarkb#link https://review.opendev.org/c/opendev/infra-specs/+/954826 Spec outlining the motivation and plan for Matrix trialing19:11
clarkbI pushed a new patchset yseterday to address the reviews I got. Thank you for the feedback. Looks like ianw has some followup feedback I should probably go ahead and address today as well19:12
clarkbthen once that new patchset is up followup reviews would be appreciated19:12
clarkbI think we can keep conversation in the review I just wanted to call out there was an update and there should be another one soon19:12
clarkb#topic Working through our TODO list19:13
clarkb#link https://etherpad.opendev.org/p/opendev-running-todo-list19:13
corvusdid you find out whether ems will run molnir?19:13
clarkb#udno19:13
clarkb#undo19:13
opendevmeetRemoving item from minutes: #link https://etherpad.opendev.org/p/opendev-running-todo-list19:13
clarkb#undo19:13
opendevmeetRemoving item from minutes: #topic Working through our TODO list19:13
clarkbcorvus: I couldn't find anything in their docs indicating they do but didn't login to check the toggles in the dashboard19:13
clarkbcorvus: but reading up on mjolnir it seems very straightforward to run like one of our other bots so that was what I proposed in the spec19:14
corvusok.  we should probably check the dashboard, but ack, agree we can just run it if we need to19:14
clarkbits basically a container with a filesystem bind mount for data storage and a config file configuring its management room (where anyone in that room gets to control the bot)19:14
corvus++19:15
corvus(that's all i had on this topic; thx)19:15
clarkb#topic Working through our TODO list19:15
clarkb#link https://etherpad.opendev.org/p/opendev-running-todo-list19:15
clarkbThis is a reminder that if you get bored and want to pick up something new starting with this list is a good start19:15
clarkbI think I'll drop this off of the meeting agenda after this meeting though as the list has a proper location that should be discoverable at this point so I don't think manual reminders in the meeting are as valuable19:16
clarkbfeel free to update that list. The idea is that its informal and somewhere we can capture the very beginnings of efforts before they become specs or changes etc19:16
clarkb#topic Pre PTG Planning19:17
clarkb#link https://etherpad.opendev.org/p/opendev-preptg-october-2025 Planning happening in this document19:17
clarkbProposed times: Tuesday October 7 1800-2000 UTC, Wednesday October 8 1500-1700 UTC, Thursday October 9 1500-170019:17
clarkbI think fungi and I actually have a conflict on the 7th from 1800-1900 UTC but we should be able to reschedule that conflict. And this way I think we can avoid having multiple meetings that day and just replace our meeting with the pre ptg19:18
fungiright, i can be flexible19:18
clarkbPlease leave feedback on the proposed times as well as the topics (and/or add your own topic ideas)19:18
clarkbOn my own I was able to come up with a good set of ideas which helps validate that this is a useful thing to do19:19
tonybI'm still planning to be in MN for the pre-PTG19:19
clarkbtonyb: those time blocks should be CDT friendly. But let me know if they aren't for some reason19:19
fungiclarkb: the meeting you're thinking of isn't on the 7th from what y calendary claims19:19
clarkbfungi: oh good19:19
tonybThey seem good to me :)19:20
fungithat's the off-week for the meeting19:20
clarkband if we end up not needing all three blocks of time we can cancel and/or stop early19:20
clarkbI just want to have a reasonable amount of time available to us upfront so that we have the opportunity to use it if necessary19:21
clarkband ya please update the topics list with your own ideas19:21
clarkb#topic Service Coordinator Election Planning19:22
clarkbNomination Period open from August 5, 2025 to August 19, 2025. If necessary we will hold an election from August 20, 2025 to August 27, 2025. All date ranges and times will be in UTC.19:22
clarkb#link https://lists.opendev.org/archives/list/service-discuss@lists.opendev.org/thread/YXRD23ZWJGDPZ3WESBNZNEYO7NBCXFT4/19:22
clarkbThis is your "one week remaining" warning notice19:22
clarkbI'm more than happy to support someone in this role if there is interest. Let me know if there are any questions about what is involved19:22
fungialso similarly happy to help support someone in that role who isn't me ;)19:23
clarkb#topic Loss of upstream Debian bullseye-backports mirror19:24
clarkbthe workaround to have reprepro ignore errors landed so that we'll continue to update debian packages that do exist19:24
clarkbbut now we need to make a plan for the longer term. I think what I'd like to see if have opendev base jobs stop configuring backports by default on debian to match the upstream behavior. Then we can delete the bullseye backports content and the xenial and bionic-arm64 conent19:25
clarkbI think the process for all three of those is basically identical so bundling them all up and oing it at once helps avoid unncessary context switching19:25
fungithe only question i have is whether it makes sense to announce and adjust in zuul/zuul-jobs instead19:25
fungibecause if opendev has this problem, other users of that role also may19:26
clarkbthat is a good point. I think if we do this for bullseye alone we can avoid big announcements as upstream has forced our hand. But if we do it for all of debian then we should announce19:26
fungibut as corvus pointed out last week, that role is also on the road to deprecation if someone gets time to replace it with the new design19:26
clarkb(and we should announce whether we do it for opendev or zuul or both)19:26
clarkbmy hunch here is that relatively little stuff is relying on backports since it is a non default opt in choice19:27
fungiyeah, i have no problem announcing whatever we do wherever we do it, just wondering if diverging from the zuul default makes sense in this case19:27
clarkbso blast radius should be small and we should rip the bandaid off19:27
fungior if freezing the zuul stdlib implementation in order to reduce churn for anyone working on a replacement is better19:28
clarkbya probably comes down to corvus' preference for the zuul community19:28
clarkbif corvus thinks we should fix this at the root in zuul-jobs we'll do that otherwise we'll modify things in opendev only19:28
corvusi don't think anyone's working on a replacement... and i don't have strong feelings about it...19:29
corvusi'm not 100% caught up on the issue19:29
fungithere's also not a ton of urgency from our perspective, the workaround is functional but we'd like to be able to free up the space19:29
corvusso i'd say: if it makes sense for everyone to change, then go ahead and do it in zuul jobs... otherwise meh19:29
clarkbI think the main reason it applies broadly is it helps mimic upstream debian behavior by default better19:30
clarkbwhich leads to fewer surprises as people are testing software19:30
corvussounds reasonable :)19:30
fungicorvus: essentially upstream debian deleted the backports suite for bullseye a few weeks ago. the configure-mirrors role defaults to enabling the backports suite on debian nodes, so if we delete it from our mirrors then any bullseye jobs are going to fail unless they're overridden to disable backports in that role19:30
clarkbsounds like we should make the change in zuul-jobs and announce it then go from there?19:31
fungiand yeah, disabling backports by default instead of enabling would be an easy switch in configure-mirrors, and would be closer to normal debian installs these days, but is a clear behavior change19:31
fungiclarkb: probably the reverse. announce and *then* switch19:32
clarkber yes19:32
clarkbwe want to announce it before we break things19:32
fungiagreed19:32
fungii ca ntake care of that19:32
clarkbthanks!19:32
clarkb#topic Adding Ansible 11 to Zuul19:32
clarkbNext up in potentially breaking changes for jobs run by zuul :)19:33
corvusha, but hopefully not!19:33
clarkbwe switched the zuul and opendev tenants to ansible 11 last week and as far as I can tell things seem to be working19:33
clarkbzuul in particular has been running a good number of jobs19:33
corvusi haven't heard of any issues.  should we switch the rest of the tenants?  if so when?19:33
clarkbI think we should switch the other tenants. I think we can send an announcement ~today for a switch early next week? That way we can communicate to people the change is happening and they can override the version back to 9 if things break19:34
clarkbits also probably just early enough in the openstack release cycle that we can work through problems19:34
fungiyeah, much longer and we're getting close to the rush/freeze19:34
clarkbif we didn't have the fallback to 9 in job overrides I would worry more. but we do have that fallback so I think we should proceed19:35
corvussounds good19:35
clarkbcorvus: do you want to send that announcement or should I?19:35
corvusclarkb: if you don't mind, that'd be great19:36
corvushttps://paste.opendev.org/show/bu8VuxDAKYEzai5Caik7/19:37
clarkback /me scribbles a note19:37
corvusi did just find an old copy of a message from you on the subject ^19:37
clarkbI should be able to do that after lunhc today19:37
corvus#link old message https://lists.opendev.org/archives/list/service-announce@lists.opendev.org/message/A7SOKNMQLZXZSTXUI4UOIIGOQHIQSMZ6/19:37
corvus(yay archived-at header)19:37
clarkb#topic Etherpad 2.4.2 Upgrade19:37
clarkb#link https://github.com/ether/etherpad-lite/blob/v2.4.2/CHANGELOG.md19:38
clarkbThere are some etherpad updates. Nothing critical according to the release notes but I've been doing my best to keep us up to date on services like this19:38
clarkbProblem is the new versions break every skin except for colibris19:38
clarkb#link https://github.com/ether/etherpad-lite/issues/7065 Theming updates have broken the no-skin and custom skin themes. We use no-skin.19:38
clarkbsomeone else followed up on the issue I filed to point out it breaks custom skins too19:39
clarkbUnfortunately, no feedback from the maintainers yet19:39
clarkbI think our options are either wait more and see if they fix it. Or consider using colibris. As mentioned this doesn't appear to be urgent but if you have 10 minutes it might be a good idea to check out colibris on the held node19:39
clarkb158.69.64.117 is the ip address to update in /etc/hosts for etherpad.opendev.org to do that19:40
clarkbthe biggest difference I find is that colibris mimics the sheet of paper appraoch used by google docs19:40
clarkbthis narrows the useable space of the pads. Not necessarily always a good or bad thing. Depends on context19:41
fungii personally find it unnecessarily wastes screen space19:41
clarkbif you do check it out and have thoughts on whether or not it would be problematic for us let me know. If we think it is a valid switch then we can proceed that way19:41
clarkbfungi: ya I feel like it may also imply that these are less ether pads and more perma pads19:42
corvusyeah that's kind of silly, but, probably not a showstopper if we have to switch19:42
fungiwhich isn't a show-stopper but feels like a regression19:42
fungiyes, agreed19:42
clarkbprobably the thing to do is wait for now and see if they respond to my issue. Then if there are any important reasons to update sooner (security updates major bug fixes) we can switch to colibris and do our best19:43
corvuswho needs more than 87 columns anyway?  /s19:44
clarkb#topic Cleaning Up the Old voip.ms Account19:44
clarkbyesterday fungi and I were made aware that the voip.ms account is making small infrequent charges19:45
clarkbsince we dropped asterisk and switched to meetpad we haven't been using this service19:45
clarkbbefore we formally shutdown the account/cancel the service I wanted to double check there wasn't a reason to avoid doing so. I think if we decide to tie POTS into meetpad via SIP we can always resurrect the account or create a new one at that point19:46
clarkboh hey etehrpad literally just responded wtih a question. I'll followup after the meeting19:46
corvusi imagine the chargers are for maintaining the DID (phone number); part of shutting it down will be releasing that19:46
clarkbcorvus: yes that was what I thought. fungi thought it could also be spam calls19:47
corvusso as long as we don't feel super attached to that number, then that should be fine.19:47
clarkbI think giving up the number is also fine. Using a new number shouldn't be a big impact19:47
corvusnot sure what the plan was, but i wouldn't expect non-terminated incoming calls to be charged19:47
clarkback19:48
fungiyeah, so probably not minutes getting eroded by random call-ups then19:48
corvuslast i looked it was something like $2/mo to maintain a did19:48
corvus(and last i looked was 4 years ago)19:48
clarkbto summarize we would give up the existing phone number and do whatever the equivalent of shutting down the voip.ms account is (not sure what options they give us there)19:49
clarkband I'm not hearing any objections to that plan. Wont' get to that until at least tomorrow so if there are objections feel free to raise them after the meeting too19:49
corvussgctm19:49
corvussgtm19:50
fungiwfm too19:50
clarkb#topic Moving OpenDev's python-base/python-builder/uwsgi-base Images to Quay19:50
clarkbAs mentioned earlier in the meeting all of our python based container images (that we build ourselves) are running on Ubuntu Noble now19:50
clarkbthat means we should be good to move the python base images to quay.io then update all of the consumers of the images to pull them from there19:51
clarkband we shouldn't lose speculative image testing. However, there are a couple of caveats/concerns with that19:51
clarkbThe first is that zuul and vexxhost also use these images so not sure how much communication we feel is necessary to ensure they don't accidentally use orphaned images. Then these images are primarily used in the image build process not the container deployment and testing process. That means that the existing changes to noble to use podman for deployments may not be19:52
clarkbsufficient to ensure specualtive testing continues to work19:52
clarkbDocker build via buildx/buildkit does support alternative mirrors and is enabled by default in modern docker. But we may need extra configuration to ensure the mirrors are set?19:52
clarkbalternatively we could switch to building with podman/buildah and I think that should just work now after we update all the jobs19:53
clarkbso I guess I'm asking if we think we need a deprecation announcement period before switching and if we should continue building with docker or switch to buildah19:53
corvusi think we should look into continuing to build with docker buildx/buildkit -- so looking into what we need to do to make the mirrors used there, if anything.19:54
fungii suppose uploading to both registries for a transitional period is a no-go (there was almost certainly a reason we didn't do it before now if not)19:54
clarkbfungi: it is possible. I just want to avoid doing that long term if possible since that would continue to make us vulnerable to docker rate limits19:55
clarkbcorvus: ok I can dig into that a bit more to understand that better19:55
corvus(i'm less confident in podman build and buildah; but not opposed if it works)19:55
corvusi think a note to service-discuss should be fine.  consider the zuul community notified already.  :)19:56
clarkback thanks19:56
clarkbso step 0 is become better informed on the docker build behavior and then make an announcement and plan the swithc19:56
clarkbI'll work on that19:56
clarkb#topic Open Discussion19:56
clarkbWe have a few minutes left. Anything we missed that is worth covering?19:56
corvusand test between steps 0 and 1 to make sure the mirror stuff is right19:57
clarkb++19:57
corvusit's not terrible if we have to roll back, but would be nice if we could avoid that and break our streak :)19:57
corvus1 quick thing19:58
corvusthere's an upcoming syntax change for zuul-launcher; i have the change to zuul-providers prepped: https://review.opendev.org/95694619:58
corvusthat's reviewed, and so is the zuul change; when the zuul change merges, i'll handle restarting zuul with it, then merging the zuul-providers change19:59
corvusjust wanted to let folks know that's upcoming19:59
clarkbwill test node boots break during the time between restarting launcher and updating zuul-providers?20:00
clarkband if so does that imply we'll need to force merge that update?20:00
corvus(during this unstable development period for niz, we might have the occasional backwards-incompat change like that -- basically, it's two commits in zuul so it's just long enough for opendev to make the change without breaking).20:00
corvusclarkb: nope ^ 3 changes total: add new syntax, update zuul-providers, remove old.  so i'll sequence everything so it's never broken for opendev20:01
clarkbgot it20:01
clarkbwe're over tiem. Thank you everyone! We'll be back next week at the same time and location20:01
corvus[eot]20:01
corvusthanks!20:01
clarkb#endmeeting20:02
opendevmeetMeeting ended Tue Aug 12 20:02:00 2025 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)20:02
opendevmeetMinutes:        https://meetings.opendev.org/meetings/infra/2025/infra.2025-08-12-19.00.html20:02
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/infra/2025/infra.2025-08-12-19.00.txt20:02
opendevmeetLog:            https://meetings.opendev.org/meetings/infra/2025/infra.2025-08-12-19.00.log.html20:02
fungithanks clarkb!20:02

Generated by irclog2html.py 4.0.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!