Tuesday, 2025-03-25

clarkbJust about meeting tmie18:57
clarkbmy typing has been terrible today. Apologies in advance18:57
clarkb#startmeeting infra19:00
opendevmeetMeeting started Tue Mar 25 19:00:02 2025 UTC and is due to finish in 60 minutes.  The chair is clarkb. Information about MeetBot at http://wiki.debian.org/MeetBot.19:00
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.19:00
opendevmeetThe meeting name has been set to 'infra'19:00
clarkb#topic Announcements19:00
clarkbOpenStack is making its 2025.1 Epoxy release next week. And the week after that is the virtual PTG19:00
clarkbThings to be aware of when making changes. I suspect we'll put the meetpad hosts in the emergency file late next week after testing that things work to avoid unexecpted upgrades19:01
clarkbAlso we aren't going to have OpenDev ptg time as we all tend to be busy participating in other groups' ptg blocks19:01
clarkbAnything else to announce?19:02
fungiopeninfra summit date and venue have been announced, volunteers sought for the programming committee with a kickoff call scheduled for thursday. contact Helena Spease <helena@openinfra.dev> if you want to help in any way19:02
fungi#link https://openinfra.org/blog/openinfra-summit-2025 Announcing the OpenInfra Summit Europe 2025!19:02
clarkboh ya that just ahppened19:02
clarkb#topic Zuul-launcher image builds19:04
clarkbcorvus recently added all of our x86_64 regions to zuul-launcher19:04
corvusall the x86 clouds are present now, but i haven't chased down the images builds19:05
clarkbthe cloudsthat have smaller and bigger flavor sizes also support the smaller and bigger labels19:05
corvus(to make sure they all have images built)19:05
clarkbgotcha19:05
corvussounds like that may be bad timing because of the noble thing?  :)19:05
corvusbut the noble kernel thing got me thinking19:06
corvuszuul19:06
corvuser19:06
corvuszuul-launcher does have the ability to validate builds.  we're not exercising that yet.  and i don't think we want to validate too much.  but i think maybe validating things like basic network connectivity and iptables, etc, would be reasonable.19:06
corvusso if we think it's a good idea, and if anyone wants to volunteer to write a validation job that does something like that, i'd be happy to mentor/review.19:07
clarkba "does it boot and have our base configuration" doe seem reasonable19:07
clarkbI agree we don't want to try and catch every possible problem our jobs may have though19:08
clarkb#action someone look into basic image validation with zuul-launcher19:08
clarkbanything else on this subject?19:09
corvusnope19:09
clarkb#topic Container hygiene tasks19:09
clarkb#link https://review.opendev.org/q/topic:%22opendev-python3.12%22+status:open Update images to use python3.1219:09
clarkbwe did end up managing to update our base python container images. The next step is to start rolling out updates to rebuild the containers to use python3.1219:09
clarkbthis ensures we're using the newly built base image content and moves us from python3.11 to 3.1219:09
clarkbthis sin't terribly urgent but  Ithink it is good hygiene to make these updates periodically if people have time to do reviews19:10
clarkbI'm going to pop out this afternoon but I'm ahppy to approve things tomorrow when I can monitor fi reviews happen19:10
clarkbthe early changes to python3.12 have just worked thankfully as well19:10
corvusare we happy with the performance of py 3.12 these days?  i want to say at one point we were not super motivated to bump zuul for performance reasons, but maybe that time has passed?19:11
clarkbcorvus: its been fine for the small utilities we use it for within opendev. I think the zuul unittest jobs are still consistently slower on noble with py312 than they are with jammy/bookwork and py31119:11
clarkbI think zuul is likely to be the only thing where the performance might be noticed by us? since zuul is often quite busy. Maybe nodepool too19:11
clarkbbut I'm also happy to try it and see if we notice the difference isn't massive19:12
corvusdo you happen to know about 3.13?19:12
clarkbI have it installed locally but haven't done much with it. We dont' have base images for 3.13 yet but we can add those now that 3.10 is gone19:12
corvuswonder if maybe we should just wait for that vs going to 12...19:13
clarkbI don't know if there is a good wy to install it for zuul unittests other than pyenv19:13
fungiunless we add debian-trixie images19:13
clarkband unfrotauntely with pyenv you either do an hour long compile to get something production like or a 2 minute compile and its slower. Good for testing compatibility and less for performance19:13
fungimight be getting close to time to think about that, it's already entered soft freeze time19:13
corvusokay.  i think that's good info.  i'm not going to rush out and bump zuul right now; and instead will keep an eye out for a 3.13 opportunity.  and we can bump to 3.12 when it's more pressing.19:14
clarkbworks for me19:14
fungithough the official trixie release is probably still a few months away19:14
clarkbmaybe we add an early pyenv job to check for compaitiblity issues but don't read into performance too much19:14
clarkbunless miraculously the 3.13 compiled without optimizations manages to be faster than the optimized 3.11 and 3.12 builds from the distro19:15
clarkbin that case we acn probably safely assume the optimized builds will be at least as performant19:15
corvusi might try some time trials locally with docker images, but that is not at the top of my list right now19:15
clarkbsounds good19:16
fungiyeah, part of the challenge with optimizing cpython builds is that it needs to compile, then run a battery of tests with profiling on, then build again based on the results19:16
clarkbanything else on this topic?19:16
fungiso it's basically double the compile time plus the minimal make test time19:16
corvusgood here19:16
clarkb#topic Dropping uWSGI19:16
clarkbthis is a related item where I'd like to drop uWSGI. After rebuilding the base container images last week this is a bit less urgent so I've deprioritized it19:17
clarkbbut my goal is still to swithc lodgeit to granian or similar and then we can stop worrying about uwsgi entirely19:17
clarkbthe main gotcha here is the deployment and image have to change so we're potentailly taking a small downtime. I think the best way to approach that is to put the service in the emergency file, land the image update, manually edit the docker-compose.yaml and pull the new image and restart things then alnd a system-config update to reflect that19:18
clarkball of which I'm happy to do when I have time or for someone else to push along if they are interested. Its not like I'm really into wsgi servers I just watned to improve our paste service and container image builds19:18
clarkbin the meantime reviews welcome but I think we had general consensus last week that htis was ok for a service like paste19:19
clarkb#topic Upgrading old servers19:19
clarkbI have continued to upgrade servers. Since last week I'ev upgraded all of the nodepool builders and launchers and the osuosl mirror19:19
clarkbno new problems have been found with podman / docker compose / noble which is nice19:20
clarkbbtu it is worth mentioning that since I completed that work yesterday Noble's kernel updated and now has a bug managing ipv6 firewall rules that I suspect may cause problems for launching new servers until fixed19:20
clarkbthe next server on my todo list is the rax iad mirror and it can serve as a canary for ^19:20
clarkb#link https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2104134 Noble kernel bug19:21
clarkbalso I think the new osuosl builder and mirror may perform better than the old ones19:21
clarkbthe image builds defniitely seemed quicker19:21
clarkbanyone else have updates for replacing servers?19:22
funginot i19:23
clarkbthen I'll end this topic with a reminder that every little bit helps. There is a fairly large backlog of servers to replace and I appreciate all the help I can get doing so19:23
clarkb#topic Running certcheck on bridge19:24
clarkbI don't have any updates on this. Too many more urgent things keep popping up. But I would still like to explore ianw's suggestion on this19:24
clarkbparticularly since it doesn't seem to be urgent at the moment to move to bridge19:24
clarkbrelated to that LetsEncrypt is going to stop sending email reminders taht your certs expire soon. We've never relied on their emails so not a big deal19:24
fungiyeah, i'm happy to abandon my change in gerrit19:25
fungiit started out as a tiny thing, single ansible task, and snowballed19:25
clarkbI think if it were broken right now we'd hurry up and fix it one way or another19:26
clarkbbut it was just a github blip iirc so we're limping along otherwise19:26
clarkb#topic Working through our TODO list19:26
clarkb#link https://etherpad.opendev.org/p/opendev-january-2025-meetup19:26
clarkbjust a friendly reminder we haev this list if you need to find something impactful to do19:27
clarkbthis applies to existing opendev contributors and new contributors alike19:27
clarkband feel free to reach out if there are questions about a topic you'd like to get involved in19:27
clarkb#topic Upgrading to Gitea 1.23.619:28
clarkb#link https://review.opendev.org/c/opendev/system-config/+/94541419:28
clarkbgitea just made a new release. I pushed a change up to upgrade our installation with links to the changelog. It looks fairly straightforward to me but double checking is always great. Particularly with the openstack release next week19:29
fungithanks, i meant to review that yesterday but time got away from me19:29
clarkbgood news is the memcached and firewall changes seem to be working well. I want to say gitea performance has been very consistent for me since we implemented those two changes19:29
fungiyeah, awesome work!19:29
clarkband again a reminder taht I probably won't be able to monitor today but happy to do so tomorrow if people review and want to defer approvals19:29
fungiyeah, i'm going over the changelog now, but will refrain from approving tonight19:30
clarkb#topic Rotating mailman 3 logs19:30
fungioh, right, we still need... a logrotate config?19:30
clarkbthis is mostly a reminder that we're not rotating mm3 logs and that is both an oversight in our config management but one that worked out for us sort of because upstream doesn't support rotating logs?19:31
clarkbI don't want us to forget and think we should start trying something and see what breaks. Maybe even push up a logrotate config then hold a node?19:31
fungiyeah, i remember i looked into it, now i don't recall the details. something about not gracefully following the inode change19:31
clarkbwe don't need to test in prod necessarily, but I would like to see us fix this before we have a 20Gb log file that fills a disk and we're scrambling to fix it19:31
clarkbfungi: ya it keeps writing to the old file until you restart it or something19:32
clarkband maybe our logrotate config rotates the file and restarts the service?19:32
fungiseems like if we use teh copy-truncate method in logrotate it will probably work? at least those were the workarounds i saw mentioned19:32
clarkbnot ideal but doable19:32
clarkbya copy truncate might work but I think someone reported it had problems too19:32
fungimmm19:32
clarkbanyway this is on here as a reminder I don't have any answers. Happy to help review and reread issues though19:32
fungii'll try to revisit that and get something up for it19:32
clarkbthanks19:33
clarkb#topic Open Discussion19:33
clarkb#link https://etherpad.opendev.org/p/opendev_newsletter19:33
clarkbwe've been asked to help write a blurb for the openinfra newsletter going out soon. I put a draft on that etherpad19:33
clarkbwould be great if you have a moment to read it and check for accuracy and also think about whether or not we want to get involved in the ai web crawler mess19:34
clarkbits possible that calling it out like that might get a target put on our backs and so far it sounds like we've had it relatively easy compared to other open source projects19:34
clarkbI think the foundation newsletter editors want a final draft tomorrow19:35
clarkbso provide feedback soon19:35
clarkboh and while I keep saying tomorrow I can approve things that may end up being weather dependent. The weather pattern that is making it super nice and warm today for a bike ride is going to get smashed into by cooler weather tomorrow and generate thunderstorms with possible tornadoes and large (for us) hail19:36
clarkbI don't expect problems because thunderstorms here tend to be mind, but I thought I'd call that out19:36
clarkb*tend to be mild19:36
clarkbI might clear out the garage tonight though19:37
clarkbanything else?19:38
fungii got nothin'19:38
clarkbif that is everything we can all have ~20 minutes to do something else19:38
clarkbthank you everyone!19:39
clarkbwe'll be back here next week at the saem time and location19:39
clarkb#endmeeting19:39
opendevmeetMeeting ended Tue Mar 25 19:39:11 2025 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)19:39
opendevmeetMinutes:        https://meetings.opendev.org/meetings/infra/2025/infra.2025-03-25-19.00.html19:39
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/infra/2025/infra.2025-03-25-19.00.txt19:39
opendevmeetLog:            https://meetings.opendev.org/meetings/infra/2025/infra.2025-03-25-19.00.log.html19:39

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!