Tuesday, 2025-08-19

clarkbjust about meeting time18:58
clarkb#startmeeting infra19:00
opendevmeetMeeting started Tue Aug 19 19:00:08 2025 UTC and is due to finish in 60 minutes.  The chair is clarkb. Information about MeetBot at http://wiki.debian.org/MeetBot.19:00
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.19:00
opendevmeetThe meeting name has been set to 'infra'19:00
clarkb#link https://lists.opendev.org/archives/list/service-discuss@lists.opendev.org/thread/JQ73GENJBNTUU4QZD7SY6OO2NZ2H7WGU/ Our Agenda19:00
clarkb#topic Announcements19:00
clarkbI didn't have anything to announce. Did anyone else?19:01
fungii did not19:01
clarkb#topic Gerrit 3.11 Upgrade Planning19:02
clarkbThis continues to be half on the back burner. Except that upstream has made our lives more difficult again by publishing new releases :)19:02
clarkb#link https://review.opendev.org/c/opendev/system-config/+/957555 Upgrade Gerrit images to 3.10.8 and 3.11.519:02
clarkbProbably a good idea to land that and restart gerrit and recycle teh holds so that when I do get to testing things I'm testing with up to date gerrit versions19:03
clarkbmaybe tomorrow or thursday for the gerrit restart depending on how zuul stuff goes between now and then?19:03
clarkbthe zuul upgrade and reboot playbook is looking happy so far just needs time and zuul launchers haven't run out of disk yet so they also looks happy19:04
clarkbmakes me think I probably will have time for that soon19:04
clarkbany comments or concerns around the gerrit 3.11 upgrade?19:04
clarkb#topic Upgrading old servers19:05
clarkbFungi cleaned up refstack and the old eavesdrop server stuff last week19:06
clarkbNext on the list are kerberos, openafs, graphite, and backup servers19:06
fungii'll try to get to kerberos/openafs later this week19:06
clarkbfungi: that would be great. Feel free to ping me if I can help in any way19:06
fungithose seem to all be on focal at the moment19:06
clarkbThen I wanted to call out a milestone that we appear to have reached: there are no more bionic servers19:06
clarkb#link https://review.opendev.org/c/opendev/system-config/+/957950 cleanup bionic node testing in system-config19:07
fungimy plan is to stick them in emergency disable, update our testing to jammy, in-place upgrade them, one by one to jammy, then maybe increase our testing again to noble and repeat, finally removing them from the disable list19:07
clarkbThis came up because ansible 11 (our new default in zuul) isn't compatible with the python version on bionic. I did a quick workaround yesterday then dug in more this morning and I believe that we don't have any servers running bionic so we can drop testing for that platform and use ansible 1119:07
corvushuzzah!19:08
clarkbfungi: that sounds like a great plan. I like the idea of going step by step and checkpointing just to catch any problems early19:08
fungiat least i hope our system-config jobs will give an early warning of serious problems before attempting to upgrade19:08
clarkbfeel free to double check me on the no more bionic assertion but best I can tell the hosts in our ansible fact cache reporting bionic are no longer in our inventory19:08
clarkbfungi: ya it should catch the more obvious stuff19:09
clarkbits a good check19:09
clarkbI also shutdown gitea-lb02.opendev.org this morning as frickler reporting its cronspam was not verifying now that its dns records are gone19:09
clarkbwe kept the server around as a debugging aid so I dind't want to delete it. But shutting it down for now seemed fine and I've done so19:09
clarkbany other server upgrade/replacement/deletion questions comments or concerns?19:10
corvusthat reminds me, we have a dns record for review02 but i think we're on review03 now?19:10
fungicacti/storyboard are xenial, looks like, and wiki is older19:10
fungiso i agree no more bionic that i can find19:11
fungiwe have stuff older than bionic but we also can't really test it at this point19:11
clarkbcorvus: correct we're on 03 now. I'll make a note in my todo list to look at that19:11
clarkbya and we removed testing for that older stuff a little while ago19:12
clarkbso ship has sailed...19:12
fungireview02 is also still in the emergency disable list19:12
fungialong with gitea-lb0219:12
clarkbthey aren't in our inventory any more so should be able to be removed from the emergency file. I can check on that too19:13
clarkb#topic Matrix for OpenDev comms19:14
clarkb#link https://review.opendev.org/c/opendev/infra-specs/+/954826 Spec outlining the motivation and plan for Matrix trialing19:14
clarkbI've updated the spec with the feedback that I got. It looks like ianw is happy with it now. Anyone else care to rereview?19:14
clarkbI guess corvus did ask about double checking the possibility of EMS hosted mjolnir which I haven't done19:14
clarkbwe should oprobably keep most of the discussion on this topic within the spec review19:15
corvusyeah, still seems like a good idea, but probably doesn't radically alter the next steps which are: run mjolnir (either ourselves or via ems)19:15
clarkbso please follow up there.19:16
corvusthough i'm still at "learn how to speel mjolnir" which is step 019:16
clarkbI think it is literally the hardest word to remember the spelling of and type19:16
clarkb#topic Pre PTG Planning19:16
fungisadly i'm enough of a mythology geek that i have no trouble spellingnit19:17
clarkb#link https://etherpad.opendev.org/p/opendev-preptg-october-2025 Planning happening in this document19:17
clarkbI think we can consider this proposed schedule pretty well settled at this point as I haven't heard any feedback to the contrary19:17
fungisgtm, thanks!19:18
clarkbplease add agenda items to the etherpad if you have ideas for things to do or change etc19:18
clarkbI'll continue to add items myself as I think of them19:18
clarkb#topic Service Coordinator Election Planning19:19
clarkbThe service coordinator nomination period ends at EOD today on a UTC clock19:19
clarkb#link https://lists.opendev.org/archives/list/service-discuss@lists.opendev.org/thread/YXRD23ZWJGDPZ3WESBNZNEYO7NBCXFT4/19:19
clarkbyesterday I went ahead and nominated myself19:19
clarkbhttps://lists.opendev.org/archives/list/service-discuss@lists.opendev.org/thread/WNUYDT47NMYC3SC5QA44OG4PWK5ENQEF/19:20
clarkbit seemed like no one else was going for it and I wanted to make sure that I was well ahead of the deadline. If however I misread the room please speak up. I'm happy to step to the side or work togetherwith someone etc19:20
clarkb(or have an election)19:20
clarkbbut I suspect its a tag you're it again situation for myself. And that is fine too. That said I think some variety would be a good thing and would happily support someone else in the role19:21
fungiyour sacrifice is appreciated19:22
clarkb#topic Loss of upstream Debian bullseye-backports mirror19:22
clarkbI think we had a rough outline of a plan here which included potentially breaking a small number of jobs19:22
fungisounds like we have a way forward for this yes, just haven't had time to start in on it19:22
clarkbany concerns with the plan since we last spoke about it here?19:23
clarkb(the plan is basically to clean up backports for bullseye and force jobs to find an alternative since that most accurately reflects the state with upstream)19:23
fungii just need to remember if we said base-jobs first or straight to zuul-jobs with an advance announcement19:23
clarkbshould be in the meeting logs (I believe it was zuul-jobs but double check)19:24
fungithe latter is less work for greater benefit in the long run, but will take a  it longer19:24
fungiyeah, that's what i thought19:24
clarkb#topic Etherpad 2.4.2 Upgrade19:26
clarkb#link https://github.com/ether/etherpad-lite/issues/7065 Theming updates have broken the no-skin and custom skin themes. We use no-skin.19:26
clarkbI was hopeful that there would be followup after they responded to the issue last week just after our meeting19:26
clarkbbut no. Basically they said no-skin isn't expected to be used its only there as an example for other skins and I responded that no-skin is/was the only skin until colibris was added and we kept it for user familiarity and density of the text19:27
fungiit does seem like there's been some upstream turnover, to the point where the current maintainers aren't aware of what etherpad-lite used to look like19:27
clarkbstill waiting for a response indicating whethero r not no-skin is no longer expected to be used19:27
clarkbbut hopefully we have an answer soon on whether or not we have to accept colibris or can continue as is19:27
fungibut also custom skins (presumably based on no-skin) are now broken too19:28
clarkbyes at least one person responded that this is the case I think they have some stuff to fix at least19:28
fungiso no-skin isn't currently useful even for they purpose thet thought it was for19:28
fungiand if they fix that, it will likely be usable for us again too19:29
clarkbhere's hoping19:29
clarkb#topic Moving OpenDev's python-base/python-builder/uwsig-base Images to Quay19:29
clarkb#link https://review.opendev.org/c/opendev/system-config/+/95727719:29
clarkblast week I noted that I had some concern that we may not use speculative images in builds with this switch. Since then I dug into the docker docs, our test job roles, and the change I wrote and I think this is a non issue19:30
clarkbour past selves already addressed these problems19:30
clarkbat this point I think the main concern is that we'll want to rebuild all of the python based images after this lands19:31
fungiyay! /pats selves on back19:31
clarkbprobably don't need to rush to do that but also won't want to delay19:31
clarkbI'm not sure there is ever a good time for something like that so its mostly calling it out as a todo once that lands so we don't forget and use the old stale images forever19:31
clarkbso ya reviews welcome and feedback on timing too19:31
clarkb#topic Bootstrapping rax-flex iad319:32
clarkbthere is a third rax flex region and cloudnull has given us the go ahead to use it. Yesterday we landed a change to update our clouds.yaml files and set up the cloud launcher to preconfigure things19:33
clarkbcloud launcher failed on auth issues and it seems we still need to login to skyline first to have the new env sync account stuff from the old env19:33
fungi(along with increased quotas in all 3 regions)19:33
clarkbI have done this for both of our accounts in iad3's skyline service and the openstack client can list images in both accounts now19:33
clarkbI think I'll wait for our daily cloud launcher run to happen at ~0200 UTC and then look into bootstrapping a mirror tomorrow19:34
clarkbwe also need to set the network mtu to 1500 which is an extra step post cloud launcher19:34
clarkbthen cloudnull also suggested we reenable the rax classic regions and see if they are happier now. Looks like the first change to do that just merged19:35
clarkbIf we do see new or renewed problems with that we can use the existing email thread I started to followup19:35
clarkb(and if anyone would prefer I do the emailing just let me know))19:36
fungiyeah, the change for the first two regions just merged19:36
clarkbI split them because ord and dfw had a different failure mode to iad19:36
clarkbso want to reenable them separately for ease of reverts / debugging19:36
fungiright, i didn't approve the second for now19:36
clarkb#topic Open Discussion19:37
clarkbAnything else?19:37
fungii'll be gone next week and half the following week19:38
clarkbI mentioned earlier that the zuul upgrades and reboots seem happier now. I am running that playbook in a root screen on bridge out of band due to several consecutive failures the last few weeks19:38
clarkbfungi: enjoy your time of19:38
clarkb*off19:38
fungithanks!19:38
clarkbthe ansible 11 swithc has gone really well I think. The main issues we have encountered are Bionic and older nodes not being supported by ansible 11 due to python version incomaptibilities19:41
corvus++19:41
clarkbthen we also discovered skyline used a list for playbook vars: https://opendev.org/openstack/skyline-apiserver/src/branch/master/playbooks/devstack/pre.yaml#L8-L9 and ansible 11 doesn't like that19:41
fungiand that one weird skyline job19:41
fungiyeah that19:41
clarkbbut I'm not sure older ansible was even doing the right thing with that config. It should also be a trivial problem to fix19:42
fungiit was likely ignored19:42
corvusi get the idea that was like removing prehistoric syntax19:42
corvusyou think?  i thought it was just a super weird old way to specify vars19:42
fungii should be surprised that turned up in one of the newest openstack projects, but i'm not19:43
fungiyeah maybe it did actually work in 919:43
clarkbya its possible it just worked until ansible decided it was weird and stopped being backward compatibile19:43
clarkbin any case straightforward to fix19:43
clarkbif you see any other ansible 11 issues its good to make note of them as this sort of info can go into zuul's changelog for ansible things19:44
clarkbhelps other zuul users19:44
clarkbok last call. Anything else? Otherwise we can end about 15 minutes early today19:44
fungibindep and git-review were testing on older python versions we'll need to make some decisions about19:45
fungipbr too19:45
clarkbI think for bindep and git review we just drop the old stuff and move on. They have old releases that can run with old python19:45
clarkbpbr is trickier and probably worth keeping python2.7 still since swift only just dropped support for that version19:46
fungiseems like we can move pbr's py27 testing to newer platforms, but will probably need to drop 3.5-3.7 testing19:46
clarkb(and that means updating the python2.7 test job to jammy I think)19:46
clarkbya exactly19:46
fungiwell, or focal19:46
clarkbsounds like that may be everything. Thanks everyone!19:48
fungithanks clarkb!19:48
clarkbI'll probably run a meeting next week despite the expected lower attendance. Its good to capture the goings on for people to review if nothing else19:48
clarkb#endmeeting19:48
opendevmeetMeeting ended Tue Aug 19 19:48:39 2025 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)19:48
opendevmeetMinutes:        https://meetings.opendev.org/meetings/infra/2025/infra.2025-08-19-19.00.html19:48
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/infra/2025/infra.2025-08-19-19.00.txt19:48
opendevmeetLog:            https://meetings.opendev.org/meetings/infra/2025/infra.2025-08-19-19.00.log.html19:48

Generated by irclog2html.py 4.0.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!