Tuesday, 2023-09-19

* fungi waves19:00
clarkbhello there19:00
fungiohai19:00
clarkb#startmeeting infra19:01
opendevmeetMeeting started Tue Sep 19 19:01:25 2023 UTC and is due to finish in 60 minutes.  The chair is clarkb. Information about MeetBot at http://wiki.debian.org/MeetBot.19:01
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.19:01
opendevmeetThe meeting name has been set to 'infra'19:01
clarkb#link https://lists.opendev.org/archives/list/service-discuss@lists.opendev.org/thread/ZBZMOM7RXD4AXORIXJO537T3YDOJPFTW/ Our Agenda19:01
clarkb#topic Announcements19:01
clarkbWe are deep into the OpenStack release period and OpenStack elections end tomorrow19:01
clarkbgo vote if you haven't already19:02
clarkb#topic Mailman 319:03
clarkbfungi: the plan is still for starlingx and openinfra to migrate on thursday at ~15:30 UTC?19:03
fungi#link https://etherpad.opendev.org/p/mm3migration maintenance plan starts at line 19819:03
fungiyes19:03
fungitomorrow i'll prime the rsyncs and remind the relevant community managers/liaisons19:04
fungibut we're on track there19:04
clarkbfungi: probably want to update dns records at the same time tomorrwo too?19:04
clarkb(to reduce TTLs?)19:04
fungii did the dns ttl adjustments early because one of the domains is in cloudflare19:04
fungialready crossed off the list19:04
fungiwanted to make sure i could still get to it19:04
clarkbaha19:05
fungithis time we'll approve the change earlier ahead of the window so we don't end up starting late19:05
fungi#link https://review.opendev.org/895205 Move OpenInfra and StarlingX lists to Mailman 319:05
fungiplease review19:05
fungii plan to approve it by 13:30 utc on thursday19:06
clarkbsounds like a plan. Any other prep work besides reviewing that change we can help with?19:06
fungiit can technically be approved as far ahead as we like, but easier not to need to roll that back if we end up postponing for some reason19:07
clarkband avoids having the two lists of mailing lists diverge19:07
clarkbthough at this point I suspect we'd say please wait for the migration to complete before adding a new list19:07
fungino remaining prep work for this window, though assuming it goes well i'll start planning for the openstack lists (15:30-19:30 utc on thursday 2023-10-12, a week after their release)19:07
fungii've already jotted some preliminary notes for that one, and plan to split the server/config management deprovisioning to a separate change and put the old server in emergency disable to avoid any accidents19:08
fungiyou can find the section for the final maintenance at the very end of the previously mentioned pad19:09
clarkbsounds good, thank you for pushing this along19:09
fungisure, looking forward to being done with it after what's been about a year of off-and-on effort from several of us19:10
clarkb#topic Server Upgrades19:11
clarkbnothing new here...19:11
clarkb#topic Nodepool image upload situation19:11
fungithe timeout increase merged yeah?19:12
clarkbyup yesterday19:12
clarkbour image builds look good too. The thing we lack in the dashboard is a listing of how old images are in each cloud provider19:12
clarkbbut I think in a week we can do a nodepool image-list and check for any that are more than 7 days olver19:13
clarkb*more than 7 days old19:13
fungiin semi-related news, cloudnull is back at rackspace and possibly has leverage/mandate to assign effort for fixing some of the problems we've observed19:13
fricklerthere are 6 and 13d old ones19:13
clarkbfrickler: the 13d images are the fedora ones19:14
clarkbwe'll need to clean up that dashboard to remove them I think19:14
fungi6d and 13d sounds perfect. like we'll be getting 0d and 7d tomorrow19:14
clarkboh you mean in the cloud sorry19:14
clarkball that to say early signs are this is working well enough for us again19:14
clarkbbut lets check back in again in a week and ensure that the complete set of changes is looking good19:15
fricklerack19:15
fungiwe should probably also do another leaked upload cleanup to see if we keep getting more19:15
clarkbfungi: good idea19:16
frickleryes, now would be a good time to see if the 6h timeout helps with that19:16
fungii can try to find time for that later this week19:17
clarkbthanks. Anything else nodepool related?19:18
funginothing i'm aware of19:19
clarkb#topic Zuul PCRE deprecation19:19
clarkb#link https://etherpad.opendev.org/p/3FbfuhNmIWT33fCFK1oK Draft announcement for OpenDev users (particularly OpenStack)19:19
clarkbI think corvus was looking for feedback on this before sending it out. At this point I want to say we have chimed in so corvus  you are probably good to send that when ready?19:19
fricklernote I added some comments in the etherpad, so it shouldn't be sent as is19:20
clarkbthank you everyone for reviewing that19:22
fricklerI also tasked kopecmartin to look at the qa projects and started doing patches for OSC myself, those are the largest batches of warnings I saw19:22
clarkbtripleo-ci repo had a lot of them too last I looked19:22
frickleror some of tha largest, yes19:22
fricklerbut tripleo was to be retired somehow? need to check the timeline for that19:22
fungiyeah, that's one to bring up with the tc19:23
fungii wouldn't sink lots of effort digging into tripleo, we can just ask the tc how they want it handled19:23
fungimaybe it can be retired instead19:24
clarkbI think that repo supports the stable branches they've kept open but in that case they should be able to fix it19:24
clarkbeither way we can come up with a solution19:24
fungiright. it's a question of whether they're keeping those open in light of the new "unmaintained" and opt-in or automatic eol resolution19:25
clarkb#topic Python image updates19:25
clarkb#link https://review.opendev.org/q/(+topic:bookworm-python3.11+OR+hashtag:bookworm+)status:open19:25
clarkbI think I've decided that we should avoid the Gerrit update until after the openstack release. The two drivers for this are that Irealized there are plenty of other less impactful services that we can work on in the meantime (changes for some pushed up under that link) and Gerrit made a 3.7.5 release that we should also update to at the same time19:26
clarkbthat means our gerrit update will be a 3.7.4->3.7.5 upgrade, java 11 -> java 17 upgrade, and bullseye -> bookworm upgrade19:26
clarkbwe can split those up or go at once but either way enough changing that I think we should avoid the openstack release19:27
fricklersounds reasonable to me19:27
clarkbIn the meantime reviews on the other changes I've pushed are appreciated and I think I can approve and monitor those while we wait19:27
clarkbnote I've asked kopecmartin to weigh in on the refstack update too19:28
clarkbI'm hopeful that we'll be able to drop bullseye and python3.9/3.10 image builds soon enoguh. Then we can look at adding 3.12 builds once that release occurs19:28
clarkbbut one step at a time :)19:28
clarkb#topic Redeploying the InMotion/OpenMetal cloud19:29
clarkbI sent email to yuriy to gather some initial information on what this involves. I cc'd fungi and frickler19:29
clarkbYuriy responded, but it wasn't imemdiately clear to me what a redeployment would be based on if we deployed today. Yuriy did talk about how in the new year they would be on 2023.1 or 2023.2 openstack.19:30
clarkbAnyway I asked for further clarification and volunteered to set up time to meet and coordinate if necessary (I think that yuriy in particular finds the more synchronous interaction beneficial)19:30
fricklerso currently they would be able to deploy yoga iiuc19:30
fungiin light of that, it might make sense to delay rebuilding for a few more months19:30
clarkbya I wanted more clarification on the base OS stuff before we decide19:31
clarkbbut if the base OS doesn't move forward much by redeploying today then a delay may be worthwhile19:31
fungiright, that's more the deciding factor than openstack version, since we can in-place upgrade openstack19:32
clarkbOther items of note: we should call the new cloud openmetal not inmotion. They seem interested in continuing to help us successfully consume this hardware and service as well. Thank you openmetal for the help19:32
fricklerbut we wouldn't rename the cloud before the redeploy? other than possibly in grafana?19:33
clarkbfrickler: I think we can if it is important to them. It is a bit of a pain to do as we have to shutdown and startup a new provider in nodepool19:34
fungiyeah, i would say anything that's a significant lift in the renaming department should get wrapped into the redeployment19:34
clarkbdoable but if we can tie it into a redployment that will be easiest19:34
fungiwe did something similar for internap->iweb though19:34
frickleralso they want to rework the networking19:35
fricklerseems currently we have 6 x /28, they'd provide a single /26 instead19:35
fungieasy things to rename we can do straight away because why not? and right if they express urgency then we can accommodate that19:35
clarkbfrickler: yup that should hopefully simplify thigns for us19:35
fungiyes, their platform previously could only allocate networks in /28 prefixes19:36
fungibut they've apparently overcome that design limitation now19:36
fungi(i wonder if they're any closer to ipv6)19:36
clarkbIt also sounded like we may need to do the self signed cert thing19:36
clarkbkolla supports that so hopefuilly just a matter of setting the correct cars and having kolla rerun19:36
clarkboverall though an encouraging first email trade. I'll try to keep the thread alive and drive it towards something we can make a plan off of19:37
fricklerin 2023.2 kolla might support LE19:37
fungithanks!19:37
clarkb#topic Open Discussion19:38
clarkbAnything else?19:38
fungiif we served the dns for the api endpoints in our opendev.org zone, we could probably wrangle our own le with our usual ansible19:38
corvusthanks for the suggestions on the email, i'll update the etherpad later19:39
fungijust missing the bit to inject that into kolla19:39
corvusjust wanted to make sure we were all on board with the tone and the requested actions19:39
fungiyep, still lgtm19:39
fricklerfungi: you'd just place the resulting cert file into the correct location in the kolla config19:39
fungifrickler: right, we're missing however we'd tie that file deployment into our ansible19:40
fungii guess we'd put something in our inventory for wherever that is19:40
clarkbianw had started looking into that19:41
fungiseems like the linaro cloud could benefit from the same19:41
clarkbbasically a lighter weight base role application for systems where we don't want to manage firewalls and email and so on19:41
clarkbyup it was the linaro cloud where he was proving this out19:42
clarkbbasically add our users and potentailly other lightweight stuff. I could see working some of the LE provisioning into that19:42
clarkbspeaking of LE I wonder if we can unfork acme.sh yet19:42
clarkbLast call for any other items otherwise we can probably end a bit early today19:43
fungithanks clarkb!19:43
fricklerack, thx all19:44
clarkb#endmeeting19:45
opendevmeetMeeting ended Tue Sep 19 19:45:06 2023 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)19:45
opendevmeetMinutes:        https://meetings.opendev.org/meetings/infra/2023/infra.2023-09-19-19.01.html19:45
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/infra/2023/infra.2023-09-19-19.01.txt19:45
opendevmeetLog:            https://meetings.opendev.org/meetings/infra/2023/infra.2023-09-19-19.01.log.html19:45
clarkbthank you everyone19:45

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!