Tuesday, 2023-10-31

* tonyb plays typical gameshow countdown music18:58
fungii hope it's the music from the gameshow "countdown"18:58
clarkbI'll take OpenStack for $40018:58
fungisounds expensive18:59
tonybclarkb: What will the name of the next OpenStack release be?18:59
fungitoo long?19:00
clarkbwhat is caracal19:00
fungino, wait, unpronouncible!19:00
clarkb#startmeeting infra19:00
opendevmeetMeeting started Tue Oct 31 19:00:32 2023 UTC and is due to finish in 60 minutes.  The chair is clarkb. Information about MeetBot at http://wiki.debian.org/MeetBot.19:00
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.19:00
opendevmeetThe meeting name has been set to 'infra'19:00
clarkb#link https://lists.opendev.org/archives/list/service-discuss@lists.opendev.org/thread/IILSVTDAEDTRTCRSZZ3P2UKY4CIOKUEY/ Our Agenda19:00
clarkb#topic Announcements19:00
clarkbI announced the gerrit 3.8 upgrade for November 17 and fungi announced a mm3 upgrade for Thursday19:01
fungishort notice, i mainly just didn't want ui changes and logouts to surprise anyone19:01
clarkbAlso worth noting that November 23 is a big US holiday which may mean people have varying numbers of days off around then19:02
clarkbother than that I didn't have any announcements19:02
clarkb#topic Mailman 319:02
clarkb#link https://review.opendev.org/c/opendev/system-config/+/899300 Upgrade to latest mm3 suite of tools19:02
clarkbthis is the change that corresponds to the mailman3 upgrade fungi announced19:03
fungias of moments ago the remaining cleanup changes merged19:03
clarkbthere were two other cleanup changes but ya they have both merged now19:03
fungiso, yes, and yes, 899300 is the upgrade for thursday19:03
clarkboutside of those changes we still need to snapshot and cleanup the old server at some point19:04
clarkband I've left the question of whether or not we add MX records on the agenda19:04
clarkbI'm somewhat inclined to leave things alone in DNS since this seems to be working and is simpler to manage19:04
tonybyeah they shouldn't be needed.19:04
fungidown the road we can revisit things like spf and dkim signing if they become necessary, but i'd rather avoid them as long as we can get away with19:05
tonyb++19:05
clarkbfungi: both google and yahoo have been making statements about requiring that stuff early next year...19:06
clarkbbut ya lets worry about it when we get more concrete details19:06
clarkbmeanwhile the vast majority of spam I receive comes from gmail addresses19:06
clarkbAnything else mailing list related?19:06
funginope19:07
clarkb#topic Server Upgrades19:07
clarkbtonyb started looking at jammy mirror testing19:07
clarkb#link https://review.opendev.org/c/opendev/system-config/+/89971019:07
tonybit's currently failing due to a change in behaviour in curl19:07
clarkbthis appears to be failing but I think for a test framwork reason not necessarily because the jammy deployment is failing. I don't fully understand the failure though19:07
clarkboh is it curl that changed? fun19:07
tonybyeah19:07
tonyba command line that works on focal fails on newer curls19:08
tonybI'm looking at what the right fix is.19:08
clarkbtonyb: I think what the goal is there is to set SNI stuff properly so that we get the correct responses?19:08
tonybI'll propose a testing fix first and then add the jammy server 19:09
clarkbotherwise using localhost we don't align with the apache vhost matching19:09
fungii suppose an alternative could be to fiddle with /etc/hosts on the node, but that feels... dirty19:09
tonybyeah. that makes sense.  we could probably do something in python itself19:10
tonybit should be a quick fix once I get back to my laptop 19:10
clarkblooking forward to it19:10
clarkbI am not aware of any other updates but wanted to mention we are also in a good period of time to look at meetpad server replacements since the PTG just ended19:11
tonybat some point we should decide if we need testing for 3 Ubuntu releases 19:11
tonybmeetpad is next 19:11
clarkbtonyb: in general I think we're trying to align with all the things we're deploying. As we replace servers and reduce the total list of mirrors we can reduce the ubuntu flavors19:11
clarkbtonyb: the main concern there is apache version differences (which haven't been a problem in more recent years) and openafs functionality19:12
tonybokay that's pretty much what I thought 19:12
fungiyeah, basically we want to test that the changes we make to stuff continues to work on the platforms we're currently running on, and then once we're not running on those platforms we can stop testing them19:12
tonyb++19:12
fungion a service-by-service basis19:13
clarkbonce upon a time we tried to be more generally compatible with people doing similar to us outside of our env but realized it was too much effort and should focus on our set of things19:13
fungiso basically, if we upgrade the meetpad servers to jammy, we can then switch to only testing that meetpad deployments work on jammy19:13
clarkb#topic Python Container Updates19:14
clarkb#link https://review.opendev.org/q/(+topic:bookworm-python3.11+OR+hashtag:bookworm+)status:open19:14
clarkbthis is very close to the finish line (as much as there is one)19:14
clarkbpython 3.9 is gone and the current TODOs are to update zuul-operator and OSC to python3.1119:14
clarkbOSC should merge its change soon I expect as openstack is voting on python3.11 jobs now which makes switching the image to python3.11 safe19:15
clarkbon the zuul-operator side of things the CI jobs there are all unhappy and I'm not quite sure the scope of the necessary fixes yet19:15
clarkbI was hoping zuul-operator users would get it sorted soon enough but I may need to help out19:15
clarkbonce that is done we can drop python3.10 image builds19:15
tonybyay19:15
clarkbI've also got a change up to add python 3.12 images but that is failing because uwsgi doesn't support python3.12 yet.19:16
clarkbI think we can wait for them to make a release that works (there is upstream effort to support newer python but not yet in a release)19:16
tonyba quick tangent, I think it'd be good to remove old images/tags from the public registry 19:16
tonybleaving buster based 3.7 images feels dangerous?19:17
clarkbmaybe? openshift recently broke zuul's openshift functional testing because they deleted old images19:17
fungisounds like a refcounting challenge19:17
tonybI could generate a list of things we could tag as deprecated and pull later19:17
clarkbthere is definitely a tradeoff. I think if someone is using an image for testing its fine, but you're right you wouldn't want it in production19:17
clarkbmaybe retag as foo-deprecated19:18
tonybfair enough.19:18
fungifoo-dangerous19:18
clarkbthen people have an out but it also makes it more apparent if something should not be used19:18
tonybyeah that's sort of what I was thinking 19:18
clarkbI think that would be my preference over proper deletion19:18
fungifoo-if-it-breaks-you-get-to-keep-the-pieces19:18
tonybthat's all I had19:19
clarkb#topic Gitea 1.2119:19
clarkb#link https://review.opendev.org/c/opendev/system-config/+/89767919:19
clarkbstill no proper release and no changelog19:19
clarkbI have tried to keep up with their updates though and it generally works for us other than the ssh key size check thing that I disabled in that change19:20
clarkbI've left this on the agenda under the assumption we'll have to make decissions soon but upstream hasn't made that the case yet19:20
clarkb#topic Gerrit 3.8 Upgrade19:21
clarkbThis is one with a bit more detail19:21
clarkb#link https://etherpad.opendev.org/p/gerrit-upgrade-3.819:21
clarkb#link https://lists.opendev.org/archives/list/service-announce@lists.opendev.org/thread/XT26HFG2FOZL3UHZVLXCCANDZ3TJZM7Q/19:21
clarkbI have announced the plan to upgrade Gerrit to 3.8 on November 17 at 15:30-16:30 UTC19:21
clarkbI've tested the downgrade path on a held CI node and then re upgraded it for the experience of it19:21
clarkbYesterday we merged a config update necessary for 3.8 that we'll want to have in place under 3.7 to ensure it is working there as well. My plan is to restart Gerrit later today19:22
clarkbthis config update shouldn't result in any behavioral differences. It is entirely about maintaining compatibility of acceptable config in gerrit 3.819:22
clarkbfungi: if I want to do that restart around say 22:00 UTC is that a bad time for you?19:23
clarkbfungi: maybe better to ask if ther eis a good time for you later today?19:23
fungisure, i can help at 22:00 utc19:24
clarkbcool I think that time should work for me19:24
fungigreat19:24
clarkbthe last thing I've noticed is a traceback starting up the plugin manager plugin. Upstream thought they had already fixed it which made me concerned this waws a problem with our builds but on closer inspection it seems to be a different problem (tracebacks differ)19:25
clarkbalso we hit it on 3.7 too so shouldn't impact 3.819:25
fungiso basically not a regression19:25
clarkbmore of a thing to be aware of as an expected startup tracebacks that looks scary but is believed to be fine19:26
fungijust some continued broken for a feature we're not using19:26
clarkbyup19:26
clarkband that was all I had. The etherpad has pointers to the held node if anyone wants to take a look at it19:26
clarkb#topic Etherpad 1.9.419:27
fungiin progress19:27
clarkbin the time it took us to be ready to upgrade to 1.9.3 they released 1.9.4. Fun fact: 1.9.4 fixes the mysql isn't utf8mb4 encoded bug I filed with them years ago19:27
fungii need to finish diffing the upstream container configs19:27
clarkbwe worked around that by manually setting the encoding on the db but before that etherpad hard crashed because it couldn't log in this instance19:27
tonybwas that the poo emoji crash from like Vancouver?19:28
fungisnowman19:28
fungibut yes19:28
tonybokay19:28
clarkbtonyb: no, this was on the db level not the table level. They had fixed the table level thing prior19:28
clarkbits all related though.19:29
clarkbIn this case they wanted to log "warning this is probably a problem" but their loggign was broken so the whole thing crashed19:29
clarkbrather than bad bytes causing the crash later19:29
tonyblol19:29
fungiah, right, that problem19:29
clarkbfixing the db level encoding meant it never tried to log and things proceeded :)19:30
fungialso related, update to log4js which invalidates some of the config we're carrying, preventing the service from starting, which is why i need to more deeply diff the configs19:30
clarkbfungi: I guess once we have an updated change we'll hold a node and do another round of testing19:30
fungicorrect19:30
tonybsounds good.19:30
clarkb#topic Open Discussion19:31
clarkbthat was it for the emailed agenda19:31
clarkbworth noting we just updated nodepool to exclude openstacksdk 2.0.0 as it isn't compatible with rax cinder v2 apis19:31
clarkba fix is in progress in openstacksdk which frickler and I mentioned we could help test19:31
clarkbthis effectively took rax offline in nodepool for a few days. It also causes nodepool to not mark nodes as node failures when a cloud is failing like that19:32
clarkbI kinda want to make nodepool fail the request in that cloud when the cloud is throwing errors rather than try forever19:32
tonybthat seems like it'd be move visible 19:33
fungithe openstack vmt would like a private room on the opendev matrix homeserver to use instead of its current restricted irc channel, since some members are joining from the oftc matrix bridge which doesn't handle nickserv identification very well. i doubt there will be any objections, but... objections? otherwise i'll work on adding it19:33
fricklerit failed very early, kind of a similar scenario to the expired cert issue, which also could use better handling19:33
clarkbfrickler: ya I think they have the same underlying failure method for request handling internally with nodepool which is to basically move on and then the request is never completed19:33
clarkbfungi: no objections from me. Worth noting private and encrypted are distinct in matrix so you'll have to decide on those two things separately iirc19:34
fungiyeah, it'll be both in this case19:34
clarkbprivate is basically invite only and then encrypted is whether or not everyone is doing e2e amongst themselves19:35
fricklerafaict even then some things are not encrypted like emojis19:35
fricklerbut not worse than IRC likely, so no objection either19:35
tonybthanks fungi 19:36
fungithe vmt uses its private communication channel only for coordinating things which can't be mentioned in public (and even then it's just things like "i triaged this private bug, please take a look: <url>" so emojis rarely come into it ;)19:36
clarkbre Holidays the 10th is also a holiday here and I'm taking advantage for a long weekend. I won't be around on the 10th and 13th19:37
fungii'll try to be around19:38
fricklerone question regarding the branch deletion I did for kayobe earlier: the github mirror should sync this on the next merged change, right?19:38
fungicorrect19:38
fungithat job only gets triggered by changes merging, so addition/deletion of branches or pushing tags doesn't replicate immediately19:38
fricklerok, so we'll wait for that to happen and then can check again19:39
fungiit could probably be added to additional pipelines, if that becomes a bigger problem19:39
fricklerI don't think it is urgent in this case, just wanted to cross that check off my list19:40
clarkblast call for anything else. Otherwise we can all have a few minutes back for $meal or sleep 19:41
clarkbthank you for your time and help everyone! We'll be back here same time and place next week.19:43
clarkb#endmeeting19:43
opendevmeetMeeting ended Tue Oct 31 19:43:17 2023 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)19:43
opendevmeetMinutes:        https://meetings.opendev.org/meetings/infra/2023/infra.2023-10-31-19.00.html19:43
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/infra/2023/infra.2023-10-31-19.00.txt19:43
opendevmeetLog:            https://meetings.opendev.org/meetings/infra/2023/infra.2023-10-31-19.00.log.html19:43
fungithanks clarkb!19:43
tonybthanks all19:43
fricklero/19:43
-opendevstatus- NOTICE: Gerrit on review.opendev.org will be restarted to pick up a configuration change required as part of Gerrit 3.8 upgrade preparations.22:00

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!