Tuesday, 2022-05-24

clarkbjust about meeting time. We'll get started shortly19:00
clarkb#startmeeting infra19:01
opendevmeetMeeting started Tue May 24 19:01:12 2022 UTC and is due to finish in 60 minutes.  The chair is clarkb. Information about MeetBot at http://wiki.debian.org/MeetBot.19:01
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.19:01
opendevmeetThe meeting name has been set to 'infra'19:01
clarkb#link https://lists.opendev.org/pipermail/service-discuss/2022-May/000337.html Our Agenda19:01
ianwo/19:01
frickler\o19:01
clarkb#topic Announcements19:01
clarkbA reminder that the Summit starts in 2 weeks. Running June 7-919:01
clarkbI think we should go ahead and cancel the meeting on June 7 (I don't anticipate being able to attend19:01
fungii support that19:02
fricklerwho is going there? me isn't19:02
clarkbI think fungi and myself will be there19:02
fungiif others want to hold a meeting that tuesday i'm not opposed, i'll just catch up from the meeting log19:03
clarkbya thats fine too. I won't be able to run it19:03
fricklermaybe we can decide next week?19:04
clarkbsure19:04
clarkbAlso keep in mind the event if making changes to things like etherpad which will be used by the forum19:04
clarkb#topic Actions from last meeting19:05
clarkb#link http://eavesdrop.openstack.org/meetings/infra/2022/infra.2022-05-17-19.01.txt minutes from last meeting19:05
clarkbThis wasn't recorded explicitly as an action but I do recall saying I would get it done this week. wiki.openstack.org has a new cert now. Thank you fungi for installing it19:06
clarkbThe email spam about that should go away now19:06
frickleryay19:06
fungithanks for ordering it19:06
clarkb#topic Improving CD throughput19:07
clarkbWith the summit and everything else going on recently (jammy and afs and pip and so on) this has taken a backseat for me when I meant to work on some related items19:07
clarkbHopefully next week I'll have made progress19:08
clarkbDid anyone else have anything to mention on this topic?19:08
clarkbSounds like no. We can continue19:09
clarkb#topic Container Maintenance19:09
clarkbianw: did end up testing the mariadb upgrade as part of the gerrit 3.5 upgrade19:10
clarkbianw: I left comments on the etherpad but was curious if you tried the container image env var or if you decided not to use it for some reason?19:10
clarkbJust trying to figure out if running the upgrade command by hand is preferable to using the automation built into the container image for some reason19:10
ianwsorry haven't looped back to that yet.  i can check on the env var way of doing things19:11
clarkbeither way the process looked straightforward and while we aren't in a rush to do this due to mariadb support periods it is probably good hygiene19:11
clarkbianw: thanks19:11
clarkb#topic Support For Jammy Jellyfish19:13
clarkbfrickler: has made some more progress on spinning up the wheel mirrors for jammy19:13
clarkb#link https://review.opendev.org/c/openstack/project-config/+/842841 add jobs to publish to openafs19:13
clarkbI think that is the last remaining piece? frickler do we need to create a new openafs volume for that or is that done already?19:14
fricklerI think I did that, let me check19:14
clarkbI guess ti is two volumes. One for each architecture19:15
fricklerseems I created the volumes, but didn't mount them19:15
ianwdoesn't seem mounted @ http://mirror.iad.rax.opendev.org/wheel/19:15
clarkbmaybe we review and +2 the change then you can +A when it is ready on the afs side?19:17
fricklerwill need to check that later19:17
ianwi can push it through today if you like to get it going19:17
clarkbcool I'm sure we'll sort it out. Thank you for getting this together19:18
clarkb#topic Gerrit 3.5 upgrade planning19:19
clarkb#link http://lists.opendev.org/pipermail/service-announce/2022-May/000039.html Scheduled for 20:00 UTC June 19, 202219:19
clarkbthe announcement for the upgrade went out.19:19
clarkb#link https://etherpad.opendev.org/p/gerrit-upgrade-3.519:19
clarkbianw also has notes on that etherpad.19:19
clarkbI've reviewed them and tried to leave notes/questions/comments for stuff but this seems pretty straightforwatd19:19
clarkbthe main thing I think we're missing now is explicit config to enable collision checking. We should be able to land that on 3.4 and gerrt will ignore it until 3.519:20
clarkbianw: ^ is there a change for that yet?19:20
ianwno sorry, there's a couple of those todo items that need changes.  i will do those19:21
ianw#link https://etherpad.opendev.org/p/gerrit-upgrade-3.519:21
ianwis what we're talking aobut19:21
clarkbfeel free to ping me when they're up. Happy to revie wthat stuff and I tend to have just enough gerrit context paged in to do a decent job :)19:22
clarkbAnything else to bring up on the gerrit upgrade?19:23
clarkb#topic Move reqirements propose-updates job to py3.8 + 3.919:24
clarkbfrickler: want to fill us in on this one? I've reviewed at least one change but I think I'm lacking some context19:24
clarkbI believe what is going on is openstack is trying to update requirements handling for newer python19:26
clarkband there were issues in the first attempt because all the needed python interpreters weren't installed19:26
clarkbThat should be fixed now19:26
clarkbIn the process frickler discovered that enqueing periodic jobs manually is not straightfoward and was asking how to do that properly19:27
clarkbI want to say we have to use enqueue-ref of master for periodic jobs but periodic triggers have been weird. corvus is there anything else to know about that?19:27
corvusclarkb: i don't know off hand either way, sorry19:28
clarkbI'll give it a couple more minutes in case frickler is still around to add anything I missed19:29
ianwi have de ja vu on starting periodic jobs19:30
ianwhttp://lists.zuul-ci.org/pipermail/zuul-discuss/2019-May/000909.html19:30
clarkbThinking out loud the script to dump queues is probably a good one to cross reference19:30
ianwi don't think i ever really found the right thing 19:31
clarkbsince it will produce enqueue commands for the periodic pipelines19:31
fricklerthe difference seems to be a "commit 00000" added to the manual trigger19:31
fricklerand then jobs end up in error19:31
ianwyeah, i think, and from re-reading that message, you need to set newrev as the HEAD 19:32
clarkbsounds like this problem deserves more debugging. I wonder if we can use the zuul client testing or a similar setup to get testing going for it and then encode that in docs?19:32
clarkbthis might be a good topic to resurrect on the zuul mailing list if there continues to be confusion19:34
fricklerthe other thing I noticed: the docs talk about adding the --trigger option19:34
fricklerbut our zuul-client doesn't know that option19:34
clarkbthe old built in zuul enqueue commands did know trigger but I think it became unnecessary because each pipeline has a specific trigger already19:35
fungii think that must be cruft, i haven't needed to specify it. in theory zuul should just get it from the config19:35
fricklero.k., so just a docs update needed then19:35
clarkbright so zuul-client just dropped it as unnecessary where it was required with the old zuul command19:35
corvusyeah, trigger is dead.  the other stuff certainly warrants more debugging and should be testable within zuul itself.19:35
fricklerI wanted to test on my local zuul but didn't get to that yet19:36
ianwi'm having some more memories, based on http://lists.zuul-ci.org/pipermail/zuul-discuss/2019-May/000914.html19:37
ianwwhich was all rpc client at the time, so things might have changed a bit19:37
ianwbut there wasn't a way to create an event that looked *exactly* like the timer trigger remotely, as that didn't fill in the oldrev/newrev19:38
ianwbut iirc setting the newrev value to the current HEAD was enough to make it work19:39
clarkbmakes sense. And if we have time to add a etst for that in zuul somewhere that would be great19:39
ianwbut i never became convinced enough this was the real solution to update the docs19:40
fricklerthx for the pointer, I'll test that @home19:40
clarkb#topic Zuul changing default Ansible to v519:41
clarkbZuul supports ansible v5 now. Eventually v5 will become the default ansible version and then sometime after that old EOL ansible versions will stop being supported by zuul19:42
clarkbWhat that means is now is the time to start sorting out Ansiblev5 support so that we can hopefully uplift onto that version in opendev for our tenants.19:42
clarkbwe've found a number of issues doing that uplift so far. In particular the include task si no longer valid. You need to use include_* or import_*19:43
clarkbWe've also had issues where ansible running as zuul wants to become another unprivileged user and this fails because ansible can't setfacl or chown/chmod the temporary files it copies to do work on the remote19:43
clarkbit is looking like ensuring that setfacl is installed may be the fix for this19:43
fungiinstalled on the job nodes19:44
corvusre setfacl: oy ... that's such a weird thing to have changed....19:44
corvusi've been working through some stuff for the zuul tenant, mostly around container jobs at this point.  not quite done yet.19:45
clarkbya reading the code diff it isn't clear to me how old ansible worked and new ansible fails if setfacl isn't installed19:45
corvusit looks like we'll be dropping support for focal in the ensure-podman role, because there is no longer any place (that i know of) to get podman for focal19:45
clarkbbut installing it seems to make new ansible happy so that is probably a reasonable thing to us to add to our base images19:46
corvuswe previously relied on kubic for that, but it's now gone...19:46
corvusthat may be something to keep in mind in the future potential uses of kubic...19:46
ianwyeah, i just saw that they have restored that19:47
ianwhttps://github.com/containers/podman/issues/1433619:47
clarkbthey note that it has known security bugs though19:47
clarkbnot sure we should set users up with that by default19:47
ianwbut an old version.  it's definitely a better idea to switch to jammy for that19:47
corvusoh neat19:47
clarkb(side note, this is why tools like docker end up ebing so popular. Its the solaris problem all over again... access and simplicity)19:47
ianwi guess once we get the wheels sorted we will have a fully operational deathstar^w jammy environment19:47
corvuswell, the current version of my change is just to throw an error if you use ensure-podman on focal... that may still be the best approach....19:47
ianwprobably not a bad idea.  i'm not sure there's a lot of users19:48
clarkbsystem-config-run jobs are happy with new ansible19:49
clarkbI'm wondering if we should start forcing ansible v5 on in places like that and slowly just keep adding to the list. Or is that problematic because if v6 happens we have to go and update all those explicit entries?19:49
corvusi really like keeping it out of job defs because of ^19:50
fungiyeah, that just seems like a way of inflicting pain on our future selves19:51
clarkbI guess the ideal then is to test what we can, set a date to update to v5 by default (we can do that in opendev by tenent before zuul changes) and then anything that breaks after gets fixed19:51
corvusi think in the past, we've gotten like 90% confidence with targeted testing, then switch the default and folks can pin to the old version if something comes up...19:51
clarkbya ok. Maybe end of june is a good target for that?19:51
corvusyeah, pin or fix depending on severity :)19:51
clarkb(thinking we want it to be long enough for people to give testing a go before we switch and with summit going on that means some time, but also early enough we don't majorly impact say the openstack release cycle)19:52
corvussounds good.  i don't know what zuul's schedule will end up being, but probably not before end of june.  but it doesn't really matter because opendev can set the tenant defaults regardless of what zuul does -- just as long as we communicate and know when it's happening19:52
clarkbyup19:52
clarkband for setfacl I think adding that to our base images is reasonable. Its a system utility that helps bootstrap ansible.19:53
corvusagree19:53
clarkbrather than force everyone to go and install the package themselves if they are using bcome19:53
corvus(also, facls are awesome and should just be installed everywhere anyway)19:54
fricklerfyi devstack passed with acl installed https://zuul.opendev.org/t/openstack/build/113f568178fc4012b7cb862713a4813019:54
clarkbyup I suspect if we update our base images this problem largely goes away19:55
clarkbprobably infra-package-needs is the thing to modify19:55
clarkbI can try to collect all the different setfacl package names later today and push a change for that19:56
fricklerexcept maybe for some weird distro ... but we'll see19:56
clarkb#topic Open Discussion19:56
clarkbWe are almost out of time so want to give any other business a quick opportunity19:57
corvus[i plan to be at the summit]19:57
clarkbcool. It will be an interesting experience to do a conference again.19:58
corvusinteresting is what i'm worried about! :)19:59
* fungi hopes it remains safely boring19:59
clarkband we are at time20:00
clarkbthanks everyone20:00
fungithanks clarkb!20:00
clarkb#endmeeting20:00
opendevmeetMeeting ended Tue May 24 20:00:22 2022 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)20:00
opendevmeetMinutes:        https://meetings.opendev.org/meetings/infra/2022/infra.2022-05-24-19.01.html20:00
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/infra/2022/infra.2022-05-24-19.01.txt20:00
opendevmeetLog:            https://meetings.opendev.org/meetings/infra/2022/infra.2022-05-24-19.01.log.html20:00

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!