Wednesday, 2025-09-24

*** bauzas6 is now known as bauzas00:43
opendevreviewMerged openstack/governance master: Add TC/PTL results from 2026.1 election  https://review.opendev.org/c/openstack/governance/+/96159800:46
gouthamrtonyb: was waiting on everyone at this point - it currently looks like we don't have a good overlapping time.. the best we could do is: 1600 UTC - 1700 UTC on Wednesdays or, between 1600 UTC and 1800 UTC on Thursdays05:36
gouthamrI don't know if folks considered the DST changes in Oct/Nov.. US/Canada will observe a change on Nov 2, EU at the end of Oct and Australia at the beginning of Oct... 05:36
gouthamrits still god awful early for you in any case... so, i think lets float the idea of alternate weeks, or have a monthly APAC-friendly meeting slot05:39
tonybThat's rough.  Realistically (post DST) 1800UTC is the earliest I can be up.   So yeah I think alternate weeks would be good05:39
tonybI don't recall and options that were late (very small UTC times) for me in the poll05:40
fricklerso if I count correctly, we have 5 x EU, 3 x US and 1 x APAC based TC members. IMO any schedule we come up with should take these weights into consideration. tonyb: what would be your latest reasonable meeting time? like something between 06 and 12 UTC-ish I'd assume?05:51
tonybFor a once a week meeting can more or less do anything other than UTC 1300-185905:54
tonybso yeah for a one hour meeting starting by UTC 1200 will mean we usually finish by UTC130005:54
tonybIf it's going to be a late one I'd appreciate avoiding UTC Tue/Wed as I already have a Tue UTC1900 meeting (opendev)  and late nights+early starts don't mix05:56
fricklerwell I assume anything between 08-13 UTC will be equally bad for US folks, so we likely could choose something in the early range from that, which might then even be feasible for some stay-up-late west-coaster ;) likely we should do another poll for that range06:05
tonybthat works for me07:41
*** jbernard_ is now known as jbernard14:05
*** diablo_rojo_phone_ is now known as diablo_rojo_phone14:19
*** diablo_rojo_phone is now known as Guest2727514:20
*** mnasiadka_ is now known as mnasiadka14:26
*** johnsom_ is now known as johnsom14:26
*** TheJulia_ is now known as TheJulia14:26
*** prometheanfire is now known as Guest2731414:28
*** johnsom_ is now known as johnsom14:39
*** mnaser_ is now known as mnaser14:39
*** tmazur_ is now known as tmazur14:39
*** cardoe_ is now known as cardoe15:32
opendevreviewStephen Finucane proposed openstack/governance master: Retire shade  https://review.opendev.org/c/openstack/governance/+/96219715:56
*** gthiemon1e is now known as gthiemonge16:01
spotz[m]clarkb: fungi - I have an ex-OpenStacker current CentOS Stream person coming to Summit so we can hopefully figure out the issues you're finding and make a plan. Not just open to you both but wasn't sure who all to tag17:29
clarkbspotz[m]: I'm not aware of any major issues at the moment I think they were all largely worked through. We haev noticed the mirror grows perpetually as centos stream doesn't seem to clear out old packages and some packages are quite large (liek those for firefox or java)17:31
clarkbbut tonyb has a tool to verify repo state that he thinks can be used to prune the extra packages17:31
fungiafaik the only significant issue we have with it is that we can't boot it on a significant percentage of our systems due to the x64v3 requirement, and yeah that their mirrors just grow boundlessly and are never pruned of old package versions17:31
clarkbI think the discussion before was more frustration that adding a new debuntu release is trivial but adding centos/rhel/fedora/etc requires massive work each release just to get booting17:32
clarkblike we literally just change the release name and build a new image for debuntu most releases.17:33
fungithat's "we as in opendev" though, "we as in openstack" have some additional challenges like centos stream not accurately reflecting the *current* state of rhel but rather a speculative future state where bugs get tested on the users and not reverted with any urgency17:33
fungi(like that one time ping was broken for... a month?)17:34
clarkbya I guess the most recent issue along those ^ lines was centos stream broke their cloud images which some openstack projects (like ironic and dib) rely on for their own testing17:35
clarkband there was an issue for it that calimed to fix it but it was broken for about 2.5 - 3 weeks (the assertion it was fixed was wrong)17:35
clarkbthe issue was short writes of the images on the storage side so the file was incomplete/corrupt and the hash files were empty17:36
fungimore generally, openstack wants to test on lts server distributions, while centos stream is more of a test-type distribution with different stability expectations17:36
fungiwe were trying to test on centos as an analogue for rhel, because licensing challenges prevented testing on actual rhel, but once traditional centos was abandoned we ended up testing on stream which really isn't the same thing17:37
clarkbya what would probably be useful is for openstack to decide what sort of testing is useful and then if there are conflicts with stream in making that happen we can take that discussion to the summit17:40
clarkbopendev will struggle through adding platforms if there is a desire to add it (and if there are volunteers to add thigns)17:41
spotz[m]Well I thought we had gotten close to resolving the issue of the licenses, but Stream is the next point release of RHEL so yeah there could be some issues but hopefully there shouldn't be. But I think getting everyone together to talk would benefit both projects so we can figure out a path forward to reduce any issues. For me it kind of sucks walking into bashing when I'm involved in both projects and nothing has been raised to the other17:45
spotz[m]project. Will it get something fixed faster but if they don't know they can't do anything17:45
spotz[m]* fixed faster maybe not but if17:46
fungii don't really see it as a bashing, just a mismatch. centos stream isn't designed to meet these particular needs, and i don't think it's intended to17:46
clarkbI think there is also a communication flaw in that half of the things we run into are private and we're not allowed to see issues for17:47
clarkband I believe in some cases even when we file the issues from the public side those images are not visible to others on the public side17:47
fungiwell, yes. not using secret red hat bug trackers would be a huge improvement in transparency17:47
clarkbI want to say this happened recently with an issue that mnasiadka filed17:47
clarkbso yes there aer communication issues, but I feel like the problem is not on our side. The tools we have available to us are hostile to its users17:48
spotz[m]We are working on that issue, some policies auto convert issues. We're trying to get the main one to stay open but open another if needed17:48
clarkb*those issues. Heh too much discussion about images17:48
fungibut i don't want to imply that if centos stream was more transparent with bugs it would suddenly fit the lts server use case17:48
fungii'm happy to tell them that bringing back traditional post-rhel centos would fit what openstack needs, though i doubt that feedback is especially helpful17:50
mnasiadkaI don’t think Stream is ever going to fit the LTS use case, and even in Kolla-land we often find centos stream jobs fail because some new packages introduced breakages, while Rocky 10 remains stable17:50
mnasiadkaAnd those breakages never propagate to RHEL/Rocky next minor release17:51
clarkbI personally think there is value in testing against forward looking distros. I really wish the tumbleweed stuff had been useable enough. The problem with these things is that if you don't have a group dedicated to debugging issues that do arise then its just noise/problems17:51
clarkband unfortunately, I don't think openstack has the bandwidth right now for that more forward looking stuff whcih is part of the issue here17:51
mnasiadkaKolla runs Stream jobs just to see if something in next RHEL minor release is going to break us, but it’s rather not proven to be useful17:52
clarkbbut that is why I think openstack should probably try to define what it cares about from a testing standpoint. If oepnstack sees value there and wants to try and put resources behind it then we should do our best to make it happen. But if openstack doesn't want to then we shouldn't17:52
fungiand it seemed like the abandonment of traditional centos was a political/product decision that red hat didn't want a "free" equivalent of rhel that competed with their license revenue, which i respect after all it's their money to spend and their business model to manage, but as such i strongly doubt a post-rhel centos will ever come back17:53
mnasiadkaSo if OpenStack as a whole wants to invest cycles to test also on EL (in addition to Debuntu) - I think Rocky/Alma is the way to go, unless we can use RHEL (which we can’t)17:53
mnasiadka(And I don’t think there’s a lot of people using RHEL and not using RHOSP - but I might be mistaken)17:54
fungias for the "rhel licensing being worked out" as of a couple years ago red hat said they were going to assign some contributors to work through the logistics and implementation needed to handle the license management, as well as a plan for how devs and users could locally reproduce runs without needing to jump through licensing hoops of their own, but that never actually17:55
fungimaterialized17:55
TheJuliaThere is a definite contingent who just accepts the movement drift and rolls with it with Centos, but its all going to boil down to what folks are comfortable with in the end17:56
clarkbbut yes I agree with fungi. I think a huge part of the problem here is a mismatch in expectations. CentOS communication was that stream should work for you if you were using centos before. Our experience is that isn't the case for openstack testing/gating17:56
clarkband the reasons it doesn't work include problems with communication tools breaking down the ability to report and track issues taht are discovered, udpate flows to packages not making sense to outsiders (the ping issue for example), and generally just unexpected updates (I wonder if there is a schedule we could see for general things like networkmanager, libvirt/qemu, etc)17:57
fungibut also fundamental design choices, "just try harder and have fewer bugs" isn't going to solve that, and i don't want to give the impression it would17:59
spotz[m]Let's get everyone together and see what we can do and what we need to bring back to see what we can do on the other end18:00
fungitraditional centos, while problematic, was closer to what openstack needed and still needs. when that disappeared, other distros like rocky and alma showed up to fill that same space and i agree they make for a more sensible choice in this specific use case. that's not meant as a slight against centos stream18:00
TheJuliaI feel like the discussion then goes in circles18:03
fungibut yeah, if there's going to be a useful discussion it should really be more about openstack's challenges rather than opendev's challenges. solving the needs of openstack is the hard part, opendev will go along with whatever openstack wants in the end18:03
clarkbthe other concern that came up recently was that rdo is going away maybe? so we may need new packages for a few dependencies?18:05
clarkbI want to say it was rabbit, ovs, and smoething else.18:05
clarkbmaybe it was the erlang runtime for rabbitmq18:05
fungieven if that discussion is really just explaining politely to the centos folks that openstack has a use case that stream just fundamentally isn't suited for18:05
TheJuliaI think a huge part of a problem is there is a tendency to let perceptions be the leading driver. OpenStack as a whole is always going to be a complex matrix and overall things need to be data driven.18:06
clarkbTheJulia: conversely we could say that we need data on what value centos stream would provide over the alternatives (I do think there is avlue as I mentioned before with forward looking testing, but it requires dedicated effort to work with and we don't have that today)18:07
fungiyes, and openstack's testing is great at finding bugs in platforms, but if we test openstack on centos stream we're essentially doing upstream testing for rhel to keep bugs out of rhel at the expense of our own limited contributor and systems bandwidth18:07
fungithis is why we also don't test upstream on fedora, ubuntu non-lts versions, debian's testing release, et cetera18:08
fungi(or tumbleweed as clarkb mentioned)18:09
TheJuliaSo, going back to perceptions, the perception at hand is just testing and finding bugs. I'd argue the same exists with every distro on some level. Nothing is perfect. We pay the cost regardless, we just save it up and have to deal with it for major version changes or we pay it as we go.18:12
clarkbright, a big example of that is the eventlet not working with modern python problem that kicked off the whole maybe we should use eventlet less process18:14
TheJuliafrankly, as an overloaded engineer, I prefer smaller hits as I go. Major version changes is pretty much a huge *surprise* moment where I potentially loose weeks at a time.18:14
clarkbthis is why I think an idea testing setup probably does have sanity checks against forward looking systems, but relies on things that are expected to be stable for gating18:14
clarkbbecause our system essentialyl requires gating to pass this is the default for most (probably all) of the opesntack projects and makes it easy to not worry about the forward looking stuff if you can't anymore18:15
clarkbbut imo that means you probably don't want to gate on centos stream. You need to test it separately. but if people only care about the gates (basically the situation we are in now) how do you incentivize the other thing? I don't know18:15
clarkbwe've failed at it multiple times18:15
clarkbnon ubuntu lts releases, opensuse tumbleweed, fedora, and centos stream have all run into this problem18:16
TheJulia... even ubuntu releases have18:17
TheJuliaeven LTS18:17
clarkbwell no we gate on ubuntu LTS (and previousl centos before stream) because it is stable enough for that purpose18:17
mnasiadkaclarkb: Kolla has ditched RDO for rpm distros and replaced that with EPEL and some CentOS SIG repos (Ceph, OVS/OVN) plus use RMQ and MariaDB directly from their respective upstream repos - I don’t see a lot of problems in doing the same in devstack18:17
clarkbI'm saying for the forward looking stuff we've failed in basically every attempt18:17
TheJuliaexample, they incremented the dnsmasq version in response in response to a CVE. That too had fallout and means the gates can't be perfect. Besides, are we focused on perfection or the range of "good to great!"18:18
clarkbbecause people run out of steam and things break enough that if you are not consitently on top of it then you just get 100% failure over time18:18
clarkbTheJulia: I think its about balance18:18
clarkbUbuntu LTS and CentOS pre stream broke infrequently18:18
clarkbat a rate low enough that people were able to address it because 1) rate is low and 2) we basically have no other choice this is the baseline18:19
clarkbwith the more regularly updating distro releases things break far more often and we have just not had the ability to keep up and also do the other parts of our jobs18:19
TheJuliaI think the last centos stream break was the base images the pushed to the mirrors? That impacted tons of communities18:19
clarkbTheJulia: yes the corrupted cloud images are the last breakage I am aware of. Prior to that was the issue mnasiadka filed that I can't even remember what that was now (but also never hda access to the issue because ti was auto privated)18:20
TheJuliaclarkb: but then we all had to pay a huge cost whenever we went to the next version18:20
clarkbTheJulia: that hasn't been the case with debuntu in my experience18:20
clarkbit was an issue with centos18:20
clarkbsince the systemd switch things have been very stable in debuntu with things like network configuration and daemon management18:21
clarkbI think the upstart -> systemd move was the last big lift with debuntu18:21
fungii expect because rhel/centos type distros traditionally haven't suported in-place upgrade workflows between major versions, there's a different approach to backward-compatibility between releases18:21
TheJuliaOh yeah, centos 7->8, 8->9-stream was painful. But then the other question is how much technical debt is a distro maintaining?18:21
clarkbI remember the issue mnasiadka filed was about the centos images not labeling their partitions correctly18:23
clarkbI believe that they probably still aren't labeled correctly but we gave up and just work around it now18:23
mnasiadkahttps://issues.redhat.com/browse/RHEL-9389118:23
TheJuliamnasiadka: thanks, I was about to ask for that18:23
mnasiadkaFound it ;)18:23
clarkbthat issue is still not public18:23
clarkbso ya I was neverable to even engage with it18:23
TheJuliawow, that stinks18:24
TheJuliayeah, restricted to RH Engineering18:24
mnasiadkaclarkb: and still unresolved and at least on my end looks like nothing happened18:24
TheJuliaI suspect its not correctly labled/filed for them to pick it up18:25
TheJuliaFWIW18:25
clarkband I can understand that changing that might be too risky vs the impact, but if I'm not even allowed to be told that via a properly filed issue with the upstream well now you can probably understand why some of us don't like dealing with this18:25
mnasiadkaAnyway, I’m not that interested in Stream, I did the DIB/Glean stuff to get Rocky 10 in OpenDev18:25
clarkband to be clear ubuntu typically doesn't fix a lot of issues like that either. But you usually get a response telling you that you have to wait for the next release or it wont be fixed at all and you can have that conversation18:26
TheJuliamnasiadka: are you able to view/edit the content on that item you linked?18:29
mnasiadkaTheJulia: yes, no movement in that ticket since 12th June, assigned to pm-rhel18:31
mnasiadkaNo comments, no nothing really18:31
TheJuliaYour going to need to point them to the fact that this is the images they are publishing, that they don't conform to BLS spec. They may be okay with that18:31
TheJuliafurthermore, I think you'll get more traction with the component set to centos-stream-release18:32
TheJuliayou may be able to change Security Level, if not I can change it, although the bot changed it outright. No idea why.18:33
spotz[m]There's a lot of bot stuff going on. If you give Julia and I the ticket we can poke folks18:36
TheJuliaRHEL-9389118:36
clarkbprior to that issue I think the main issues were the x86-64-v3 requirement and the removal of backward compatible network config from NetworkManager (so network config has to be native NetworkManager config)18:37
TheJuliaYeah, a lot of stuff like that is because we can't actually buy any hardware which is not v318:37
clarkbsupport for NetworkManager direct configuration was added to glean and we were lucky that all of our nested virt capable providers are also x86-64-v3 capable instances so we just run centos 10 stream on the nested virt labels18:37
TheJuliaSimilarly, we can't buy native legacy bios booting hardware anymore18:38
clarkbhuh I still own plenty :)18:38
TheJuliaso why keep bios boot around when it is also a huge PITA18:38
clarkbgranted most of them are machiens in my closet collecting dust but my fileserver is v2 and not v318:38
TheJuliaAnd you restrict the memory range for PCI interactions18:38
clarkbone reason to keep bios around: no expiring keys that will break your boots on september 11, 202518:39
clarkb(though disabling secure boot has the same effect)18:39
TheJulia... I do love how they shipped updates and people just didn't want to upgrade to a version with the key updates18:39
TheJuliaso similar effect in the end18:40
clarkbI suspect the v2 vs v3 issue is bigger than bios vs uefi. My fileserver is uefi but not v318:41
clarkbhowever when it comes to the issues opendev ran into I suspect they are equiavlent18:41
TheJuliaYeah, bios gear has basically disappeared18:42
TheJulia... although some vendors have special UEFI firmware as well that... yeah18:42
TheJuliajust... ugh18:42
clarkbtumbleweed has mixin packages for v3 support. I'm not sure if opensuse leap does this too or if tumbleweed is the beta test. But I erally like the appraoch continues to support old hardware while speeding up processes on hardware that can take advantage of it18:45
clarkb`zypper search -- -v3` says libboost, libz, python, openssl, some media stuff, and more all have v3 mix in packages18:46
clarkba downside to this approach is you can't do this for the kernel. You'd need to build and ship completely separate kernels I think18:48
clarkbthen have grub updates select the correct kernel based on the current hardware. But if you install the old hardware kernel too then you can potentially move root disks around and override the kernel via grub at boot18:48
TheJuliaAt the same time, I can't in good conscious have a customer rolling out a huge bunch of infra on 12+ year old gear. There is a balance to be had.18:52
clarkbTheJulia: I think the real range is like ~4 years18:54
clarkb(atom didn't get v3 until 2021 and there are many valid reasons to run atom devices in professional settings like embedded devices, factories, etc)18:54
TheJuliaV4 is more like in the last 4 years, fwiw. Started with Intel Skylake and AMD Zen4s. V3 is Haswell/Excavator (okay, 10 years for Excavators)18:56
clarkbatom does not have v4 yet18:56
clarkbthe first atom release with v3 according to wikipedia was gracemont released november 2021. No releases yet with v418:57
TheJuliawow, 3 years for atom18:57
TheJuliathat is... a sign to migrate away from Atom microarch18:57
clarkbI would disagree. Atom fits a particular use case but one that is very useful if you need it18:57
clarkblow power soc's for hardware dedicated to accomplishing some task (which might be in cars, factory machinery, etc)18:58
TheJuliaAt this point, I'd still ask why not Arm :)18:59
TheJuliaanyway, its academic at that point18:59
TheJuliamnasiadka: following up, its now assigned to a human and a team. I've also removed the security. We'll try to get some clarity on fixing process/bots since the original component set was apparently wrong and a bot made a hidden comment but that doesn't help someone who doesn't have elevated rights. We'll try to get it sorted out.19:15
mnasiadkaTheJulia: thanks, we did a workaround in DIB, so that’s not that important - but if all tickets raised by people like me get to purgatory, then it’s not fun :-)19:22
spotz[m]We're working on that issue19:53
TheJuliamnasiadka: more information requested on the bug.20:05
fungii have a bunch of new machines that give me the option to switch from uefi to bios boot, so i don't agree that it's entirely gone away (though i do agree that there's basically nothing new being sold that is incapable of uefi, at least for x86-based systems)21:41
clarkbdoesn't look liek you can previwe the formating of comments in the red hat bug tracker? In any case I posted a response there to add more details23:36
*** iurygregory_ is now known as iurygregory23:41

Generated by irclog2html.py 4.0.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!