13:00:14 #startmeeting kolla 13:00:14 Meeting started Wed Jun 19 13:00:14 2024 UTC and is due to finish in 60 minutes. The chair is mnasiadka. Information about MeetBot at http://wiki.debian.org/MeetBot. 13:00:14 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 13:00:14 The meeting name has been set to 'kolla' 13:00:17 #topic rollcall 13:00:18 o/ 13:00:24 o/ 13:00:25 o/ 13:00:33 o/ 13:00:35 o/ 13:03:08 Michal Nasiadka proposed openstack/kayobe master: seed-vm/infra-vms: Add support for EFI and q35 https://review.opendev.org/c/openstack/kayobe/+/921660 13:03:23 #topic agenda 13:03:23 * CI status 13:03:24 * Release tasks 13:03:24 * Regular stable releases (first meeting in a month) 13:03:24 * Current cycle planning 13:03:25 * Additional agenda (from whiteboard) 13:03:25 * Open discussion 13:03:27 #topic CI status 13:03:36 Kolla/Kolla-Ansible seems green 13:03:43 Kayobe SLURP jobs are still failing 13:03:49 anything I missed? 13:04:34 kolla build failure rate seems up, at least according to https://grafana.opendev.org/d/c0d59dad13/kolla-failure-rate?orgId=1 ? 13:04:56 kolla-ansible is broken in grafana (I guess for some time again, I should check more often) 13:06:02 let me check kolla periodics 13:06:43 last failures from 15th of June in Kolla 13:07:04 mhm, maybe just grafana acting up 13:07:08 anyway, let's continue 13:07:13 #topic Release tasks 13:08:46 R-17 has passed, according to Release Management schedule - we should switch to master branches from 2024.1 13:09:03 Something like https://review.opendev.org/c/openstack/kayobe/+/791764 13:09:13 Any volunteers to do this? 13:10:01 I can 13:10:29 ty :) 13:10:30 ok the 13:10:32 n 13:10:34 let's move on 13:10:42 \o 13:10:42 #topic Current cycle planning 13:11:16 I started some work on the Ubuntu Noble LTS support and Ansible bump 13:11:34 Ubuntu nodes do OOM on new Ansible, and some other fantastic things - so having fun 13:12:10 ah we just talked about this internally today (noble). notice afaik noble doesn't support upgrades from 22.04 until 24.04.1 release 13:12:11 Does anybody have any other high priority features that need discussing? 13:12:44 I'm not interested in in-place upgrades now, we just need to get that working and backport this to C 13:13:04 the oom issue should be fixed with 2.17.1, see the comments in your patch 13:13:26 as always ..me :) ..just find some time to review again my patches 13:14:13 Yeah, and hundreds of other patches :) 13:14:23 Let's go to additional agenda 13:14:30 #topic Additional agenda (from whiteboard) 13:14:36 yeah, but i am actively applying changes regarding comments :) 13:14:49 (SvenKieske: please review patch for broken keystone federation backend https://bugs.launchpad.net/kolla-ansible/+bug/2058656 13:14:49 trivial patch at https://review.opendev.org/c/openstack/kolla-ansible/+/913908 needs Cores <--still needs input 13:14:49 https://review.opendev.org/c/openstack/kolla-ansible/+/921381/comment/56b11f74_f98925e8/ (rmq startuptime high, reason found, please share your opinion) 13:15:36 yeah the first two are rather getting old, they really don't need any discussion in the meeting imho, just feedback on the patches itself. 13:16:04 the third one I don't know, I really would like to have some more opinions on it, but could also be discussed on the patch itself 13:16:12 My question is - do we have ANY CI job that at least configures keystone to do federation? 13:16:26 let me check.. 13:17:07 Maksim Malchuk proposed openstack/kolla master: Revert "[release] Use Caracal sources by default" https://review.opendev.org/c/openstack/kolla/+/922318 13:17:07 Because if we don't, we're going to see those issues every now and then 13:17:19 true 13:17:34 at least from quick grepping I don't find any tests 13:17:55 that being said, a bugfix is a bugfix. we have many bugfixes where you don't ask for tests, even if there are none 13:18:19 we should add some ... 13:18:33 and ftr I agree we should maybe focus more on tests, but I would appreciate it if this pressure would be applied equally, at least it doesn't feel to be applied equally, from my pov. 13:18:56 but ymmv :) 13:19:03 Mark Goddard proposed openstack/kolla-ansible master: Add support for customising Neutron physical network names https://review.opendev.org/c/openstack/kolla-ansible/+/922320 13:19:23 Well, I'm not going to set up an AIO with federation enabled to check if that fixes anything 13:20:50 commented on the patch 13:21:38 SvenKieske: what about that RMQ thing? 13:22:24 the path thing? 13:22:57 well the startup time is raised, because it is computed everytime rmq starts, so we should move it either to build time (in the dockerfile in kolla) or directly hardcode it 13:23:43 not sure though how often this changes, I just had a look at the otp repo where imho the ssl libs erlang sources reside, there is already version 11 and they do frequent releases, but we do seem to have version 9.4 somewhere.. 13:24:01 if that changes frequently we might not want to update this string manually in the dockerfile 13:24:15 well, that's erlang version? 13:24:33 ah, ssl lib in erlang 13:24:40 so I think the best compromise would be to let the erl autodiscovery run at build time, so it doesn't run every rmq start, but we don't need to patch for every ssl lib minor version bump? 13:25:35 from my pov that makes sense, but I'd really like to have some opinions of seasoned maintainers there, so we don't need to rewrite the patch in one or two weeks :) 13:25:55 I discussed this briefly with Matus (patch author) in private. 13:26:18 well, adding that to rabbitmq-env.conf - which we template out in kolla-ansible? 13:26:36 this really did affect the build pipeline which sometimes would just timeout 13:27:05 that's another option, but I'm not sure it fixes anything? doesn't this run on every start as well? or do you mean hardcoding it there? 13:27:17 according to RMQ docs - we need to add that to rabbitmq-env.conf 13:27:31 and if we template out the whole file in kolla-ansible 13:27:42 let me look. wait ok if we template it, it's only run once, seems also fine 13:27:43 then adding that in Dockerfile isn't going to get us anywhere 13:27:50 true 13:28:38 I'm really not an expert, this patchset is the first time I've even seen this option, just looked it up myself in upstream docs. 13:28:38 So unless that's a big burden - I would say it should be added in kolla-ansible 13:29:14 probably just spawning a sidecar container with rabbitmq to get that information out of it - and then use the output to be templated out to rabbitmq-env.conf - only when we enable rabbitmq tls 13:29:17 it's fine either way :) it just doesn't make sense to compute it on every startup and make startup slower and this leads to CI pipeline timeouts in some tests :) 13:29:43 Anyway - any volunteer to do this? 13:30:10 I think Matus can implement it in the same patchset if he has some guidance, should be possible with kolla_toolbox, no? 13:30:51 or should this go in a separate patch? afaik the ssl stuff is only needed for this patch? so can be developed together? 13:30:56 Don't we want to get that backported? 13:31:06 We have RabbitMQ TLS working today, don't we? 13:31:23 this is only inter node tls, so the inter cluster traffic 13:31:25 or is that only for internode TLS? 13:31:28 ok 13:31:33 then incorporate that in the current patch 13:31:35 I don't know if it affects other TLS settings actually 13:31:44 well, maybe it's good to check ;-) 13:31:51 yeah, will do 13:31:59 but RMQ docs rather mention inter-node 13:32:12 anyway 13:32:13 let's move on 13:32:23 (r-krcek): please review https://review.opendev.org/c/openstack/kolla-ansible/+/914997 13:33:17 Maksim Malchuk proposed openstack/kayobe master: Revert "[release] Use OpenStack 2024.1 release" https://review.opendev.org/c/openstack/kayobe/+/922322 13:34:06 oh crap, love that kind of changes 13:35:12 Will have a look 13:35:26 #topic Open discussion 13:35:29 Let's move on 13:35:32 Actually I have a topic 13:37:13 Since we only have two core-reviewers that are really active outside of StackHPC - I'd like to propose that if a change has one +2 from SHPC - and a second review has not landed for 2 weeks (after setting RP+1) - I'd like to merge such patches with single company approval. 13:37:38 frickler, kevko: What do you think about that? 13:38:58 please review https://review.opendev.org/c/openstack/kayobe/+/921627 as discussed I've made the followup 13:39:40 i am not sure if i understand 13:39:51 mnasiadka: I guess that's fine for me, even 1 week at most should be fine 13:39:53 and new for Kayobe: https://review.opendev.org/c/openstack/kayobe/+/921628 13:40:07 summer lack of cores? 13:40:14 again? 13:40:23 kevko: currently we have a rule - if the author is from SHPC - we can't merge it with two core reviewers from SHPC 13:40:43 We have a lot of changes, that are stalled due to missing core review from outside of SHPC 13:40:52 mnasiadka lets add cores 13:40:55 Those have usually RP+1 - and it does not help 13:40:58 mmalchuk: shh 13:41:02 well the role is for any company, not specific to SHPC, just to avoid confusion 13:41:07 yes 13:41:38 frickler not only SHPC use Kolla projects 13:41:40 i have the same problem ... what about trade +2 so you will give me +2 for my review and I will find a time for your review :D 13:41:43 so - in order to not let changes to rot - I'm proposing to merge patches with single company approval 2 weeks after the first +2 - if a second +2 from a separate company does not come in 13:42:06 kevko: show me your patches that already have one +2 :) 13:43:03 I was not asked, but maybe require at least one +1 from another company? that should be a pretty low bar, so maybe not worth spelling it out, though. ¯\_(ツ)_/¯ 13:43:40 So, there's another thing that irritates me most - if I can 13:43:52 Adding +1 to a patch after it gets merged - that's to you SvenKieske and mmalchuk ;-) 13:43:59 mnasiadka: well this is another problem ..how can I get some reviewer if you guys in SHPC just giving +2 to each other ... 13:44:12 but no problem ..i am ok with it btw 13:44:42 faster merging ..is always better ..it's still master ..it can be reverted ..it can be fixed by another patch if needed 13:45:26 kevko: sure, everybody has the same problem - it's even hard to find two SHPC cores having time to review a patch ;-) 13:45:48 mnasiadka: well sometimes I review a patch, give +1, then there is last minute change like typo in commit message and immediately +2 +1 workflow and merged and my review is "gone". I think it's at least a decent signal to re-review the minor change then and say "I also did look at it" 13:45:52 But I'll try to have a look in your patches next week - last two weeks have been extremely hectic on my side 13:45:58 if that's annoying I can leave it be I guess 13:46:19 I think it's useful if there's a comment "you missed that", or "please follow up with a fix to this" 13:46:20 Or we should create some queue or something ... 13:46:25 but just +1 is a bit irritating :) 13:46:33 for example here is reaaaallly trivial and important CI fix 13:46:35 https://review.opendev.org/c/openstack/kolla-ansible/+/919939 13:46:39 kevko: we have a queue, just add RP+1 :) 13:47:11 Michal Nasiadka proposed openstack/kolla-ansible master: CI: Use 2024.1 as previous_release https://review.opendev.org/c/openstack/kolla-ansible/+/919939 13:47:53 maybe a good reminder that not only mnasiadka can RP+1; not sure if everybody knows, but mere mortals (non cores) can't RP+1 (currently) 13:48:07 because ...without this patch for example we are testing totally something different as we should ... because now it's tested from 2023.2 to -> current uploaded patch 13:48:15 yup 13:48:19 I even updated that patch :) 13:48:25 mnasiadka: I know 13:48:27 frickler: if you can have a look in ^^ 13:48:41 kevko: as I said, I've been extremely busy - it should be better next week ;) 13:48:42 this would also result in less clicks for backport voting btw, not sure how many reviews it needs, it has gotten quite some attention already: https://review.opendev.org/c/openstack/project-config/+/920472 13:49:03 SvenKieske: that is project-config, I have no power over that :) 13:49:10 SvenKieske: well, I also added some RP +1 ...but somebody just deleted it :D 13:49:12 frickler? ;) 13:49:25 kevko: lol ok, didn't notice, too bad :) 13:49:27 hmm, "branches: none" is a weird idea, I have to think about that 13:49:56 other option is to remove the config at all 13:50:04 and revert in E 13:50:13 but we would need to add some docs to the release management guide 13:50:39 Feel free to comment with a better solution 13:50:50 we could have a CI var "master_is_slurp" 13:50:53 maybe that's better 13:51:11 if Zuul can honor that 13:51:20 that sounds decent, if it works. 13:51:44 the thing is, the job might still run if it is defined somewhere else, like from stable/2024.1 13:51:46 I really would like a handy list which releases are slurp, didn't sink into my brain just yet 13:52:08 SvenKieske: https://releases.openstack.org/ :D ? 13:52:10 *.1 is slurp, *.2 isn't. slurp == odd 13:52:36 frickler: basically I would be happier if we run slurp whole cycle 13:52:39 frickler: ah that's a good shorthand! 13:53:01 and I didn't know it's mentioned on the release page as well, thx! 13:53:48 as in when it's master branch, not just when we do rc1 and find out slurp does not work 13:55:07 but for current master, from where would you upgrade? from 2024.1 it is just normal upgrade jobs, don't need to duplicate those 13:55:22 from 2023.1 is too old, we don't want to support that 13:55:37 from 2023.2 ... weird, might not be supported by services 13:55:44 well, when master is 2025.1 13:55:54 we should be doing slurp from 2024.1 to master (to be 2025.1) 13:55:55 right? 13:56:01 yes 13:56:16 so re-enable as soon as 2024.2 is branched 13:56:39 yeah, was thinking if we can have jobs running when some variable is true 13:56:45 so we would only change that variable 13:57:17 but still setting up slurp per new release is going to take some work 13:57:24 I can check with zuul ppl whether that would be possible 13:57:25 so maybe we just remove jobs from project.yaml 13:57:36 and add them back when master is slurp 13:57:43 and just document that in our Release Management docs 13:57:46 I'm fine with any :) 13:58:00 yes, that would be the usual approach that openstack-zuul-jobs is taking 13:58:10 ok, so let's do that 13:58:21 could even add them all in one project-template and just add/remove that one maybe 13:58:40 Michal Nasiadka proposed openstack/kolla-ansible master: CI: Use 2024.1 as previous_release https://review.opendev.org/c/openstack/kolla-ansible/+/919939 13:58:52 that also makes sense 13:59:07 I'll have a go at writing the docs and trying out project template 13:59:24 thanks 13:59:31 ok, we're out of time :) 13:59:38 Thank you all for coming - see you next week :) 13:59:40 #endmeeting