15:00:00 #startmeeting kolla 15:00:00 Meeting started Wed Mar 9 15:00:00 2022 UTC and is due to finish in 60 minutes. The chair is mnasiadka. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:00:00 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:00:00 The meeting name has been set to 'kolla' 15:00:04 #topic rollcall 15:00:12 o/ 15:00:52 #help 15:01:14 \o 15:01:18 o/ 15:01:20 \o/ 15:01:38 o/ 15:01:49 o/ 15:01:58 o/ 15:03:04 #topic agenda 15:03:04 * Review action items from the last meeting 15:03:04 * CI status 15:03:04 * Release tasks 15:03:04 * Current cycle planning 15:03:05 * Additional agenda (from whiteboard) 15:03:05 * Open discussion 15:03:11 #topic Review action items from the last meeting 15:03:52 mnasiadka to triage security bugs and update them with resolution plan (if needed) 15:03:52 Enable osbpo in Debian APT sources, abandon extrepo command use then? 15:03:52 mnasiadka to update kolla review dashboard with kolla collection 15:04:07 I triaged security bugs, but mgoddard created a new one :) 15:04:11 Dashboard not updated. 15:04:18 the osbpo task was on hrw 15:04:24 does anybody know if he has done it? 15:04:40 I've seen something 15:04:58 abandoned 15:05:11 https://review.opendev.org/c/openstack/kolla/+/831548 15:05:16 shift of approach 15:05:44 so extrepo will have offline config, ok 15:05:50 so then it's still sort of in progress 15:06:00 yup 15:06:34 #action mnasiadka to triage security bugs and update them with resolution plan (if needed) 15:06:41 #action mnasiadka to update kolla review dashboard with kolla collection 15:06:52 #action hrw Enable osbpo in Debian APT sources, abandon extrepo command use then? 15:06:57 #topic CI Status 15:07:00 How's our CI yoctozepto ? 15:07:40 mnasiadka: happy 15:07:48 goodie 15:07:49 no idea about kayobe though 15:08:07 I've seen an occasional OOM on TLS job, but nothing very concerning. 15:08:16 #topic Release tasks 15:08:21 None this week. 15:08:29 #topic Current cycle planning 15:09:28 Most of the leftovers have been postponed to Z, but let's get anything we can in Y. 15:09:58 yoctozepto: some more changes from mgoddard on baremetal role split 15:10:17 mnasiadka: where? 15:10:38 whiteboard L428 15:12:02 oh my 15:12:52 does not seem look like a priority to me, to be honest, add me as a reviewer and I will get to it 15:12:57 ok 15:13:29 I think it's important for Kayobe to reach the last patch on the list to be merged 15:13:46 #topic Additional agenda (from whiteboard) 15:13:58 (yoctozepto) Having both PXE and iPXE working at the same time https://review.opendev.org/c/openstack/kolla-ansible/+/832159/ 15:14:58 yes, I think that's something sensible to do 15:15:10 especially for people having existing pxe deployments 15:15:17 and wanting to progressively move to ipxe 15:15:39 the downside is we need to touch the volumes 15:15:54 as long as it works and doesn't break anything 15:15:58 yeah 15:16:11 the question is whether there is some state that we should preserve 15:16:19 in tftpboot and httpboot 15:16:30 to me it looks easy/not-expensive to regenerate 15:16:33 I'm not an Ironic expert 15:16:36 but I would like to know your opinions 15:16:38 yeah 15:16:45 sadly mgoddard is not around 15:16:52 so ideally we would need mgoddard to look into this 15:17:01 that was the point 15:17:33 ok then, I guess he'll try, given the notifications he already did get ;-) 15:18:32 mnasiadka: yeah ;-) 15:18:44 (yarovkon) Migration path for kolla-ansible from docker to podman (https://etherpad.opendev.org/p/Kolla-ansible_migration_plan_docker-podman) 15:18:50 that's next 15:19:27 Wanted to ask if that path looks fine for you 15:20:26 I think we need to automate those first two steps, you don't expect people to do it manually on let's say - 300 servers? 15:20:51 of course that it will be automated in ansible 15:21:06 we even have it already (not in gerrit yet) 15:21:29 ohorecny2: post to gerrit ..i'm curious 15:22:11 yes, post it to gerrit and let's review 15:22:18 but I doubt we will get to podman in Y 15:23:31 uninstalling docker cannot be the first step 15:23:34 as it will down containers 15:23:43 we should be able to migrate container by container 15:23:43 it doesn't matter i think 15:24:02 kevko: how so? 15:24:13 yoctozepto: installing both podman and docker on the same host (at least in CentOS world) is problematic 15:24:17 it was reply to mnasiadka that it will be not in Y 15:24:23 kevko: ack 15:24:25 yes, it doesn't matter we tested it with active VMs (instances) and they were still alive, during whole migration process 15:24:40 mnasiadka: hmm, that's sad - what's the issue? 15:24:53 yoctozepto: containerd mumbo jumbo 15:25:03 haha, kolla will be tested directly from kolla itself :D ...so we will see how HA works :D :D :D 15:25:27 here is some writeup, it's a bit problematic: https://faun.pub/how-to-install-simultaneously-docker-and-podman-on-rhel-8-centos-8-cb67412f321e 15:25:32 ohorecny2: well, vms yes but not all other processes ;-) 15:25:38 mnasiadka: ack, sad 15:25:44 kevko: where? 15:25:47 well, problem will be with rabbitmq, mysql and haproxy 15:26:02 and definitively has to be run with limit to one host 15:26:27 yes, but small outage during migration process is ok, I think 15:27:00 as long as everything works afterwards, with rabbitmq it's not granted ;-) 15:27:10 i'm just trying to say that on the end of process there should be tasks which will check everything is working well 15:27:36 and when i am thinking about it ..same tests should run before these actions are going happen 15:27:50 we need to start with something, post the patch to gerrit and let's work on it collectively 15:28:15 mnasiadka: for example I know that kolla-ansible changed autoheal politics (sorry i don't know the exact keyword in config) to something different 15:28:17 yes, it is ok to do some healthchecks before and after migration 15:28:21 (for rabbitmq) 15:28:36 autoheal? it's not Kubernetes 15:28:42 ah, for rmq 15:28:51 so on deployments with three rabbitmqs (small ones) rabbit will stop working immediately 15:29:05 for rabbitmq 15:30:02 mnasiadka: rabbitmq_cluster_partition_handling: "pause_minority" << this 15:30:08 Merged openstack/kolla stable/train: CentOS Stream 8: drop CentOS RabbitMQ 3.8 repo https://review.opendev.org/c/openstack/kolla/+/832650 15:30:48 in moment where rabbitmq will not see a node ..rabbitmq will be unresponsive .. 15:30:56 kevko: pause_minority is partition handling, there's no partition if you stop one node. 15:31:37 it should only cause havoc on network issues 15:31:50 but we've diverted the topic 15:32:30 yes, there can be some errors, because it will be for some time down, but after migration it is going up again 15:32:59 as long as we have some CI that is testing the move - we can work out the quirks I guess 15:33:21 so, post a patch - and let's discuss in the review (and probably on next meetings) 15:33:54 hmm ok 15:34:04 There's another podman-ish topic 15:34:05 (halomiva,hinermar) structure of podman testing and moving baremetal role to openstack collection 15:34:51 what about this one? 15:34:54 i think we should talk about systemd first 15:35:01 because podman depends on it 15:35:22 yes, systemd is something we should be able to land in Y 15:36:22 so what's the status of systemd implementation? 15:36:39 tests are passing but reviews are not coming so we would really appreciate some reviews 15:36:59 because we implemented everything you asked for 15:37:21 https://review.opendev.org/c/openstack/kolla-ansible/+/816724 - this one? 15:37:37 yes 15:38:23 yoctozepto: have cycles to look there again? 15:40:15 mnasiadka: currently not 15:40:33 Merged openstack/kolla-ansible master: Explicitly unset net.ipv4.ip_forward sysctl https://review.opendev.org/c/openstack/kolla-ansible/+/832087 15:40:57 ok, I'll do some testing and review that until end of the week. 15:41:01 and ask mgoddard to do the same 15:41:09 Thank you 15:41:34 And regarding the testing of podman patch, since the single file for kolla_docker would be too large I have split up the specific test parts and put them in directory /tests/kolla_docker_tests/ 15:41:51 I just want check with you if that's alright 15:42:52 sure, who likes big files 15:43:25 Great, thank you 15:43:53 since baremetal role was move to collection and is under refactor, does it make sense to try push our patch to it for installing podman? 15:44:32 or should we keep baremetal role in kolla-ansible for testing purposes? 15:45:00 baremetal role is already out, removed 15:45:22 so you need to add podman installation to kolla Ansible collection 15:45:38 and without that, you won't be able to test the podman patch ;-) 15:45:56 yea but there are some changes coming to that role no? 15:46:42 so should I wait for the changes to come or push it now 15:46:42 yes, look if there are any patches conflicting with your work, and adapt to that direction? 15:47:04 I don't think mgoddard is planning a lot more work with the collection this cycle 15:47:36 https://review.opendev.org/q/project:openstack/ansible-collection-kolla 15:50:27 halomiva: probably best would be to create new role(s) for podman in the collection 15:50:47 ok then 15:51:01 #topic Open discussion 15:51:08 Anybody? Anything? 15:51:17 yes, just ask for review 15:51:30 beside these podman changes to kolla-ansible 15:51:36 we have additional to kolla 15:51:48 for image building by podman 15:52:13 from our point of view they are ready, is there possibility to merge them to Y release? 15:52:21 url? 15:52:33 I think it is quite independent thing 15:52:55 mnasiadka: https://review.opendev.org/q/owner:konstantin.yarovoy%2540tietoevry.com 15:54:15 ok, I see them - will try to review before next meeting 15:54:33 thank you very much 15:55:55 yoctozepto: I'm nearly finished with the fqdn based single external frontend on HAProxy (https://review.opendev.org/c/openstack/kolla-ansible/+/823395) - would be nice if you could look into that 15:56:17 I'll add a CI job as well, guess without that it's going to be hard to merge ;) 15:56:41 path based should be easy to add, probably I'll do that when adding CI job 15:57:48 ok, let's finish for today 15:57:50 thanks for coming! 15:57:53 #endmeeting