15:00:38 #startmeeting openstack_ansible_meeting 15:00:39 Meeting started Tue Apr 19 15:00:38 2022 UTC and is due to finish in 60 minutes. The chair is noonedeadpunk. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:00:39 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:00:39 The meeting name has been set to 'openstack_ansible_meeting' 15:00:44 #topic rollcall 15:00:51 \o/ 15:01:05 hey! 15:01:08 o/ 15:01:37 hey ! o/ 15:01:42 hello 15:02:24 #topic office hours 15:02:26 and thanks for nominating me! (i just saw the message on ML) 15:03:38 so. seems we have issue with ceph-ansible version... Will need to take a look at the actuall thing first time tomorrow in the morning.... 15:04:08 but that sounds really weird, I havn't heard a thing that they're replacing cinder-volume with anything else... 15:04:19 *ceph-volume 15:04:51 FYI, i'm not sure if it's related, but I've hit this error when deploying AIO+ceph today https://paste.openstack.org/show/bRMsII6P2s1YyjugFuWk/ 15:05:00 but haven't looked much into this yet 15:05:22 that's interesting actually. 15:05:42 I can recall we were having apt pinning for ceph somehow.... 15:05:43 we should have apt pins to make sure that the ceph version comes from where we expect 15:06:00 which iirc on ubuntu should always be download.ceph.com 15:06:03 but I think we did that only for ceph-client role? 15:06:14 could be, yes 15:08:04 https://github.com/openstack/openstack-ansible-ceph_client/blob/master/defaults/main.yml#L43 15:08:42 huh and we don't really include role anywhere 15:09:01 unless we rely on that? https://opendev.org/openstack/openstack-ansible-ceph_client/src/branch/master/meta/main.yml#L40-L45 15:09:10 yes it comes from meta 15:09:11 but, um.... 15:09:32 but anyway failure is during ceph-ansible runtime 15:09:49 (in CI at least) 15:10:25 Also I spent time previous week for unplanned activity of fixing run of roles in check mode 15:10:35 was interesting how much doable is that... 15:11:07 hosts and infra seems quite fine, openstack services are tricky 15:11:49 I also thought it would be easy to have CI job but then realized that to run in check mode it has plenty of dependencies in terms of missing services and certificates. 15:12:12 Still doable but appeared a bit harder then expected 15:12:33 Will likely continue that after more important things are done:) 15:12:52 BTW we kind of need to make last release of V and move it to EM 15:14:01 jrosser: do we need wip here? https://review.opendev.org/c/openstack/openstack-ansible-tests/+/836335 15:15:01 no - i think it was just to try to get things merged in order down the stable branches 15:15:22 this has all been a quite big mess with not really any cherry-picks that work 15:15:35 so i had to patch each branch pretty much individually 15:16:32 and really the patch we need is https://review.opendev.org/c/openstack/openstack-ansible-tests/+/837368 15:16:57 there is a bunch of broken things due to that ^ 15:18:59 the gluster patches are nearly ready for proper review 15:19:14 i have to make a hack^W fix for rocky 15:20:39 But hopefully NeilHanlon would ping for some packaging changes one day? 15:20:51 hopefully yes 15:21:25 ok great. And then I can pick CentOS 9 deployment on top of gluster :) 15:22:01 Yes, i am working on fixing up those release packages for, e.g., gluster 15:22:47 i just need to decide if we lineinfile/regex the repo file or vendor a copy of it and just copy: it into place 15:22:49 great) 15:24:45 do you think we should be more specific about pinning the ceph repo on ceph hosts? 15:25:16 if we pin the client to ceph.com but not the server, we still have potential conflicts between ubuntu / uca / ceph.com repos 15:26:15 jrosser: Um, can you point to that Rocky hack as I can't find it quickly :) 15:26:29 no i still try to write it :) 15:26:39 ah, ok then :) 15:27:06 but i believe we have to adjust mirrorlist=http://mirrorlist.centos.org/?release=$stream&arch=$basearch&repo=ResilientStorage&infra=$infra 15:27:33 regarding ceph pinning - that kind of make sense... Despite that should be part of ceph-ansible to be fair... 15:27:47 so that always release=8-stream 15:29:23 i need to spend some time trying to figure out why i get this downgrade error, then I'll have a better view on how do we want to pin ceph repo. Planning to look into this during the evening 15:29:33 ceph-ansible already has requirements.yml in place, so technically we can push them a change to handle pinning and add another dependency on our roles :) 15:29:41 NeilHanlon: do you have a bug link i can put in my patch for glusterfs? 15:31:36 don't think they're gonna like it though... 15:32:06 I kind of more and more thinking that integration of ceph-ansible should be more for CI purposes only. 15:33:04 while it's good to have playbooks for that and some sort of integration, I'm not really sure if anybody should do that in production :) 15:33:12 (at least as is) 15:34:17 I'm going to use it in production in a few weeks, i can share my thoughts then :D 15:34:43 the questions come when you're trying to upgrade stuff :) 15:35:34 as we pin specific ceph-ansible version that is tighten to ceph releases quite hardly 15:35:37 i have it in production. (he types with shaking hands :D ) 15:36:18 as then when you upgrade osa you should either override ceph-ansible or do upgrade ceph at same time kind of. 15:36:29 i had it with osa for exactly 1 deploy then i move the ceph-ansible on the side right after 15:37:23 we are thinking about replacing it with ceph-adm, not starting the procedure though 15:37:37 ceph-ansible is fine if you use it on its own 15:37:51 i wouldnt copy the approach from OSA CI into a production deployment 15:38:26 nah, we're not talking about ceph-adm, but more about at what level we should provide ceph deployment 15:38:40 and if we explicit enough about how to use that properly 15:41:11 tbh I like the idea of having one repo for both but at the end of the day we are using them kinda separately and will override the role-requirement if needed 15:41:55 i just think that ceph integration is a huge advantage of openstack-ansible project(especially for a newcomer who is is comparing different deployment tools and trying to choose one) 15:42:14 so maybe we can list our main issues with it and think what we can do about them? 15:42:31 ceph-ansible sometimes has very specific requirements for the ansible version (though this is less important with modern ansible) 15:42:52 and versions of ceph-ansible only support very specific releases of ceph 15:43:19 specific major rel 15:43:36 so you ended up in very difficult places at upgrade time when the openstack, ansible and ceph (and maybe also OS) versions all had to change simultaneously 15:43:42 and thats a total nightmare for upgrades 15:44:37 but You don't have to run `ceph-install.yml` in the middle of openstack upgrade, right? :D 15:45:00 no, if you're not running /run-upgrade.sh :p 15:45:11 (which what newcommers will do) 15:45:27 and we test exactly "zero" of that approach :) 15:45:48 and you will get newer clients anyway unless explicitly set in user_variables 15:46:37 I'm not talking about dropping support, but likely more being more explicit in documentation about possible caveats by doing that 15:46:44 so maybe what we need to do is to show a warning message when running run-upgrade.sh with ceph enabled? 15:46:50 ah yeah, i was just going to say that 15:47:00 that another option is put some information in docs 15:47:05 well - back to where we started there is really only benefit in pinning to ceph.com 15:47:36 the UCA packages didnt used to have debug symbols and that was a problem for us when things were broken 15:47:38 saying that OSA upgrades may be tricky when it manages ceph as well 15:48:38 I can recall some early on bionic distro provided ceph version was treated as newer one comparing to one that was coming from ceph.com 15:48:50 yes, i remember having to fix that 15:49:01 I guess that's when we added pinning :) 15:49:16 i think it was the local patch that ubuntu applied made the version number sematically greater 15:49:26 yup 15:49:54 while in fact it wasn't and dependencies were broken 15:50:26 anyway - let's see what the current issues are. 15:51:27 damiandabrowski[m]: you might want to look at this as it addresses similar errors to those you were seeing https://github.com/openstack/openstack-ansible-os_cinder/blob/9f2bf29db8ef921cfad7857dcb7652436d0d887b/tasks/main.yml#L183-L209 15:51:35 ofc we can add pinning in playbook alone 15:51:56 thanks, I'll have a look 15:52:17 i had to split the list of packages into those to install before, and after the ceph_client role is run 15:53:54 btw. we already managed to merge few of my tempest patches, but many of them still remain unreviewed. 15:53:59 it would be awesome to close this topic soon ;) https://review.opendev.org/q/topic:tempest-damian-2021-12 15:55:36 i'll take a look 15:56:57 damiandabrowski[m]: aren't most of them need rebase? 15:57:36 see tons of `Indirect ancestor` in gerrit 15:57:52 likely they won't merge cleanly but not sure 15:58:32 that's right, but the plan was to get initial reviews -> apply Your suggestions -> rebase -> merge 15:58:49 by doing it like this, we'll avoid running too much unnecessary zuul jobs 15:58:53 but I can rebase them now if You want :D 16:00:09 but main drawback of this plan is that You'll need to leave Your reviews twice on these patches(before and after rebase) 16:01:14 well, rebase in gerrit doesn't remove Code-Review label 16:01:20 it would remove jsut Workflow 16:01:33 but ok, indeed:) 16:01:34 ahhh okay 16:01:40 #endmeeting