14:00:16 #startmeeting cinder 14:00:16 Meeting started Wed Jul 7 14:00:16 2021 UTC and is due to finish in 60 minutes. The chair is rosmaita. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:00:16 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 14:00:16 The meeting name has been set to 'cinder' 14:00:21 #topic roll call 14:00:24 hello 14:00:26 hi 14:00:32 hi! o/ 14:00:32 hi 14:00:33 Hi 14:00:34 hi 14:01:02 #link https://etherpad.opendev.org/p/cinder-xena-meetings 14:01:03 hi 14:01:17 I am double booked. Will watch as I can. 14:01:49 hello everyone 14:01:53 #topic announcements 14:02:01 hi 14:02:05 ok, this is week R-13 14:02:20 xena milestone-2 is next week (!) 14:02:30 key issue is the new driver merge deadline 14:02:43 as far as i am aware, we have one new driver: 14:02:54 NFS driver for Dell EMC PowerStore 14:03:02 #link https://review.opendev.org/c/openstack/cinder/+/797608 14:03:25 it would be good for any NFS-based driver maintainers to take a look, you may have helpful comments 14:03:47 would be good to get some reviews today and tomorrow to allow reasonable time for revisions 14:03:56 good news is that it's not a very large patch 14:04:20 looks like it's just using generic nfs driver for all functionalities 14:04:50 uhm, does it need a separate driver then? 14:04:56 yes, so it will bask in the goodness of our current nfs driver 14:05:00 hmm 14:05:51 lol 14:05:52 tosky, not really but its the same as nfs driver which seems strange to me... 14:06:03 is the intent to CI and test it on their hardware? hard to understand why this exists... 14:06:39 the bp is a bit sparse: https://blueprints.launchpad.net/cinder/+spec/powerstore-nfs-driver 14:07:22 but it does have CI: http://publiclogs.emc.com/08/797608/3/check/DellEMC_PowerStore_NFS/f03b290/DellEMC_PowerStore_NFS/13 14:08:08 in any case, i don't think we have a policy that a driver must be of a certain level of complexity > generic driver in order to be accepted? 14:08:33 mep 14:09:40 i'll have to look deeper but i think the check for reporting multiattach support is wrong 14:10:04 i also felt the same, if not qcow2 then multiattach = true 14:10:29 ok, eharney and whoami-rajat please look more closely and leave comments on the review 14:10:50 but if it's really the same (I haven't seen the code), they could just use the generic one on their CI 14:10:52 * tosky shuts up 14:11:41 well, they will find bugs in the generic one, i guess in either case 14:12:45 but this is a serious question, is the closeness to the generic driver a reason to reject this one? 14:13:24 i think we should at least ask why they want to create one (i.e. is it just to add this flag to support multiattach?) 14:14:40 i dont' think Ivan Pchelintsev is here 14:14:52 walshh_: do you know? 14:15:06 No, I don't have details 14:15:20 But I can reach out to him after the meeting 14:15:45 eharney: it is definitely wrong lol 14:16:00 snapshot of attached volume won't work 14:16:26 also that, which is why it's turned off in the generic nfs driver 14:16:42 rosmaita: I wouldn't reject a driver just because it's basically a wrapper for the generic driver 14:16:57 geguileo: noted 14:17:14 sounds like we already have comments for the review 14:17:15 rosmaita: it may be their initial implementation and they may want to add funcionality later on (eg: replication) 14:17:23 that's what i was thinking 14:17:32 or maybe it's for branding reasons 14:17:34 i don't necessarily think we should reject it, but we should probably see if there are enhancements they want that could just go in the base driver 14:17:36 I'm ok either way 14:17:45 eharney: +1 14:18:17 #action rosmaita reach out to dell (Ivan Pchelintsev) about strategy for powerstore nfs driver 14:18:24 ok, moving on 14:18:38 #topic dropping lower-constraints jobs in master in all repos 14:18:57 the TC has said it's up to individual projects whether to maintain these jobs or not 14:19:20 there isn't anyone who actually uses the lower-constraints.txt as far as we can tell 14:19:42 and, there isn't an ongoing program to maintain the l-c with the dependencies of our dependencies 14:20:05 and, if we update requirements at every M-3 as we have been doing since wallaby 14:20:21 we won't have to worry about stale versions in the l-c 14:20:54 and, the l-c are only tested against our unit tests, so it's not clear that everything in there actually gets a good enough workout to detect problems 14:21:35 so, i think we should drop the jobs and files (since the file is even more useless if we're not testing against it) 14:21:59 i don't think we get any real benefit by trying to keep it, so i agree 14:21:59 IIRC l-c jobs also break gate everytime a migration is done, eg: bionic->focal 14:22:11 there are some patches up for this for some of our repos, i will un-block those and put up any required other patches 14:22:44 i will make sure they have the topic "drop-l-c" 14:22:58 so if anyone has second thoughts, look for a patch and leave a comment 14:23:21 any comments? 14:23:33 i guess i just said leave it on the reviews 14:23:37 ok, moving on 14:23:48 #topic Please land gate fix for wallaby 14:23:55 this is an alert for the stable cores 14:24:03 https://review.opendev.org/c/openstack/cinder/+/798671/ 14:24:43 yep, nothing else to add there really 14:24:53 actually, looks like https://review.opendev.org/c/openstack/cinder/+/798696/ is the one that needs review 14:25:20 ++ 14:25:40 this is slightly off topic, but whoami-rajat has been doing a good job organizing the stable releases and reviewing backports 14:26:06 i think i will propose him as a stable-core 14:26:14 on the ML, so watch for that 14:26:21 :) 14:26:26 ok, next topic 14:26:32 :) 14:26:39 #topic Request for review (blocking glance_store change) 14:26:58 we discussed this last week, and whoami-rajat has made requested revisions 14:27:14 so it is time for people who made those requests to re-review 14:27:22 i believe that i am one of those people :) 14:27:49 we have a light agenda today, so i will use my free time after the meeting 14:28:04 I also have some more changes in glance_store dependent on the base attachment code, just wanted to get things in on time 14:28:29 good point, as a nonclient library, glance_store gets released early (like os-brick) 14:28:49 so, people with interest in cinderclient, please review! 14:28:49 yep 14:29:04 #topic rbd: Add snapshot mirroring support 14:29:18 there's a bp filed for an improvement to the rbd driver 14:29:26 #link https://blueprints.launchpad.net/cinder/+spec/rbd-snapshot-mirroring 14:29:48 apparently, there are two ways of doing replication for ceph 14:29:57 the old-fashioned way, journaling 14:30:09 and the recent improved way, snapshot mirroring 14:30:26 this proposal is to let the operator configure which one the rbd driver will use 14:30:55 my question here is: do we really want this to be a config option? 14:31:09 or should we just use the better way if it's available? 14:31:28 i think this should be transparent, and not put onto the user 14:31:30 i seem to remember that we had this discussion about a different feature, but i couldn't find the patch 14:31:34 config option seems to make sense, but we need to ensure the behavior doesn't change between the two and that someone can actually switch back and forth 14:31:43 so maybe it was a different driver 14:32:01 eharney +1 14:32:34 jbernard: is there a difference in how the RBD pool must be configured? 14:33:15 if the RBD driver can tell which way the pool is configured, then it could do it automatically, right? 14:33:34 from a quick look at the patch, it looks like you just need fast-diff available 14:33:38 the first couple of paragraphs on https://docs.ceph.com/en/latest/rbd/rbd-mirroring/ seem to indicate that the two modes don't have the same semantics 14:34:22 geguileo: no, and anyway the pool configuration is expected to be performed by the installer, the driver asserts that the cluster is configured properly 14:35:00 jbernard: if the driver cannot tell which way the pool is configured then it must be a configuration option, right? 14:35:24 geguileo: we can query from the driver, i dont see that as a problem 14:35:35 jbernard: that's precisely what I asked 14:35:52 geguileo: ok, i see 14:38:31 so, it's not clear to me that the snapshot-based replication is always the better way, from the doc 14:39:06 well, if there would be actual reasons for preferring journaling, then i think we need to keep it as an option 14:39:27 it reads to me like the snapshot-based mode has a longer RPO but hard to tell 14:40:32 I read that too 14:40:55 and if the deployer didn't configure a snapshot schedule replication wouldn't even happen 14:41:23 so do we even want to allow this feature? 14:41:35 I think it's useful 14:41:58 i think it makes sense to allow the admin to choose based on journal (slower overall but better RPO) and snapshot-based (probably better I/O perf, but worse RPO) 14:42:49 ok, we will have to make sure the release note is clear ... and i guess the defaults should be to journaling? 14:43:34 yes 14:45:36 ok, and i guess the same deal for the rbd backup driver? 14:46:26 i don't think this would apply there? 14:47:41 #link https://review.opendev.org/c/openstack/cinder/+/784956 14:49:20 I don't think we want that there (could be wrong though) 14:49:27 for backups we want them to be synchronous 14:49:52 do we support mirroring backups at all currently? 14:51:19 no 14:51:38 people can do pool based mirroring, but the backup driver would not be aware of it 14:52:53 just to add something backup_ceph_image_journals config applies journaling and EXCLUSIVE_LOCK feature 14:56:16 ok, so it sounds like we think: (1) rbd driver option to select journaling vs snapshot mirroring, (2) for rbd backup driver, journaling only when it's available, snapshot mirroring does not seem an appropriate technology for backups 14:57:11 sounds good 14:57:26 ok, the proposer should be fine with (1) because that's what's implemented in the patch 14:57:39 the proposer can come to the next meeting and tell us why we are wrong about (2) 14:57:53 deadline for this is M-3 14:58:04 #topic open discussion 14:58:11 i added two links to the etherpad 14:58:16 all that free time i was talking about evaporated 14:58:19 https://review.opendev.org/q/topic:%22bug%252F1926630%22+(status:open%20OR%20status:merged) 14:58:24 https://review.opendev.org/c/openstack/cinder/+/789602 14:58:29 https://review.opendev.org/c/openstack/cinder/+/789603/10 14:58:31 API fixes for encryption 14:58:39 this stuff has been waiting for more reviews 14:58:42 this is about creating the types? 14:58:49 yes 14:58:52 i mean, the encryption type for a volume type 14:58:55 or whatever we call it 14:59:06 and gracefully handling previously created types that are invalid 14:59:31 ok, i will put those on my list (i think i already had a look) 14:59:43 you +2'd one of them some time ago 14:59:55 even better 14:59:57 these are pretty straightforward, would be nice to close them out 15:00:04 ok, other people, please look them over! 15:00:26 we are out of time 15:00:29 thanks everyone! 15:00:29 eharney: that will not fail on the API, right? 15:00:36 thanks! 15:00:36 geguileo: define "fail" 15:00:44 QA team is OK with the change 15:00:54 decided it does not impact the API 15:01:06 eharney: well, it aborts the creation, so I assume it fails and goes to error state 15:01:34 we need to move this over to #openstack-cinder 15:01:37 #endmeeting