15:01:38 #startmeeting manila 15:01:39 Meeting started Thu Feb 15 15:01:38 2018 UTC and is due to finish in 60 minutes. The chair is bswartz. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:01:40 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:01:41 hi 15:01:42 The meeting name has been set to 'manila' 15:01:43 hello all 15:01:49 o/ 15:01:49 Hello again 15:01:52 \o 15:01:56 Hi 15:02:00 hey 15:02:03 hello 15:02:20 courtesy ping: toabctl markstur vponomaryov cknight 15:02:34 I know gouthamr is still honeymooning, and zhongjun is out for chinese new year 15:03:04 vkmc is here even though it's Carnaval, or hangover from same 15:03:21 hard to get back from a long weekend 15:03:25 >.< 15:03:28 but it's not the middle of the night 15:03:32 Too bad we don't have carnaval in the states 15:03:48 you should have it... it's some sort of mardi gras I think 15:03:53 hi 15:04:01 nawlins is special 15:04:26 vkmc: how often do you have Carnaval? once a year? 15:04:28 #topic announcements 15:04:43 Okay so we still have 2 weeks until release 15:05:01 xyang, yes! 15:05:10 Right now we should be bug hunting, fixing any release-blocking bugs, and fixing up docs for Queens 15:05:26 PTL elections are over, and tbarron is now officially installed 15:05:31 Huzzah! 15:05:33 xyang, 40 days before easter... we go nuts... I think it's a latin american thing 15:05:43 :) 15:05:58 tbarron: congratulations! 15:06:05 tbarron, congrats! 15:06:14 tbarron: congrats! 15:06:31 tbarron: congratulations! 15:06:35 So the main things to cover this week are any critical bugs, and the PTG planning 15:06:47 Let's do PTG planning first 15:06:51 #topic PTG planning 15:06:54 Thanks all, I'm going to need help and support from all of you :) 15:07:03 #link https://etherpad.openstack.org/p/manila-rocky-ptg 15:07:25 tbarron: gladly :) 15:08:22 We should probably take a look at the proposed topics, and start scheduling them for Tuesday or Friday, based on how much time they're likely to consume 15:08:36 tbarron: I think you and I should get together and do that 15:08:51 bswartz: +1 15:08:53 Based on how much stuff we have here, I doesn't feel like we'll use up 2 whole days 15:08:58 Friday might be a short day 15:09:14 Then again, maybe some people have topics not yet added to the etherpad 15:09:25 I probably do... 15:09:38 * bswartz thinks 15:09:50 * tbarron sees smoke in the room 15:10:47 bswartz: has it been oficially confirmed that we will have the Tuesday and Friday? 15:11:01 In any case, since we're going to come up with a schedule soon, consider this your last chance to propose a topic for any time other than friday afternoon 15:11:04 ganso: yes 15:11:12 bswartz: great 15:11:38 ganso: http://ptg.openstack.org/ptg.html 15:12:07 tbarron: nice! we already have the room numbers! 15:12:33 So my other AI related to PTG I still haven't done :-[ 15:12:47 ? 15:12:50 I need to find out about evening events so we can schedule a team dinner 15:13:12 Does anyone know what the schedule of evening events looks like? 15:14:08 Cinder plans Thursday night dinner: 2/28 15:14:48 xyang: Thursday is 3/1 15:15:09 We already eliminated Thursday+Friday 15:15:13 that's on Cinder ptg etherpad:) 15:15:14 It has to be Mon, Tue, or Wed 15:15:35 xyang: lol, I'm going to fix it over there 15:16:06 ganso: might be worth confirming with the cinder PTL that it's thursday not wednesday 15:16:27 jungleboyj: ^ 15:17:03 tbarron: unfortunately there's an all-hands meeting here at netapp after this meeting, but are you available later this afternoon? 15:17:05 jungleboyj: Can you confirm: Cinder Planning to have a dinner outing on Thursday night: 3/1? 15:17:26 bswartz: sure, afternoon works 15:17:34 Yes. That is the plan. Thursday night. 15:17:45 okay cool 15:17:47 jungleboyj: which month? 15:18:05 I believe that's when we start March. 15:18:06 jungleboyj: you wrote down the wrong date initially:) 15:18:09 #topic Let's Go Over New Bugs 15:18:11 * tbarron is not doing well working on good relations with cinder 15:18:52 dustins: I assume you have some bugs from last week worth covering 15:18:52 Do I have February 28th in the etherpad accidentally? 15:18:52 #link https://etherpad.openstack.org/p/manila-bug-triage-pad 15:19:03 but first, are there any new bugs found since the RC1 tag? 15:19:12 jungleboyj: yes:) 15:19:17 jungleboyj: yes 15:19:20 bswartz: You assume correctly, and I believe just one in the last few days 15:19:24 We can start with that one 15:19:39 #link https://bugs.launchpad.net/manila/+bug/1748696 15:19:40 Launchpad bug 1748696 in Cinder "get-pools doesn't reflect capacity change made by consume_from_volume" [Undecided,In progress] - Assigned to TommyLike (hu-husheng) 15:19:54 This one is as yet unconfirmed in Manila, but does effect Cinder 15:19:58 I will fix that. 15:20:07 it's already fixed by ganso 15:20:28 So I would bet that tommylike is on vacation for chinese new year 15:20:48 dustins: the assumption is that we cloned oversubscription code from cinder so we need a similar fix 15:20:50 Is this a release blocker? 15:21:05 if so, we've lived with it for a long time 15:21:06 Oh you mean the fix for manila is already merged? 15:21:24 I don't mean that. 15:21:39 responding to xyang 15:21:48 And I think xyang was talking about the etherpad. 15:21:56 xyang: ? 15:21:59 oh 15:22:07 yes, I was responding to jungleboyj 15:22:17 IRC chaos lol 15:22:22 multiple conversations overlapping 15:22:29 about oversubscription, I'm not sure we have the exact same issue. we handled a little differently here 15:22:38 I suspect we need a careful fix for the oversubscription issue and wouldn't want to rush. 15:22:40 okay so since tommylike isn't around, who understands this bug? 15:23:17 I see the proposed cinder fix: https://review.openstack.org/#/c/543205/ 15:23:49 bswartz: I understand that there was an improvement merged in Cinder in the queens release, and we would like to have this improvement ported over to manila in the future. I am unaware of *BUGS* related to that 15:23:58 Maybe we should look at whether our scheduler code is close enough to cinders that we have the same bug and the same fix would work? 15:24:45 I remember this part is a little different, but I'll have to take a look 15:25:30 The main decisions we need to make today are: should we plan to address this in a RC2 15:25:39 and who wants to own the investigation? 15:25:41 note that tommylike says manila *may* have the same bug 15:25:55 Yeah I mean if it needs addressing at all 15:26:14 Right, it's unconfirmed in Manila right now 15:26:24 xyang: do you have time to look and let us know if the issue exists in manila? 15:26:36 Still if we want to fix it in an RC2, then there's a rush to get the investigation done 15:26:39 I can take a look 15:26:52 Otherwise, we can talk about it at PTG and fix it in Rocky 15:27:18 tbarron: Whatever happened to that glance bug? Did they make an RC2 that fixed it? 15:27:35 xyang: if we have it, let bswartz and me know and we'll see about the rc2 vs rocky thing 15:27:46 bswartz: it's fixed and postgresql tests now pass 15:27:47 tbarron: sure 15:27:56 excellent 15:28:02 bswartz: dustins: https://bugs.launchpad.net/manila/+bug/1749184 15:28:03 Launchpad bug 1749184 in Manila "db migration error with mariadb >= 10.2.8" [Undecided,In progress] - Assigned to Stefan Nica (stefan.nica) 15:28:19 that's another new bug and potential rc2 candidate 15:28:30 https://review.openstack.org/#/c/543927/ addresses it 15:28:51 gate (and our distro) are using using a lower mariadb version but 15:28:57 Do we have a cap on the mariadb version we support? 15:29:04 toabctl: is SUSE affected by this one? 15:29:23 * tbarron notes the bug reporter works for SUSE 15:29:42 bug reporter == bug fixer 15:29:57 Yeah I'm looking at this fix -- I'm not sure why the fix is mysql specific 15:30:13 Does the fix not also work on postgres? 15:30:39 It seems like dropping a primary key before dropping the column itself should be completely harmless 15:31:11 I think it does not, see patch set 3 15:31:13 Oh wait, this drops the whole primary key and recreates it 15:31:55 I wonder if the limitation here comes for sqlalchemy or the underlying databases 15:32:19 It makes me nervous to have different upgrade code for different DBs 15:32:24 in patch set #3 he did the drop and readd unconditionally and the postgresql tests failed 15:33:26 bswartz: agree, but I wonder if anyone actully uses OpenStack with mariadb >= 10.2.8 yet 15:33:27 But if SUSE is shipping a new enough version of MariaDB with they Queens openstack, then this does feel worthy of an RC2 15:33:52 I believe toabctl is on paternity leave (again). 15:34:04 toabctl: do you think you could find this out soon? 15:34:36 He was here 30 minutes ago 15:34:46 bswartz: probably he'll read the backlog but I'll ask in the review as well. 15:35:35 okay well this is important because if we need an RC2, then the sooner we cut it the better 15:36:07 As a reminder, we've branched already so it's okay to merge the fix into master as soon as the fix is ready, and we can take the decision to backport it seaparately 15:36:37 bswartz: remind us of the rules w.r.t. doc fixes, etc. now. 15:36:57 doc fixes are always okay to merge, and those go into master 15:37:10 I don't think we've ever backported docs changes 15:37:44 The main thing that shouldn't be going into master are large/risky changes that could complicate any backports to queens 15:38:30 so if we have missing api doc it won't be in o.o.docs for queens then, right? 15:38:49 Once the release is done, then we open the gates for anything into master 15:39:08 tbarron: IDK id the new docs process requires us to maintain docs in stable branches of manila 15:39:14 s/id/if/ 15:39:42 We can look at how the docs builds are being done 15:39:50 well if it *allows* us to, then we can backport those 15:40:02 we have several api update reviews pending 15:40:05 Yes but it's a huge hassle to maintain stable docs in addition to the master docs 15:40:35 The question would be, is anyone going to read anything other than the master version of the docs? 15:40:45 bswartz: I'm not saying we have to do it indefinitely, just that we have some outstanding and it may be worth doing it right after the release. 15:40:56 bswartz: perhaps, if the person is using an older version of OpenStack, the master version will not make sense 15:41:11 well I'm not sure what oo./*latest*/ points to 15:41:27 ganso: one would hope we don't change things so much that the master docs are nonsensical w.r.t. stable version of the code 15:41:31 * tbarron needs to check, I think we're pointing to pike rather than master atm 15:41:43 * bswartz sighs 15:41:47 IIRC we have /latest/ and previous ones, like /ocata/, /pike/ 15:41:50 Okay perhaps we need to do some docs backports 15:42:08 atm I'm just arguing for having *latest* up to date 15:42:25 The master docs should be as correct as we can make them 15:42:43 updating /latest means less things to worry about in the future =) 15:42:44 If we need to backport some docs changes to stable branches, then so be it 15:43:01 * dustins wishes he had time to be docs guy again 15:43:21 okay dustins, what other bugs do you have? 15:43:27 I believe gouthamr is the current docs guy 15:43:48 gouthamr is the newlywed guy 15:43:55 ganso: Ah, didn't know that! 15:44:12 #link https://bugs.launchpad.net/manila/+bug/1747695 15:44:13 Launchpad bug 1747695 in Manila "neutron network multi segment ip version" [Undecided,New] 15:44:25 This one seems to have been found in Pike 15:44:52 Related to the IPv6 changes? 15:45:03 The multi-segment binding feature is not well covered by tests 15:45:42 I'm not even sure what that is, honestly 15:45:43 ganso: do you know if any tests that NetApp runs could catch this regression (if that's what this is)? 15:46:10 bswartz: I believe tests we run aren't able to catch this 15:46:17 There's a good chance that maurice will propose a fix, he has two in review right now 15:46:40 Since it's pike-targeted, we don't have to consdier it for Queens until after the release 15:47:02 But the bigger concern here is that the regression was able to happen due to poor test coverage 15:47:21 I don't know how we could cover this better in the gate, but 3rd party CI can and should cover this kind of use case 15:47:44 Maurice links to stable/pike code, so this is not something we changed in queens, we actually might have fixed it (unlikely) 15:47:54 I doubt it 15:48:09 does he claim (or do we know) that it's a *regression*? 15:48:31 Presumably he was running Ocata and it was working fine 15:48:33 do we know know it's OK in ocata? 15:48:47 And on pike it stopped working -- that's when the ipv6 stuff merged 15:48:49 k, but I'll ask explicitly in the bug 15:49:01 tbarron: Thanks! 15:50:06 bswartz: but I agree, it would be good to detect this ourselves rather than having large enterprises discover it for us. 15:50:27 tbarron: that could be a PTG topic then -- how to test multisegment binding in the gate 15:50:49 done 15:51:00 bswartz: +1 15:51:10 I'm not optimistic we'll find a solution 15:51:15 But it's worth discussing 15:51:21 dustins: next? 15:51:40 #link https://bugs.launchpad.net/manila/+bug/1746725 15:51:41 Launchpad bug 1746725 in Manila "LVM driver is unable to remove addresses in different IP versions belonging to the same interface properly" [Undecided,New] 15:52:07 I would like to fix this bug with my new export helper 15:52:19 Already a PTG topic 15:53:18 awesome 15:53:58 dustins: anything else? 15:54:09 #link https://bugs.launchpad.net/manila/+bug/1744084 15:54:12 Launchpad bug 1744084 in Manila " DocImpact: Add MapR-FS native driver" [Undecided,New] 15:54:24 This one's just a simple "are the docs there for this" bug 15:54:34 Yes, let's check if there are docs 15:54:54 #link https://github.com/openstack/manila/blob/master/doc/source/configuration/shared-file-systems/drivers/maprfs-native-driver.rst 15:55:24 Close bug? 15:55:33 Looks good to me 15:55:48 next? 15:56:02 #link https://bugs.launchpad.net/manila/+bug/1733286 15:56:02 Launchpad bug 1733286 in Manila "snapshot share data Sync with the source share" [Undecided,New] 15:56:40 An older one that I think we've looked at in the past 15:57:03 Oh 15:57:05 dustins: indeed, looks like nobody volunteered to try to reproduce it 15:57:49 This is generic driver specific 15:58:01 I think there are caching/timing issues with how it does snapshots 15:58:24 We just take a cinder snapshot and assume that the right thing happens 15:58:47 If there's unwritten data in the service VM's cache though, it can get missed by the snapshot 15:59:21 bswartz: I guess it depends on how driven we are to fix this thing with the Generic driver 15:59:28 The correct behavior would probably be to flush the write caches of the service VM before taking the cinder snasphot 16:00:06 time check 16:00:16 Yeah given that it's probably not easy to reproduce this problem, it would be tough to test a fix 16:00:34 But the fix could be as easy as SSH to service VM and invoke "sync" before cinder snapshot 16:00:40 yeah we're out of time 16:00:46 thanks everyone 16:00:53 I have to run to my next meeting 16:00:59 tbarron: i'll be back later this afternoon 16:01:04 #endmeeting