21:00:00 #startmeeting nova_cells 21:00:01 Meeting started Wed May 17 21:00:00 2017 UTC and is due to finish in 60 minutes. The chair is dansmith. Information about MeetBot at http://wiki.debian.org/MeetBot. 21:00:02 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 21:00:04 The meeting name has been set to 'nova_cells' 21:00:13 o/ 21:00:36 man, crickets 21:00:43 o/ 21:00:46 dtp is on vacation 21:01:12 aight 21:01:19 #topic cells testing / bugs 21:01:30 so, we have a bug, which I think goes back to newton, 21:01:38 although more severe the closer you get to ocata 21:01:43 which is a sort of db connection leak 21:01:49 which melwitt has a patch up for 21:01:52 yes, I opened dis https://bugs.launchpad.net/nova/+bug/1691545 21:01:54 Launchpad bug 1691545 in OpenStack Compute (nova) "Significant increase in DB connections with cells" [High,In progress] - Assigned to melanie witt (melwitt) 21:02:09 since we don't need to worry about purging the list in older releases, I'd vote for making the change as tiny as possible 21:02:21 agreed and/or did you already change it? 21:02:27 already changed yes 21:02:40 cool, I have outed myself as "has not looked yet" 21:02:43 what do you mean by purging the list? 21:03:01 mriedem: the cache invalidation todo 21:03:03 like clearing the cache upon SIGHUP, is what I had before 21:03:16 and yeah, I did summarize in the TODO 21:03:22 why does that not matter in older releases? 21:03:31 because we never dump any of the other lists of cells we have now 21:03:34 like in compute/api 21:03:43 making this one dump will just be confusing and is more to backport 21:04:09 ok 21:04:27 so like with nova.compute.api.CELLS, 21:04:29 so, I'll go look at that when we're done here and hopefully we can get that landed soonly 21:04:34 if you need to refresh it, you have to restart the services 21:04:38 yeah, for now 21:04:51 ok, release note with that? 21:05:01 release note for no change? 21:05:07 or for the bug fix? 21:05:26 release note for the fix plus the fact that if you need to refresh the cache, you have to restart the nova-api service(s) 21:05:39 also, this is probably something to send to the operators list as a heads up, 21:05:43 since they are rolling to newton, 21:05:56 the restart thing is no different from the rest of the code, so calling it out seems confusing to me 21:06:02 but a note for the fix makes sense 21:06:03 and we've had at least two people say they are having perf issues in newton, and zzzeek made it sound like this was pretty bad 21:06:31 I guess we could surface info about the hidden caches in this rel note 21:06:35 I don't think he understood what we were talking about when he said that 21:06:37 we could put the restart thing in a cells FAQs in the devref 21:06:43 which i'd like to start anyway, but havne't yet 21:07:14 sure, documenting the general "new cells get picked up by a restart" in devref makes sense 21:08:07 ok i've got another bug if you're done with this one 21:08:07 anything else on this? 21:08:11 sure 21:08:13 https://review.openstack.org/#/c/464088/ 21:08:29 is the fix for the upgrade thing with special characeters in the db connection url 21:08:38 and simple_cell_setup effing those up for cell0 21:09:06 okay 21:09:55 anything specific on that or just highlighting? 21:10:05 just pointing it out, it's going to need to go back to newton also 21:10:10 okay 21:11:01 on the testing front, I've gotten my devstack multicell patch past the cellsv1 job now and tomorrow will start on the migration job 21:11:12 aside from the migration job and a libvirt crash, I've got a good run on it against the nova tree 21:11:26 noice 21:11:29 so, soonish on that I think 21:11:34 anything else on bugs/testing? 21:12:14 #topic open reviews 21:12:26 melwitt: last I looked at your set, it had a bunch of jenkins -1s on ti 21:12:27 *it 21:12:33 your quotas set that is 21:13:06 that's all spurious gate failures. I do need to incorporate the feedback from the short summit discussion, to change all counts to dicts anyway though 21:13:18 okay 21:13:28 been busy with this db connection thing but next up is refreshing that set 21:13:34 okay 21:13:48 other than that, we landed the discover hosts fasterer thing due to help from mriedem 21:13:50 so that's cool 21:14:02 cool 21:14:14 also my set for cellsv2 target fixes is in need of a booty call 21:14:24 https://review.openstack.org/#/q/status:open+project:openstack/nova+branch:master+topic:multi-cell-testing 21:14:40 https://review.openstack.org/#/c/458634/ scared me 21:15:01 why? 21:15:10 the turducken of context managers in https://review.openstack.org/#/c/458634/10/nova/compute/api.py 21:15:18 it's just a bunch of calling convention changes so the next patch can make it work 21:15:37 all of that gets simpler after the quotas series also 21:15:43 i haven't gone through it in detail yet 21:15:50 right, 21:15:57 it gets easier after quotas, 21:16:01 and after the instance delete cleanup 21:16:06 but it fixes real issues 21:16:11 once you get to the context.py change 21:16:21 k 21:16:35 anyway 21:16:42 anything else in open reviews? 21:16:51 https://review.openstack.org/#/c/461519/ and the one after it have a +2 21:16:56 alex was on them, but he's smashing bugs this week 21:17:12 okay I'll look at that too, sheesh 21:17:17 not a huge rush on those 21:17:24 they are mostly plumbing 21:17:33 what else? 21:17:54 that's all i know about 21:18:02 cool 21:18:14 #topic open discussion 21:18:20 anything here? 21:18:24 yes! 21:18:28 dammit. 21:18:52 lol 21:18:53 at some point here i have to do recaps of the summit sessions i ran, were you going to do that for the cells v2 session? if not, i can 21:19:12 I wasn't, but I can if you want 21:19:22 up to you, i don't think there was a ton to recap, 21:19:29 compromise on passing hosts to cells for retries 21:19:31 only two topics really 21:19:32 yeah 21:19:38 and you add that auto-disable thing 21:20:24 anyway, I'll plan to do that on friday or something 21:20:26 sanity check, the retry behavior is no different than in cells v1 right? retries stay in a cell I thought 21:20:37 melwitt: definitely 21:20:47 k. just making sure since I told someone that today 21:21:01 cool 21:21:02 heh 21:21:04 and if you don't have separated conductors then you can reschedule across cells in v2 21:21:13 we just need to work when we don't 21:21:30 okay, anything else? 21:21:49 nope 21:21:54 we're about seven minutes over time, so if there's nothing else... 21:22:24 #endmeeting with extreme prejudice