17:00:29 <alaski> #startmeeting nova_cells 17:00:30 <openstack> Meeting started Wed Jun 3 17:00:29 2015 UTC and is due to finish in 60 minutes. The chair is alaski. Information about MeetBot at http://wiki.debian.org/MeetBot. 17:00:32 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 17:00:34 <openstack> The meeting name has been set to 'nova_cells' 17:00:46 <alaski> Anyone around for the cells meeting? 17:00:49 <bauzas> \o 17:00:51 <melwitt> o/ 17:00:54 <belmoreira> o/ 17:00:54 <dansmith> o/ 17:00:55 <vineetmenon_> o/ 17:01:13 <alaski> cool 17:01:20 <alaski> #topic Tempest testing 17:01:33 <alaski> I'm going to put melwitt on the spot and defer to her for updates here 17:01:40 <melwitt> :) 17:02:05 <melwitt> okay, the cells job seems in the same state as it has been, link here http://goo.gl/b7R8wq 17:02:50 <melwitt> so it's good there's nothing new happening. I went through the failures from yesterday and they all have to do with tests that delete a server, and I have identified a race where instances can be "undeleted" causing some tests to fail or time out 17:03:20 <melwitt> I have this patch up for the fix https://review.openstack.org/#/c/176518/ that fixes it sort of indirectly, details in the bug it closes 17:04:11 <alaski> nice, I will review that again in a bit 17:04:11 <bauzas> melwitt: seeing a -W, can I review it ? 17:04:20 <melwitt> I will remove the -W today, I was running a test all night that runs one of the failure-prone tests in a loop to make sure it doesn't make any races with it worse 17:04:27 <bauzas> melwitt: ok 17:04:54 <melwitt> I finally got it to where it could run all night without failing, so I'll put up a couple of more patches for those races that came out after undelete became impossible 17:05:00 <bauzas> alaski: which target do you try to reach for having the cells job voting ? 17:05:28 <melwitt> the UnexpectedTaskStateError is I think unrelated, but is a trace we need to clean up 17:05:39 <bauzas> alaski: considering another observation period once melwitt's patch lands, I guess ? 17:05:54 <alaski> melwitt: okay, so there's still a bit more work 17:06:14 <alaski> bauzas: that's a good question 17:06:18 <melwitt> there is also this bug that I *think* doesn't fail tests (I'm not sure) but is a clean up https://bugs.launchpad.net/nova/+bug/1448302 17:06:18 <openstack> Launchpad bug 1448302 in OpenStack Compute (nova) "cells: intermittent KeyError when deleting instance metadata" [Low,Confirmed] - Assigned to melanie witt (melwitt) 17:06:38 <bauzas> alaski: I just wonder about any benefits that we could have 17:06:47 <alaski> bauzas: I would like to see us close to the overall failure rate for non cells, but I don't know that that is right now 17:07:01 <bauzas> alaski: agreed, melwitt's patch is important 17:07:04 <melwitt> it's another race with update/delete server metadata I think but I didn't spend much time on it 17:07:14 <bauzas> alaski: my only wonder is how we can prevent any tunnel effect 17:07:19 <alaski> melwitt: would it help to have someone else look into that? 17:07:20 <bauzas> alaski: ie. fixing all the cells bugs 17:07:48 <melwitt> alaski: yeah, I think bauzas said he might want to look at it, so feel free 17:07:51 <bauzas> alaski: melwitt: my brain certainly needs some recreation, so I would be pleased to help 17:07:54 <belmoreira> bauzas: sorry, tunnel effect? 17:07:57 <alaski> great 17:08:09 <bauzas> melwitt: 17:08:20 <bauzas> cool 17:09:02 <bauzas> belmoreira: sorry, something probably french 17:09:07 <melwitt> so, I guess what I'm thinking is watch the job after the patches land. that I expect to see very few failures after that. but you never know 17:09:15 <bauzas> belmoreira: the idea of trying to boil the ocean I guess 17:09:27 <alaski> realistically I think if we fix the list servers negative test failure and the random test failures from unexpected task state errors I think we'll be where we need to be 17:10:16 <melwitt> yeah, okay. I'll have all the patches ready today, now that I'm done testing it 17:10:26 <bauzas> cool 17:10:43 <melwitt> I think that's all I have about the tempest testing 17:10:52 <alaski> awesome, great work 17:10:56 <bauzas> melwitt: so you don't expect https://bugs.launchpad.net/nova/+bug/1448302 to break tests ? 17:10:56 <openstack> Launchpad bug 1448302 in OpenStack Compute (nova) "cells: intermittent KeyError when deleting instance metadata" [Low,Confirmed] - Assigned to melanie witt (melwitt) 17:11:01 <alaski> thanks for the update 17:11:08 <bauzas> ok, let's move on 17:11:21 <melwitt> bauzas: honestly I don't know, I didn't spend enough time to be sure. often it doesn't fail them 17:11:38 <alaski> #topic Specs 17:12:04 <alaski> The host mapping spec merged during the summit 17:12:21 <alaski> and https://review.openstack.org/#/c/141486/ is ready for review 17:12:41 <alaski> and the followup to that one, though I suspect it will need some work 17:13:19 <alaski> I have a few more to write, but have still been organizing a few things after the break last week 17:13:43 <alaski> but there is some work available if anyone is interested, in implementing the host mapping spec 17:14:22 <alaski> and that's all I have on that, so if no comments we can move on 17:14:34 <belmoreira> alaski: we are looking into a spec to have flavor tables in the api DB 17:14:45 <alaski> belmoreira: ahh, excellenbt 17:14:49 <alaski> bah, excellent 17:15:45 <alaski> please ping me when that's up, or add me to the review 17:15:53 <bauzas> alaski: that spec seems pretty trivial to be done, probably we should propose some new contributors for it ? 17:16:01 <alaski> and add it to the agenda for mention, or I can do that 17:16:11 <bauzas> alaski: thinking about the "mentoring" stuff you know 17:16:26 <alaski> bauzas: I would love for somebody to pick up that work 17:16:38 <bauzas> alaski: we know we have some quickwins that new contributors could help, and that one seems also a good opportunity 17:16:45 <belmoreira> alaski, bauzas: we can do it 17:16:50 <bauzas> belmoreira: \o/ 17:16:58 <vineetmenon> alaski, bauza, belmoreira one thing to be confirmed about that spec.. 17:17:07 <bauzas> belmoreira: I can help you on the implementation but I don't have the bandwidth for implementing it 17:17:18 <vineetmenon> we assumed that flavor will reside in cell.. i hope it's okay 17:17:25 <bauzas> belmoreira: so you can hassle me on that one 17:17:47 <bauzas> belmoreira: to be clear, I was talking about cell-host-mapping :) 17:18:07 <alaski> vineetmenon: what do you mean? I thought we were discussing moving flavor to the api db 17:18:11 <alaski> so out of the cell 17:18:37 <vineetmenon> alaski, ooh ya.. sorry. flavors in api 17:19:13 <alaski> vineetmenon: yes, that's okay. I think there's agreement that that's where they should be 17:19:20 <bauzas> +1 17:19:35 <belmoreira> does anyone have the spec number... I can't find it now... 17:19:39 <bauzas> belmoreira: still happy with https://review.openstack.org/#/c/182715/2/specs/liberty/approved/cells-host-mapping.rst,cm ? 17:19:48 <alaski> https://review.openstack.org/#/c/182715/ 17:20:13 <belmoreira> alaski: thanks 17:20:23 <bauzas> eh :) 17:21:15 <bauzas> belmoreira: so, there is an API model, a migration script to write and a NovaObject to provide 17:21:24 <alaski> belmoreira: to make sure we're on the same page, you're saying that you can help with that spec? 17:21:32 <vineetmenon> +1 17:21:38 <belmoreira> alaski: yes 17:21:42 <alaski> great, thanks 17:21:47 <bauzas> belmoreira: awesome 17:22:03 <alaski> anything else on specs? 17:22:33 <alaski> #topic Open Discussion 17:22:43 <alaski> anything else to discuss generally? 17:23:24 <belmoreira> what is blocking https://review.openstack.org/#/c/136490/ ? 17:24:07 <alaski> belmoreira: I just need to go through it again and refresh it 17:25:10 <alaski> my goal is that by the next meeting I will have refreshed all specs, and written more or at least have a list of specs I think need to be proposed 17:26:00 <alaski> anything else? 17:26:40 <alaski> thanks everyone! 17:26:45 <alaski> #endmeeting