17:00:29 <alaski> #startmeeting nova_cells
17:00:30 <openstack> Meeting started Wed Jun  3 17:00:29 2015 UTC and is due to finish in 60 minutes.  The chair is alaski. Information about MeetBot at http://wiki.debian.org/MeetBot.
17:00:32 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote.
17:00:34 <openstack> The meeting name has been set to 'nova_cells'
17:00:46 <alaski> Anyone around for the cells meeting?
17:00:49 <bauzas> \o
17:00:51 <melwitt> o/
17:00:54 <belmoreira> o/
17:00:54 <dansmith> o/
17:00:55 <vineetmenon_> o/
17:01:13 <alaski> cool
17:01:20 <alaski> #topic Tempest testing
17:01:33 <alaski> I'm going to put melwitt on the spot and defer to her for updates here
17:01:40 <melwitt> :)
17:02:05 <melwitt> okay, the cells job seems in the same state as it has been, link here http://goo.gl/b7R8wq
17:02:50 <melwitt> so it's good there's nothing new happening. I went through the failures from yesterday and they all have to do with tests that delete a server, and I have identified a race where instances can be "undeleted" causing some tests to fail or time out
17:03:20 <melwitt> I have this patch up for the fix https://review.openstack.org/#/c/176518/ that fixes it sort of indirectly, details in the bug it closes
17:04:11 <alaski> nice, I will review that again in a bit
17:04:11 <bauzas> melwitt: seeing a -W, can I review it ?
17:04:20 <melwitt> I will remove the -W today, I was running a test all night that runs one of the failure-prone tests in a loop to make sure it doesn't make any races with it worse
17:04:27 <bauzas> melwitt: ok
17:04:54 <melwitt> I finally got it to where it could run all night without failing, so I'll put up a couple of more patches for those races that came out after undelete became impossible
17:05:00 <bauzas> alaski: which target do you try to reach for having the cells job voting ?
17:05:28 <melwitt> the UnexpectedTaskStateError is I think unrelated, but is a trace we need to clean up
17:05:39 <bauzas> alaski: considering another observation period once melwitt's patch lands, I guess ?
17:05:54 <alaski> melwitt: okay, so there's still a bit more work
17:06:14 <alaski> bauzas: that's a good question
17:06:18 <melwitt> there is also this bug that I *think* doesn't fail tests (I'm not sure) but is a clean up https://bugs.launchpad.net/nova/+bug/1448302
17:06:18 <openstack> Launchpad bug 1448302 in OpenStack Compute (nova) "cells: intermittent KeyError when deleting instance metadata" [Low,Confirmed] - Assigned to melanie witt (melwitt)
17:06:38 <bauzas> alaski: I just wonder about any benefits that we could have
17:06:47 <alaski> bauzas: I would like to see us close to the overall failure rate for non cells, but I don't know that that is right now
17:07:01 <bauzas> alaski: agreed, melwitt's patch is important
17:07:04 <melwitt> it's another race with update/delete server metadata I think but I didn't spend much time on it
17:07:14 <bauzas> alaski: my only wonder is how we can prevent any tunnel effect
17:07:19 <alaski> melwitt: would it help to have someone else look into that?
17:07:20 <bauzas> alaski: ie. fixing all the cells bugs
17:07:48 <melwitt> alaski: yeah, I think bauzas said he might want to look at it, so feel free
17:07:51 <bauzas> alaski: melwitt: my brain certainly needs some recreation, so I would be pleased to help
17:07:54 <belmoreira> bauzas: sorry, tunnel effect?
17:07:57 <alaski> great
17:08:09 <bauzas> melwitt:
17:08:20 <bauzas> cool
17:09:02 <bauzas> belmoreira: sorry, something probably french
17:09:07 <melwitt> so, I guess what I'm thinking is watch the job after the patches land. that I expect to see very few failures after that. but you never know
17:09:15 <bauzas> belmoreira: the idea of trying to boil the ocean I guess
17:09:27 <alaski> realistically I think if we fix the list servers negative test failure and the random test failures from unexpected task state errors I think we'll be where we need to be
17:10:16 <melwitt> yeah, okay. I'll have all the patches ready today, now that I'm done testing it
17:10:26 <bauzas> cool
17:10:43 <melwitt> I think that's all I have about the tempest testing
17:10:52 <alaski> awesome, great work
17:10:56 <bauzas> melwitt: so you don't expect https://bugs.launchpad.net/nova/+bug/1448302 to break tests ?
17:10:56 <openstack> Launchpad bug 1448302 in OpenStack Compute (nova) "cells: intermittent KeyError when deleting instance metadata" [Low,Confirmed] - Assigned to melanie witt (melwitt)
17:11:01 <alaski> thanks for the update
17:11:08 <bauzas> ok, let's move on
17:11:21 <melwitt> bauzas: honestly I don't know, I didn't spend enough time to be sure. often it doesn't fail them
17:11:38 <alaski> #topic Specs
17:12:04 <alaski> The host mapping spec merged during the summit
17:12:21 <alaski> and https://review.openstack.org/#/c/141486/ is ready for review
17:12:41 <alaski> and the followup to that one, though I suspect it will need some work
17:13:19 <alaski> I have a few more to write, but have still been organizing a few things after the break last week
17:13:43 <alaski> but there is some work available if anyone is interested, in implementing the host mapping spec
17:14:22 <alaski> and that's all I have on that, so if no comments we can move on
17:14:34 <belmoreira> alaski: we are looking into a spec to have flavor tables in the api DB
17:14:45 <alaski> belmoreira: ahh, excellenbt
17:14:49 <alaski> bah, excellent
17:15:45 <alaski> please ping me when that's up, or add me to the review
17:15:53 <bauzas> alaski: that spec seems pretty trivial to be done, probably we should propose some new contributors for it ?
17:16:01 <alaski> and add it to the agenda for mention, or I can do that
17:16:11 <bauzas> alaski: thinking about the "mentoring" stuff you know
17:16:26 <alaski> bauzas: I would love for somebody to pick up that work
17:16:38 <bauzas> alaski: we know we have some quickwins that new contributors could help, and that one seems also a good opportunity
17:16:45 <belmoreira> alaski, bauzas: we can do it
17:16:50 <bauzas> belmoreira: \o/
17:16:58 <vineetmenon> alaski, bauza, belmoreira one thing to be confirmed about that spec..
17:17:07 <bauzas> belmoreira: I can help you on the implementation but I don't have the bandwidth for implementing it
17:17:18 <vineetmenon> we assumed that flavor will reside in cell.. i hope it's okay
17:17:25 <bauzas> belmoreira: so you can hassle me on that one
17:17:47 <bauzas> belmoreira: to be clear, I was talking about cell-host-mapping :)
17:18:07 <alaski> vineetmenon: what do you mean?  I thought we were discussing moving flavor to the api db
17:18:11 <alaski> so out of the cell
17:18:37 <vineetmenon> alaski, ooh ya.. sorry. flavors in api
17:19:13 <alaski> vineetmenon: yes, that's okay.  I think there's agreement that that's where they should be
17:19:20 <bauzas> +1
17:19:35 <belmoreira> does anyone have the spec number... I can't find it now...
17:19:39 <bauzas> belmoreira: still happy with https://review.openstack.org/#/c/182715/2/specs/liberty/approved/cells-host-mapping.rst,cm ?
17:19:48 <alaski> https://review.openstack.org/#/c/182715/
17:20:13 <belmoreira> alaski: thanks
17:20:23 <bauzas> eh :)
17:21:15 <bauzas> belmoreira: so, there is an API model, a migration script to write and a NovaObject to provide
17:21:24 <alaski> belmoreira: to make sure we're on the same page, you're saying that you can help with that spec?
17:21:32 <vineetmenon> +1
17:21:38 <belmoreira> alaski: yes
17:21:42 <alaski> great, thanks
17:21:47 <bauzas> belmoreira: awesome
17:22:03 <alaski> anything else on specs?
17:22:33 <alaski> #topic Open Discussion
17:22:43 <alaski> anything else to discuss generally?
17:23:24 <belmoreira> what is blocking https://review.openstack.org/#/c/136490/ ?
17:24:07 <alaski> belmoreira: I just need to go through it again and refresh it
17:25:10 <alaski> my goal is that by the next meeting I will have refreshed all specs, and written more or at least have a list of specs I think need to be proposed
17:26:00 <alaski> anything else?
17:26:40 <alaski> thanks everyone!
17:26:45 <alaski> #endmeeting