21:01:24 #startmeeting nova_cells 21:01:24 Meeting started Wed Apr 20 21:01:24 2016 UTC and is due to finish in 60 minutes. The chair is alaski. Information about MeetBot at http://wiki.debian.org/MeetBot. 21:01:27 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 21:01:29 The meeting name has been set to 'nova_cells' 21:01:29 anyone around today? 21:01:36 o/ 21:01:38 sort of 21:01:38 o/ 21:01:51 o/ 21:01:54 \o 21:02:05 #topic testing 21:02:36 haven't heard of any gate failures, so things are sailing smoothly these days 21:02:45 ccarmack1 had a thing to discuss though 21:02:50 I put some thoughts about testing in https://etherpad.openstack.org/p/nova-cells-testing 21:03:10 please add any comments 21:03:21 at the bottom it looks like 21:03:24 ah, my old friend that etherpad 21:03:29 yea, sorry at the bottom 21:03:47 does anyone object to me removing the big glob in the middle? 21:04:00 I was thinking we need something that approximates a real multi node cells env 21:04:22 alaski: not me. I totally forgot about this etherpad 21:04:35 melwitt: I had repressed it as well :) 21:04:40 haha :) 21:04:54 ccarmack1: yes, that would be ideal 21:04:55 amazing what google will find 21:05:07 how found that etherpad 21:05:16 impressive 21:05:40 orly? 21:05:57 so the question I have for multi-node is where will cell0 live 21:06:11 I would probably put it on the api node 21:06:25 that's what I was thinking. that would be like real life anyway, right? 21:06:33 I was thinking of a devstack with api + cell1 21:06:33 for a lot of people, yeah 21:06:42 and cell0 21:07:15 on one node? 21:07:15 for multi-node I thought we would have 2 cells, no? 21:07:31 api + cell1 + cell2 21:07:31 I was concerned about a 3 host multinode 21:07:40 but yes that would be better 21:07:56 does the gate support 3 hosts? 21:08:02 why not 2 nodes with (api, cell0, cell1) and (cell2) ? 21:08:11 thats where I left it 21:08:17 like an AIO-+1 21:08:22 ccarmack1: not sure. that's probably a sdague or -infra question 21:08:42 I was actually thinking (api, cell0), then cell1 separate 21:08:55 and eventually another node for cell2 21:09:01 but we're not quite up to a cell2 yet 21:09:10 thats what I have in the doc.. but I can ask infra about having 3 hosts in a multi-node test 21:09:25 not quite up to cell2 ? 21:09:40 yeah, I was thinking future, I guess I assumed multi-node means multi-cell 21:10:05 ccarmack1: we don't have all the pieces in place to make that work yet 21:10:27 but melwitt has a patch up right now with a big chunk of it 21:10:42 I think this is planning for testing when everything is working 21:11:02 okay 21:11:27 melwitt's patch in the rpc switching? 21:11:29 okay, so are we thinking we have one multi-node job now that has api + cell0 + cell1 and then in the future another multi-node job that has 2 cells or just amend the existing job to have 2 cells? 21:11:30 yeah 21:11:55 melwitt: I was thinking the latter 21:12:12 and each cell has 1 host? 21:12:25 aka node 21:12:32 yes, if infra can provide that 21:12:39 ok, I'll ask 21:12:54 we could do (api, cell0, cell1) and (cell2) as a reasonable test bed as well I think 21:13:16 that's 2 hosts, if infra can't do 3 21:13:17 I'm just thinking of people using All-in-One 21:13:29 ccarmack1: yeah. a fallback 21:13:30 given that, adding a new cell 21:13:49 wait, are we waiting until multi-cell to do multi-node or are we thinking of multi-node now with everything talking to the same db/mq? 21:13:58 hence 2 nodes, first with (api, cell0, cell1) and the other with only a cell 21:14:15 2 MQs 21:14:22 melwitt: and different DBs 21:14:32 but within the same infra node 21:14:57 melwitt: I'm thinking of multi-cell, when its ready 21:15:12 well, I was thinking of multi-node with one cell, and then expand to two when ready 21:15:35 but if everyone wants to wait we can do that 21:15:49 alaski: would that env test anything specific to cells? 21:16:18 it would be a non-AIO 21:16:30 it wouldn't 21:16:42 so maybe it's not important now 21:17:27 okay, so multi-node will be multi-cell 21:17:37 and it may be a while until that works 21:17:53 I was basically thinking that a cell DB could be close to the compute 21:17:57 I was thinking waiting for multi-cell and then have a config as we are saying — node1 has api, cell0, cell2 .. node2 has cell1 21:17:58 it would be figuring out the legwork now instead of later I guess. or rather setting it up now once to be expanded instead of potentially having to redo a lot to expand? 21:17:58 but yeah... 21:18:13 melwitt: yeah, that was my thinking 21:18:32 melwitt: alaski: well, that's just config stuff right? 21:18:53 meaning that we cut our deployment with the manage calls we want 21:18:59 ccarmack1: that works, but it won't pass anything now so the question is are you planning to work on it now, and should it run on all reviews? 21:19:03 not sure changing that could be difficult 21:20:05 alaski: I was planning to do some leg work to set up the jobs, etc 21:20:31 ccarmack1: okay. that would be great, but it probably needs to start in the experimental queue 21:20:39 yup 21:21:04 bauzas: you're thinking of doing two cells in AIO? 21:21:10 bauzas: I was thinking of having each cell's db on the same host has compute 21:21:17 alaski: nope 21:21:24 alaski: one cell in AIO 21:21:41 alaski: and one other cell in a separate node with another compute 21:21:52 bauzas: gotcha 21:21:53 alaski: but that means a cell DB could be close to the compute node 21:22:21 yeah, that's fine though 21:22:23 which could be a bit sad for performance... but we use AIO too :) 21:22:27 yep 21:22:40 so, maybe later a large-ops one 21:23:35 yeah 21:23:41 okay, I think we're all on the same page now 21:23:55 could you all comment in the etherpad 21:24:04 so I can chew on this later? 21:24:14 also, I'm not sure about the grenade stuff 21:24:19 #action all comment on https://etherpad.openstack.org/p/nova-cells-testing 21:24:52 alaski: is multi-tier cells in the future? 21:25:03 erh 21:25:05 or not for newton? 21:25:08 I think the only upgrade thing different between cells and non for grenade is testing the update from non cells to cells 21:25:13 ccarmack1: not planned at all 21:25:17 I think we agreed to only have one parent/child 21:25:42 horizontally, not vertically 21:25:47 alaski: and a cellsv1 to cellsv2 upgrade? 21:25:55 yeah, multi-tier would be cellsv3, or v2.5 or something. not being considered for this effort 21:26:02 ok 21:26:15 multi-tier means nested, right? 21:26:17 ccarmack1: yes, and we don't really have c1->v2 sorted yet 21:26:25 yes melwitt 21:26:28 ccarmack1: we kind of need multi-cell support first 21:26:42 okay. yeah, nested cells hasn't been a thing so far and maybe never will 21:26:51 ok 21:27:31 any more on testing? 21:27:38 I'm done 21:27:45 cool 21:27:51 #topic Open Reviews 21:27:52 ccarmack1: cellv1 to cellv2 path is clear now 21:27:59 https://etherpad.openstack.org/p/newton-nova-priorities-tracking 21:28:07 bauza: yes I saw your doc update 21:28:17 I think you did that… 21:28:33 multi cellsv1 can be only one cellv2 by default 21:28:40 for reviews just check out the etherpad 21:28:45 #topic Open Discussion 21:29:24 yeah, we have a plan for v1->v2, but without multicell support no one is realistically going to upgrade 21:29:27 and we can't fully test it 21:29:59 we can't it fully be tested? 21:30:13 why,,, 21:30:35 alaski: yup, agreed 21:30:38 just that we can't migrate two v1 cells to v2, which is really what we want to check 21:31:43 I was wondering how are we doing on the 'boot an instance' with a single cell v2. are all of the patches up for that or is there anything else still needed? 21:31:59 there's a few things still needed 21:32:18 I have a local WIP which moves instance creation to the conductor, that's the last big piece 21:32:40 there's also handling listing instances from a BuildRequest 21:32:46 which has no work on it yet 21:32:49 okay, cool. just wanted to make sure is it a matter of just reviews or if there's more to come 21:32:59 mostly those two things I think 21:33:13 aren't there still API DB migrations needed too? 21:33:47 I consider those more of the multi-cell story 21:33:59 but yes, plenty of those still needed 21:34:23 yeah I figured those weren't done yet :) 21:34:40 heh, not even close 21:34:44 what else for multi-cell? API DB migrations, is there anything more with the scheduling interaction or is that covered under the single cell work? 21:34:51 doffm did get some specs up though 21:35:15 Yep and they've come up on my objects reviews dashboard, but I've been ignoring them 21:35:21 melwitt: that should be covered under the single cell work, unless there's something being overlooked 21:35:28 alaski: great 21:36:02 alaski: well, I suppose it will be a matter of putting target_cell in the right place for the switching too 21:36:12 right 21:36:15 alaski: in my local testing I put it in conductor 21:36:49 using switching for db and rpc and db migrations. I think that's mostly what's left after current WIP is done 21:36:57 cool 21:37:19 and testing, and upgrade path for v1 21:37:37 *nod* 21:38:06 it probably goes without saying, but I will not be running this meeting next week 21:38:19 if you want to talk cells come find me in Austin :) 21:38:56 yeah, I assumed no meetings next week 21:39:31 also, hopefully everyone knows by now that I am not a locquacious person, so please come ready to help with the cells summit sessions 21:39:57 I'm not either but I will try :) 21:40:09 * rlrossit googles locquacious 21:40:24 dangit, spelled it wrong 21:40:41 melwitt: between the two of us we're almost a talkative person 21:40:52 alaski: lol, so true 21:41:12 that's all I have for this week 21:41:15 anyone have a topic? 21:41:24 nooope 21:41:27 see y'all in austin 21:41:32 getting my accent ready 21:41:41 great! see y'all there 21:41:42 great! 21:41:43 :) 21:41:45 o/ 21:41:47 #endmeeting