14:30:23 #startmeeting neutron nova-network parity 14:30:24 Meeting started Wed Jul 23 14:30:23 2014 UTC and is due to finish in 60 minutes. The chair is mestery. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:30:25 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 14:30:27 The meeting name has been set to 'neutron_nova_network_parity' 14:30:35 mestery: Error: Can't start another meeting, one is in progress. Use #endmeeting first. 14:30:41 Whoops 14:30:45 #link https://wiki.openstack.org/wiki/Meetings/NeutronNovaNetworkParity Agenda 14:31:00 hi everyone 14:31:09 * mestery waits a few minutes to let anyone else filter in. 14:31:52 #topic Gap Analysis Plan 14:31:57 #link https://wiki.openstack.org/wiki/Governance/TechnicalCommittee/Neutron_Gap_Coverage 14:32:13 So, maybe to start, we could quickly go over where we're at for each item in this plan? 14:32:27 want me to run through it? 14:32:31 markmcclain: Please do :) 14:32:42 #info Gap 0 is complete 14:33:51 we merged a healing migration that updates the various schema branches that had formed to the canonical version that includes all models 14:34:09 That was awesome work by the team working on gap 0! 14:34:34 yeah I was super happy to see everyone working to develop a solution for it 14:35:04 Gap 1 Tempest testing is nearly done 14:35:18 the lone holdout is enabling the full tempest job for voting 14:35:33 salv-orlando is working on that bit 14:35:47 Gap 2 is resumption of Grenade testing 14:36:15 #link http://lists.openstack.org/pipermail/openstack-dev/2014-July/040973.html Full Job Email Update 14:37:04 mestery: thanks for the link 14:37:19 markmcclain: np 14:37:46 for Gap 2 we have to make some changes to the way devstack sets up Neutron 14:38:20 basically we need to stop misusing enable_service 14:38:43 I'm working on those and should have them proposed shortly 14:38:57 Excellent! 14:39:04 For Gap 3 is dependent on Gap 1 and 2 14:39:34 #link https://review.openstack.org/#/c/105785/ Neutron as default in devstack 14:40:38 cool.. just need to track down the failures 14:40:51 Gap 4 is the where we are hurting the most 14:41:08 the spec within Nova did not get approved by the deadline 14:41:35 oops.. Gap 5 spec did not get approved 14:41:36 Wait, you meant gap 6 14:41:43 4 is missing API calls :) 14:41:46 * markmcclain needs more coffee 14:42:10 * mestery hands markmcclain coffee with kahlua 14:42:26 obondarev - you want to update on your progress? 14:42:31 Gap 4 API calls is complete 14:42:40 Gap 5: is DVR work which is on track 14:42:48 marun: sure I will 14:43:16 obondarev: This is for Gap 6 now (nova-net to neutron migration) 14:43:33 mestery: ok 14:43:47 so according to nova team feedback on the spec the overall design of the neutron migration was slightly changed 14:44:11 now instead of migrating to neutron within one compute host the idea is to migrate between hosts 14:44:27 and make neutron migration as part of existing live-migration mechanism 14:44:33 #link https://review.openstack.org/#/c/101921/ Neutron migration specification 14:44:49 which is reasonable as it is the usual way to perfom big upgrades 14:44:53 obondarev: ++ 14:44:54 I mean host-by-host 14:45:04 mestery: thanks for the link 14:45:24 currently I'm working on agreeng the design with the nova team and implementing POC in parallel 14:45:31 thanks to dansmith for his reviews and suggestions! 14:46:24 there are some difficulties with POC as now I'm not even able to perform an original live migration on a multinode devstack 14:46:27 I notice that daniel berrange has provided contradictory advice as to the chosen strategy on the most recent patch :/ 14:46:46 Hopefully we can hammer out the contradictory advise at the mid-cycle meetup next week. 14:46:52 marun: right 14:46:58 marun: ++, you and markmcclain will be busy with that next week 14:47:17 well, 14:47:23 I followed the guide from official openstack docs but still facing some issues with live-migration 14:47:33 he said that live migration can't always work, which is what I said regarding make sure that cold migration works as well 14:47:48 obondarev: as per dan's comment, maybe the starting point is more properly cold migration, since it doesn't have hypervisor dependencies? 14:47:54 dansmith: ah, gotcha. 14:48:20 dansmith: though there was a comment from him that talked about in-place network switch 14:48:25 is cold migration is that one that is not a "true" live migration? 14:48:32 dansmith: on line 119 14:48:33 I'm concerned that cold migration won't satisfy the downtime requirements the TC put forth though, just wanted to through that out there. 14:48:47 mestery: where is this requirement? 14:49:00 marun: Documented in the TC meeting minutes from last April I'm afraid :( 14:49:01 marun: yeah, I wonder if he reviewed the previous version of this and saw what that implied 14:49:03 markmcclain: Thoughts on this? 14:49:16 mestery: I think we should pull it out of the minutes and formalize it in the gap coverage page 14:49:24 marun: ++ I'll take na action to do that. 14:49:25 mestery: the lack of visibility is a problem both on the nova and neutron sides 14:49:25 mestery: I would expect that "zero downtime for any and all VM types" is probably not a reasonable requirement anyway 14:49:36 #action mestery to scour meeting minutes around downtime requirements for migration and add to coverage wiki 14:49:41 marun: agreed 14:49:57 dansmith: I agree, just throwing it out there from my memory of the TC meeting. 14:49:57 right… so originally we specified that network connections can/will drop 14:51:00 but committed to keeping the VMs running if it is impossible than we can always go back and say that keeping them running will be problematic and why 14:51:30 markmcclain: Makes sense to me. 14:52:15 keeping them running and doing it in-place would be awesome, but I'm not sure it's worth what we'll have to do in order to support it 14:53:24 dansmith: by cold migration do you mean "nova migrate..." one? 14:53:26 agreed… there are some that don't want to reboot everything to upgrade 14:53:34 obondarev: yes 14:54:05 I just think we have to document everything properly 14:54:08 there is some question, though, of just how valuable a migration mechanism is - who the target audience is, and what kind of use cases they have for migration 14:54:30 we need something, but it's not clear what without more involvement from deployers who want to use it 14:54:41 is it worth raising the question on the operators list? 14:54:56 right, I think we're missing some definition about these details and requirements 14:55:03 so part of the issue is that if we EOL nova-net then they operators have to hae something 14:55:41 markmcclain: sure. I don't think that precludes doing some research to see what 'something' should e 14:55:41 be 14:56:05 and maybe that should be driven from the TC side, given that its them that are setting the requirements 14:56:49 I'll work on narrowing the scope a bit 14:57:29 from the TC side you mean? 14:58:07 dansmith mentoined that there could be folks from metacloud at the mid-cycle next week (if we're lucky) 14:58:08 marun: yes 14:58:23 even if not, it might be worth engaging with them since they're a pretty heavy nova network user 14:58:30 yeah, 14:58:45 knowing what they'd expect a migration to have to look like before they'd be willing to take it would be good data 14:59:21 good point 14:59:26 markmcclain: so, we can leave it to you to drive narrowing the requirements from the TC side 14:59:45 mestery: I need to quit, I have to be in the DVR status meeting, if you need any info please ping me. 14:59:48 markmcclain: and I'd hope you'd raise the point of engaging with operators so that the effort is able to be grounded in actual requirements 14:59:54 Swami: will do, thanks! 15:00:10 marun: yeah.. I'll try to make items more specific 15:00:18 re cold vs live migraiton 15:00:42 markmcclain: also, it'd be good to know if the TC expects every possible vertex on the matrix of configurations to be migratable, without downtime, etc 15:01:25 markmcclain: because it could be that just providing a nova-manage command to tweak the database while everything control-plane-wise is shut down would be sufficient for folks that can't do migrations 15:01:41 dansmith: we didn't delve into that level of specifics at the meeting, there is a definite gray zone here. 15:01:48 we know there will be configurations that cannot be universally upgraded, so part of the process is documenting the ones that cannot be done 15:01:56 markmcclain: and then let nova-compute startup migrate the VIFs on the next startup 15:02:07 and then give the operator options to manually resolve 15:02:18 markmcclain: okay, well, we should be able to make serious progress on this next week, in terms of ideas and feasibility I think 15:02:51 dansmith: agreed 15:04:23 also Nachi Ueno has proposed another possible way for neutron migration recently 15:04:42 #link https://docs.google.com/presentation/d/12w28HhNpLltSpA6pJvKWBiqeEusaI-NM-OZo8HNyH2w/edit#slide=id.p 15:04:52 hi 15:05:17 hi Nachi 15:05:27 it's still in idea phase, but I think using nova-network manager code is also simple way 15:05:35 I'm not sure of the value of the proposed approach, since it still requires migrating responsibility between nova network and neutron 15:05:44 The idea looks nice as it requires minimal nova-side changes and seems can be implemented fairly quickly 15:05:47 what does this intermediate step buy us? 15:05:48 I think we can make no downtime 15:06:04 nati_ueno: this is more like what I was expecting us to have 15:06:05 but it is still not a true neutron migration I think 15:06:21 nati_ueno: changes to nova-network to bridge the gap until we could "go direct to neutron" after some amount of small transition 15:06:31 so current our approach is 100% neutron compat 50% nova compat. 15:06:42 I thinks we should start with 100% nova compat 50% neutron compat 15:06:50 then we can improve compatibility in neutron side 15:06:52 right 15:06:55 dansmith: ya 15:07:15 #link: https://review.openstack.org/#/q/status:open+project:openstack/neutron+branch:master+topic:bp/neutron-ovs-dvr,n,z 15:07:46 sorry wrong place. 15:09:01 so I believe north bound api and data plane downtime matter 15:09:52 nati_ueno: ah, this is why you were asking about the migrate_* methods :) 15:10:07 dansmith: yes. 15:10:44 _setup_network_on_host is missing in neutron side, but we can call it when we craete/delete port 15:11:04 I think this proposal is a nicer solution, but it's not clear to me what the implementation cost would be compared to the strategies already proposed. 15:11:06 we considered this approach earlier too… there are still some issues with how different elements that are currently shared between Nova and Neutron cooperate 15:11:32 and we'd need clarity from a user perspective to decide whether that cost was worth paying 15:11:52 (of course, if it's cheaper and simpler, it would be hard to argue against it) 15:11:55 markmcclain: what's issue did you have? 15:11:58 marun: ++ 15:12:14 iptables is one of them 15:12:39 markmcclain: For that part, we can use neutron side driver 15:12:49 markmcclain: nova-network has no security group code 15:13:06 Anyway, I agree with marun. I'll try to POC this. 15:13:19 may be, if it works, it can be one option, right? 15:13:40 it may take 1 week, or 3 month :) 15:13:41 it can definitely be an option 15:13:46 nati_ueno: I would recommend collaborating with obondarev to add details to the spec as a secondary step 15:14:03 marun: Yes, we should coorinate this as much as possible. 15:14:03 marun: sure 15:14:06 nati_ueno: we'll need comparison between proposed approaches if we're to make a decision 15:14:06 but we probably should to fail fast vs letting it linger 15:14:22 markmcclain++ 15:15:02 ya. anyway, this is still just an idea. Let me POC it 15:15:26 so team should go existing way 15:15:56 nati_ueno: Thanks! 15:16:16 #topic Open Discussion 15:16:21 That's all I had on the agenda for this week. 15:16:28 Going over the gaps and reporting progress. 15:16:33 Anything else from anyone? 15:17:03 that's all from me 15:17:31 OK, thanks everyone! 15:17:44 markmcclain and marun and dansmith: I hope you folks make some serious progress in person next week at the nova mid-cycle. 15:17:53 We'll have hte meeting next week as well. 15:17:56 Thanks everyone! 15:17:59 #endmeeting