15:00:24 #startmeeting neutron_dvr 15:00:25 Meeting started Wed Jul 6 15:00:24 2016 UTC and is due to finish in 60 minutes. The chair is haleyb. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:00:26 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:00:29 The meeting name has been set to 'neutron_dvr' 15:00:33 #chair Swami 15:00:37 Current chairs: Swami haleyb 15:01:38 #topic Announcements 15:02:09 there's finally a quorum - welcome back Swami 15:02:17 sure 15:02:58 let's get right to bugs 15:03:01 #topic Bugs 15:03:54 yes, there are couple of new bugs 15:04:18 #link https://bugs.launchpad.net/neutron/+bug/1597561 15:04:19 Launchpad bug 1597561 in neutron "L3 agent allows multiple gateway ports in fip namespace" [High,In progress] - Assigned to Brian Haley (brian-haley) 15:04:52 Carl actually did the work, I just edited the patch to fix a nit 15:05:07 https://review.openstack.org/#/c/335755/ 15:05:17 I did see it. 15:05:50 I was not sure, how this occurred. But the patch at least checks for the stale and removes it. Looks good. 15:05:59 and oleg just gave it a +A 15:06:36 The next in the list is. 15:06:40 #link https://bugs.launchpad.net/neutron/+bug/1599089 15:06:40 Launchpad bug 1599089 in neutron "DVR: floating ip stops working after reassignment" [High,In progress] - Assigned to Oleg Bondarev (obondarev) 15:06:53 There is also a patch for this bug pushed in by Oleg. 15:07:15 https://review.openstack.org/#/c/337591/ 15:07:27 Thanks for posting the patch. 15:08:03 I did see that you have already reviewed it. I will take a look at it again. 15:08:12 It should be good to go. 15:08:33 yes, that one looks good 15:08:42 The next in the list is 15:08:46 #link https://bugs.launchpad.net/neutron/+bug/1593354 15:08:46 Launchpad bug 1593354 in neutron "SNAT HA failed because of missing nat rule in snat namespace iptable" [Undecided,New] 15:09:11 This seems to be like a duplicate, but will check with adolfo, before we tag the priority on this bug. 15:11:06 assaf has posted a question in there to check if it is a duplicate of #link https://bugs.launchpad.net/neutron/+bug/1571113 15:11:06 Launchpad bug 1571113 in neutron "SNAT interface not created for dvrha in some scenarios" [Medium,Fix released] - Assigned to Adolfo Duarte (adolfo-duarte) 15:11:46 ok 15:11:50 adolfo-duarte has already addressed that bug, but will re-check with adolfo again and will triage the bug. 15:12:30 The next one is performance related to DVR and IPv6. 15:12:34 #link https://bugs.launchpad.net/neutron/+bug/1596473 15:12:34 Launchpad bug 1596473 in neutron "Packet loss with DVR and IPv6" [Undecided,New] 15:13:19 This need to be triaged. I am not sure we have seen such a degradation in performance. 15:14:07 right, looks like packets are going multiple places. i had asked about reproducing in a newer release but had no answer 15:14:25 haleyb: yes I saw your message. 15:15:03 So let us wait and see or check on the master branch and prioritize based on the findings. 15:15:20 IPv6 with DVR should only route externally via the CVR 15:15:57 haleyb: Is that what that was reported as performance degradation. 15:17:19 The next in the list is 15:17:21 i can't tell, but it seems like if they force a route via the CVR everything works 15:18:29 got disconnected. 15:18:38 The next bug is 15:18:54 #link https://bugs.launchpad.net/neutron/+bug/1599287 15:18:54 Launchpad bug 1599287 in neutron "Cleanup snat redirect rules when agent restarts after stale snat namespace is cleaned." [Undecided,In progress] - Assigned to Swaminathan Vasudevan (swaminathan-vasudevan) 15:20:10 https://review.openstack.org/337855 15:20:52 haleyb: I got some push back from assaf and oleg on this patch since it is too much to cache and use it later. 15:21:24 Let me see if there is a way to delete all the ip rule and default gateway without the snat_idx. 15:22:01 Swami__: yes, if you can somehow just cleanup all the rules it would be better, will be more work though 15:23:20 haleyb: I thought caching would help us, to re-instantiate if someone deletes the namespace and if we wanted to recreate those rules, but may be we can do a full-sync at that time to recreate the rules. 15:23:54 haleyb: This is again similar to the fip and its priority retained in the localfilesystem to handle reboots. 15:24:25 haleyb: But that might have a different use cases. 15:24:34 s/use cases/use case 15:25:07 yes, and i have a feeling if we proposed the fip priority code today it might get push-back. Best to try and sync if possible 15:25:27 haleyb: ok will try to see what can be done to it. 15:25:36 haleyb: one the same lines, #link https://review.openstack.org/#/c/326729/ 15:26:22 After removing the check for the namespace exists check in create, the jenkins is passing on this patch. What do you think. Is there any other work on this patch or it can merge. 15:27:05 Swami__: yes, i saw the update. I did just have a question on self.snat_ports usage since it's initialized differently 15:27:43 haleyb: I know I saw your comments, about the empty list and none used at different places. 15:27:58 haleyb: I think you lean towards the empty list rather than none. 15:28:17 So, should we make that fix in a different patch or add it to this patch. 15:28:39 Swami__: well, i think they should be the same, right? then maybe one of the changes isn't necessary? 15:29:23 haleyb: what do you mean one of the changes isn't necessary 15:29:56 So what you are saying is if we define the variable in dvr_local_router, we don't need to refine it in the dvr_edge_router 15:30:04 https://review.openstack.org/#/c/326729/18/neutron/agent/l3/dvr_local_router.py - you check self.snat_ports, but don't think you need to 15:30:06 s/refine/redefine 15:30:50 get_snat_port_for_internal_port() can take None or [] as an argument 15:31:49 I didn't know if it was related to one place being None and the other [] 15:32:00 so the best option is to change it to None. 15:32:25 I meant change it to None in dvr_local_router as well. 15:33:02 if you make it [] then we might not need another change in the dvr ha edge router code 15:33:24 because it does a 'for port in self.snat_ports' which will blow-up with None 15:33:48 haleyb: ok let me check that out and see what makes sense. 15:34:02 I know it's been a while since you made that change, so I was trying to figure out what error you saw 15:35:33 haleyb: Yes basically the only issue that I saw was unable to iterate on self.snat_ports, since it would be None. 15:35:59 That's all I had for bugs right now. 15:36:18 So perhaps it should be [], then there's just a question of should it be in the parent class, but we can discuss that in the review 15:36:33 haleyb: Ok, will address it in the review. 15:36:58 The other thing that is pending for a long time is #link https://review.openstack.org/#/c/275073/ 15:37:39 Swami__: did the tempest patch merge? 15:37:46 This is the nova patch and probably we should be pushing it before the nova mid cycle, I don't want this to caught up. 15:37:53 haleyb: no it did not merge. 15:38:40 haleyb: can you ping tempest core to take a look at that patch. 15:39:45 Swami__: i can once i find the patch 15:40:14 #link https://review.openstack.org/#/c/286855/ 15:40:19 tempest patch link above 15:40:38 thanks 15:41:56 Swami__: i saw you update the WIP patch for "fast exit", https://review.openstack.org/#/c/297468/ - is that ready for review? 15:42:05 haleyb: Once I finish of this patch, I might be focussing on the fast path exit. 15:42:15 great, thanks 15:42:21 Yes, one of the patch is ready, but after the rebase it is failing jenkins, I will fix it today. 15:42:39 The next one I will be working on is the static routes on the fip namespace. 15:42:55 I have a WIP patch, but need to revisit, since it was done long back. 15:43:48 Thats all I had on bugs to discuss 15:43:53 ok 15:44:15 haleyb: I had another question 15:44:27 sure 15:44:32 I had a couple of patches for addressing the floatingips for unbound ports. 15:44:59 that was an RFE, but yes, I remember 15:45:54 armax had a -2 on this because I was using a config option. I think I got rid of the config option. 15:46:00 at least i think it was an RFE 15:46:12 can you take a look at it. Meanwhile I will ping armax and see if he is ok to remove the -2 on it. 15:46:34 #link https://review.openstack.org/#/c/320669/ 15:46:39 yes, i'll look 15:46:51 #link https://review.openstack.org/#/c/323618/ 15:47:02 haleyb: yes that was an RFE. 15:47:33 Yes that's all I had for bugs. 15:48:32 ok, thanks 15:49:00 #topic Stable backports 15:49:52 Ihar sent a reminder to the list to not forget about pro-active backports 15:50:18 https://goo.gl/sx0KL5 has a link to the bugs, I've been working through them 15:50:24 haleyb: ok thanks 15:50:29 that's in addition to https://etherpad.openstack.org/p/stable-bug-candidates-from-master 15:51:03 we haven't merged a lot of DVR fixes, but there were some that needed a push along 15:51:20 haleyb: ok 15:51:32 #topic Open discussion 15:51:46 Any other topics to discuss? 15:51:54 I have not looked at the gate failures lately. 15:51:57 Is it stable. 15:53:08 it is much better. the multinode dvr job is still 2x in the check queue 15:53:37 http://grafana.openstack.org/dashboard/db/neutron-failure-rate 15:53:57 Yes we should shoot for the multinode stability and voting pretty quick 15:54:48 That's all I had for today. 15:55:24 any lurkers have comments? :) 15:56:29 thanks Swami__ 15:56:32 #endmeeting