14:08:46 <liuyulong> #startmeeting neutron_l3 14:08:47 <openstack> Meeting started Wed Jun 17 14:08:46 2020 UTC and is due to finish in 60 minutes. The chair is liuyulong. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:08:49 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 14:08:51 <openstack> The meeting name has been set to 'neutron_l3' 14:08:59 <liuyulong> Sorry, a bit late... 14:09:17 <liuyulong> slaweq, haleyb, ping 14:09:23 <slaweq> hi 14:09:30 <haleyb> hi 14:09:36 <liuyulong> hi 14:10:12 <liuyulong> Alright, let's start 14:10:13 <liuyulong> #topic Announcements 14:11:09 <liuyulong> #link http://lists.openstack.org/pipermail/openstack-discuss/2020-June/015368.html 14:11:28 <liuyulong> This is the ptg summary from the Virtual PTG. 14:12:27 <liuyulong> Thanks slaweq for the detailed summary. 14:13:16 <liuyulong> #link http://kaplonski.pl/images/Virtual_PTG_2020/photo_3.png 14:13:56 <liuyulong> I saw you handsome guys. 14:14:24 <liuyulong> #link http://eavesdrop.openstack.org/meetings/networking/2020/networking.2020-06-16-14.00.log.html#l-13 14:15:38 <liuyulong> This is the announcements from the team meeting yesterday. 14:16:29 <liuyulong> We are in Victoria devloping cycle now, so each spec should be moved to Victoria folder. 14:16:54 <liuyulong> OK, no more from me now. 14:17:04 <slaweq> :) 14:17:25 <liuyulong> Neutron CI is down, any idea? 14:18:41 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1883601 14:18:41 <openstack> Launchpad bug 1883601 in neutron "ovn based neutron gate jobs failing 100% of times" [Critical,In progress] - Assigned to Jakub Libosvar (libosvar) 14:19:14 <liuyulong> This is new bug, but seems the real problem is not fixed either. 14:20:11 <liuyulong> OK... 14:20:13 <liuyulong> #link https://review.opendev.org/#/c/735536/ 14:20:31 <liuyulong> This is the gatefix 14:20:53 <liuyulong> Next topic 14:20:56 <liuyulong> #topic Bugs 14:21:26 <liuyulong> #link http://lists.openstack.org/pipermail/openstack-discuss/2020-June/015178.html 14:21:31 <liuyulong> #link http://lists.openstack.org/pipermail/openstack-discuss/2020-June/015323.html 14:21:38 <liuyulong> #link http://lists.openstack.org/pipermail/openstack-discuss/2020-June/015442.html 14:21:43 <liuyulong> We have a long list.... 14:24:45 <liuyulong> First one 14:24:48 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1880969 14:24:48 <openstack> Launchpad bug 1880969 in neutron "Creating FIP takes time" [Low,New] 14:25:23 <ralonsoh> IMO, the times spent by the server is ok 14:25:30 <ralonsoh> c#2 of this LP 14:25:41 <ralonsoh> (only the Neutron server times) 14:26:38 <liuyulong> ralonsoh, yes, agreed. The HTTP response time from the neutron server log should be considered first. 14:27:54 <liuyulong> "GET /v2.0/ports?network_id=55c74232-825a-4a4a-b53d-5b4b7aa4ad74&device_owner=network%3Adhcp HTTP/1.1" status: 200 len: 1272 time: 0.0676231 14:28:07 <liuyulong> A simple case from my deployment. 14:28:37 <liuyulong> A pattern for logstash should be useful. 14:28:56 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1880532 14:28:56 <openstack> Launchpad bug 1880532 in neutron "[RFE]L3 Router should support ECMP" [Wishlist,New] - Assigned to XiaoYu Zhu (honglan0914) 14:29:08 <liuyulong> I have reviewed the spec one time. 14:29:18 <liuyulong> #link https://review.opendev.org/#/c/729532/ 14:29:41 <slaweq> I have to review this spec too 14:31:24 <liuyulong> In general, the final use scenarios looks limited to the loadbalancer. The main point is not in the Neutron side. 14:31:44 <liuyulong> So let's continue the discussion on the gerrit. 14:31:59 <slaweq> yes, there are some suggestions that it can be done with existing neutron API IIRC 14:32:31 <ZhuJoseph> My current plan is to add a new function to extraroutedb.py to handle this requirement. 14:32:52 <liuyulong> Hi, you are here. 14:33:01 <liuyulong> "XiaoYu Zhu" it's you? 14:33:07 <ZhuJoseph> and use api like :/v2.0/routers/27757e09-fb6a-4196-957d-cdce604f087e/remove_ecmps 14:33:11 <ZhuJoseph> yes 14:33:20 <ZhuJoseph> I am 14:33:23 <liuyulong> Welcome 14:36:23 <liuyulong> ZhuJoseph, if there are some existing code or POC, you may submit it in parallel, that could also be useful for the upstream team to understand your real requirement. 14:37:12 <liuyulong> And do not forget to add the link to the spec. 14:37:29 <liuyulong> One more thing, you should move specs/ussuri/l3-router-support-ecmp.rst, to the Virtual folder. 14:37:45 <liuyulong> s/Victoria 14:37:49 <ZhuJoseph> ok 14:39:16 <liuyulong> OK, next 14:39:22 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1881995 14:39:22 <openstack> Launchpad bug 1881995 in neutron "Centralized SNAT failover does not recover until "systemctl restart neutron-l3-agent" on transferred node" [Medium,In progress] - Assigned to Ann Taraday (akamyshnikova) 14:39:54 <liuyulong> We already have some discussion on the LP, and here is a workaround fix: 14:40:03 <liuyulong> #link https://review.opendev.org/#/c/734070/ 14:41:10 <liuyulong> For the fix, IMO, it partially revert the fix of the original fix of https://review.opendev.org/#/c/692352/ 14:41:12 <ralonsoh> IMO this is a workaround 14:41:17 <liuyulong> in some case 14:41:51 <ralonsoh> but if accepted and does not clash with any other part of the code 14:41:53 <ralonsoh> I'm ok 14:42:31 <ralonsoh> you know better this code... 14:42:32 <liuyulong> The main problem is in the namespace deletion based on my current research. 14:43:05 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1881995/comments/7 14:43:05 <openstack> Launchpad bug 1881995 in neutron "Centralized SNAT failover does not recover until "systemctl restart neutron-l3-agent" on transferred node" [Medium,In progress] - Assigned to Ann Taraday (akamyshnikova) 14:43:12 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1881995/comments/8 14:43:48 <liuyulong> I will add some log for this issue as a start. 14:45:01 <ralonsoh> good finding in c#7 14:45:02 <liuyulong> ralonsoh, the pyroute2 namespace deleting could be related. I may need your help. : ) 14:45:08 <ralonsoh> sure 14:45:20 <ralonsoh> but where is this called? 14:45:34 <liuyulong> Wait a sec 14:45:40 <ralonsoh> no no 14:45:42 <ralonsoh> I mean 14:45:46 <ralonsoh> in this executing 14:45:54 <ralonsoh> why the namespace is deleted? 14:46:03 <ralonsoh> *execution 14:46:25 <liuyulong> #link https://github.com/openstack/neutron/blob/master/neutron/agent/linux/ip_lib.py#L705 14:46:42 <liuyulong> #link https://github.com/openstack/neutron/blob/master/neutron/agent/linux/ip_lib.py#L906 14:47:03 <ralonsoh> yes and the ns is deleted, so that's ok 14:47:11 <ralonsoh> but why the ns was deleted? 14:47:56 <liuyulong> And finally, https://github.com/openstack/neutron/blob/master/neutron/privileged/agent/linux/ip_lib.py#L542 14:48:17 <liuyulong> the qrouter namespace was not deleted successfully. 14:48:30 <liuyulong> bug/1881995/comments/7 14:50:22 <liuyulong> Or maybe it is concurrent query and deleting. 14:50:44 <liuyulong> Delete namespace does not have much log now, I will add some. 14:53:02 <liuyulong> OK, next one 14:53:05 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1882860 14:53:05 <openstack> Launchpad bug 1882860 in neutron "after FIP is assigned vm lost network connection" [Undecided,Incomplete] 14:53:31 <liuyulong> It's a ovn-router related report. 14:54:44 <liuyulong> Jakub has left a potential fix of the issue and some questions, no response for now. 14:55:21 <liuyulong> Next 14:55:23 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1883321 14:55:23 <openstack> Launchpad bug 1883321 in neutron "Neutron OpenvSwitch DVR - connection problem" [High,New] 14:55:56 <liuyulong> This is really a complicated issue. 14:57:14 <liuyulong> As I said in the fix, there are tons of cases for the real deployment, for instance, DVR, DVR + HA, openflow firewall, network node mixed compute services... 14:57:26 <liuyulong> I have a long list. 14:58:04 <liuyulong> Let's continue the talk on LP bug. 14:58:08 <liuyulong> Last one 14:58:28 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1883089 14:58:28 <openstack> Launchpad bug 1883089 in neutron "[L3] floating IP failed to bind due to no agent gateway port(fip-ns)" [Medium,In progress] - Assigned to LIU Yulong (dragon889) 14:58:29 <liuyulong> reported by me 14:58:48 <liuyulong> I have two patches. 14:59:04 <liuyulong> #link https://review.opendev.org/#/c/735432/ 14:59:10 <liuyulong> #link https://review.opendev.org/#/c/735762/ 14:59:43 <liuyulong> The test case should be simple, just create a fake external network, and create router/network/subnet/VM. 15:00:24 <liuyulong> Then just see the changes of fip-namespace on hosts and DvrFipGatewayPortAgentBinding in DB. 15:00:41 <liuyulong> #link https://review.opendev.org/#/c/702547/ 15:01:10 <liuyulong> IMO, this fix just missed that DVR related clean up action. 15:01:16 <liuyulong> OK, we are out of time. 15:01:28 <liuyulong> #endmeeting