14:01:15 <liuyulong> #startmeeting neutron_l3 14:01:17 <openstack> Meeting started Wed Dec 2 14:01:15 2020 UTC and is due to finish in 60 minutes. The chair is liuyulong. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:01:18 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 14:01:20 <openstack> The meeting name has been set to 'neutron_l3' 14:01:39 <liuyulong> long time no see 14:03:46 <liuyulong> Seems this will be another one persone meeting... 14:04:27 <liuyulong> No announcement from me today, let's scan the bug list. 14:04:35 <liuyulong> #topic Bugs 14:04:58 <liuyulong> #link http://lists.openstack.org/pipermail/openstack-discuss/2020-November/018782.html 14:05:02 <liuyulong> #link http://lists.openstack.org/pipermail/openstack-discuss/2020-November/019032.html 14:05:08 <liuyulong> #link http://lists.openstack.org/pipermail/openstack-discuss/2020-November/019119.html 14:05:16 <liuyulong> #link http://lists.openstack.org/pipermail/openstack-discuss/2020-December/019154.html 14:05:52 <liuyulong> First one: 14:05:53 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1903433 14:05:55 <openstack> Launchpad bug 1903433 in neutron "The duplicate route in l3 router namespace results in North-South traffic breaking" [Medium,In progress] - Assigned to yangjianfeng (yangjianfeng) 14:06:44 <liuyulong> The patch is done hereļ¼ 14:06:47 <liuyulong> #link https://review.opendev.org/c/openstack/neutron/+/761829/ 14:07:06 <liuyulong> Just start another recheck for it. 14:09:57 <liuyulong> The input CIDR overlapping status will be checked between external subnet and user internal subnet. The unit test cases are really good to verify this. 14:11:03 <liuyulong> Next 14:11:04 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1904564 14:11:06 <openstack> Launchpad bug 1904564 in neutron "GRE tunnels over IPv6 are created in wrong way" [High,In progress] - Assigned to Slawek Kaplonski (slaweq) 14:12:31 <liuyulong> I've not used gre for tunnels in our local cloud, but this is a valid bug. 14:14:04 <liuyulong> More like a ovs-agent mis-configuration, it's worthy to fix IMO. 14:15:03 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1904436 14:15:05 <openstack> Launchpad bug 1904436 in neutron ""gateway" option is not honored when creating a subnet with subnet pool" [Wishlist,New] 14:16:23 <liuyulong> Our IPv6 subnets are created from a subnet pool, but we have not see such issue. I will check this in my local deployment. 14:17:30 <liuyulong> But I wonder which IP the router internal gateway will set finally. 14:17:44 <liuyulong> I will keep an eye on that. 14:18:58 <liuyulong> Next one 14:18:59 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1905295 14:19:02 <openstack> Launchpad bug 1905295 in neutron "[RFE] Allow multiple external gateways on a router" [Wishlist,New] - Assigned to Bence Romsics (bence-romsics) 14:20:30 <rubasov> hi liuyulong 14:20:39 <liuyulong> This is a quite big proposal. 14:20:41 <liuyulong> hi 14:20:54 <rubasov> I saw you mentioned this 14:20:59 <liuyulong> Yes 14:21:21 <liuyulong> I saw the comments in the LP bug. 14:21:52 <rubasov> I'm also bit a afraid this could be big, that's why I would like to limit the scope as much as possible 14:22:48 <liuyulong> OK, so maybe we need some details on how to accomplish the traffic from VM to outside world, and vice versa. 14:23:12 <rubasov> by the way, does the use case make sense to you? 14:24:19 <liuyulong> I'm not sure. I think we need some details about the "multiple external gateways". 14:24:47 <liuyulong> What will this multiple external gateways use for? 14:25:16 <rubasov> basically I'd like to model a bgp+ecmp router 14:25:55 <liuyulong> The multiple external gateways are independent of each other? Or it will work together to do north-south traffic? 14:26:39 <rubasov> their faults are independent, but traffic is balanced by ecmp 14:28:33 <liuyulong> So, the router has multiple external gateways connected to multiple physical router in one single host? 14:29:04 <rubasov> yes, it does 14:29:37 <liuyulong> Or the router will be scheduled to multiple hosts, each scheduled instance has different external gateway to different physical router? 14:29:50 <rubasov> I was trying to draw it in the RFE, but launchpad ate my ascii art 14:30:02 <rubasov> but this etherpad has a better drawing 14:30:06 <rubasov> https://etherpad.opendev.org/p/neutron-multiple-external-gateways 14:30:31 <rubasov> (I just used this to prepare the RFE, nothing new there, but the drawing is better) 14:31:14 <liuyulong> Openning... 14:32:41 <rubasov> lines 28-48 14:32:42 <liuyulong> Alright, if it is the first topology. Why not add all NICs to one bond to achive the high availibility? 14:33:31 <liuyulong> And beyond that, the fault scope is still in one single host no matter how many external gateways. 14:34:19 <rubasov> I believe the people who came up with this design in my company don't like LACP recovery times in some situations 14:34:22 <liuyulong> That means if the host is done, the entire traffic will go done (if you have HA, the failover will happen). 14:34:28 <liuyulong> s/done/down 14:35:23 <rubasov> and some cases I believe they want ECMP not just between the 4 routers, but down to the compute host 14:37:21 <liuyulong> In such situation, I will suggest user to enable the DVR : ) 14:37:24 <lajoskatona> Hi 14:37:29 <liuyulong> #link https://github.com/gotostack/shanghai_ptg/blob/master/shanghai_neutron_ptg_topics_liuyulong.pdf 14:38:02 <liuyulong> In shanghai PTG, I'd raised a ACTIVE-ACTIVE mode for router in a DNAT nodes. 14:39:34 <liuyulong> That proposal will also use ECMP for the floating IPs routes. 14:40:10 <rubasov> regarding dvr, in telco we so far did not have much success selling that, the people in my company very much prefer dedicated hw networking equipment 14:40:39 <liuyulong> lajoskatona, hi 14:41:39 <liuyulong> rubasov, yes that could be true, the users sometimes require some dedicated hosts for netwoking because of some security concern. 14:42:00 <lajoskatona> liuyulong: Hi 14:42:09 <rubasov> and they'd like to model that design in neutron api 14:43:16 <liuyulong> Allow me to elaborate the ACTIVE-ACTIVE DNAT topology,. 14:43:22 <rubasov> sure 14:43:59 <liuyulong> When the user try to access one floating IP, the upstream physical router will have multiple next hop IP which direct to some scheduled Router namespace. 14:44:45 <liuyulong> And in each router namespace, the iptables rules, route rules will work as centriliazed routers do. 14:45:52 <rubasov> okay, so southbound traffic is over ecmp 14:45:57 <rubasov> right? 14:46:00 <liuyulong> And in the compute node, when the packet is back to the outside world 14:46:32 <liuyulong> the ovs-agent will add some port group which is also something like ecmp to send the return packet back to the router namespace. 14:46:52 <liuyulong> Due to all these traffic we take care are L3 only, 14:47:18 <liuyulong> so the taffic can be send back to any router namespace no matter the hash rule is. 14:47:22 <liuyulong> rubasov, yes 14:47:48 <rubasov> so northbound traffic is over LAG, maybe MLAG 14:48:16 <rubasov> I believe my people favor ECMP over MLAG for some reason 14:48:35 <rubasov> but I will need to go back to them to better understand their reasons for this 14:49:28 <liuyulong> Hmmm, actually we do not care about the underlay topoloy. 14:50:34 <liuyulong> In neutron's view, we can only assume there are some neutron virtual Routers and one Upstream physical router. 14:50:55 <liuyulong> The upstream physical router has multiple path to a single distination. 14:51:45 <liuyulong> We do not care about how many fabric the hosts are connected to the upstream router. 14:53:49 <rubasov> but don't I have to represent the links in my previous picture in lines 31-35 as neutron nets (used as external gw-s)? 14:54:38 <rubasov> I don't see how could I represent that as 1 neutron network 14:55:51 <liuyulong> So this ACTIVE-ACTIVE DNAT topology is what you want? 14:56:40 <liuyulong> If so, in the picture, you can just remove R4, and link R2 to R1, R1', R1'' etc. 14:56:54 <liuyulong> Sorry 14:57:09 <liuyulong> In the picture, you can just remove R4 and R2, and link R3 to R1, R1', R1'' etc. 14:57:50 <liuyulong> The R1, R1', R1'' mean one single neutron router has multiple scheduled instance in different host. 14:58:02 <liuyulong> All these R1 instances are ACTIVE. 14:58:11 <rubasov> I'm far from sure I fully understand the active-active, I will try to better understand your slides 14:58:39 <liuyulong> They work together to surve the north-south traffice. 14:58:51 <liuyulong> s/traffic 14:58:55 <liuyulong> OK 15:00:00 <rubasov> but thank you for the questions, it helps to better formulate the use case (and see if it's valid) 15:00:36 <liuyulong> #link https://m.pgb-fc.com/docs/en-us/netscaler/media/ns-cluster-ecmp-topology.png 15:00:59 <liuyulong> Ignore all the IPs/protocls/words in this picture. 15:01:32 <liuyulong> Something like that. 15:02:03 <liuyulong> ok, time is up 15:02:17 <liuyulong> #endmeeting