15:00:13 #startmeeting neutron_l3 15:00:18 Meeting started Thu Aug 30 15:00:13 2018 UTC and is due to finish in 60 minutes. The chair is mlavalle. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:00:19 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:00:21 The meeting name has been set to 'neutron_l3' 15:00:22 hi 15:00:24 hi 15:00:28 hey haleyb 15:00:32 hi manjeets 15:01:01 hi 15:01:19 welcome munimeha1 15:01:45 munimeha1: do you have a topic you want to discuss during the meeting? 15:02:48 #topic Announcements 15:02:52 o/ 15:03:16 Yesterday we released Rocky: https://review.openstack.org/#/c/597529/ 15:04:15 so for all practical purposes, we can think we are in the Stein cycle 15:04:25 welcome to Stein! 15:04:48 The PTG is a little more than a week away 15:05:26 I am consolidating all the proposed topics in a coherent agenda: https://etherpad.openstack.org/p/neutron-stein-ptg 15:05:52 manjeets: are you joininh the team dinner on Thursday? 15:06:17 mlavalle, yes 15:06:36 manjeets: so add a Yes at the end of the line where your name is in the etherpad 15:07:08 any other announcements from the team? 15:07:14 done 15:07:18 :-) 15:07:45 ok, let's mlove on 15:07:56 yeah, let's love on 15:07:58 LOL 15:08:00 lol 15:08:04 #topic Bugs 15:08:46 First I have for today is https://bugs.launchpad.net/neutron/+bug/1788006 15:08:46 Launchpad bug 1788006 in neutron "neutron_tempest_plugin DNS integration tests fail with "Server [UUID] failed to reach ACTIVE status and task state "None" within the required time ([INTEGER] s). Current status: BUILD. Current task state: spawning."" [High,Confirmed] - Assigned to Miguel Lavalle (minsel) 15:09:26 I investigated this one and couldn't find anything wrong on the Neutron side 15:09:52 I traced it back to Nova. The compute receives the vif plugged event correctly from Neutron 15:10:27 but that doesn't propagate to the Nova API, so the instance stays stuck in state "spawning" 15:10:37 I opened a bug in the NOva side 15:11:12 however, during the CI a meeting slaweq pointed out that it was weird that this is happening in DNS integration tests only 15:11:32 so I am digging in other cases 15:12:02 Next one is https://bugs.launchpad.net/neutron/+bug/1789434 15:12:02 Launchpad bug 1789434 in neutron "neutron_tempest_plugin.scenario.test_migration.NetworkMigrationFromHA failing 100% times" [High,Confirmed] - Assigned to Miguel Lavalle (minsel) 15:12:22 I had fight hard with haleyb to get ownership of this bug 15:12:38 I finally prevailed ;-) 15:12:50 LOL 15:13:34 as shared yesterday with manjeets and hongbin yesterday, I know have a vagrabt script where I can build dvr-ha environments in 50 minutes 15:14:00 so I am going to attemopt to reporduce this issue locally in my environment today 15:15:10 Finally, I am looking at https://bugs.launchpad.net/neutron/+bug/1787919 15:15:10 Launchpad bug 1787919 in neutron "Upgrade router to L3 HA broke IPv6" [High,Confirmed] 15:15:26 I just assigned it to myself 15:15:44 and will also reproduce locally in my environment 15:15:49 that looks interesting 15:16:23 in this case there are notes from hongbin on how to reproduce locally 15:16:31 \so I'll follow the recipe 15:17:05 like baking a cake :) 15:18:03 we have this critical one https://bugs.launchpad.net/neutron/+bug/1788185 15:18:03 Launchpad bug 1788185 in neutron "[Stable/Queens] Functional tests neutron.tests.functional.agent.l3.test_ha_router failing 100% times " [Critical,Confirmed] - Assigned to Miguel Lavalle (minsel) 15:18:15 really the assignee here is slaweq 15:18:32 he has been working on the issue for the past few days 15:20:10 fixes were fixed for Ocata and Queens. I will follow up at the end of this meeting with slaweq to see where we are 15:20:10 i think there's still an ocata backport? but maybe it's still broken after reading the last comments? 15:20:20 yeah 15:20:27 that last comment gave me pause 15:20:41 I'll check with slaweq to see what he thinks 15:21:11 i thought i saw a strange HA func test failure today, maybe that's related 15:21:40 Those are the bugs I have at this point 15:21:59 there is another 15:21:59 are there any other bugs we should discuss today? 15:22:08 https://bugs.launchpad.net/neutron/+bug/1789403 15:22:08 Launchpad bug 1789403 in neutron "ip6tables rules for PD subnets not fully recreated at l3-agent restart" [High,Confirmed] - Assigned to Brian Haley (brian-haley) 15:22:40 posted https://review.openstack.org/#/c/597710/ - just got an update that it seems to fix the issue 15:22:56 that was the one that had the odd test failure 15:23:05 ahhh, ok 15:23:12 http://logs.openstack.org/10/597710/1/check/neutron-functional/9d0610b/logs/testr_results.html.gz 15:23:20 hi, sorry for being late :) 15:23:36 mlavalle: so i didn't know if it was my code, or keepalived 15:24:18 I can run that test locally. I'll let you know if it fails 15:24:37 mlavalle: thanks 15:25:14 hi slaweq. I was worried that you migh have been eaten by the python ;-) 15:25:26 mlavalle: LOL, no 15:25:33 I eat python :D 15:25:41 you do 15:26:01 we were wondering about the status of https://bugs.launchpad.net/neutron/+bug/1788185 15:26:01 Launchpad bug 1788185 in neutron "[Stable/Queens] Functional tests neutron.tests.functional.agent.l3.test_ha_router failing 100% times " [Critical,Confirmed] - Assigned to Miguel Lavalle (minsel) 15:26:31 workaround patches for Queens and Ocata are merged 15:26:38 there is still not merged patch for Pike: https://review.openstack.org/#/c/596559/ 15:27:07 but yesterday frickler was playing a bit with it and he had some comments about keepalived version 15:27:29 exactly, that is really my question 15:27:31 for him it looks like old keepalived was behaving wrong and we tested this wrong behaviour as good one 15:27:41 I will have to check that once again 15:28:03 and also strange thing is that on newer keepalived (1.3.9) test is fine 15:28:03 ok 15:29:23 is it ok if I assign the bug to you? 15:29:28 sure 15:29:51 done 15:29:55 thanks! 15:30:32 any other bugs we should discuss today 15:30:39 https://bugs.launchpad.net/neutron/+bug/1786272 15:30:39 Launchpad bug 1786272 in neutron "Connection between two virtual routers does not work with DVR" [Medium,In progress] - Assigned to Brian Haley (brian-haley) 15:30:59 it's only assigned to me since i updated the review, but i see new comments 15:31:47 slaweq made a change to schedule these shared-subnet DVR routers to allow traffic, but it requires more work 15:32:09 I'm trying to do scenario test to cover such case today 15:32:36 so, should we change the assignee to slaweq? 15:32:37 with such scenario test would be easier to be sure that patch works 15:32:45 slaweq: looks like it's going to need an OF change ? 15:33:10 locally I didn't do any changes in OF 15:33:16 and tbh I don't think so 15:33:42 basically if there is vm from "second" subnet on host, qrouter- will be created now, and all will work then 15:33:48 oh, i thought from swami's comment it was needed 15:34:03 maybe that was without your change... 15:34:26 so if I will change logic of which qrouter namespaces should be created on node, it should do everything to be working 15:34:46 but let's do test and check then on CI :) 15:34:55 * haleyb was just confused with comments 15:35:27 cool 15:35:32 any other bugs? 15:35:41 we can re-assign, or just wait for slaweq to update and it will change to him :) 15:35:50 ok 15:36:48 ok, let's move to open agenda then 15:36:54 #topic Open Agenda 15:37:59 In building environment to test dvr-ha related bugs, I followed this recipe: https://docs.openstack.org/neutron/latest/admin/config-dvr-ha-snat.html 15:38:32 and it referes in several place to br-ex 15:38:51 that is obsolete, isn't it? 15:39:25 Not yet I think 15:39:58 The patch still hasn't merged? 15:40:29 https://review.openstack.org/#/c/567369/ 15:41:40 ok, but it is in it's way to be obsolete 15:41:46 Thanks! 15:42:03 np! 15:42:21 and that patch by the way fixes the doc 15:42:35 so we are good 15:42:52 any other topics we should discuss today? 15:43:45 ok, thanks for attending 15:43:49 #endmeeting