16:00:19 #startmeeting neutron_ci 16:00:20 Meeting started Tue Mar 5 16:00:19 2019 UTC and is due to finish in 60 minutes. The chair is slaweq. Information about MeetBot at http://wiki.debian.org/MeetBot. 16:00:21 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 16:00:24 The meeting name has been set to 'neutron_ci' 16:00:38 o/ 16:00:43 hi 16:01:00 lets wait few minutes for others to join 16:01:12 while we wait, did you ping rubasov about the patches yet to be revieiwed? 16:01:16 o/ 16:01:29 hi 16:02:20 mlavalle: yes I talked with rubasov about it 16:02:33 I will give You later log from what we talked, ok? 16:02:48 ok, lets start 16:02:52 o/ 16:02:54 hi 16:03:01 #topic Actions from previous meetings 16:03:08 first one was 16:03:10 njohnston to create etherpad with python3 status of stadium projects 16:03:22 #link https://etherpad.openstack.org/p/neutron_stadium_python3_status 16:03:47 I identified the jobs that look like they are still running py2 16:04:12 by going into the job log for every job and seeing what the full path to libraries were reported in tracebacks, seemed like the most reliable method 16:04:43 at this point all automatically generated changes are merged and py3 unit test jobs are present and passing 16:05:08 thx njohnston, great job 16:05:34 I started on the list with bagpipe trying a zuul v3 migration as well. 16:05:49 so with your approval I will send this to the ML 16:05:58 and work on a few of the changes mysqlf 16:06:02 *myself 16:06:09 ok, that sounds good for me 16:06:37 yes, great! 16:06:48 and IMO we should focus only on voting jobs, non-voting jobs should be taken care by project's team 16:06:52 what do You think? 16:07:00 the list does not look too bad nice (of course, no guarantee what will happen when trying to move some of them to py3) 16:07:08 I agree, I just made a note of them for the sake of completeness 16:07:41 so that if a project wants to mark a job voting then they know they should also make it py3 16:07:44 yes, only voting jobs 16:07:48 yes, that's good we have it on the list but lets take care only about voting jobs from this list :) 16:07:56 +1 16:09:03 ok, lets move on then 16:09:08 next one was 16:09:12 slaweq to check bug https://bugs.launchpad.net/neutron/+bug/1816489 16:09:14 Launchpad bug 1816489 in neutron "Functional test neutron.tests.functional.agent.l3.test_ha_router.LinuxBridgeL3HATestCase. test_ha_router_lifecycle failing" [High,Fix released] - Assigned to Slawek Kaplonski (slaweq) 16:09:26 I did, and fix is merged: https://review.openstack.org/640400 16:09:51 next one was: 16:09:53 slaweq to create bionic test patches for stadium projects 16:10:20 test patches are done for neutron and stadium projects: patches done: https://review.openstack.org/#/q/topic:legacy-job-bionic+status:open+owner:%22Slawek+Kaplonski+%253Cskaplons%2540redhat.com%253E%22 16:11:10 I found some bugs, so reported it on launchpad 16:11:27 I also listed summary to ML: http://lists.openstack.org/pipermail/openstack-discuss/2019-February/003129.html 16:11:49 if someone wants to help, You can take a look at those bugs and propose fixes :) 16:12:07 for networking-ovn lucasgomes already proposed fix, so that is fine 16:12:31 questions, comments? 16:12:39 good job, thanks! 16:12:52 looks great! good work. 16:12:56 thx 16:13:00 ok, lets move on 16:13:08 slaweq to prepare etherpad and plan of moving tempest plugins from stadium projects to neutron-tempest-plugin repo 16:13:31 so I did etherpad https://etherpad.openstack.org/p/neutron_stadium_move_to_tempest_plugin_repo with "plan of work to do" 16:13:37 please check if that makes sense for You 16:14:13 generally it's only 5 projects which have some tempest tests to move 16:14:31 that's a manageable number 16:14:35 I didn't list here 3rd party projects like for vmware-nsx 16:14:38 very nice 16:14:43 it's only related to stadium projects 16:15:01 is that fine? or should we take care of 3rd party project too? 16:15:02 I think tidwellr can help us with dynamic routing 16:15:05 just FYI for the meeting minutes; the link for the bionic migration email was incorrect. The correct one is: http://lists.openstack.org/pipermail/openstack-discuss/2019-March/003479.html 16:15:31 njohnston: thx 16:15:33 I can ping tidwellr and ask him to help 16:15:43 I'd say 3rd party by definition means outside of common repo 16:15:56 bcafarel: yes, I also think that 16:16:20 but according to what QA team wants they also should be moved to separate repositories 16:16:20 that leaves 4 repos 16:16:21 I think 3rd party projects would be outside the scope of this upstream committee, but might be in scope for distro vendors like Red Hat or SuSE to work with as part of the distro offerings outside the scope of the official stadium support 16:16:40 if you know what I mean 16:16:56 njohnston: yes, I know 16:17:06 njohnston: that we can discuss downstream too :) 16:17:33 ok, so getting back to the list 16:17:51 mlavalle will ask tidwellr to help with dynamic-routing, that's good 16:17:56 any other volunteers? 16:18:04 before going on.... 16:18:10 if You want to help, please add You name in etherpad 16:18:11 I got confused 16:18:19 mlavalle: why? 16:18:28 to make sure 16:18:47 we are moving these plugins to the neutron-tempest-plugin repor, right? 16:18:54 yes 16:18:58 ok 16:19:08 we agreed on that on last meeting I think 16:19:25 it was already done for midonet some time ago, so we will do it for others too 16:19:43 it was a request from QA in Denver PTG right? 16:19:49 bcafarel: right 16:19:55 I know, but I got cofused by something that was said above... nevermind 16:20:07 mlavalle: sure, I understand :) 16:20:24 I can probably give a hand for sfc, I still remember a few things there :) 16:20:32 for bgpvpn.... have we asked tmorin? 16:20:41 I put my name down for fwaas 16:20:57 I took sfc 16:21:08 ok, I also put in this etherpad some "action plan" how to perform such migration but please read it and update if You think it should be done differently 16:21:13 unless bcafarel wants it 16:21:35 then I'd take vpnaas 16:21:37 mlavalle: no problem, I can do vpnaas 16:21:54 bcafarel: let's switch. you know sfc well 16:22:01 in case there are issues 16:22:02 I can ping tmorin about bgpvpn tomorrow 16:22:14 ^ sounds good, before "volunteering" him 16:22:22 mlavalle: ack :) 16:22:54 basically there shouldn't be big problems with it, it's "just" rehoming tests to other repo :) 16:23:09 but we also need to define new jobs for those tests 16:24:16 ok, can we move forward then? 16:24:17 so each volunteer is responsible for the jobs of his repo? 16:24:37 +1 16:24:45 +1 16:24:55 all good 16:25:56 ok, that was easy :) 16:26:01 thx guys :) 16:26:08 lets go to next topics 16:26:30 as we already talked about transiton to bionic and python 3, lets go to 16:26:33 #topic Grafana 16:26:40 #link http://grafana.openstack.org/dashboard/db/neutron-failure-rate 16:27:52 sometimes I think we should open this link in some tab when starting the meeting (to give it time to load) 16:28:05 bcafarel: that's good idea 16:28:18 I will try to remember to put it as first thing in the meeting next time :) 16:29:05 so basically we have "only" 2 problems now :/ 16:29:18 neutron-fullstack and functional tests are in very bad shape 16:29:22 fullstack/ functional? 16:29:48 yes, most of tempest/scenario jobs are fine 16:30:24 one thing to mention: I added py37 UT job to dashboard recently 16:30:54 and moved lower-constraints job to the same graph too as it's also UT job in fact 16:31:32 any questions/comments? 16:32:04 makes sense (the "new" UT graph) 16:32:26 thx bcafarel :) 16:32:33 lets than talk about functional/fullstack issues 16:32:39 #topic fullstack/functional 16:32:48 so, first functional tests 16:33:01 I recently noticed and reported 3 different bugs: 16:33:25 https://bugs.launchpad.net/neutron/+bug/1818334 - this one isn't very big issue as it happend "only" few times 16:33:26 Launchpad bug 1818334 in neutron "Functional test test_concurrent_create_port_forwarding_update_port is failing" [Medium,Confirmed] 16:33:53 I wanted to ask liuyulong_zzz that maybe he can take a look as he was doing this test IIRC 16:34:11 we can bring it up tomorrow in the l3 meeting 16:34:16 I'll do it 16:34:33 mlavalle: thx 16:34:49 so, we want to ask if he is still fixing it? 16:34:49 now, next bugs are much more urgent: 16:34:57 https://bugs.launchpad.net/neutron/+bug/1818613 16:34:58 Launchpad bug 1818613 in neutron "Functional/fullstack qos related tests fails often" [Critical,Confirmed] 16:35:50 this one is I think somehow related to ralonsoh patch https://review.openstack.org/#/c/406841/ 16:36:22 slaweq, how? 16:37:16 ralonsoh: http://logstash.openstack.org/#/dashboard/file/logstash.json?query=message:%5C%22line%2052,%20in%20_minimum_bandwidth_initialize%5C%22 16:37:28 it looks that it started failing like that when we merged Your patch 16:37:55 but it's only from my quick look, so please don't take it as 100% sure thing :) 16:38:06 slaweq, I'll take a look at this now 16:38:11 ralonsoh: thx 16:38:26 in bug report You have link to example failure and to logstash query 16:38:27 gracias ralonsoh 16:38:39 como siempre, muy entron 16:38:58 I know that dougwig also wanted to look at them so please maybe sync with him 16:39:15 ok, and second of those bugs is related to L3 HA: https://bugs.launchpad.net/neutron/+bug/1818614 16:39:16 Launchpad bug 1818614 in neutron "Various L3HA functional tests fails often" [Critical,Confirmed] 16:39:32 in this case also many random tests are failing 16:39:54 and only common thing between them is that they are all failing when waiting for router to be transitioned to master 16:40:17 so from that I would say that first thing to check is keepalived and all things related to it 16:41:50 mlavalle: maybe You can also raise it on tomorrow's L3 meeting? 16:42:09 you read my mind.... I'll add a tage to it 16:42:14 mlavalle: thx 16:43:05 I saw in neutron channel that dougwig is looking at this now, I will sync later with him if he will need any help on that 16:43:26 and basically those are most urgent functional tests issues which fails a lot recently 16:43:30 i was adding elastic queries, i'm not sure how much time i have for these particular bugs. 16:43:43 dougwig: hi, ok 16:43:53 all help is welcome dougwig 16:43:55 i'll try to give it some cycles today, but i'm not sure. 16:44:05 dougwig: so if You would found something, please write it in bug report 16:44:08 ok. 16:44:14 I will tomorrow morning continue work on it 16:44:21 thx dougwig :) 16:44:44 dougwig, slaweq: let's just update the bug with whatever progress we make 16:44:50 mlavalle++ 16:44:53 so others can follow from that point 16:45:22 ok, and now fullstack tests 16:45:28 - https://bugs.launchpad.net/neutron/+bug/1818335 16:45:29 Launchpad bug 1818335 in neutron "Fullstack test test_dscp_marking_packets fails" [Medium,Confirmed] 16:45:44 * mlavalle aslo added a l3 dvr backlog tag to the port forwarding bug 16:45:44 I found quite many times some issues with this test 16:46:01 it always fails because no marked packet was received 16:46:21 is the priority right? 16:46:23 I don't know if that is issue in L2 agent or maybe in tcpdump which is checking those packets 16:46:37 mlavalle: what is the priority? 16:46:44 of the bug, medium? 16:47:00 yes, I marked it like that few days ago 16:47:06 ok 16:47:09 but now I think we can change to high 16:47:20 ahhh, that was my question 16:47:22 as it happens more times in last few days :) 16:47:29 changed 16:48:00 I guess that there will be no voluneers for that one so I will probably assign it to my self 16:48:15 but first I will focus on this bug with functional tests 16:48:58 from other bugs, I found also that we recently hit couple of times old bug https://bugs.launchpad.net/neutron/+bug/1799555 16:48:59 Launchpad bug 1799555 in neutron "Fullstack test neutron.tests.fullstack.test_dhcp_agent.TestDhcpAgentHA.test_reschedule_network_on_new_agent timeout" [Medium,Confirmed] 16:49:41 slaweq: if you need help with the dscp one, I can try to help 16:49:51 mlavalle: that would be great 16:49:59 ok, I'll take it 16:50:07 I can try to help with the dscp one as well, I have some expertise in DSCP :-) 16:50:11 actally looking at this last one now http://logstash.openstack.org/#/dashboard/file/logstash.json?query=message:%5C%22line%20168,%20in%20test_reschedule_network_on_new_agent%5C%22 16:50:17 it happens also quite often 16:50:30 njohnston: in that case, I'll let you take a stab at it 16:50:33 so I will also change it to high 16:50:46 * njohnston adds it to my task list 16:50:58 mlavalle: so maybe You can take a look at https://bugs.launchpad.net/neutron/+bug/1799555 ? 16:51:00 Launchpad bug 1799555 in neutron "Fullstack test neutron.tests.fullstack.test_dhcp_agent.TestDhcpAgentHA.test_reschedule_network_on_new_agent timeout" [High,Confirmed] 16:51:00 :) 16:51:27 ok 16:51:34 I see at least 7 hits in last 7 days 16:51:37 thx mlavalle 16:51:52 ok, so to sum up 16:51:57 slaweq: i filed two new ci unstable bugs this morning, and added elastic rechecks for them. 16:52:10 njohnston will take a look at fullstack dscp issue, 16:52:24 #action njohnston Debug fullstack DSCP issue 16:52:25 mlavalle: will take a look at fullstack dhcp rescheduling issue 16:52:27 right? 16:52:31 +1 16:52:34 +1 16:52:36 thx 16:52:54 #action mlavalle to take a look at fullstack dhcp rescheduling issue https://bugs.launchpad.net/neutron/+bug/1799555 16:52:55 Launchpad bug 1799555 in neutron "Fullstack test neutron.tests.fullstack.test_dhcp_agent.TestDhcpAgentHA.test_reschedule_network_on_new_agent timeout" [High,Confirmed] 16:53:03 dougwig: do You have links to bugs? 16:53:16 one sec 16:53:37 https://bugs.launchpad.net/neutron/+bug/1818696 16:53:37 Launchpad bug 1818696 in neutron "frequent ci failures trying to delete qos port" [Undecided,New] 16:53:45 https://bugs.launchpad.net/neutron/+bug/1818697 16:53:46 Launchpad bug 1818697 in neutron "neutron fullstack frequently times out waiting on qos ports" [Undecided,New] 16:53:55 if either is a dup, i can update the elastic queries. 16:54:40 it may be that second one is dup of https://bugs.launchpad.net/neutron/+bug/1818613 16:54:40 Launchpad bug 1818613 in neutron "Functional/fullstack qos related tests fails often" [Critical,Confirmed] 16:54:52 but one is related to functional tests and second to fullstack 16:55:12 so I would say, lets keep open both - maybe they will be fixed with same patch 16:55:24 do You agree? 16:55:40 yes. one is not finding a port, the other is a timeout. they may be related, but i'm not sure yet. 16:55:57 dougwig: ok, thx :) 16:56:03 +1 16:56:47 ok 16:57:12 ok, lets move on quickly 16:57:18 #topic Tempest/Scenario 16:57:31 mlavalle: any updates on on https://bugs.launchpad.net/neutron/+bug/1795870 ? 16:57:32 Launchpad bug 1795870 in neutron "Trunk scenario test test_trunk_subport_lifecycle fails from time to time" [High,In progress] - Assigned to Miguel Lavalle (minsel) 16:57:41 the patches are ok 16:57:51 I am trying to find the best way to test 16:57:59 ready for review, right? 16:58:20 attempt this past Sunday on testing wasn\'t too good 16:59:04 so I need to iterate once more 16:59:16 ok, if You will need any help, ping me :) 16:59:23 ok 16:59:39 I'll probably ping you and dougwig in channel 16:59:45 I think we are running out of time now 16:59:49 thx for attending 16:59:53 o/ 16:59:53 #endmeeting