16:00:30 #startmeeting neutron_ci 16:00:31 Meeting started Tue Apr 17 16:00:30 2018 UTC and is due to finish in 60 minutes. The chair is slaweq_. Information about MeetBot at http://wiki.debian.org/MeetBot. 16:00:32 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 16:00:34 The meeting name has been set to 'neutron_ci' 16:00:38 hi 16:01:06 o/ 16:01:31 o/ 16:01:43 jlibosva: haleyb: are You there? :) 16:01:51 I am here 16:01:53 o/ 16:01:56 hi, but i need to run home so will miss :20 16:02:03 yes :) 16:02:11 ihar: new nick? 16:02:16 assign me all the bugs :) 16:02:22 haleyb: sure 16:02:37 all the bugs, not just the Neutron ones 16:02:56 ok, let's start 16:03:01 #topic Actions from previous meetings 16:03:08 haleyb to continue testing why router migrations tests fails 16:04:04 haleyb: any update on this one? 16:05:09 ok, so I guess not, let's move on then 16:05:16 next one was 16:05:17 haleyb to continue testing why router migrations tests fails 16:05:21 undo 16:05:29 slaweq will check old gate-failure bugs 16:05:46 so, again I didn't have time to go through this list yet 16:06:03 but I will try to do it this week 16:06:40 #action slaweq will check old gate-failure bugs 16:06:50 next one 16:06:52 yamahata to fix issues with openstack-tox-py35-with-neutron-lib-master periodic job 16:07:30 I think it's fixed and it can be confirmed health check 16:07:48 yes, I saw that periodic jobs are passing now 16:07:51 thx a lot for help 16:08:15 ok, next one 16:08:16 haleyb will mark router migration tests are unstable 16:08:29 I think it was done already 16:08:39 I +2ed that patch yesterday 16:08:58 https://review.openstack.org/#/c/561322/ 16:08:58 patch 561322 - neutron-tempest-plugin - Mark DVR/HA migration tests unstable (MERGED) 16:09:04 it is this one probably 16:09:11 so it's done 16:09:17 yes 16:09:27 and last but not least 16:09:37 agreed after meeting with mlavalle :) 16:09:38 mlavalle to makes ovsfw scenario job voting 16:09:52 that was taken care of by jlibosva 16:09:54 I think that jlibosva did it and it's now voting 16:10:00 yep it is voting now 16:10:07 thx jlibosva 16:10:17 jlibosva: I told you, he was going to come after me ;-) 16:10:21 today I also pushed https://review.openstack.org/#/c/561930/ related to this one 16:10:22 patch 561930 - openstack-infra/project-config - Change label for neutron-tempest-ovsfw to "voting" 16:10:28 hehe :) 16:10:33 to change job name in grafana 16:11:10 ok, so that's all about actions from previous week 16:11:22 #topic Grafana 16:11:28 http://grafana.openstack.org/dashboard/db/neutron-failure-rate 16:12:08 I checked today graphs from last 7 days and it looks quite good 16:12:35 I like how dvr-multinode goes down after router migration tests are skipped :) 16:12:49 except problems with pip 10 and lower-constraints which caused (almost) all tests failing all the time for last few days 16:13:08 but those problems should be already fixed and jobs are getting better today again 16:13:31 jlibosva: yes, dvr-multinode job is much better now :) 16:13:32 why gate tempest jobs so bad? is it because not many patches merged? 16:14:21 ihar: might be - please not that in last 3 days nothing was merged probably 16:14:35 because of this issue with lower-constraints job and ryu package 16:14:55 yeah, we had one patch merged on 14th and then 2 patches merged today 16:15:04 and that's it 16:15:12 yeah. that's what I am saying, maybe that's because the number of data points is so low so average was skewed by an outlier. 16:15:18 ok 16:15:48 ok then :) 16:16:12 do You have anything to add? 16:16:35 or we can go to talk about some specific job types? 16:16:57 looks like we're good here 16:17:15 ok then 16:17:20 #topic Functional 16:17:59 today I saw failure like http://logs.openstack.org/67/556667/20/check/neutron-functional/71b2acc/logs/testr_results.html.gz in functional tests 16:18:14 bollocks 16:18:17 and I think that I saw same issue at least one more time last week 16:18:37 maybe it's not big problem yet but I wanted to mention about it 16:18:39 that might mean the firewall blink is not fixed 16:19:17 I'll have a look 16:19:29 or at least there is some corner case when it is failing 16:19:35 thx jlibosva 16:19:55 #action jlibosva take a look on failing ovsfw blink functional test 16:20:41 ok, next topic 16:20:46 #topic Fullstack 16:20:59 nothing very urgent on my side for fullstack currently 16:21:30 but I found today one failed security groups test http://logs.openstack.org/12/470912/38/check/neutron-fullstack/ba16b38/logs/testr_results.html.gz so I will take a look on it during the week 16:21:52 #action slaweq will check failed SG fullstack test 16:22:03 do You have anything to add according to fullstack? 16:23:10 no 16:23:36 so next topic 16:23:38 #topic Scenarios 16:24:00 as jlibosva told already dvr-multinode job is getting better with skipped migration tests :) 16:24:23 let's see this week what failure rate it will have without those tests 16:24:49 other scenario jobs looks quite good now IMO 16:24:53 ++ 16:24:55 * ihar thrilled 16:24:56 we still have the trunk failing occasionally that I'm looking at. I deployed multinode dvr environment and I'm not able to reproduce the issue locally. I ran the tests probably around 100 times 16:25:52 I know that it wasn't so common failure reason for this job so it might be hard to reproduce 16:26:40 jlibosva: it might be hard to reproduce it locally :/ 16:27:05 it never failed for me 16:28:09 did You checked logs from such failed job? Do You have any idea what could be the problem there? 16:29:08 I suspect wrong order of rpc messages about remote security group and prepare_port_filter. the logs contain error where vlan tag can't be found in ovsdb for given subport/trunk 16:29:31 that means ovs firewall cannot get correct zone to be used in conntrack as the zone number corresponds with the local vlan tag of given port 16:29:38 so then all traffic is dropped 16:29:46 which would explain the SSH connection issue 16:30:13 but I can't find a reason why the vlan tag is not present in the ovsdb 16:31:38 maybe You should add some additional debug logs in agent and wait for new failure to check it? 16:31:45 it also seems that the scenario job doesn't have indexed console output in logstash 16:32:18 yeah, I plan to log snapshot of openflows per ovs firewall action, so we can see the state of openflows at the time of failure 16:32:24 yeah, I was talking about some time ago but when I didn't need it anymore I forgot about it :/ 16:33:06 ok, so You will continue work or this, right? 16:33:13 jlibosva: why isn't it in logstash? 16:33:30 ihar: I do not know :) 16:33:48 you mean console.html not indexed? 16:33:54 or whatever it is named now 16:33:59 it's now the job-output.txt or something like that 16:34:00 but yeah, that one 16:34:12 weird. I can take a look at that one 16:34:20 it's job-output.txt.gz 16:34:28 ihar: first step would be to make sure I'm not lying :) 16:34:41 yeah sure :) 16:34:53 I will keep you honest and shame in public if you aren't! 16:35:04 thanks 16:35:07 :)) 16:35:16 slaweq_: add an action 16:35:55 #action jlibosva will check if job output are indexed in logstash 16:36:02 slaweq_: ihar :) 16:36:04 ihar: here You go :) 16:36:19 slaweq_: ihar will check 16:36:28 yes 16:36:29 ah, sorry 16:36:35 I mean, I just checked and I still cannot see it 16:36:44 #action ihar will check if job output are indexed in logstash 16:36:54 better? 16:36:56 :) 16:37:12 yes sir 16:37:20 ok :) 16:38:21 so, moving on? 16:38:26 yes 16:38:29 #topic Rally 16:38:49 as from grafana it looks that rally is fine now 16:39:14 so I think that we don't need to talk about it too much :) 16:39:25 at least You have something to add here 16:39:59 there was an email from rally folks about openstack plugin being spun off rally. shouldn't affect us but worth being aware. 16:40:20 apparently rally is more than openstack :) 16:40:54 thx ihar 16:41:02 good to know 16:42:18 ok, so can we move to next topic then? 16:42:36 I'd say so 16:42:37 YES 16:42:41 #topic Periodic 16:42:55 here I just wanted to mention that it looks fine currently 16:43:11 looks that yamahata's fix works fine :) 16:43:40 thx once again yamahata 16:43:50 :) 16:44:54 and the last topic is 16:44:55 #topic Gate 16:45:23 here I also don't have anything to talk for today - except this tempest job which ihar mentioned before it looks good IMO 16:45:50 do You want to talk anything here? 16:46:06 or do You anything else to talk about today? 16:46:37 I don't. I think if the chair believes there is nothing to cover we can as well skip sections and leave the rest for open discussion 16:47:20 #topic Open Discussion 16:47:50 so do You want to talk about something else related to CI? 16:47:57 I don't have anything 16:48:21 I have nothing 16:48:36 mlavalle: ? 16:48:39 nope 16:48:49 ok, so thank You 16:48:56 11 minutes back, yay :) 16:48:59 and enjoy Your free 11 minutes :) 16:49:00 #endmeeting