16:00:17 #startmeeting neutron_ci 16:00:18 Meeting started Tue Jan 7 16:00:17 2020 UTC and is due to finish in 60 minutes. The chair is slaweq. Information about MeetBot at http://wiki.debian.org/MeetBot. 16:00:19 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 16:00:20 hi 16:00:21 The meeting name has been set to 'neutron_ci' 16:00:24 o/ 16:01:13 hi 16:01:30 o/ 16:02:28 ok, lets start as we have our regular attendees already 16:02:35 Grafana dashboard: http://grafana.openstack.org/dashboard/db/neutron-failure-rate 16:02:51 #topic Actions from previous meetings 16:03:13 actually this time those are "actions from previous year" even :) 16:03:22 and the first one is 16:03:24 njohnston to check failing NetworkMigrationFromHA in multinode dvr job 16:04:55 I think we lost njohnston now 16:05:41 sorry, network hiccup. I'll work on this later today 16:06:16 ok :) 16:06:24 so I will set it as reminder for next week too 16:06:28 #action njohnston to check failing NetworkMigrationFromHA in multinode dvr job 16:06:45 as long as it is not for next year :) 16:07:33 bcafarel: LOL, this time no 16:07:36 only for next week 16:07:45 next one 16:07:47 slaweq to talk with gmann about finish of tempest plugins migration 16:07:54 I talked with gmann about it today 16:08:05 so he is aware of that already 16:08:35 and that's all actions from last meeting on my list 16:08:41 lets move on 16:08:43 #topic Stadium projects 16:08:53 njohnston: any updates about dropping py2 support? 16:09:03 I took a look at the outstanding work 16:09:31 We have work proceeding on networking-odl, networking-bagpipe, networking-midonet, and neutron-fwaas 16:09:41 although I am not sure the author of the fwaas change is engaged anymore 16:09:51 midonet in particular is close to a merge 16:10:02 so the stadium is in a good position 16:10:12 that's good 16:10:13 the only remaining ones are neutron-lib and neutron proper 16:10:31 I will file changes for those after the meeting 16:10:33 I can take care of those 16:10:36 ahh, ok 16:10:38 thx 16:10:44 so good progress overall then? 16:10:58 there is one existing for neutron-lib but it includes things like removing six and other 2.7-hostile changes, which I want to separate out 16:11:16 yes, I remember that one 16:11:47 so I think I will just edit that change 16:11:58 +1 for that 16:11:58 bcafarel: Yes, definitely good progress 16:12:23 thx njohnston for the update 16:12:28 that's really great work 16:13:07 as a team we are getting good at pulling together for these community-wide efforts :-) 16:13:58 anything else related to the stadium projects You want to discuss today? 16:14:38 For the zuulv3 migration, the standouts are networking-bgpvpn, midonet, neutron-dynamic-routing, and neutron-vpnaas 16:14:52 those are the ones that still need work, otherwise everything is covered 16:15:45 ok, I will try to find finally some time to work on at least some of them 16:16:11 ok, next topic 16:16:21 #topic Grafana 16:16:30 #link http://grafana.openstack.org/dashboard/db/neutron-failure-rate 16:17:35 we don't have too much data from last days, especially from gate queue 16:17:54 yeah, just waiting for https://review.opendev.org/#/c/701018/ 16:18:05 in check queue, the biggest problem IMO are thise neutron-tempest-plugin failures 16:18:28 ralonsoh: exactly 16:18:33 yes, definitely, those are killer 16:19:14 I would delay any CI errors analysis until we have this patch 16:19:27 (but I'm very lazy too) 16:19:40 laziness can be good :) 16:19:46 ralonsoh: this bug affects "only" neutron-tempest-plugin jobs 16:20:09 and in fact other jobs are in pretty good shape (at lest not worts than usually) 16:20:22 ralonsoh: Her, laziness is #1 on Larry Wall's three virtues of a great programmer 16:20:26 *hey 16:20:35 hahah 16:20:35 :) 16:21:34 anything else related to the grafana? 16:21:41 or can we move on to the next topic? 16:22:25 ok for me 16:22:32 +1 to move on 16:22:37 ok 16:22:39 #topic fullstack/functional 16:22:48 I was looking today on recent failures 16:23:02 and I found 3 various failed tests which we may check now 16:23:05 first one: 16:23:11 neutron.tests.functional.agent.test_firewall.FirewallTestCase.test_ingress_udp_rule 16:23:17 https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_f37/697317/8/check/neutron-functional/f37a608/testr_results.html.gz 16:25:38 did You ever saw such failure before? 16:25:56 not very usual, do you want to implement some kind of retrial context? 16:26:26 that is new and different to me 16:26:37 but, IMO, the connection handler under the hood should be enough 16:28:31 ok, lets just be aware of it for now 16:28:55 maybe it will not happen often (or even will not happen at all anymore :)) 16:30:29 ok, next one 16:30:54 those are 2 failures of different tests, but error looks kind of similar IMO 16:30:56 https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_ea8/677092/11/check/neutron-functional/ea8fe46/testr_results.html.gz 16:31:02 and https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_19d/701003/1/check/neutron-functional/19d8a0a/testr_results.html.gz 16:31:31 one is timeout during namespace creation (again) 16:31:33 and second is timeout during bridge creation 16:31:47 but it's linuxbridge 16:32:04 so IMHO netlink is common part of both of those issues 16:33:38 did You saw similar issues before? 16:33:45 yes, many times 16:33:51 at least with the ns creation 16:34:07 not so many, but some of them, related to the br creation 16:35:02 ok, so I think we should open bug for that, right? 16:35:24 yes, I'll do it 16:35:34 at least the one related to the bridge 16:35:53 the problem is I have spent days with those privsep/netlink errors 16:36:05 and I don't see the cause 16:36:18 ralonsoh: I know You were debugging and trying to fix it 16:36:30 it's not so often now but still happens from time to time 16:37:50 ralonsoh: thx for reporting this on LP 16:38:05 #action ralonsoh to report bug for timeout related to bridge creation 16:38:35 ok, that's all from my side about functional/fullstack jobs 16:38:41 anythig else You want to add here? 16:38:46 no 16:39:59 me neither 16:40:02 ok, lets move on 16:40:12 #topic Tempest/Scenario 16:40:31 we already talked about issue with neutron-tempest-plugin jobs 16:40:43 I also reported today new bug for dvr jobs: https://bugs.launchpad.net/neutron/+bug/1858642 16:40:43 Launchpad bug 1858642 in neutron "paramiko.ssh_exception.NoValidConnectionsError error cause dvr scenario jobs failing" [High,Confirmed] 16:40:58 it happens quite often that tests fails due to such error 16:41:18 and I saw it only in dvr jobs (which are non-voting fortunatelly) 16:41:32 but if someone would have some time, it would be good to take a look 16:42:37 next problen is that neutron-tempest-plugin-fwaas job is unstable 16:42:48 I reported it here: https://bugs.launchpad.net/neutron/+bug/1858645 16:42:48 Launchpad bug 1858645 in neutron "Neutron-fwaas tempest job is not stable" [Critical,Confirmed] 16:43:06 and I proposed patch to make those jobs non-voting and remove from gate queue for now 16:43:12 https://review.opendev.org/#/c/701371/ 16:43:30 if someone interested in neutron-fwaas could take a look, that would be great 16:44:03 will do 16:44:17 thx njohnston 16:44:18 despite my lack of interest in neutron-fwaas :-) 16:44:32 :) 16:44:49 and that's all what I have for scenario jobs for today 16:46:08 anything else related to scenario jobs You have or can we move on? 16:47:06 +1 to move on 16:47:57 ok, last topic for today from me 16:47:58 #topic Periodic 16:48:25 still our mariadb job is failing 100% due to issue in Mariadb 10.1 16:48:45 is there anyone who has some cycles to check how to upgrade mariadb in this job? 16:48:54 https://review.opendev.org/#/c/698980/1/tools/fixup_stuff.sh 16:49:01 I think this patch won't fix it 16:49:12 because fixup_ubuntu is not being called 16:49:34 I'll try again this week 16:49:48 yes, I think it's called only in functional/fullstack jobs 16:50:24 ok, thx ralonsoh 16:50:32 if You would need any help with this, please ping me 16:50:45 #action ralonsoh to take a look how to use newer Maridb in periodic job 16:50:57 thanks 16:51:00 ok, that's all from my side for today 16:51:15 anything else You want to discuss today? 16:51:20 no thanks 16:51:34 nothing more from me 16:51:35 nothing here (except happy new ci year) 16:51:53 happy new CI year :) 16:51:56 ahh, one thing 16:52:18 I just pushed my blog post to be online: http://kaplonski.pl/blog/failed_builds_per_patch/ 16:52:28 it's about number of "rechecks" in our CI 16:52:37 +1 16:52:54 * bcafarel adds to the "to read" tabs 16:53:02 if You have few minutes, please take a look and tell me if that makes any sense for You and if such metric may be useful for us somehow 16:53:02 +1 16:53:13 and thx njohnston for help with english in this post :) 16:54:19 my pleasure, your English is already very good :-) 16:54:26 thx njohnston :) 16:54:37 ok, I think we can finish our meeting now 16:54:41 thx for attending 16:54:45 and see You all online 16:54:47 o/ 16:54:50 #endmeeting