15:00:37 #startmeeting neutron_ci 15:00:38 Meeting started Tue Mar 9 15:00:37 2021 UTC and is due to finish in 60 minutes. The chair is slaweq. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:00:39 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:00:41 The meeting name has been set to 'neutron_ci' 15:01:12 Hi 15:01:52 hi 15:01:56 hey again 15:02:43 ok, let's start 15:02:50 Grafana dashboard: http://grafana.openstack.org/dashboard/db/neutron-failure-rate 15:03:15 #topic Actions from previous meetings 15:03:21 slaweq to check failing qos migration tests in train neutron-tempest-dvr-ha-multinode-full job 15:03:26 Bug reported for nova for now https://bugs.launchpad.net/nova/+bug/1917610 15:03:27 Launchpad bug 1917610 in neutron "Migration and resize tests from tempest.scenario.test_minbw_allocation_placement.MinBwAllocationPlacementTest failing in neutron-tempest-dvr-ha-multinode-full" [Critical,Fix released] 15:03:28 Fixed in tempest https://review.opendev.org/c/openstack/tempest/+/778451 15:03:31 thx gibi for help with it :) 15:03:44 next one 15:03:50 ralonsoh to try to check how to limit number of logged lines in FT output 15:04:13 still checking this one, no progress yet 15:04:15 sorry 15:04:20 sure, np 15:04:32 can I assign it to You for next week? 15:05:13 sure 15:05:18 #action ralonsoh to try to check how to limit number of logged lines in FT output 15:05:20 thx 15:05:27 next one 15:05:29 ralonsoh to report bug with ip operations timeout in FT 15:05:36 one sec... 15:05:54 one patch: https://review.opendev.org/c/openstack/neutron/+/778735 15:06:01 LP: https://launchpad.net/bugs/1917487 15:06:02 Launchpad bug 1917487 in neutron "[FT] "IpNetnsCommand.add" command fails frequently " [Critical,New] - Assigned to Rodolfo Alonso (rodolfo-alonso-hernandez) 15:06:32 still working on 2) timeouts during the sysctl command execution 15:07:18 thx for that 15:07:31 I hope that with https://review.opendev.org/c/openstack/neutron/+/778735 functional tests will be a bit more stable 15:07:41 that would be nice 15:08:53 next one 15:08:54 bcafarel to check failing fedora based periodic job 15:09:17 so we had in fact a LP for that https://bugs.launchpad.net/neutron/+bug/1911128 15:09:18 Launchpad bug 1911128 in neutron "Neutron with ovn driver failed to start on Fedora" [Critical,In progress] - Assigned to Bernard Cafarelli (bcafarel) 15:09:47 I think main issue is ovs daemons do not run as root in Fedora, and so can not read the TLS certs (owned by stack) 15:10:09 I am testing this in https://review.opendev.org/c/openstack/neutron/+/779494 (could have had results if I had modified the correct job on first try...) 15:10:35 if it passes, it sounds like a good fix, we can have fedora+tls support added later, what do you think? 15:10:52 oh actually it passed zuul 15:11:00 that would be ok as workaround at least IMO 15:11:05 yes, it's green now 15:11:40 yes for proper support I am not sure how it would go in devstack, as "chmod 777" the certs is not really a nice fix :) 15:12:05 yes, but I think that it's perfectly valid to test it without ssl in that job 15:12:21 we don't want really to test ovs on fedora in that job 15:12:23 but neutron 15:12:27 :) 15:12:29 +1 15:12:45 if ralonsoh and lajoskatona are ok with that, I'm ok too 15:12:49 +1 15:12:58 +1 15:13:03 ok I will remove "wip" flag and then periodic can go back to green then 15:13:29 1 less periodic failure mail then? 15:13:34 ++ 15:13:36 cross fingers :) 15:13:38 thx a lot 15:13:56 lajoskatona: do You get emails about periodic jobs results? 15:15:07 yes, but recently too much 15:15:15 how to configure that? 15:15:21 I don't get such emails :/ 15:15:28 if there's only a few networking related I checked 15:15:36 I check it for you 15:15:42 thx 15:16:14 ok, lets move on 15:16:21 #topic Stadium projects 15:16:30 anything related stadium's ci? 15:17:00 I think this is where you can suscribe: http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-stable-maint 15:17:45 For stadiums: some patches are moving to the stable branches, nothing serious 15:17:55 thx lajoskatona 15:20:06 ok, thx 15:20:13 #topic Stable branches 15:20:27 bcafarel: except recent pip issue, anything else worth mentioning? 15:21:02 rest is mostly OK, I saw more grenade failures/timeouts than usual recently but not too bad (yet) 15:21:11 k 15:21:16 and thanks slaweq for all the CI improvement backports, they should help stable branches too! 15:21:33 yes, I made some but only up to train 15:21:40 in older branches we have many legacy jobs 15:21:50 and it would be too much to backport those things 15:22:55 sounds good, older EM branches if jobs get problematic, we can limit them 15:23:12 yeah 15:23:13 and though I stein needs some rechecks from time to time, rocky and queens are quite stable these days 15:24:33 let's move on 15:24:35 #topic Grafana 15:24:40 grafana.openstack.org/dashboard/db/neutron-failure-rate 15:25:15 in overall I think that things are pretty ok now 15:25:25 still functional/fullstack jobs are failing most 15:25:37 but they also went down a bit since last week 15:25:57 maybe it's due to mock of the ovn maintenance task there 15:26:22 do You have anything regading grafana dashboards for today? 15:28:08 ok, so lets talk about functional jobs then 15:28:10 #topic fullstack/functional 15:28:17 I have few things there 15:28:33 first one is interesting (for me) issue 15:28:50 I proposed some time ago patch to limit number of test workers in functional job 15:28:59 https://review.opendev.org/c/openstack/neutron/+/778151 15:29:09 and now I see that this job is failing 15:29:23 and many tests are failed due to "too many opened files" error 15:29:29 https://0bf054d7c7210f57ced8-38841c8dd9732a175234859ce574a8ea.ssl.cf5.rackcdn.com/778151/3/check/neutron-functional-with-uwsgi/6757358/testr_results.html 15:29:38 I have no idea why it is like that 15:29:45 should be the opposite... 15:29:48 do You maybe have any clues? 15:29:51 ralonsoh: exactly :) 15:30:09 but it's repeatable 15:30:17 I rechecked few times and had such problem 15:30:37 but only with zuul? 15:30:45 I have never seen locally 15:31:02 I didn't try to run all functional tests locally 15:32:48 I need to review that, I have no idea why this is happening 15:32:55 so, any help with that is more than welcome :) 15:33:02 thx ralonsoh 15:34:13 I also reported 2 new bugs 15:34:15 https://bugs.launchpad.net/neutron/+bug/1917487 15:34:16 Launchpad bug 1917487 in neutron "[FT] "IpNetnsCommand.add" command fails frequently " [Critical,New] - Assigned to Rodolfo Alonso (rodolfo-alonso-hernandez) 15:34:18 sorry 15:34:24 this one was reported by ralonsoh :) 15:34:35 I just found new occurence of that issue this week 15:34:37 :) 15:34:50 but I opened new bug https://bugs.launchpad.net/neutron/+bug/1918266 15:34:50 Launchpad bug 1918266 in neutron "Functional test test_gateway_chassis_rebalance failing due to "failed to bind logical router"" [High,Confirmed] 15:35:03 any volunteer to check that? 15:35:20 if not, I will ask jlibosva or otherwiseguy if they have some cycles to look 15:35:28 o/ 15:35:33 sorry, not this week, I has 6 bugs today for me 15:35:35 * jlibosva looks 15:35:47 ralonsoh: no need to sorry, I know You are busy :) 15:36:57 slaweq: I can have a look, tho I see we still don't collect OVN logs :-/ 15:37:07 we don't? 15:37:23 I thought we merged Your patch already 15:37:33 yeah, we did but the logs are not there 15:37:43 ah, sorry 15:37:46 the patch is not yet merged 15:38:01 wait :) 15:38:12 jlibosva: ok, so lets merge that patch first and then if the problem will happen again, I will ping You :) 15:38:17 fine for You? 15:38:36 yes, I'll check if perhaps the patch fixed some jobs only or if functional was inlcuded too 15:39:18 jlibosva: are we talking about https://review.opendev.org/c/openstack/neutron/+/771658 ? 15:39:24 if so, it's just for tempest jobs 15:40:24 slaweq: that's right 15:40:36 maybe I'm looking at wrong place 15:40:47 jlibosva: can You do the same for functional job? 15:40:58 slaweq: yes 15:41:08 You can make it "related to" to that LP mentioned above 15:41:24 #action jlibosva to fix collecting ovn logs in functional jobs 15:41:27 thx jlibosva 15:41:47 ok, generally it's all what I have for today 15:41:58 do You have anything else related to our CI to discuss? 15:43:08 nothing else from me 15:43:12 if not, I think we can finish meeting earlier today 15:43:13 nope 15:43:19 thx for attending the meeting 15:43:24 bye 15:43:26 and have a great week o/ 15:43:30 #endmeeting