15:00:26 #startmeeting neutron_ci 15:00:26 Meeting started Tue Apr 12 15:00:26 2022 UTC and is due to finish in 60 minutes. The chair is slaweq. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:00:26 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:00:26 The meeting name has been set to 'neutron_ci' 15:00:28 hi 15:00:32 hi 15:00:41 Grafana dashboard: https://grafana.opendev.org/d/f913631585/neutron-failure-rate?orgId=1 15:01:03 o/ 15:01:20 o/ 15:02:44 o/ 15:02:59 ok, I think we can start 15:03:01 #topic Actions from previous meetings 15:03:04 o/ 15:03:14 we have pretty long list of action items from last time 15:03:22 mlavalle will continue work on https://bugs.launchpad.net/neutron/+bug/1945283 15:03:41 I haven't found instances of that in openserach 15:03:47 opensearch 15:04:02 maybe I don't know how to use it well, yet 15:04:13 I'll keep searching and querying 15:05:18 ok 15:06:20 next one 15:06:22 slaweq to add extra logs to fullstack tests to investigate https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_366/830623/1/check/neutron-fullstack-with-uwsgi/366febd/testr_results.html 15:06:47 Patch https://review.opendev.org/c/openstack/neutron/+/834070 15:07:10 next one 15:07:11 slaweq to continue investigation on nova-neutron notifications issue https://bugs.launchpad.net/neutron/+bug/1963899 15:07:13 Fix merged https://review.opendev.org/c/openstack/neutron/+/834852 15:07:27 next one 15:07:29 lajoskatona to create yoga templates in neutron-tempest-plugin jobs 15:08:28 that is merged 15:08:37 great, thx 15:08:48 next one 15:08:49 mlavalle to add note about rechecking with reason 15:08:59 working on this 15:09:05 I'll finish this week 15:09:16 ok 15:09:17 cool 15:09:24 #action mlavalle to add note about rechecking with reason 15:09:50 next one 15:09:52 ralonsoh to propose patch to run mysql functional tests in serial 15:09:56 merged 15:10:39 great 15:10:50 next one 15:10:57 slaweq to investigate "inprogress" tests which are interrupted somehow https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_a75/833620/4/check/neutron-functional-with-uwsgi/a75e59b/testr_results.html 15:11:04 Bug reported https://bugs.launchpad.net/neutron/+bug/1966394 15:11:13 Patch https://review.opendev.org/c/openstack/neutron/+/836426 merged, but we also have now https://review.opendev.org/c/openstack/neutron/+/836120/ so we can investigate memory usage a bit more 15:11:37 and the last one from my list 15:11:39 slaweq to report bug about https://12ef4cf37bb4f9cf8615-49968699300828e6c9b78fd54dff75ef.ssl.cf5.rackcdn.com/828687/8/gate/neutron-functional-with-uwsgi/3688faf/testr_results.html 15:11:44 Bug https://bugs.launchpad.net/neutron/+bug/1966035 15:12:13 obondarev is investigating that (at least trying to) 15:12:26 and that's all actions from last meeting 15:12:36 I think we can move on to the next topic now 15:12:37 #topic Stable branches 15:12:43 any updates bcafarel ? 15:13:23 lajoskatona++ quickly fixed yoga branch last week, the rest is in good shape 15:13:38 victoria is going EM soon I have to check the pending backports to get a last release 15:13:46 ++ 15:13:53 ok 15:14:08 so we should also tag neutron-tempest-plugin for victoria when it will go EM 15:14:23 great point yes indeed! 15:14:31 isn't that done together with tempest? 15:14:39 lajoskatona: I'm not sure 15:14:58 I can check with kopecmartin or gmann 15:15:08 thx lajoskatona 15:15:36 #action lajoskatona to check with QA team if neutron-tempest-plugin tag for victoria EM will be done together with tempest 15:15:50 +1 15:16:08 ack, I will plan that for tempest this week sometime. 15:16:24 thx gmann 15:16:50 thanks 15:16:59 ok, I think we can move on now 15:17:06 #topic Stadium projects 15:17:10 lajoskatona: any updates here? 15:17:21 I checked weekly jobs today 15:17:26 https://zuul.openstack.org/buildsets?project=openstack%2Fnetworking-sfc&project=openstack%2Fnetworking-bagpipe&project=openstack%2Fnetworking-bgpvpn&project=openstack%2Fnetworking-odl&project=%09openstack%2Ftap-as-a-service&project=openstack%2Fneutron-vpnaas&project=openstack%2Fneutron-dynamic-routing&branch=master&pipeline=periodic-weekly&skip=0 15:17:41 and it seems that py39 job in networking-odl is failing 15:17:47 https://zuul.openstack.org/build/0412f7ed782d400598fe51a08949b08e 15:17:52 do You know about it? 15:17:56 I have tocheck that 15:18:16 another thing is that mnaser would like to work on vpnaas issues 15:18:31 I have to check with him what does this mean 15:18:42 that's good news 15:18:49 yeah, i can try and have a look at it, we use it in prod and we have a few atches we're running with locally (that are proposed upstream but would like to have them landed) 15:18:50 #action lajoskatona to check failing py39 job in networking-odl 15:19:28 mnaser: thanks 15:19:35 that's good news 15:20:04 anything else regarding stadium projects' CI? 15:20:07 mnaser: would be good to have somebody who can check time-to-time if the jobs are passing or if there's new bugs 15:20:08 or can we move on? 15:20:15 I think we can 15:20:25 lajoskatona: i will try and make that effort, as long as i can get +2's and +W's in return :) 15:20:50 mnaser: sure :) 15:20:52 thx 15:20:55 mnaser: I'll help with that 15:21:07 +1 15:21:12 will it be you submitting the patches? 15:21:27 mnaser: ^^^ 15:21:41 mlavalle: probably yeah, at least to fix the func jobs 15:21:46 mnaser: we can add you I suppose to the core group for vpnaas also 15:22:20 i'd be okay with that too, i think it would be also good to clear up the backlog of stuff there 15:22:42 still, he cannot +2 +W himself 15:23:15 mlavalle: that's true, but more eyes on hanging patches 15:23:25 +1 15:23:42 we can discuss that offline, and continue the ci meeting I think, sorry 15:23:45 +1 15:23:58 sure, no problem at all :) 15:24:06 ok, lets move one with CI meeting then 15:24:11 #topic Grafana 15:24:15 https://grafana.opendev.org/d/f913631585/neutron-failure-rate 15:24:57 it seems for me that still our biggest problem is functional tests job in check queue :/ 15:25:32 and in gate queue too 15:25:47 most of these failures are due to pyroute2 15:26:23 ralonsoh: do we have to use new version? 15:26:25 I saw many failures related to the timeout while waiting on the transition of the router 15:26:41 we should merge your retry patch 15:27:12 https://review.opendev.org/c/openstack/neutron/+/833015 15:28:08 ahh, ok, I forgot that, and thought it is merged :-) 15:28:16 approved :) 15:28:17 slaweq, btw, I'm working on a patch to replace the keepalived-state-change monitor 15:28:29 instead of using a python binary 15:28:32 to use a shell script 15:28:33 ralonsoh: great, I hope that will help too 15:28:47 (but I didn't have time to continue wiht that) 15:28:53 because that job is basically killing us currently 15:29:17 should we maybe mark those l3 ha tests as unstable temporary? 15:29:25 to make job more stable 15:29:27 wdyt? 15:29:31 I wouldn't 15:29:39 it is not blocking completely the CI 15:29:58 completly no, but number of rechecks is going high 15:30:03 ok then 15:30:42 ok, maybe it's not that bad yet 15:30:51 so lets wait for next week 15:32:38 +1 15:33:40 ok, so for functional job failures, list of examples is in the etherpad https://etherpad.opendev.org/p/neutron-ci-meetings#L67 15:34:07 but most of them (or even all) are related to those pyroute2 issues, like e.g.: 15:34:12 https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_170/836120/5/gate/neutron-functional-with-uwsgi/1701f9a/testr_results.html 15:34:27 https://b5e0246d2a61241093f5-52f9a55e019565a745f2bb8d45b0d6fa.ssl.cf1.rackcdn.com/836725/2/check/neutron-functional-with-uwsgi/ad2bdce/testr_results.html 15:34:36 or to the issue with router transition to primary state: 15:34:43 https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_868/835786/1/check/neutron-functional-with-uwsgi/8688fff/testr_results.html 15:34:43 https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_0e9/837146/1/check/neutron-functional-with-uwsgi/0e9f5a8/testr_results.html 15:35:53 so I think we can move on to the next topic now 15:35:57 #topic Tempest/Scenario 15:36:11 here I (again) found one old issue with linuxbridge job: 15:36:15 https://4de9725805f5e105f3a0-621008116cc26c58a3190713e2eda213.ssl.cf1.rackcdn.com/836863/3/check/neutron-tempest-plugin-scenario-linuxbridge/95dd563/testr_results.html 15:36:22 failed to transition device to DOWN state 15:36:29 I think lajoskatona was trying to fix it some time ago 15:36:37 right lajoskatona ? 15:37:18 slaweq: yeah, I think I remember 15:37:47 now, giving what we discussed during ptg, should we simply skip that test in the linuxbridge job? 15:37:51 wdyt? 15:38:02 +1 15:38:12 ok, I will propose patch for that 15:38:13 with a NOTE 15:38:26 maybe we should add this to the docs first 15:38:26 this is the patch for the above issue I think: https://review.opendev.org/c/openstack/neutron/+/827728 I have to check it again when I have some time 15:38:30 #action slaweq to skip failing linuxbridge scenario test 15:38:31 just the warning 15:38:38 ralonsoh: sure 15:38:41 perfect 15:38:46 +1 15:39:17 I also started patch https://review.opendev.org/c/openstack/neutron-tempest-plugin/+/836912 15:39:32 to remote neutron-tempest-plugin-api job and run those tests in the "scenario" jobs 15:39:39 but it's not ready yet 15:39:47 I need to check why some tests are failing there 15:40:01 but please be ready for review of it soon :) 15:40:05 sure 15:40:21 ok, and last topic from me today 15:40:27 #topic Periodic 15:40:37 Jobs results: http://zuul.openstack.org/buildsets?project=openstack%2Fneutron&pipeline=periodic&branch=master 15:40:41 speaking of tempest, if you have a chabce, review https://review.opendev.org/c/openstack/neutron-tempest-plugin/+/834411 15:40:42 we do have couple of issues to investigate there 15:40:49 it's an easy one 15:40:56 sure mlavalle 15:41:17 getting back to periodic 15:41:23 neutron-ovn-tempest-ovs-master-fedora - broken again 15:41:27 anyone wants to check it? 15:41:28 :) 15:41:37 again 15:41:38 and the same for neutron-ovs-tempest-slow 15:41:42 I'll look at it 15:41:48 mlavalle: thx 15:42:04 #action mlavalle to check neutron-ovn-tempest-ovs-master-fedora periodic job 15:42:11 so I will check neutron-ovs-tempest-slow 15:42:20 #action slaweq to check neutron-ovs-tempest-slow periodic job 15:42:40 there is also "propose-translation-update" which has some problem with pyroute2 version 15:42:57 https://zuul.openstack.org/build/8a44d67db5474acdb1cfce78d4ba04cd 15:43:06 anyone have cycles to check it this week? 15:43:13 i can check 15:43:18 thx ykarel 15:43:40 #action ykarel to check failing propose-translation-update periodic job 15:44:04 ok, and that's all what I had for today 15:44:12 do You have any other topics to discuss today? 15:44:20 ah, this is the doc/requirements.txt file 15:44:21 if not, I will give You few minutes back 15:44:32 well, not a short one but we didn't get to the top of the hour 15:44:35 not bad 15:44:38 just one little topic 15:44:40 https://review.opendev.org/c/openstack/neutron-lib/+/828738/18/.zuul.yaml 15:44:45 this is an example 15:44:59 I know we are always talking about reducing CI jobs 15:45:14 but having tempest jobs in n-lib is very useful 15:45:24 I agree 15:45:30 do you agree with adding them to the n-lib zuul deifnition? 15:45:34 that's why we added neutron-tempest-plugin-api job there some time ago 15:45:51 but if You think that adding scenario jobs too also make sense, I'm ok with that 15:45:59 me too 15:46:05 I can add only OVS native and OVN 15:46:17 agree 15:46:18 skipping hybrid and Linux Bridge 15:46:20 perfect 15:46:21 +1 15:46:22 thanks a lot 15:46:22 actually if You will remove neutron-tempest-plugin-api from check queue in that Your patch, it will help me with my "consolidation" patch :) 15:46:26 we should try to reduce the number of jobs but not to the point where we are opening holes 15:46:42 in our testing 15:46:44 perfect 15:47:09 #action ralonsoh to add neutron-tempest-plugin jobs to the neutron-lib CI 15:47:18 thx ralonsoh for bringing it up 15:47:37 small one: https://review.opendev.org/c/openstack/neutron/+/837552 to fix arm64 unit test jobs 15:48:20 ykarel: I try to check if the pkg is needed for the API tests for NDP proxy 15:48:38 but let's remove it from bindep.txt now 15:49:01 lajoskatona, yes bindep.txt is not getting used in devstack jobs, so i think it's fine 15:49:53 hold on 15:50:09 ok, but it is better to have an explanation from the commiter 15:50:37 * mlavalle just W+ it 15:50:50 ralonsoh: you mean from NDP proxy dvelopers? 15:51:14 I think this is a leftover, this is not needed for NDP, if I'm not wrong 15:51:24 in any case, this is a feature in development now 15:51:29 we can read it if needed 15:51:44 ++ 15:52:07 ok, ack 15:52:52 ok, I think that with this we can conclude the meeting for today :) 15:52:57 thx for attending 15:52:59 bye! 15:53:07 o/ 15:53:08 have a great week and see You online 15:53:10 bye 15:53:13 #endmeeting