15:00:39 #startmeeting neutron_ci 15:00:39 Meeting started Tue Aug 22 15:00:39 2023 UTC and is due to finish in 60 minutes. The chair is slaweq. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:00:39 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:00:39 The meeting name has been set to 'neutron_ci' 15:00:50 hi 15:01:05 o/ 15:01:09 ping bcafarel, lajoskatona, mlavalle, mtomaska, ralonsoh, ykarel, jlibosva, elvira 15:01:11 Grafana dashboard: https://grafana.opendev.org/d/f913631585/neutron-failure-rate?orgId=1 15:01:23 slaweq: irc or video? 15:01:26 o/ 15:01:26 o/ 15:01:36 mlavalle irc today 15:01:54 I think we can start 15:01:56 #topic Actions from previous meetings 15:02:09 ralonsoh to check failed neutron.tests.fullstack.test_l3_agent.TestHAL3Agent.test_keepalived_multiple_sighups_does_not_forfeit_primary test 15:02:25 no, I didn't start with this one 15:02:26 sorry 15:02:41 no worries 15:02:58 I didn't saw this issue recently so maybe we can just wait for new occurences 15:03:01 wdyt? 15:03:05 and maybe then get back to this 15:03:09 ok for me 15:03:49 ok, so next one 15:03:50 mtomaska to check failing neutron-functional-with-sqlalchemy-master periodic job 15:03:55 https://review.opendev.org/c/openstack/neutron/+/890939 15:04:08 should be fixed when that patch merges 15:04:12 o/ 15:04:15 thx for the fix mtomaska, I see it's in the gate now 15:04:54 so, last one from previous meeting: 15:04:55 lajoskatona will send DNM patch for neutron-dynamic-routing to check jobs 15:05:18 It was sent and frickler actually found the issue in os-ken 15:05:38 so I sent the DNM and frickler done the rest of the work 15:06:02 thx lajoskatona and frickler 15:06:07 so are we good now with it? 15:06:08 and quite a journey it was ;) 15:06:12 or is it in progress? 15:06:19 we still need to test after os-ken release 15:06:29 because it isn't self testing as mentioned earlier 15:06:29 if we have the os-ken release it should be fine 15:06:40 I only tested on the held node I used for debugging 15:06:41 ok :) 15:06:56 or may be fix the jobs so os-ken in patch get's used 15:06:56 thx a lot to both of You 15:07:40 ykarel do You mean to use os-ken from master in those jobs? 15:08:09 that was discussed in the previous meeting 15:08:25 slaweq, iiuc what frickler mean the jobs running against os-ken patches not using those patches but released version 15:08:25 n-d-r job in os-ken CI is not isntalling the tested patch 15:08:28 ahh, sorry. I probably missed it then 15:08:42 hmm i was also out last meeting so might be missing context 15:09:08 so i meant we should instead fix the jobs to work with os-ken patches and not wait for actual release to test :) 15:09:26 well the release is due this week anyway 15:09:39 but fixing the job would be a good task to do, too 15:09:41 me also not sure if this is a regression or it never worked for os-ken 15:09:56 ok, so this needs to be fixed indeed 15:12:37 frickler can You maybe check it this week and open LP if we need to fix jobs in os-ken? 15:12:47 so we can track it at least and not forget about it 15:13:04 well I'm pretty sure that it is broken 15:13:24 ralonsoh was the one who wanted to take another look and open the bug 15:13:33 ok, thx 15:13:42 so ralonsoh will You open LP for it? 15:13:44 yes, I'll check that is broken in the CI execution 15:13:50 yes, after checking the logs 15:14:09 thx 15:14:27 #action ralonsoh to check n-d-r os-ken jobs and open LP related to it 15:14:38 ok, I think we can move on 15:14:39 #topic Stable branches 15:14:49 bcafarel anything new/urgent? 15:15:07 no, all good overall :) 15:15:23 recent backports passed gates smoothly, up to ussuri 15:16:19 ok 15:16:24 so I think we can move on then 15:16:30 #topic Stadium projects 15:16:48 anything to discuss here? except n-d-r and os-ken which we already talked about 15:17:00 We discussed n-d-r, so that is one thing to keep an eye on 15:17:11 the other topic is bagpipe 15:17:27 it is failing with SQLAlchemy 2, I proposed a patch: https://review.opendev.org/c/openstack/networking-bagpipe/+/891325 15:17:59 but some test still fails randomly for the sfc driver, so I have to spend some more time with it 15:18:38 that's it for the stadiums 15:18:50 are those random failures also related to SQLAlchemy 2.0? or something different? 15:19:28 no I see them only with sqlalchemy2 15:19:57 ok, so maybe ralonsoh and/or stephenfin will be able to help with them somehow 15:20:08 I'll try to find the issue there 15:20:14 thx a lot 15:20:27 thanks 15:20:56 next topic then 15:20:56 #topic Grafana 15:22:00 https://grafana.opendev.org/d/f913631585/neutron-failure-rate 15:22:08 I see that rally jobs were broken last week but it's fixed on rally side already 15:22:17 other than that it's as usual 15:23:24 +1 15:24:13 I think we can move on then 15:24:15 #topic Rechecks 15:24:51 it was a bit better last week already but then there was this issue with rally and issue with GLOBAL_VENV in devstack which made it a bit worst 15:25:06 Merged openstack/neutron master: [sqlalchemy-20] TableClause.insert constructs Insert object https://review.opendev.org/c/openstack/neutron/+/890939 15:25:09 but those problems are already fixed so I think it's pretty ok in overall 15:25:38 so I think we can move on to talk about some specific failures 15:25:44 #topic fullstack/functional 15:26:01 here I found one new (for me) failure in the neutron.tests.functional.plugins.ml2.drivers.ovn.mech_driver.ovsdb.test_maintenance.TestMaintenance.test_port_forwarding 15:26:08 https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_8a2/periodic/opendev.org/openstack/neutron/master/neutron-functional-with-pyroute2-master/8a279fa/testr_results.html 15:26:36 it was in periodic job so it's not related to any patch in progress 15:27:07 anyone wants to check it deeper maybe? Or if not we can wait if it will happen more often 15:27:07 this is a callback, could be just a race condition 15:27:20 I can check it and maybe limit the check to the expected call 15:27:30 ralonsoh++ thx a lot 15:27:52 #action ralonsoh to check failure in the neutron.tests.functional.plugins.ml2.drivers.ovn.mech_driver.ovsdb.test_maintenance.TestMaintenance.test_port_forwarding 15:28:23 and that's all regarding functional/fullstack tests 15:28:25 #topic Tempest/Scenario 15:28:38 here I noticed kernel panic in guest vm (again?): 15:28:45 https://cbf8616008e0e2c2dfec-9346de3bff5d83c6d90eefafd8632b44.ssl.cf1.rackcdn.com/884474/13/check/tempest-integrated-networking/0e81b62/testr_results.html 15:29:02 I'm not really even sure what Cirros version was used there 15:29:22 so maybe it's not an issue at all but just wanted to highlight here that I saw it again 15:29:30 cirros 6.2 15:29:46 so should be good, right? 15:29:54 maybe it's new issue then, idk 15:30:00 no shouldn't be related to cirros 6.2 15:30:23 i recall it's an old issue and it's workedaround in our job by using uec images 15:30:55 ykarel - possibly as this issue was in the tempest-integrated-networking job which is in tempest repo 15:31:01 yeap 15:31:13 so if that will happen more often we will maybe need to propose same workaround in that job too 15:31:24 lets keep an eye on it for now 15:31:30 is that ok for You? 15:31:42 +1 15:31:50 +1 15:31:55 +1 15:32:07 thx 15:32:11 so next topic 15:32:13 #topic grenade 15:32:33 I saw (again just once but wanted to mention it) some issue related to keystone: https://53ec660a16b30e470118-779b81139f4f29276caf956abf2a020f.ssl.cf2.rackcdn.com/890939/3/gate/neutron-ovs-grenade-dvr-multinode/f868b9c/controller/logs/grenade.sh_log.txt 15:32:59 did You saw something like that already? Is it something what we should report maybe to the keystone team? 15:33:16 yes, could be usefull for them to know this 15:33:31 doesn't seem to be related to Neutron 15:33:42 ok, I will let know about it to knikolla 15:34:50 #topic Periodic 15:35:07 here I saw 2 issues which we need to handle somehow: 15:35:18 fullstack fips job broken: https://zuul.openstack.org/build/b87d8c3037a1417193c865bc576ac593 15:35:30 and Centos 9 Stream jobs broken: 15:35:30 https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_cbf/periodic/opendev.org/openstack/neutron/master/neutron-ovn-tempest-ovs-master-centos-9-stream/cbf72a9/job-output.txt 15:35:30 https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_533/periodic/opendev.org/openstack/neutron/master/neutron-ovn-tempest-ovs-release-fips/5331cd4/job-output.txt 15:35:45 anyone wants to check those? 15:36:07 sounds related to GLOBAL_VENV thing 15:36:20 right 15:36:35 * haleyb noticed the centos9 job too trying to recreate a bug, but didn't dig into it 15:36:49 I'll check the centos9 error 15:37:07 cf. https://review.opendev.org/c/openstack/tempest/+/891517 15:37:24 ok, I will check fullstack fips job then 15:37:39 ok so the centos9 issue seems to be solve there 15:37:40 #action ralonsoh to check Centos 9 stream jobs failures 15:37:59 thx frickler 15:38:13 #action slaweq to check fips fullstack job failures 15:38:38 that's all regarding periodic jobs from me 15:38:43 #topic On Demand 15:38:51 do You have anything else to discuss today? 15:39:22 just one thing as more people are here 15:39:31 i raised it over the patch https://review.opendev.org/c/openstack/neutron/+/892134 15:40:05 I think I addressed your comment 15:40:08 right? 15:40:21 I removed the experimental job 15:40:29 ralonsoh, yeap related to duplicating jobs in periodic/experimental and check 15:40:48 yeah, let's have it only in check queue 15:41:05 but i had a concern to avoid blocking CI with such jobs if master commits from sqlalchemy and alembic 15:41:11 ralonsoh but I also agree with ykarel that this job maybe should be non-voting one in check queue 15:41:25 at this point, that should always work 15:41:35 we should not include anything not compatible with sqlalchemy 2.0 15:42:04 but if you agree on this, I'll mark it as non-voting 15:42:05 yeah, but the point is - will sqlalchemy not merge anything breaking for us? :) 15:42:31 ok, I'll push a new patch marking it as non-voting 15:43:01 but we should pay attention to job during review :) 15:43:30 yeap non-voting jobs might get unnoticed 15:43:51 so maybe keep it voting for now and we can always switch it to non-voting in case of any problems from sqlalchemy side 15:43:57 perfect 15:44:12 so as is now 15:44:19 +1 15:44:22 ok 15:44:23 ok and hope it get's all good +1 15:44:36 I also have one additional topic/announcement for today 15:45:08 as You probably noticed, I'm chair of this CI meeting for quite some time (6+ years already if I'm not mistaken) 15:45:20 and recently I though it would be good to pass it to someone else 15:45:35 so starting next week ykarel will be our new chair of the CI meeting 15:45:47 slaweq, thanks for all these years! 15:45:51 thx ykarel for stepping up in this role :) 15:45:56 and thanks ykarel for steeping up! 15:46:06 thanks for the efforts to keep these topics in focus 15:46:15 Thanks slaweq for all your efforts for all your efforts in those years 15:46:44 thanks for leading the meeting for so long slaweq 15:46:49 and welcome yka 15:46:53 ykarel: 15:47:06 thx everyone 15:47:12 and that's all from me for today 15:47:19 welcome ykarel as chair of this meeting:-) 15:47:26 if there are no other topics, I will give You back few minutes today 15:48:09 ok, thx for attending and have a great week everyone 15:48:13 #endmeeting