15:00:06 #startmeeting neutron_ci 15:00:06 Meeting started Tue Jun 28 15:00:06 2022 UTC and is due to finish in 60 minutes. The chair is slaweq. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:00:06 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:00:06 The meeting name has been set to 'neutron_ci' 15:00:32 hi 15:00:41 hi 15:00:51 hi 15:01:03 o/ 15:01:04 is this a video or irc meeting? 15:01:27 mlavalle today on irc 15:01:32 ack 15:01:55 I think we can start as lajoskatona is not available today 15:02:02 Grafana dashboard: https://grafana.opendev.org/d/f913631585/neutron-failure-rate?orgId=1 15:02:02 Please open now :) 15:02:07 #topic Actions from previous meetings 15:02:17 slaweq to fix functiona/fullstack failures on centos 9 stream: https://bugs.launchpad.net/neutron/+bug/1976323 15:02:39 I didn't made any progress on that one this week 15:02:50 I will assign it to me for next week again 15:02:55 #action slaweq to fix functiona/fullstack failures on centos 9 stream: https://bugs.launchpad.net/neutron/+bug/1976323 15:03:03 next one 15:03:05 ykarel to update Neutron-tempest-plugin jobs graphs in Grafana 15:03:47 hi 15:04:17 done 15:04:18 https://review.opendev.org/c/openstack/project-config/+/845975 15:04:18 https://review.opendev.org/c/openstack/project-config/+/845978 15:04:49 both merged already 15:04:52 thx ykarel 15:05:07 next one 15:05:09 ykarel to increase swap size in the neutron-tempest-plugin jobs 15:05:32 Done with https://review.opendev.org/c/openstack/neutron-tempest-plugin/+/845888 15:05:38 thx a lot 15:05:49 I didn't saw similar issues in last days 15:05:59 next one 15:06:01 slaweq to move fedora periodic job to centos9 stream 15:06:16 I'm slowly progressing with this one in https://review.opendev.org/c/openstack/neutron/+/844335 15:06:33 but for some reason ovs-vswitchd is crashing there 15:06:43 I will need to check why it's like that 15:07:19 #action slaweq to move fedora periodic job to centos9 stream 15:07:31 next one 15:07:33 ykarel to propose fix/workaround for the fips jobs and missing rabbitmq-server package 15:07:51 Done with https://review.opendev.org/c/openstack/neutron/+/846001 15:08:09 thx a lot 15:08:19 and the last one 15:08:21 ykarel to fix propose-translation-update periodic job 15:08:58 done, that needed couple of iterations https://review.opendev.org/q/topic:fix-propose-updates 15:09:38 but it seems that it's fixed already as periodic jobs were green few days this last week 15:09:40 thx a lot for that 15:10:00 yatin proposed openstack/neutron stable/yoga: Set nslookup_target in FIPS jobs https://review.opendev.org/c/openstack/neutron/+/847995 15:10:03 any questions/comments regarding those action items? 15:10:04 one action item missing in this list is https://review.opendev.org/c/openstack/neutron/+/845181 15:10:20 ups, sorry mlavalle 15:10:21 for fips job needs backport in yoga too 15:10:27 just pushed 15:10:32 I just addressed ralonsoh's suggestioins. It should be good to go 15:10:37 +2 15:10:50 along with https://review.opendev.org/c/openstack/neutron-tempest-plugin/+/845646 15:11:21 the first patch depends on the second 15:11:35 I added it to my review list for tomorrow 15:11:42 thanks :-) 15:12:17 thank You for working on that important stuff 15:12:44 ykarel I will also review Your backport :) 15:12:53 thx 15:13:59 ok, I think we can move on 15:14:01 #topic Stable branches 15:14:07 bcafarel any updates? 15:14:13 or new issues 15:14:41 none that I spotted, though there are not many backports in the queue right now (which is good also!) 15:14:59 openstack as a whole is moving to EOL pike but we already did for networking not so long ago, so nothing on us 15:15:37 ok, thx for the updates 15:15:53 as there is no Lajos today, I think we can skip stadium projects topic 15:16:01 and move directly to the next one 15:16:07 #topic Grafana 15:16:39 it looks pretty ok, except rally jobs which are broken totally 15:17:04 other jobs are pretty ok IMO 15:17:04 well, there is Lajos, just not here today :-) 15:17:32 mlavalle :D true, sorry 15:18:23 anything regarding grafana You want to discuss today? 15:18:56 for rally i pushed a patch, but rally CI is in bad shape 15:19:04 https://review.opendev.org/c/openstack/rally-openstack/+/847879 15:19:06 ykarel++ 15:20:06 yeah, I wanted to talk about it later in the meeting :) 15:20:12 but we can talk about it now 15:20:33 what if we would make those jobs non-voting temporary? 15:20:43 or do You expect that your fix will be merged soon in rally? 15:20:59 I don't want to block our gate for too long 15:22:09 +1 to unblock , ovn one is already non voting, temporary ok to make ovs too non voting 15:22:30 there are multiple issues in rally , so i doubt it will get merged soon 15:22:31 ykarel will You do it? 15:22:35 yes sure 15:22:38 thx a lot 15:22:47 ok, next topic then 15:22:51 #topic Rechecks 15:23:10 we are still below 1 recheck in average to get patches merged 15:23:23 even below 0.5 rechecks last week 15:23:33 so good job :) 15:23:35 that is really nice 15:24:16 #topic fullstack/functional 15:24:29 functional tests are in better shape recently 15:24:35 but still we have some failures there 15:25:02 test_agent_updated_at_use_nb_cfg_timestamp - AssertionError: Chassis timestamp: 1655824139000, agent updated_at: 2022-06-21 15:08:58+00:00 15:25:08 https://a725fc0d7f8b52d360c4-66ce5f117c645ca152390f12473225b2.ssl.cf5.rackcdn.com/797120/19/gate/neutron-functional-with-uwsgi/4b9fd54/testr_results.html 15:25:19 ykarel reopened https://bugs.launchpad.net/neutron/+bug/1974149 15:25:55 seen this only once 15:25:59 and there is fix proposed https://review.opendev.org/c/openstack/neutron/+/847349 15:26:37 another one 15:26:45 test_virtual_port_host_update - AssertionError: Expected 'update_virtual_port_host' to be called once. Called 0 times. 15:26:51 https://cb16041cfb3c54cedd2e-24bc61d83ed5aece64ab40b405cf025c.ssl.cf5.rackcdn.com/797121/16/check/neutron-functional-with-uwsgi/12ca1dc/testr_results.html 15:27:00 ykarel reopened https://bugs.launchpad.net/neutron/+bug/1971672 15:27:36 ralonsoh it seems that You were working on it in the past 15:27:42 yeah, not now 15:28:07 will You be able to check it again? 15:28:16 I don't think it's urgent this week 15:28:21 sure 15:28:28 thx a lot 15:28:39 next one 15:28:40 neutron.tests.functional.services.trunk.drivers.openvswitch.agent.test_trunk_manager.TrunkManagerTestCase.test_connectivity 15:28:45 https://cb16041cfb3c54cedd2e-24bc61d83ed5aece64ab40b405cf025c.ssl.cf5.rackcdn.com/797121/16/check/neutron-functional-with-uwsgi/12ca1dc/testr_results.html 15:30:55 anyone saw something like that already? 15:31:10 no sorry 15:32:09 no I haven't 15:32:26 but the error message there is strange 15:32:28 RuntimeError: Process ['ping', '-W', '1', '-c', '3', '192.168.0.1'] hasn't been spawned in 20 seconds. Return code: 0, stdout: PING 192.168.0.1 (192.168.0.1) 56(84) bytes of data.... (full message at https://matrix.org/_matrix/media/r0/download/matrix.org/HKNHjohjKrArohSCNEOkafNH) 15:32:41 "ping wasn't spawn" but there is result from that ping command 15:32:58 and it seems to be working fine 15:33:53 ok, I will try to take a look deeper into it early next week 15:34:11 #action slaweq to check trunk connectivity test failure https://cb16041cfb3c54cedd2e-24bc61d83ed5aece64ab40b405cf025c.ssl.cf5.rackcdn.com/797121/16/check/neutron-functional-with-uwsgi/12ca1dc/testr_results.html 15:34:23 next one 15:34:26 test_metadata_proxy_respawned 15:34:33 https://fd50651997fbb0337883-282d0b18354725863279cd3ebda4ab44.ssl.cf5.rackcdn.com/846960/1/gate/neutron-functional-with-uwsgi/baf4db6/testr_results.html 15:34:33 https://628f2b7919091567c7a1-482044f534933477a9da6fbd27b4ad69.ssl.cf1.rackcdn.com/periodic/opendev.org/openstack/neutron/master/neutron-functional/154a050/testr_results.html 15:34:42 this one failed twice at least this week 15:35:39 do you have a LP bug? just to document it 15:36:18 nope 15:36:25 I'll open it 15:36:28 thx a lot 15:36:35 and someone will need to check it 15:36:48 I'll try 15:36:55 thx a lot 15:36:58 ok, next one 15:37:01 https://fd50651997fbb0337883-282d0b18354725863279cd3ebda4ab44.ssl.cf5.rackcdn.com/846960/1/gate/neutron-functional-with-uwsgi/baf4db6/testr_results.html 15:37:11 yatin proposed openstack/neutron master: Temporary make rally job non voting https://review.opendev.org/c/openstack/neutron/+/847989 15:37:14 problem with db migration tests (again) 15:37:33 Arnau Verdaguer proposed openstack/neutron master: Migration revert plan https://review.opendev.org/c/openstack/neutron/+/835638 15:37:36 with Psql? 15:37:37 ralonsoh can You check that one when You will have some time? 15:37:49 yes, it's Psql test 15:37:54 yes, I think I only modified the mysql ones 15:38:01 ok 15:38:27 #action ralonsoh to check test_walk_versions failure https://fd50651997fbb0337883-282d0b18354725863279cd3ebda4ab44.ssl.cf5.rackcdn.com/846960/1/gate/neutron-functional-with-uwsgi/baf4db6/testr_results.html 15:38:49 next one 15:39:08 failure in test_ha_router_lifecycle 15:39:14 https://d2e721b9a6905a827b60-69bfa1706af4af4c0b48b8bfd809f2ca.ssl.cf2.rackcdn.com/835638/15/check/neutron-functional-with-uwsgi/56a2c78/testr_results.html 15:39:14 https://fb75ecc35c58d9fe2410-512bee2f5825275b34720067f00890fc.ssl.cf2.rackcdn.com/840419/8/check/neutron-functional-with-uwsgi/e15f93a/testr_results.html 15:39:39 I though that those tests should be skipped when fails like that 15:39:51 maybe that one is going through some other path and I missed it somehow 15:39:54 I will check it 15:40:16 #action slaweq to check why test_ha_router_lifecycle test wasn't skipped as it should be in case of failure 15:40:52 and the last one on that list 15:40:53 test_dvr_router_lifecycle_ha_with_snat_with_fips 15:40:53 https://7892a49fff80be41bd93-7937b4b8835d06e87bcc77aa86f44280.ssl.cf5.rackcdn.com/periodic/opendev.org/openstack/neutron/master/neutron-functional-with-uwsgi-fips/99f97f6/testr_results.html 15:41:13 again problem with device not found in the namespace 15:41:24 but I don't really know why it's like that 15:41:29 maybe someone wants to check it 15:41:45 I'll try this week 15:42:39 the common patter in those cases is that interface is added, deleted, added in the ovs-vswitchd log 15:42:43 see https://7892a49fff80be41bd93-7937b4b8835d06e87bcc77aa86f44280.ssl.cf5.rackcdn.com/periodic/opendev.org/openstack/neutron/master/neutron-functional-with-uwsgi-fips/99f97f6/controller/logs/openvswitch/ovs-vswitchd_log.txt 15:42:55 You can grep for qr-81815200-dd 15:43:03 2022-06-28T02:43:01.305Z|03101|bridge|INFO|bridge test-br32e76b53: added interface qr-81815200-dd on port 2 15:43:08 2022-06-28T02:43:01.465Z|03105|bridge|INFO|bridge test-br32e76b53: deleted interface qr-81815200-dd on port 2 15:43:13 2022-06-28T02:43:01.656Z|03107|bridge|INFO|bridge test-br32e76b53: added interface qr-81815200-dd on port 2 15:43:18 2022-06-28T02:43:01.749Z|03117|bridge|INFO|bridge test-br32e76b53: deleted interface qr-81815200-dd on port 2 15:43:41 but I have no idea what can be problem there really 15:43:46 so maybe we are processing the router events in the wrong order 15:43:58 ralonsoh maybe 15:44:35 if You can check it with fresh look, that would be great 15:44:50 sure 15:45:40 thx a lot 15:46:01 #action ralonsoh to check missing qr- device in the namespace 15:46:22 ok, that's all issues with functional tests for today 15:46:25 any questions/comments? 15:47:52 ok, lets move on 15:47:57 #topic Tempest/Scenario 15:48:05 here we have 2 issues for today 15:48:13 first one with live migration and trunk ports: 15:48:19 https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_8c3/797121/16/check/neutron-ovs-tempest-multinode-full/8c3b8eb/testr_results.html 15:48:26 but it seems for me like it's nova issue 15:48:35 because migration wasn't done properly 15:49:13 if that will happen more often I think we can ask nova team to take a look but for now I wouldn't bother with that too much 15:49:16 we talked about something similar downstream yesterday 15:50:33 mlavalle do You want to check it together with d/s issue? 15:50:37 maybe it's the same one 15:51:07 slaweq: yeah, I would need a pointer to the downstream bugzilla 15:51:18 was someone assigned to it? 15:51:50 are You talking about https://bugzilla.redhat.com/show_bug.cgi?id=2097160 ? 15:52:09 if so, it's for sure different issue 15:52:12 that's it 15:52:24 I don't think both errors are related 15:52:32 the CI error has a live migration problem 15:52:32 ok 15:52:38 it just rang a bell 15:52:49 but the BZ happens once the migration failed 15:53:16 yeah, I also don't think those are the same issues 15:53:25 anyway, lets not bother with that too much for now 15:53:30 :) 15:53:41 ok 15:54:03 last issue for today on my list is qos scenario test failure 15:54:04 https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_4cd/847832/2/check/neutron-tempest-plugin-linuxbridge/4cd9322/testr_results.html 15:54:10 in linuxbridge job 15:54:25 I don't think anyone of us will have any cycles to check it 15:55:32 and that's basically all what I had for today 15:55:43 do You have any other ci related topics to discuss today? 15:55:53 no thanks 15:55:56 none from me 15:56:15 just please check non voting patch https://review.opendev.org/c/openstack/neutron/+/847989 15:56:16 ok, so I will give You 4 minutes back 15:56:18 neither do I 15:56:24 thanks! 15:56:35 thx for attending the meeting 15:56:42 ykarel I already +2 it 15:56:48 thx 15:56:53 have a great week and see You online! 15:56:56 #endmeeting