Tuesday, 2023-09-05

opendevreviewyatin proposed openstack/neutron master: [CI] Bump OVS_BRANCH in ovs/ovn source deploy jobs  https://review.opendev.org/c/openstack/neutron/+/89370005:01
opendevreviewLajos Katona proposed openstack/networking-bagpipe master: CI: Change focal nodeset to jammy  https://review.opendev.org/c/openstack/networking-bagpipe/+/89370908:40
*** ralonsoh_ is now known as ralonsoh08:41
ralonsohlajoskatona, I've reviewed (and approved) your patches for bagpipe and sfc08:44
ralonsohis there any other one missing?08:44
ralonsohhttps://review.opendev.org/c/openstack/networking-bagpipe/+/87946308:45
ralonsoh^ if I'm not wrong, this one depends on the sfc patch08:45
ralonsohso we'll wait until the other one merges08:45
*** dmitriis is now known as Guest187109:12
lajoskatonaralonsoh: I still strugling with finding the root cause of the failures of the bagpipe sfc driver with sqlalchemy 209:20
lajoskatonaralonsoh: there's another one for the tempest job of bagpipe: https://review.opendev.org/c/openstack/networking-bagpipe/+/89370909:21
lajoskatonaralonsoh: I just realized that this job uses focal, I hope it is replacable with jammy :-)09:22
ralonsohI'll follow this patch09:27
opendevreviewMerged openstack/networking-sfc master: [sqlalchemy-20] Add context wrapper to ppg operations  https://review.opendev.org/c/openstack/networking-sfc/+/89366009:54
opendevreviewRodolfo Alonso proposed openstack/neutron master: Revert "[OVN][Trunk] Set the subports correct host during live migration"  https://review.opendev.org/c/openstack/neutron/+/89355210:50
opendevreviewRodolfo Alonso proposed openstack/neutron stable/2023.1: Revert "[OVN][Trunk] Set the subports correct host during live migration"  https://review.opendev.org/c/openstack/neutron/+/89355310:52
opendevreviewRodolfo Alonso proposed openstack/neutron stable/zed: Revert "[OVN][Trunk] Set the subports correct host during live migration"  https://review.opendev.org/c/openstack/neutron/+/89355410:52
opendevreviewRodolfo Alonso proposed openstack/neutron stable/xena: Revert "[OVN][Trunk] Set the subports correct host during live migration"  https://review.opendev.org/c/openstack/neutron/+/89355510:52
opendevreviewRodolfo Alonso proposed openstack/neutron stable/yoga: Revert "[OVN][Trunk] Set the subports correct host during live migration"  https://review.opendev.org/c/openstack/neutron/+/89355610:52
opendevreviewRodolfo Alonso proposed openstack/neutron stable/wallaby: Revert "[OVN][Trunk] Set the subports correct host during live migration"  https://review.opendev.org/c/openstack/neutron/+/89355710:52
ralonsohlajoskatona, slaweq ^^ sorry, that patch (and https://review.opendev.org/c/openstack/neutron/+/893447), introduced an error in the cold migration with OVN and trunk ports10:53
ralonsohwe need to revert them10:53
lajoskatonaralonsoh: ack10:58
opendevreviewRodolfo Alonso proposed openstack/neutron-dynamic-routing master: Replace "tenant_id" with "project_id"  https://review.opendev.org/c/openstack/neutron-dynamic-routing/+/88294011:27
opendevreviewRodolfo Alonso proposed openstack/neutron master: Add a new extension "security-groups-rules-belongs-to-default-sg"  https://review.opendev.org/c/openstack/neutron/+/88390711:33
ralonsohlajoskatona, slaweq, https://review.opendev.org/c/openstack/neutron/+/892564 if you have some mins. thanks! (last petition today)11:35
Continuity_Hello. all. I have a Zed based OVN+DVR environment which I'm having some "fun" with. When using vlan provider networks for Ironic. If an instance is given a FIP, it becomes uncontactable from the internet, or at least, a couple of pings will get through, and then nothing. Same from inside the network, i can ping outbound sometimes, a couple of pings work then nothing. 13:15
Continuity_I know there has been some work around VLAN + OVN + DVR13:15
Continuity_but i would love to try and get to the bottom of this.13:15
ralonsohContinuity_, please check that your neutron server has https://review.opendev.org/c/openstack/neutron/+/879296 and https://review.opendev.org/c/openstack/neutron/+/87567313:19
Continuity_ralonsoh: checking..13:20
Continuity_ralonsoh: the first fix is there in full, the second. We seem to be missing the utils.py check (line 629) and the corresponding part in ovn_client.py line 1577-157913:47
Continuity_our reads as13:47
Continuity_https://pastebin.com/LL6zcvwH13:48
ralonsohContinuity_, qq, when you said instance, that was an ironic instance?13:49
ralonsohPing list: bcafarel, elvira, frickler, mlavalle, mtomaska, obondarev, slawek, tobias-urdin, ykarel, lajoskatona, jlibosva, averdagu, amotoki 13:59
ralonsoh#startmeeting networking14:00
opendevmeetMeeting started Tue Sep  5 14:00:07 2023 UTC and is due to finish in 60 minutes.  The chair is ralonsoh. Information about MeetBot at http://wiki.debian.org/MeetBot.14:00
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.14:00
opendevmeetThe meeting name has been set to 'networking'14:00
mlavalleo/14:00
slaweqo/14:00
mtomaskao/14:00
obondarevhi14:00
haleybo/14:00
ykarelo/14:00
ralonsohhello all14:00
frickler\o14:00
ralonsohwe have quorum today, I'll wait 30 more seconds14:00
elodilleso/14:01
lajoskatonao/14:01
ralonsoh#topic announcements14:01
ralonsohthe schedule14:01
ralonsoh#link https://releases.openstack.org/bobcat/schedule.html14:01
ralonsohthis week is the14:01
ralonsoh* Election Email Deadline14:01
ralonsoh* Election Campaigning Begins14:01
opendevreviewMerged openstack/neutron master: Check the device ID and host ID during virtual port binding  https://review.opendev.org/c/openstack/neutron/+/89256414:01
ralonsohbut, of course, we know who is going to be our future PTL14:02
rubasovlate o/14:02
ralonsohso, in advance, thanks haleyb for proposing yourself14:02
lajoskatona+114:02
haleybo/14:02
mlavalle++14:03
ralonsohyou'll have my support during the next cycle, for sure14:03
haleyblooking forward to leading the project14:03
ralonsohwe had the library releases last week14:03
ralonsohn-lib created some problems in n-d-r14:04
ralonsohand a new Bobcat beta version of Neutron was created14:04
ralonsohthe 3rd one14:04
ralonsohfor now, so far so good14:04
ralonsohnow you need to make a last review of the highlights14:04
ralonsoh#link https://review.opendev.org/c/openstack/releases/+/89317414:04
ralonsohthat should be approved during this week14:04
ralonsohand the last topic I have for this section is a new episode in the openinfra web14:05
ralonsoh#link https://openinfra.dev/live/#all-episodes14:05
ralonsoh"Cyber Resilience Act: What Now?"14:05
ralonsohany other topic in this section??14:05
ralonsohok, let's jump to the next topic14:06
ralonsoh#topic bugs14:06
ralonsohlast week report is from elvira14:06
ralonsoh#link https://lists.openstack.org/pipermail/openstack-discuss/2023-September/034958.html14:06
ralonsohthere are still some pending bugs not assigned/to be discussed14:07
ralonsoh#link https://bugs.launchpad.net/neutron/+bug/203365114:07
ralonsoh[fullstack] Reduce the CI job time14:07
ralonsohI opened this bug, I think the goal is clear14:07
ralonsohfullstack job is taking between 2 and 3 hours, depending on the node14:07
ralonsohand sometimes it times out14:08
ralonsohso we need to find a way to reduce the tests, combine them or improve the code14:08
ralonsohbut I think that is a long term goal14:08
lajoskatonagood goal14:08
ralonsohand not easy, for sure, but we can do small steps14:08
mtomaskaright. seems more like tech depth than a bug.14:09
ralonsohanyone can help on this, so you are welcome14:09
ralonsohthe next one is14:09
ralonsoh#link https://bugs.launchpad.net/neutron/+bug/203329314:09
ralonsoh"dns integration saying plugin does not match requirements"14:09
ralonsohbut frickler couldn't reproduce it14:10
ralonsohso we are waiting for more logs14:10
ralonsohfrickler, any comment on this one?14:10
ralonsohok, we can keep the "incomplete" tag for now until new logs (neutron server) are provided14:11
ralonsohthe last one is14:11
ralonsoh#link https://bugs.launchpad.net/neutron/+bug/203368314:11
ralonsohopenvswitch.agent.ovs_neutron_agent fails to Cmd: ['iptables-restore', '-n']"14:11
ralonsohykarel commented on this one14:12
ralonsohthis job has been working fine for months14:12
ralonsohbut now it seems that a binary is missing14:12
ykarelbut yes nothing for neutron in this, i asked for more details and suggested what needs to be fixed in tripleo side14:13
ralonsohas you mentioned, in https://review.opendev.org/c/openstack/kolla/+/761182 "iptables-restore" is missing14:13
ralonsohto be honest, I don't know why that worked before14:14
ralonsohbut they maybe bumped the iptables library, that also affects these tools14:14
ykarelnot iptables-restore but  /usr/bin/update-alternatives14:14
ralonsohsorry, not iptables but nftables14:14
ralonsohyes, because of nftables14:14
ralonsohI'll mark this bug as "invalid" from the Neutron point of view14:15
ykarelok /me not aware about the relation b/w update-alternatives and nftables14:15
ykarel+114:15
ralonsohnftables provide all the iptables legacy support14:15
ralonsohso thanks for taking care of this one14:16
fricklerin kolla iirc we needed to fix a mismatch between what is happening inside the container and outside and what docker does14:16
fricklermaybe the situation in tripleo is similar14:17
fricklerbut not a neutron issue I agree14:17
ralonsohI think any command is executed from inside the containers, so we don't use any external binary14:18
ralonsohI only remember one issue related to "ip-netns" but due to some missing permissions14:18
ralonsohanyway, we can skip this one 14:19
fricklerbut docker does that, and then nftables and legacy iptables may collide14:19
frickleror rather the nftables rule do not get used or something similar14:19
ralonsohsorry, I used kolla many years ago and I'm not familiar with this tool, to be honest14:20
fricklernevermind, let's go on14:20
ralonsohI don't have any other bug in the list14:20
ralonsohdo you?14:21
fricklerregarding 2033293 I suspect a misconfiguration, but need more data as mentioned in order to confirm14:21
ralonsohyeah, we can wait for this information if you couldn't reproduce it14:21
ralonsohthis week ykarel is the deputy, next week will be mtomaska14:21
ralonsohack?14:21
mtomaskaack14:22
ykarelack14:22
ralonsohcool, thanks!14:22
ralonsohI'm jumping to the next section14:22
ralonsoh#topic community_goals14:22
fricklerany progress on the OVN MTU issue?14:22
ralonsohno, I had some internal issues and I coudbn't spend a single moment on this14:23
ralonsohwe are also in the release weeks14:23
ralonsohand everything seems to fail!14:23
ralonsohok, let's continue14:24
ralonsoh1) Add support for the service role in neutron API policies 14:24
ralonsoh#link https://review.opendev.org/c/openstack/neutron/+/88672414:24
ralonsoh(I don't think you spend much time on this one last week, right?)14:24
ralonsohslaweq, 14:24
slaweqno, nothing new this week14:25
lajoskatonait is still in merge conflict by gerrit14:25
ralonsohok, this will be a C release feature but you are free to review the patch ^^14:25
ralonsohthe next one is14:26
ralonsoh2) Neutron client deprecation 14:26
ralonsohlajoskatona, please14:26
lajoskatonaThere's 3 open patche that can fit to Bobcat:14:26
lajoskatonahttps://review.opendev.org/q/topic:bug/1999774+project:openstack/python-neutronclient+status:open14:26
lajoskatonathese are all for neutronclient, as SDK release happened with the SDK side of these14:27
lajoskatonathat's it for this topic from me14:27
ralonsohqq: you are not bumping the openstacksdk library in any of them14:27
ralonsohthat should be necessary to receive the dependant sdk patch14:28
lajoskatonaAs I know we got the highest from upper-constraints and that was bumped14:28
ralonsohperfect14:28
ralonsohI said that because we had an issue last week14:28
ralonsohrelated to this14:28
ralonsoh#link https://review.opendev.org/c/openstack/python-neutronclient/+/89334614:28
lajoskatonathis was the bump: https://review.opendev.org/c/openstack/requirements/+/89335114:29
lajoskatonayes last week there was some central issue with requ bumps14:29
ralonsohyeah, but I was talking about the neutronclient min reqs14:29
lajoskatonaah, true, that is necessary14:30
ralonsohplease check what sdk version has each patch14:30
ralonsohand bump it correspondingly 14:30
lajoskatonaack14:30
ralonsohthanks a lot14:30
ralonsohand that's all14:31
ralonsoh#topic on_demand14:31
ralonsohany topic you want to bring here?14:31
mlavallenot me14:31
lajoskatonaI added one14:32
lajoskatonait is a heads up:  Nova EOL-ed Train branch14:32
ralonsohyes, good topic14:32
ralonsohwhat should we do?14:32
lajoskatonaperhaps elodilles has more background14:32
ralonsohwe still accept Train patches14:32
elodilleswell, nova had some CVE fix that did not land on train, hence the team decided to EOL train14:33
elodillesso i'm not insisting anymore to keep train open for neutron either o:)14:33
lajoskatonayes something like that, so it can be that we do the same or keep the branch open14:33
ralonsohwe still don't have this problem. Some users asked us to keep it open, months ago14:33
ralonsohbut I didn't see any activity on this branch14:34
lajoskatonabut anyway as things going most projects close most of these branches soon I suppose14:34
ralonsohok, I can send a mail (again) to propose the EOL of Train in Neutron14:34
ralonsohand we can receive feedback of the community14:34
elodillesralonsoh: ++14:34
lajoskatona+114:35
elodilles(it still can be kept open, but then we have to create a patch for devstack to consume train-eol from nova)14:35
ralonsohIMO, this branch is old enough at this point14:35
ralonsohok then, I'll send the mail today. Thanks!14:35
ralonsohany other topic?14:35
elodillesthanks too14:35
ralonsohelodilles, thanks!14:35
elodillesmaybe this: since we have talked about python-neutronclient: https://review.opendev.org/c/openstack/releases/+/89361514:36
ralonsohso please remember the CI meeting is in 25 mins in this channel14:36
mlavallevideo or irc?14:36
ralonsohelodilles, ups, I missed this patch14:36
ykarelirc14:36
ralonsohwe can delay that some days14:36
ralonsohuntil we have lajoskatona's patches14:36
mlavalleack ykarel 14:36
ralonsohlajoskatona, so please, check your nclient patches asap14:37
elodillesralonsoh: no problem, you are not late :)14:37
ralonsohand then I'll update the release hash14:37
Continuity_ralonsoh: no this is currently a virtual instance, although we do see the same issue on BareMetal as well..14:37
elodillesnote that *client libs freeze was last week14:37
elodillesthis is just a stable/2023.2 branch cut patch14:38
ralonsohright14:38
lajoskatonaok, so the above 3 patches are anyway to C14:38
ralonsohactually for nclient we'll need to release a new stable version to have them in Bobcat14:38
ralonsohyes14:39
ralonsohlajoskatona, is that a problem or we need to have them in Bobcat?14:39
lajoskatonaI dont think so14:39
ralonsohok then14:39
lajoskatonanothing will break if don't have these, and we don't have a deadline for it as I know14:40
ralonsohright14:40
ralonsohso please folks check the non-client libs freeze https://review.opendev.org/c/openstack/releases/+/89361514:40
ralonsohand that's all14:40
ralonsohthank you all for attending14:40
ralonsoh#endmeeting14:41
opendevmeetMeeting ended Tue Sep  5 14:41:06 2023 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)14:41
opendevmeetMinutes:        https://meetings.opendev.org/meetings/networking/2023/networking.2023-09-05-14.00.html14:41
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/networking/2023/networking.2023-09-05-14.00.txt14:41
opendevmeetLog:            https://meetings.opendev.org/meetings/networking/2023/networking.2023-09-05-14.00.log.html14:41
mlavalleo/14:41
lajoskatonao/14:41
elodillesthanks o/14:41
ykarelo/14:41
obondarevo/14:42
Continuity_ralonsoh: sorry didnt mean to comment in the middle of the meeting.. bad form :14:47
ralonsohno problem14:48
ralonsohthe issue in OVN with ironic nodes (and SRIOV ports) could be the scheduler14:49
ralonsohthese ports don't fall in the same chassis as the LRPs14:49
ralonsohbut this could be a performance issue14:49
ralonsohwhat you are describing is a communication problem14:50
ralonsohplease check the missing patch and apply it14:50
Continuity_so the odd thing is we build containers a few weeks ago (kolla-ansible) so I would have thought that we would have that second patch in our code... 14:53
Continuity_looking at that second patch it looks to say. if the network type is vlan, and DVR is enabled, set reside on redirect chassis to false14:54
ralonsohyes14:55
ralonsohand the bridge-type14:55
Continuity_so redirect is set to false, and the bridge type is set to bridged14:55
ralonsohyes14:56
Continuity_https://pastebin.com/az7aYS6714:57
Continuity_the current settings, which looks to match what should be set by the patch14:57
Continuity_*note I havent changed anything, this is how the openstack network create command deployed it14:57
*** dasm is now known as Guest192914:57
ralonsohdo you have port forwarding?14:58
Continuity_not sure what you mean. sorry15:00
ralonsohfloating IP port forwarding15:00
ykarelk meeting time15:00
ykarel#startmeeting neutron_ci15:01
opendevmeetMeeting started Tue Sep  5 15:01:04 2023 UTC and is due to finish in 60 minutes.  The chair is ykarel. Information about MeetBot at http://wiki.debian.org/MeetBot.15:01
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.15:01
opendevmeetThe meeting name has been set to 'neutron_ci'15:01
ralonsohhello15:01
lajoskatonao/15:01
ykarelping bcafarel, lajoskatona, mlavalle, mtomaska, ralonsoh, ykarel, jlibosva, elvira15:01
ykarelGrafana dashboard: https://grafana.opendev.org/d/f913631585/neutron-failure-rate?orgId=115:01
mtomaskao/15:01
mlavalleo/15:02
*** Guest1929 is now known as dasm15:02
ykarelk let's start with the topic as some of the folks are on PTO15:02
ykarel#topic Actions from previous meetings15:02
ykarellajoskatona to check failures for bagpipe and check use of master neutron in stadium projects15:02
lajoskatonaI still strugle with bagpipe15:03
lajoskatonait is only with sqlalchemy215:03
ykarelyes that's what i recall15:04
lajoskatonaIn the meantime I also found that the bagpipe tempest job is running with focal, so I changed the nodeset to jammy: https://review.opendev.org/c/openstack/networking-bagpipe/+/89370915:04
ykarellooking at periodic i saw some issue in other job, but that can checked in later section15:04
ykarelyeap i noticed that failure too, thx for fixing it15:04
ykarelother was https://zuul.openstack.org/builds?job_name=networking-bagpipe-openstack-tox-py310-with-sqlalchemy-main&project=openstack/networking-bagpipe15:05
lajoskatonathe issue is only with the sfc driver of bagpipe, so quite complex for me at least to understand the root cause15:05
lajoskatonayes that is the sqlalchemy2 issue15:06
Continuity_ralonsoh: yes i have allowed all ICMP and SSH for testing.15:06
lajoskatonaralonsoh: what is the timeline for the sqlalchemy2 introduction?15:06
slaweqo/15:06
slaweqsorry I was on different meeting15:06
ykarellajoskatona, do we have bug already for this issue?15:06
ralonsohit will be introduced, most probably, at the begining of C15:06
lajoskatonaok15:07
ralonsohthere is a requirements patch proposed already15:07
ralonsohContinuity_, we can talk after the meeting15:07
lajoskatonaralonsoh: not for bagpipe, I used as reference the big sqlalchemy2 bug15:07
lajoskatonaI can open a bagpipe bug to track it15:08
ralonsohyeah, much better15:08
ykarel+1 15:08
lajoskatonaack15:08
ralonsohto be honest, I don't know what is failing in this CI15:08
ykarellet's check this offline15:09
ykarel#topic Stable branches15:09
ykarelBernard is out this week, but stable branches looks good considering patches merge15:09
ykareli din't noticed any consistent failure on stable branches15:10
ykarelanything to add for stable branches?15:10
ykarelsounds all good then, moving to next topic15:12
ykarel#topic Stadium projects15:12
ykarellajoskatona, anything else apart from that sqlalchemy and focal issues for stadium projects?15:12
lajoskatonaexcept bagpipe things seems to be green15:12
ykarelk good15:12
ykarel#link https://zuul.openstack.org/builds?job_name=networking-bagpipe-openstack-tox-py310-with-sqlalchemy-main&project=openstack/networking-bagpipe15:13
ykarel#link https://zuul.openstack.org/builds?job_name=networking-bagpipe-tempest&project=openstack%2Fnetworking-bagpipe&branch=master&skip=0 15:13
ykarel#topic Grafana15:13
ykarelhttps://grafana.opendev.org/d/f913631585/neutron-failure-rate15:13
ykarellet's give a minute to it if we observe anything abnormal there15:13
ykareli see a spike at tempest job, but that was a known issue already fixed15:14
slaweqtoday morning there was some "spike" but on all jobs15:14
slaweqso it's probably not an issue on our side really15:14
ralonsohneutron-ovn-tempest-ipv6-only-ovs-release had a 66% of failures15:15
ykarelyes i noticed there was quite of failures today morning but most of them were related to series of patches pushed together for l3-ovn iirc15:15
ralonsohright, perfect then15:15
ralonsohyes, I see the same spikes in other jobs15:16
ykarelyeap, let's move to next section, will keep monitoring it if something new comes in15:18
ykarel#topic Rechecks15:18
ykarelit was better last week15:19
ykarelthere were some known issues last week which might have resulted into those rechecks15:19
ykarelbare rechecks were also not much 3/17, so good15:19
ykarellet's keep avoiding bare rechecks15:20
ykarel#topic Unit tests15:20
mlavallewhat's 3/17?15:20
ralonsoh3 out of 1715:20
ykarel17 total rechecks, 3 out of them were bare15:20
ykarelyeap15:20
mlavalleahhh!15:20
mlavalleLOL15:20
ykarel#info There was issue with unit test job running with sqlalchemy/alembic main branches15:21
ykarelIt's already fixed with https://review.opendev.org/c/openstack/neutron/+/89360215:21
ralonsoh+115:21
ykarel#topic fullstack/functional15:22
ykarelneutron.tests.functional.services.trunk.drivers.ovn.test_trunk_driver.TestOVNTrunkDriver.test_subport_delete15:22
ykarelAttributeError: 'NoneType' object has no attribute 'status'15:22
ykarelSeen twice in a master/wallaby and failure looks related to the patch itself already merged15:23
ykarelso some race in the test as not happening always15:23
ykarel#link https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_b18/892890/2/gate/neutron-functional-with-uwsgi/b18c02c/testr_results.html15:23
ykarel#link https://e872331dabdf974ff450-5a66e2fcfa24aae6b75c2058251d7e58.ssl.cf5.rackcdn.com/periodic/opendev.org/openstack/neutron/master/neutron-functional-with-uwsgi-fips/c431c66/testr_results.html15:24
ykarelhttps://review.opendev.org/#/q/I2370ea2f96e2e31dbd43bf232a63394388e6945f15:25
ralonsohI'll ping Arnau to check these errors, look related to ^15:25
ykareli see this being reverted, so likely the failures would also go away15:25
ralonsohah no15:25
ralonsohthis could be another issue15:25
ralonsohin any case, we are reverting the patch you mentioned15:25
ralonsohso please wait until the next week15:25
ykarelk +1, will keep an eye15:26
ykarelif it still happens will open a bug15:26
ykarelnext one is neutron.tests.functional.agent.l3.test_keepalived_state_change.TestMonitorDaemon.test_read_queue_change_state15:26
ykarelAssertionError: Text not found in file /tmp/tmpuh6gesvz/tmp50ei9qfp/log_file: "Initial status of router".15:26
ykarelhttps://157ba513c840b85e5d0e-e65fbda5c4a8fc14eb81d398bd7b0a80.ssl.cf5.rackcdn.com/892896/1/gate/neutron-functional-with-uwsgi/0acdecd/testr_results.html15:26
ralonsohthis is something recurrent, the monitor doesn't start in some tests15:27
ykarelk so it's already a known issue?15:28
ralonsohI would need to investigate how to make these tests more stable15:28
ralonsohyes, that has been happening for years15:28
ralonsohnot very often15:28
ykarelok i see 3 failures in last 15 days across branches15:28
ykarelk thanks, will keep an eye on this and if it's start happening more frequently will open a bug for it15:29
ralonsohsure15:29
ykarelneutron.tests.functional.services.ovn_l3.test_plugin.TestRouter.test_router_gateway_port_binding_host_id15:30
ykarelTimeout exception with self.mech_driver.nb_ovn.ovsdb_connection.stop()15:30
ykarel#link https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_2a0/892897/1/gate/neutron-functional-with-uwsgi/2a03caa/testr_results.html15:30
ykarelany idea for this?15:31
ralonsohno, just a random timeout during the cleanup phase15:31
ykarelseen 4 hits as per opensearch across stable/zed and yoga15:31
ralonsohin the same test?15:32
ykarelno different test also hit it, like test_gateway_chassis_rebalance_max_chassis15:34
ykarelbut that's also in same class neutron.tests.functional.services.ovn_l3.test_plugin.TestRouter15:34
ralonsohlet me open a LP bug for this one. We can try, maybe, checking if the ovsdb_connection is still open during the cleanup15:36
ykarelk thanks15:36
ralonsohif the connection is not active, then we don't need to stop it15:36
ykarel#action ralonsoh to open bug for Timeout exception with self.mech_driver.nb_ovn.ovsdb_connection.stop()15:36
ykarelneutron.tests.fullstack.test_connectivity.TestUninterruptedConnectivityOnL2AgentRestart.test_l2_agent_restart(LB,VLANs)15:37
ykarel#link https://2df199e43476e3c732e7-3130556d487e5cec46a1ba3d1eaa7fda.ssl.cf5.rackcdn.com/892890/2/gate/neutron-fullstack-with-uwsgi/209726b/testr_results.html15:37
ykarel#link https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_da8/892869/3/check/neutron-fullstack-with-uwsgi/da88e54/testr_results.html15:37
ykarelanyone recall this failure, from meeting history i see it was seen in past too15:38
ralonsohno, sorry15:38
slaweqthis is related to LB so we can simply mark this test as unstable or simply skip it if it's not stable15:39
slaweqas LB is marked as "experimental" since some time15:39
lajoskatona+115:39
slaweqI can propose patch for that15:39
ykarelk thanks, i think i saw it in stable branches15:40
ykarel#action slaweq to check failures with fullstack test test_l2_agent_restart15:40
ykarelthx slaweq 15:40
ykarelneutron.tests.fullstack.test_agent_bandwidth_report.TestPlacementBandwidthReport.test_configurations_are_synced_towards_placement(Open vSwitch agent)15:41
lajoskatonado you have link for this one? I can check it15:41
ykarellajoskatona, may be you recall something for ^?15:41
ykarel#link https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_4fb/893543/1/check/neutron-fullstack-with-uwsgi/4fba5fb/testr_results.html15:41
ykarel#link https://c7cd1f2001ec5f5b729f-4854ed941a7816d1225c43ae9b456d0e.ssl.cf2.rackcdn.com/893143/1/check/neutron-fullstack-with-uwsgi/9137417/testr_results.html15:42
lajoskatonaI will check this one15:42
ykarelthx lajoskatona 15:42
ykarel#action lajoskatona to check failures with fullstack test test_configurations_are_synced_towards_placement15:42
ykarelalso generic one15:42
ykarelwe recently seeing quite frequent timeouts in fullstack job https://zuul.openstack.org/builds?job_name=neutron-fullstack-with-uwsgi&project=openstack%2Fneutron&result=TIMED_OUT&skip=015:43
ykareleven after timeout increase to 3 hours as part of isolated db per test patch15:43
ralonsohwe can start removing the LB tests, for example15:44
lajoskatonafor this ralonsoh opend the bug: https://bugs.launchpad.net/neutron/+bug/203365115:44
lajoskatonaso we can track the patches under the lp15:44
ykarelyeap +1 15:44
ykarelok let's move to next topic15:45
ykarel#topic Tempest/Scenario15:45
ykarelthere was an issue but already fixed with https://review.opendev.org/c/openstack/nova/+/89350215:45
ykarel#topic Periodic15:45
ykarel#link https://zuul.openstack.org/builds?job_name=devstack-tobiko-neutron&branch=master&skip=015:46
ykarelthe job was running with ubuntu focal15:46
ykarelthere is already a patch to move it to jammy with https://review.opendev.org/c/x/devstack-plugin-tobiko/+/89366215:46
ykarel#link https://zuul.openstack.org/builds?job_name=neutron-ovn-tempest-ipv6-only-ovs-master&job_name=neutron-ovn-tempest-ovs-master-centos-9-stream&project=openstack%2Fneutron&skip=015:46
ykarelovs/ovn source deploy jobs with OVN_BRANCH=main are broken https://bugs.launchpad.net/neutron/+bug/203409615:47
ykarel#link https://review.opendev.org/c/openstack/neutron/+/89370015:47
ykarelthat's it for periodic, please review the above fixes15:48
ralonsoh+2 (high priority one015:48
ykarel#topic On Demand15:48
ykarelanything else to discuss?15:49
lajoskatonanothing from me15:51
slaweqnope15:51
ykarelk thanks everyone, let's close then and have everyone few minutes back15:52
mlavallenothing15:52
ykarel#endmeeting15:52
opendevmeetMeeting ended Tue Sep  5 15:52:15 2023 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)15:52
opendevmeetMinutes:        https://meetings.opendev.org/meetings/neutron_ci/2023/neutron_ci.2023-09-05-15.01.html15:52
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/neutron_ci/2023/neutron_ci.2023-09-05-15.01.txt15:52
opendevmeetLog:            https://meetings.opendev.org/meetings/neutron_ci/2023/neutron_ci.2023-09-05-15.01.log.html15:52
mlavalleo/15:52
slaweqo/15:52
lajoskatonao/15:52
ralonsohbye15:52
mtomaskao/15:52
Continuity_ralonsoh: sorry about that I should pay more attention...15:55
Continuity_one thing that does strike me as odd, we dont have an ha-chassis group for this new provider network.16:02
ralonsohthat is created when the network is16:03
ralonsohplease, open a LP bug with the information and upload the logs16:03
ralonsohthat will help to find the issue, most probably16:03
Continuity_ok ill open a bug. as currently we dont see a ha-chassis group for newly created provider vlan networks.16:10
Continuity_to confirm - ovn-nbctl ha-chassis-group-list16:10
Continuity_does not return an item for the newly created network16:11
Continuity_any ideas on how we could force a sync or creation of that16:11
ralonsohhold on, this is only for external ports16:14
ralonsohha-chassis-group16:14
ralonsohwhat version are you using?16:15
ralonsohthe default ha chasiss group is no longer used16:16
Continuity_we are running ZED, containers built 2023-08-2316:18
Continuity_we have some vlan provider networks which are working. They have a mix of ironic and virtaul intances on them. They have ha-chassis groups show in ovn16:18
Continuity_a newly created vlan provider network, with virtual instances on them, exhibit the issue where outgoing pings from the machine work twice then stop, (with or without a fip) when a fip is attached, incoming pings do the same, two then nothing. 16:19
Continuity_occasaionly it just starts working. then it stops again16:19
Continuity_if "feels" like a traffic centralisation problem, or a pathing problem16:20
Continuity_but we are at a loss.16:20
ralonsohok, at this point we are still providing support for non tunnelled networks and DVR in OVN16:21
ralonsohso please, don't use DVR with VLAN for now16:21
Continuity_ok..... how do i switch back?16:21
Continuity_i assume i need to disable DVR?16:22
ralonsohunset the enable_distributed_floating_ip flag in the config and restart the neutron servers16:23
Continuity_how damaging will that be to currently running workloads?16:25
Continuity_will we need to rebuild anything or will it just work?16:25
Continuity_will it affect north south traffic during the restart?16:25
ralonsohbut you said the FIPs are currently not working16:25
ralonsohyes, it will affect N/S traffic16:26
Continuity_only on instances connected to these provider vlan networks16:26
Continuity_other geneve tenant networks are working fine16:26
ralonsohso that will be a problem16:26
ralonsohwe don't have a way to unset DVR only for VLAN16:26
Continuity_this bug https://review.opendev.org/c/openstack/neutron/+/875673 which you mentioned earlier talks about "ovn-chassis-mac-mappings", which are not configured in our environment. should they be?16:28
ralonsohbut this is something working now16:29
ralonsohyou should have the mac-mappins for VLAN16:29
Continuity_ok. 16:29
Continuity_i *think* it was removed as part of a kolla-ansible change a while back16:30
ralonsohthat is a configuration step for the phsycail bridges16:30
Continuity_maybe it hasnt been readded16:30
ralonsohactually this is not controlled by Neutron, but by puppet (TripleO), for example16:31
Continuity_yeah, so i *think* we are missing the ovn-chassis-mac-mappings on the controllers.16:36
Continuity_we shall try adding that tomorrow16:40
Continuity_ralonsoh: thanks for your assistance. ill let you know how we get on. and i may be back with more questions :D16:40
ralonsohsure16:40
opendevreviewMerged openstack/neutron master: Revert "[OVN][Trunk] Set the subports correct host during live migration"  https://review.opendev.org/c/openstack/neutron/+/89355216:50
opendevreviewMerged openstack/neutron stable/xena: Update dns_assignment attribute documentation  https://review.opendev.org/c/openstack/neutron/+/89354416:50
opendevreviewMerged openstack/neutron stable/wallaby: Update dns_assignment attribute documentation  https://review.opendev.org/c/openstack/neutron/+/89354516:50
opendevreviewMerged openstack/neutron master: [CI] Bump OVS_BRANCH in ovs/ovn source deploy jobs  https://review.opendev.org/c/openstack/neutron/+/89370016:50
TheJuliaralonsoh: any answer on SNAT + tftp from ovn folks?17:25
opendevreviewBrian Haley proposed openstack/neutron stable/zed: DNM: Testing Zed gate  https://review.opendev.org/c/openstack/neutron/+/89380722:00

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!