Friday, 2025-02-07

opendevreviewMaor Blaustein proposed openstack/neutron master: Fix LRP.external_ids router name prefix  https://review.opendev.org/c/openstack/neutron/+/94044505:57
opendevreviewMaor Blaustein proposed openstack/neutron master: Fix LRP.external_ids router name prefix  https://review.opendev.org/c/openstack/neutron/+/94044507:37
opendevreviewVladimir Prokofev proposed openstack/neutron-specs master: Add spec for random-fully per-FIP feature RFE.  https://review.opendev.org/c/openstack/neutron-specs/+/93265009:53
opendevreviewKonstantin Eremin proposed openstack/neutron stable/2024.2: OVS: Set log level to INFO for unconfigured ofport cases  https://review.opendev.org/c/openstack/neutron/+/94094910:03
opendevreviewKonstantin Eremin proposed openstack/neutron stable/2024.1: OVS: Set log level to INFO for unconfigured ofport cases  https://review.opendev.org/c/openstack/neutron/+/94095010:06
opendevreviewKonstantin Eremin proposed openstack/neutron stable/2023.2: OVS: Set log level to INFO for unconfigured ofport cases  https://review.opendev.org/c/openstack/neutron/+/94095110:07
opendevreviewRenjing Xiao proposed x/whitebox-neutron-tempest-plugin master: Add new nested snat test to fetch ovn config dynamically  https://review.opendev.org/c/x/whitebox-neutron-tempest-plugin/+/94090810:38
opendevreviewRenjing Xiao proposed x/whitebox-neutron-tempest-plugin master: Add new nested snat test to fetch ovn config dynamically  https://review.opendev.org/c/x/whitebox-neutron-tempest-plugin/+/94090811:01
opendevreviewRodolfo Alonso proposed openstack/neutron master: [OVN][FT] Use MySQL backend for ``Test*Maintenance`` classes  https://review.opendev.org/c/openstack/neutron/+/94095911:34
sahidlajoskatona: "all works well" on devstack with os-ken patches12:20
sahidI had to redeploy devstack in totally I don't know if you where like me but mine one a bit old12:21
sahidperhaps you have to fetch last change from devstack, clean it, remove repo or reclone and stack again12:21
sahidfrom the CI point of view, the job with os-ken-master, only 3 tests are failing12:25
sahidhttps://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_053/937765/53/check/neutron-ovs-tempest-with-os-ken-master/0538d30/testr_results.html12:25
sahidI have triggered it a second time, just to check12:25
opendevreviewMaor Blaustein proposed openstack/neutron master: Fix LRP.external_ids router name prefix  https://review.opendev.org/c/openstack/neutron/+/94044512:32
opendevreviewMaor Blaustein proposed openstack/neutron master: Fix LRP.external_ids router name prefix  https://review.opendev.org/c/openstack/neutron/+/94044512:41
lajoskatonasahid: cool, I try to check it this afternoon12:59
priteauHello ralonsoh. I have just seen this issue happen in CI again: https://bugs.launchpad.net/neutron/+bug/2093258. We are using Kolla images built from neutron master, but since your fix was a revert in neutron-lib, we are not running it yet because we use the latest pip release: 3.18.0. I know you just released neutron-lib last week, but would you consider making another14:19
priteaurelease soon so we can verify that the revert does indeed fix the issue?14:19
ralonsohpriteau, I don't think we have any other patch apart from this one 14:22
ralonsohbut I think it could worth a minor release14:22
priteauMaybe haleyb can give his input too.14:23
ralonsohhaleyb, https://review.opendev.org/c/openstack/releases/+/94097214:26
ralonsohas PTL, must approve it14:26
priteauThanks14:27
haleybyes, either me or the release liaison (slaweq), i'm looking now14:38
opendevreviewMichel Nederlof proposed openstack/ovn-bgp-agent master: Fix running sync method for every external_ids update.  https://review.opendev.org/c/openstack/ovn-bgp-agent/+/94012914:41
otherwiseguyralonsoh: on the hash ring stuff, on my system all of the extra ovn_hash_ring nodes 'updated_at' columns are being refreshed, not just the ones that are still "up".14:43
ralonsohotherwiseguy, yes, that is the goal14:44
ralonsohwe assume al hash ring workers of the same node are running at the same time14:44
otherwiseguybut I have 2 workers and 10 hash ring entries, all saying that they are up and refreshed?14:45
ralonsohotherwiseguy, the use case you commented (change the config once started with an error) is a non recoverable error14:45
otherwiseguyit happens w/o changing the config.14:45
ralonsohotherwiseguy, how that happened?14:45
otherwiseguykill -9 $worker_pid14:45
otherwiseguy(there is a script in the comments that does it)14:45
ralonsohbut wsgi will restart this worker14:46
otherwiseguyyes. and leave the ovn_hash_ring entry in there. and it keeps getting updated_at set.14:46
otherwiseguyso now we have more hash ring nodes than workers.14:46
ralonsohno no, the hash ring register should be reused by the restarted worker14:46
otherwiseguyit is not14:47
ralonsohthat's a problem I think I solved14:47
ralonsohlet me check14:47
otherwiseguythis is what I have been trying to say14:47
ralonsohI was not understanding your point14:47
ralonsohis a valid issue, of course14:47
ralonsohand I think I fixed that, let me check14:47
ralonsohotherwiseguy, I found it: https://review.opendev.org/c/openstack/neutron/+/940140/3/neutron/plugins/ml2/drivers/ovn/mech_driver/mech_driver.py#37214:49
ralonsohthat's the problem of the many POCs I've been implemented14:49
ralonsohI had this somewhere but not in the correct place14:49
ralonsohlet me push a patch under https://review.opendev.org/c/openstack/neutron/+/940256/14:50
otherwiseguyas far as the "non-recoverable error" that was just a simple example I could easily come up with for "one worker dies and keeps getting restarted, while the others are fine" so assuming that they are all running/up would be incorrect. I'm sure there are others. The world is a complex place. :)14:53
otherwiseguyIn any case, I'll try to make sure I fully understand the entire patchset before bothering with too many design comments.14:53
otherwiseguyJust saw a +2 last night and was seeing this particular bug and so stayed up to make a script to make it easy to show the issue.14:54
ralonsohotherwiseguy, let me first push something to fix your comment14:54
otherwiseguy+114:54
opendevreviewLajos Katona proposed openstack/tap-as-a-service master: GRE/ERSPAN mirroring for taas  https://review.opendev.org/c/openstack/tap-as-a-service/+/88535714:55
opendevreviewLajos Katona proposed openstack/tap-as-a-service master: Tap Mirror OVS driver  https://review.opendev.org/c/openstack/tap-as-a-service/+/89308614:55
opendevreviewLajos Katona proposed openstack/tap-as-a-service master: TAAS tap-mirror OVN driver  https://review.opendev.org/c/openstack/tap-as-a-service/+/89350914:55
opendevreviewLajos Katona proposed openstack/tap-as-a-service master: CLI for Tap Mirrors  https://review.opendev.org/c/openstack/tap-as-a-service/+/88608514:55
opendevreviewLajos Katona proposed openstack/tap-as-a-service master: Documentation for tap-mirrors  https://review.opendev.org/c/openstack/tap-as-a-service/+/90722014:55
opendevreviewMerged openstack/ovn-octavia-provider stable/2024.1: Fix pep8 with pylint 3.3.0  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/94086715:02
opendevreviewMerged openstack/ovn-octavia-provider stable/2023.2: Fix pep8 with pylint 3.3.0  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/94086815:04
otherwiseguyralonsoh: re: non-recoverable error--my main problem is that assuming all nodes are up on the host means that a failure that takes out one worker (broken config example where you don't restart neutron-api, or maximum db connections reached, but existing workers are fine because they have connections, file descriptors, whatever) means that instead of it just affecting that one worker that is down, it affects everything that is 15:24
otherwiseguystill up because the number of "up" hash ring nodes is different than the number of "up" workers, so events end up randomly not processed.15:24
otherwiseguyabsolutely, it's an edge case. and working most of the time > not working at all. Given how many edge cases I know I miss, I worry when I can find them early. :)15:29
opendevreviewLajos Katona proposed openstack/tap-as-a-service master: GRE/ERSPAN mirroring for taas  https://review.opendev.org/c/openstack/tap-as-a-service/+/88535715:29
opendevreviewLajos Katona proposed openstack/tap-as-a-service master: Tap Mirror OVS driver  https://review.opendev.org/c/openstack/tap-as-a-service/+/89308615:29
opendevreviewLajos Katona proposed openstack/tap-as-a-service master: TAAS tap-mirror OVN driver  https://review.opendev.org/c/openstack/tap-as-a-service/+/89350915:29
opendevreviewLajos Katona proposed openstack/tap-as-a-service master: CLI for Tap Mirrors  https://review.opendev.org/c/openstack/tap-as-a-service/+/88608515:29
opendevreviewLajos Katona proposed openstack/tap-as-a-service master: Documentation for tap-mirrors  https://review.opendev.org/c/openstack/tap-as-a-service/+/90722015:29
ralonsohotherwiseguy, but that already happens, for example, if another controller dies15:30
ralonsohfor a period of time, the hash ring leftovers are in the DB until they rot15:30
otherwiseguyYes, but nothing would be refreshing their 'updated_at'15:31
ralonsohyeah...15:31
ralonsohI think this is a corner case, IMO15:32
ralonsohif we have a zombie worker, the API in general (in this node) should not be working15:32
otherwiseguyIt is definitely a corner case. But it's one the previous design handled (at least with eventlet)15:32
ralonsohis it possible to have some kind of event from the OVN DB?15:33
ralonsohwe can update something (I don't know what) and all workers in all nodes will receive this event15:33
ralonsohthat will update the local hash ring manager15:33
ralonsohthat will be faster than having a refresh time (15 secs is now)15:34
ralonsohthen we need to handle the case of having leftovers (if the worker dies)15:34
ralonsohcan we monitor the DB connections?15:35
otherwiseguyWe have to be careful w/ sending extra ovsdb events--it can get kind of messy since they get broadcast everywhere and can trigger work, etc.15:35
otherwiseguylike when we try using them for metadata agent API keepalives, etc.15:36
ralonsohyes but pubsub is ideal for this job15:36
otherwiseguyand basically just disable that feature downstream with a huge timeout15:36
ralonsohright15:36
otherwiseguyI'll spend some time while looking at the patchset today to look at what's going on with the API and that initial blockage on the hash ring periodic as well, just because I want to really understand what's happening.15:38
otherwiseguyI feel like I'm throwing darts in the dark on some of this.15:38
ralonsohif we can figure out why the periodic thread is now fired if the API is idle, that will be perfect15:38
opendevreviewLajos Katona proposed openstack/tap-as-a-service master: CLI for Tap Mirrors  https://review.opendev.org/c/openstack/tap-as-a-service/+/94097915:47
opendevreviewTakashi Kajinami proposed openstack/neutron master: Use native interface to explain choices  https://review.opendev.org/c/openstack/neutron/+/94098015:52
opendevreviewMerged openstack/ovn-bgp-agent master: Add BGP NIC IP cleanup logic for the OVN driver  https://review.opendev.org/c/openstack/ovn-bgp-agent/+/94022416:01
opendevreviewSahid Orentino Ferdjaoui proposed openstack/neutron master: agent/linux: remove eventlet usage from of_monitor  https://review.opendev.org/c/openstack/neutron/+/94068416:03
opendevreviewSahid Orentino Ferdjaoui proposed openstack/neutron master: agent/linux: remove eventlet usage from ip_conntrack  https://review.opendev.org/c/openstack/neutron/+/94068516:03
opendevreviewSahid Orentino Ferdjaoui proposed openstack/neutron master: agent/linux: remove eventlet usage from ip_lib  https://review.opendev.org/c/openstack/neutron/+/94068616:03
opendevreviewSahid Orentino Ferdjaoui proposed openstack/neutron master: ovs: reimplement signals handling  https://review.opendev.org/c/openstack/neutron/+/93932116:03
opendevreviewSahid Orentino Ferdjaoui proposed openstack/neutron master: ovs: remove the usage of eventlet in the OVS agent  https://review.opendev.org/c/openstack/neutron/+/93776516:03
opendevreviewSahid Orentino Ferdjaoui proposed openstack/neutron master: ofswitch: remove usage of eventlet timeout  https://review.opendev.org/c/openstack/neutron/+/94098216:03
opendevreviewSahid Orentino Ferdjaoui proposed openstack/neutron master: osdb: remove usage of eventlet in dbhandler  https://review.opendev.org/c/openstack/neutron/+/94098316:03
opendevreviewFernando Royo proposed openstack/ovn-octavia-provider master: Add LB sync logic  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/92532416:11
opendevreviewFernando Royo proposed openstack/ovn-octavia-provider master: Add Listener sync logic  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/93125016:11
opendevreviewFernando Royo proposed openstack/ovn-octavia-provider master: Add Pool sync logic  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/93126616:11
opendevreviewFernando Royo proposed openstack/ovn-octavia-provider master: Add Member sync logic  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/93126716:12
opendevreviewFernando Royo proposed openstack/ovn-octavia-provider master: Add Health monitor sync logic  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/93128816:12
opendevreviewFernando Royo proposed openstack/ovn-octavia-provider master: Add sync floating IP support  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/92903916:12
opendevreviewBrian Haley proposed openstack/neutron-specs master: Update to use openstack-python3-jobs template  https://review.opendev.org/c/openstack/neutron-specs/+/94098616:16
opendevreviewRenjing Xiao proposed x/whitebox-neutron-tempest-plugin master: Add new nested snat test to fetch ovn config dynamically  https://review.opendev.org/c/x/whitebox-neutron-tempest-plugin/+/94090816:18
opendevreviewRenjing Xiao proposed x/whitebox-neutron-tempest-plugin master: Add new nested snat test to fetch ovn config dynamically  https://review.opendev.org/c/x/whitebox-neutron-tempest-plugin/+/94090816:21
opendevreviewFernando Royo proposed openstack/ovn-octavia-provider master: Add LB sync logic  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/92532416:22
opendevreviewFernando Royo proposed openstack/ovn-octavia-provider master: Add Listener sync logic  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/93125016:22
opendevreviewFernando Royo proposed openstack/ovn-octavia-provider master: Add Pool sync logic  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/93126616:22
opendevreviewFernando Royo proposed openstack/ovn-octavia-provider master: Add Member sync logic  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/93126716:22
opendevreviewFernando Royo proposed openstack/ovn-octavia-provider master: Add Health monitor sync logic  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/93128816:22
opendevreviewFernando Royo proposed openstack/ovn-octavia-provider master: Add sync floating IP support  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/92903916:22
opendevreviewFernando Royo proposed openstack/ovn-octavia-provider stable/2023.2: Remove join on helper request daemon thread  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/94079416:23
opendevreviewFernando Royo proposed openstack/ovn-octavia-provider stable/2024.1: Remove join on helper request daemon thread  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/94079316:23
opendevreviewBrian Haley proposed openstack/neutron-specs master: Update to use openstack-python3-jobs template  https://review.opendev.org/c/openstack/neutron-specs/+/94098616:28
opendevreviewFernando Royo proposed openstack/ovn-octavia-provider master: Add Listener sync logic  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/93125016:41
opendevreviewFernando Royo proposed openstack/ovn-octavia-provider master: Add Pool sync logic  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/93126616:41
opendevreviewFernando Royo proposed openstack/ovn-octavia-provider master: Add Member sync logic  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/93126716:41
opendevreviewFernando Royo proposed openstack/ovn-octavia-provider master: Add Health monitor sync logic  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/93128816:41
opendevreviewFernando Royo proposed openstack/ovn-octavia-provider master: Add sync floating IP support  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/92903916:41
opendevreviewTakashi Kajinami proposed openstack/neutron master: Use native interface to explain choices  https://review.opendev.org/c/openstack/neutron/+/94098016:58
opendevreviewTakashi Kajinami proposed openstack/neutron master: Use native interface to explain choices  https://review.opendev.org/c/openstack/neutron/+/94098017:01
lajoskatonasahid: Hi, I checked again (os-ken series: https://review.opendev.org/c/openstack/neutron/+/937765 & neutron: https://review.opendev.org/c/openstack/os-ken/+/938337 )and it is still not working for me (no ovs-agent in the agent list as 1st simptom)17:15
sahidlajoskatona: it's strange17:15
sahidi have mine well registered with metaga and dhcp17:16
sahidin all cases, in a cases there are still some issue with neutron-ovs-tempest-with-os-ken-master 17:18
sahidhttps://zuul.opendev.org/t/openstack/build/0538d30960b8465e9cf7b0bb2f05d60f/logs17:18
sahidI have split my last patch in 2 smaller patches to try to identify what is making the tests failing17:19
sahids/2/317:19
sahidbtw thanks a lot for your tests :-)17:19
sahidin the logs of q-agt, do you see any error?17:20
lajoskatonasahid: no error logs, all seem quiet17:25
sahidbut you see the log message form the event loop?17:26
sahidlog messages17:26
sahidFeb 07 18:27:10 devstack01 neutron-openvswitch-agent[1134689]: DEBUG neutron.agent.common.ovsdb_monitor [None req-cc5087cf-5540-4259-acc7-4aa4b0cbf12f None None] Current size of new_events: added=0 modified=0 removed=0. New events: {'added': [], 'removed': [], 'modified': []} {{(pid=1134689) process_events /opt/stack/neutron/neutron/agent/common/ovsdb_monitor.py:147}}17:27
sahidno..17:27
sahidFeb 07 18:27:10 devstack01 neutron-openvswitch-agent[1134689]: INFO neutron.plugins.ml2.drivers.openvswitch.agent.ovs_neutron_agent [None req-cc5087cf-5540-4259-acc7-4aa4b0cbf12f None None] Agent rpc_loop - iteration:9229 completed. Processed ports statistics: {'regular': {'added': 0, 'updated': 0, 'removed': 0}}. Elapsed:0.00817:27
opendevreviewMerged openstack/ovn-octavia-provider stable/2024.2: Fix pep8 with pylint 3.3.0  https://review.opendev.org/c/openstack/ovn-octavia-provider/+/94086618:28
opendevreviewMerged openstack/neutron stable/2024.2: OVS: Set log level to INFO for unconfigured ofport cases  https://review.opendev.org/c/openstack/neutron/+/94094918:31
opendevreviewMerged openstack/neutron stable/2024.1: OVS: Set log level to INFO for unconfigured ofport cases  https://review.opendev.org/c/openstack/neutron/+/94095018:31
opendevreviewMerged openstack/neutron unmaintained/yoga: Set trunk parent port as access port in ovs to avoid loop  https://review.opendev.org/c/openstack/neutron/+/91350520:07
opendevreviewMerged openstack/neutron stable/2023.2: OVS: Set log level to INFO for unconfigured ofport cases  https://review.opendev.org/c/openstack/neutron/+/94095120:31
opendevreviewRodolfo Alonso proposed openstack/neutron master: WIP == [OVN] Create a deterministic hash ring node UUID generator  https://review.opendev.org/c/openstack/neutron/+/94100821:02
opendevreviewRodolfo Alonso proposed openstack/neutron master: [OVN] Create a deterministic hash ring node UUID generator  https://review.opendev.org/c/openstack/neutron/+/94100821:29
opendevreviewRodolfo Alonso proposed openstack/neutron master: [OVN] Refresh host nodes before notifying  https://review.opendev.org/c/openstack/neutron/+/94025621:33
opendevreviewRodolfo Alonso proposed openstack/neutron master: [OVN] Refactor ``HashRingManager`` sync method  https://review.opendev.org/c/openstack/neutron/+/94034221:33
opendevreviewRodolfo Alonso proposed openstack/neutron master: [OVN] Remove the hash ring maintenance task  https://review.opendev.org/c/openstack/neutron/+/94042721:33
opendevreviewRodolfo Alonso proposed openstack/neutron master: DNM - Test "neutron-ovn-tempest-ipv6-only-ovs*" with WSGI  https://review.opendev.org/c/openstack/neutron/+/93997721:33
opendevreviewBrian Haley proposed openstack/neutron master: Add basical functionalities for metadata path extension  https://review.opendev.org/c/openstack/neutron/+/88153521:36
opendevreviewMerged openstack/neutron master: Add meter bandwidth limit support  https://review.opendev.org/c/openstack/neutron/+/86076723:40

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!