Wednesday, 2024-10-09

opendevreviewMerged openstack/neutron-lib master: Use py312 for all neutron-lib jobs  https://review.opendev.org/c/openstack/neutron-lib/+/93168000:07
opendevreviewyatin proposed openstack/neutron stable/2024.2: [CI] Functional: Increase Ulimit to 4096  https://review.opendev.org/c/openstack/neutron/+/93186304:01
opendevreviewyatin proposed openstack/neutron stable/2024.1: [CI] Functional: Increase Ulimit to 4096  https://review.opendev.org/c/openstack/neutron/+/93186404:02
opendevreviewyatin proposed openstack/neutron stable/2023.2: [CI] Functional: Increase Ulimit to 4096  https://review.opendev.org/c/openstack/neutron/+/93186504:03
opendevreviewyatin proposed openstack/neutron stable/2023.1: [CI] Functional: Increase Ulimit to 4096  https://review.opendev.org/c/openstack/neutron/+/93186604:03
opendevreviewRodolfo Alonso proposed openstack/neutron stable/2024.1: [OVN] Check metadata HA proxy configuration before restart  https://review.opendev.org/c/openstack/neutron/+/93179106:02
opendevreviewyatin proposed openstack/neutron unmaintained/zed: [CI] Functional: Increase Ulimit to 4096  https://review.opendev.org/c/openstack/neutron/+/93187106:17
opendevreviewRodolfo Alonso proposed openstack/neutron master: DNM - Test "neutron-ovn-tempest-ipv6-only-ovs*" with eventlet  https://review.opendev.org/c/openstack/neutron/+/93184206:35
opendevreviewSlawek Kaplonski proposed openstack/neutron master: [Fullstack] Add active waiting for one HA agent to be active for router  https://review.opendev.org/c/openstack/neutron/+/93176607:17
opendevreviewSlawek Kaplonski proposed openstack/neutron master: [Fullstack] Add active waiting for one HA agent to be active for router  https://review.opendev.org/c/openstack/neutron/+/93176607:19
slaweqhaleyb ralonsoh ykarel: based on our discussion during last CI meeting, I added topic about Postgresql to the TC etherpad: https://etherpad.opendev.org/p/oct2024-ptg-os-tc07:29
slaweqplease add your notes there if you have anything more there07:29
ykarelthx slaweq 07:30
ralonsohthanks07:32
opendevreviewRodolfo Alonso proposed openstack/neutron stable/2024.2: Use the declarative attribute ``standard_attr_id``  https://review.opendev.org/c/openstack/neutron/+/93187707:40
opendevreviewRodolfo Alonso proposed openstack/neutron stable/2024.1: Use the declarative attribute ``standard_attr_id``  https://review.opendev.org/c/openstack/neutron/+/93187807:40
opendevreviewRodolfo Alonso proposed openstack/neutron stable/2023.2: Use the declarative attribute ``standard_attr_id``  https://review.opendev.org/c/openstack/neutron/+/93188007:42
ykarelralonsoh, slaweq got the root cause for too many open files functional failures08:04
ykarelit's infra :)08:04
ykarela new cloud with different node config https://review.opendev.org/c/openstack/project-config/+/92832108:06
ykareland since then only we hitting the issue08:06
ykarelopensearch also confirms all failures on provider raxflex-SJC308:07
ykarelso 4096 ulimit is good enough for us08:07
ralonsohykarel, but are the CI VMs configured in a different way?08:09
ralonsohI mean, the ulimit is a OS config parameter08:09
ralonsohwhy this node is affecting us?08:10
ykarelthese new nodes are 4 cpu vs usual 8 cpus08:10
ralonsohreally??? pffffff08:10
ykareltempest run longer and with more runs per worker fd utilization increases and when stestr worker finishes then the fds are released08:11
ralonsohyeah, we are doing something wrong in the FTs, leaving some unnecessary FDs open08:11
ralonsohbut yes, that could be a plausible cause08:12
ralonsohthanks a lot!08:12
ykarelwith latest debug logs we have open fd lists now08:12
ralonsohyes, most of them related to the ovn and ovs databases08:13
ralonsohwell, not most of them but too many08:13
ralonsohykarel, check that: https://review.opendev.org/c/openstack/neutron/+/93184208:14
ralonsohno errors... pfffff08:14
ralonsoh2 jobs with eventlet, 1 with wsgi08:15
slaweqgreat catch ykarel 08:50
noonedeadpunkhey ! I was just reading through the meeting logs and around for discussion: https://meetings.opendev.org/irclogs/%23openstack-neutron/%23openstack-neutron.2024-10-08.log.html#t2024-10-08T14:26:3309:02
noonedeadpunkI think that snat already works in this case, with the https://review.opendev.org/c/openstack/neutron/+/90919409:03
noonedeadpunkat very least, just avoid binding NAT rule to port does work for us (ie smth like https://review.opendev.org/c/openstack/neutron/+/931495)09:04
noonedeadpunkas the current behaviour I find just wrong - there's no reason to point NAT to the port, which is not residing on chassis - it just can not work09:05
noonedeadpunkeven if access from top-layer network is not smth that should be supported, I'm pretty sure that routers and FIPs that just created on geneve should be supproted09:06
noonedeadpunkAs logic around - if FIP will work or not is built around that it should be either VLAN or FLAT network09:06
noonedeadpunkbut we can discuss that on the drivers meeting indeed09:07
ralonsohnoonedeadpunk, I think your patch makes sense since the LRP is not bound but the router09:09
ralonsohin that case maybe https://bugs.launchpad.net/neutron/+bug/2035281 won't work but it will the FIP09:09
ralonsohwe need more testing on this09:10
opendevreviewDmitriy Rabotyagov proposed openstack/neutron master: [OVN] Do not supply gateway_port if it's not binded to chassis  https://review.opendev.org/c/openstack/neutron/+/93149509:11
noonedeadpunkYeah, I'm gonna work on testing for sure, just trying to confirm we totally get an intended behaviour and this covers the usecase described in https://bugs.launchpad.net/neutron/+bug/208352709:13
ykarelralonsoh, ack thx, let's see if hit its with recheck09:14
noonedeadpunkSo I have an impression, that https://bugs.launchpad.net/neutron/+bug/2035281 is not applicable for usecase, where LR is binded to chassis instead of LRP09:14
noonedeadpunkas you won't get flows in this case anyway?09:15
ralonsohyour patch at least if fixing an issue that I think the code has: the FIP doesn't work for routers with tunnelled GW networks09:15
noonedeadpunkor well, from what I saw, NAT is being just ignored09:15
noonedeadpunk++09:15
ralonsohI'm not saying the LP bug you created, with multiple GW networks nested should work09:15
ralonsohbut at least your patch is solving another issue09:15
ralonsohI'm writting this topic now in the drivers meeting for this friday09:16
noonedeadpunkTo mention that, I'm not 100% getting the usecase which is intended to be done, as I don't think the intention is to access the A-net from real provider network...09:17
noonedeadpunkBut so far the team working on implementation reported that this patch seems to solve the issue they had. So it could be I mis-phrased what the intention after all is in the bug report.09:18
ykarelslaweq, can you check these CI backports https://review.opendev.org/q/Iff86599678ebdd5189d5b56d11f3373c9b13856209:18
noonedeadpunkBut "how to reproduce" section is still valid and what they do check atm09:18
opendevreviewEduardo Olivares proposed openstack/ovn-bgp-agent master: Refactor ovn_bgp_agent.privileged.ovs_vsctl  https://review.opendev.org/c/openstack/ovn-bgp-agent/+/93174309:32
opendevreviewGabriel Adrian Samfira proposed openstack/neutron master: Bind internal router port for VLAN network  https://review.opendev.org/c/openstack/neutron/+/93189209:42
gsamfiraralonsoh: Hey! I uploaded the patch I mentioned yesterday. I also added some context on the bug.09:43
gsamfiranot sure if it's correct, but it seems to work in our lab09:45
gsamfirafor context, we have an ironic deployment with the genericswitch mechanism driver, 3 nodes set up with enable-chassis-as-gw and a bunch of VLAN networks09:46
ralonsohgsamfira, first, the scheduling of the router GW ports does not use "ha_chassis_group" but "gateway_chassis" 09:49
ralonsohthis is why I proposed a new OVN L3 scheduler in https://review.opendev.org/c/openstack/neutron/+/87203309:49
ralonsohsecond, you are applying this change to all router LRPs09:49
ralonsohwe bind only the GW LRP09:50
gsamfirayep. That's the problem. If we don't bind the internal LRP of the VLAN network that the external ports are attached to, if the ha_chassis_group of the VLAN network differs from the gateway_chassis of the external port, nothing works.09:51
gsamfiraif we have a vrouter without an external port on the vrouter, routing between a VLAN network and a geneve network, doesn;t wok09:52
ralonsohgsamfira, so, because external ports use the OVN "internal" scheduler used by HA_Chassis_Group, we should use the same for the router GW LRPs09:52
ralonsohso far, the OVN L3 scheduler uses "manual" scheduling, done by Neutron creating several gateway_chassis and adding them to the LRP09:53
ralonsohin other to sync the external ports and the LRPs, both should use the same scheduling method, that is HA_Chassis_Group09:53
ralonsohbut not in this way but implementing a new OVN L3 scheduler09:54
ralonsohplease check my patch09:54
ralonsohand feel free to rebase and continue with it (I don't have time right now)09:54
gsamfirawill do. I need to see about the difference between the existing schedulers and "internal" schedulers. Not yet familiar enpough with the codebase09:55
gsamfirathe scheduler in your patch seems to apply only to external schedulers09:55
gsamfirathe scheduler in your patch seems to apply only to external gw ports09:56
ralonsohright, this is what you need09:56
gsamfiradamn. Forgot irc doesn't have edit message :D09:56
gsamfiracool. Thanks for the pointers ralonsoh !09:56
gsamfiraI need to push through a deployment for now, but as soon as I finish, I will look into making the needed changes.09:57
gsamfiraralonsoh: just to confirm. We can't only bind the external GW port. We also need to bind the internal port of VLAN networks. Otherwise, if we connect 2 or more VLAN networks to the same vrouter, each with their own ha_chassis_group potentially on different sets of chassis, we will basically need to make sure that both VLAN networks have the exact same ha_chassis_group with the exact same priorities. Otherwise we end up with a small world probl10:12
opendevreviewRodolfo Alonso proposed openstack/neutron master: Use the constant UNLIMITED_QUOTA=-1 in quota engine  https://review.opendev.org/c/openstack/neutron/+/93114310:12
gsamfiraall ports on the same router10:13
gsamfira*same chassis10:13
ralonsohgsamfira, why do you need that? If you have ports in internal networks (tunnelled or not), the traffic from this VM ports with be sent to the GW LRP chassis, if the traffic needs to go the external network10:15
ralonsohbut there is no need to bind the internal LRP10:15
gsamfirathe traffic works from vm --> BM10:15
gsamfirabut not the other way around10:15
gsamfirawithout binding the internal port to the same chassis as the external port in that same VLAN 10:16
gsamfirait gets no arp reply10:16
gsamfirafrom the gw10:16
gsamfirain tcp dump we just see arp who has 10.31.1.110:16
gsamfiraand no reply10:16
ralonsohwhy do you use internal VLAN networks?10:16
gsamfirawe'd like to be able to connect ironic managed servers to other tenant networks10:17
ralonsohok, I don't know if we test this configuration10:18
ralonsohbut that's another topic (maybe another bug)10:18
gsamfirak8s cluster running on VMs, a few large databases running on bare metal, they should be able to communicate and benefit from security groups and all that10:19
gsamfiraI can open another bug and perhaps kick off a discussion. But I'm new to neutron; not sure what the best way to do that would be. 10:20
gsamfirathis feels like a change that should be discussed first :)10:21
ralonsohopen a LP bug: https://bugs.launchpad.net/neutron/10:21
gsamfiraand I don't want to waste people's time10:21
gsamfirasounds good10:21
ralonsohat least to report the use of VLAN networks as internal ports in a router10:21
gsamfirahttps://bugs.launchpad.net/neutron/+bug/1995078/comments/34 <-- seems like the same scenario I described. They created a router, set a GW and attached a VLAN network10:33
gsamfirabut I will open a new bug and add more context10:33
ralonsohmuch better to have this is a separate context10:34
gsamfirayup10:34
ralonsohthis is a problem that has entity by itself10:34
gsamfiraI appreciate the guidance ralonsoh . I know you folks are swamped :)10:34
gsamfirais there a neutron + ironic channel or meeting that takes place?10:35
ralonsohthere is a team meeting on tuesday10:41
ralonsohhttps://meetings.opendev.org/#Neutron_Team_Meeting10:42
opendevreviewRodolfo Alonso proposed openstack/neutron master: add pyproject.toml to support pip 23.1  https://review.opendev.org/c/openstack/neutron/+/89995610:58
opendevreviewTakashi Kajinami proposed openstack/neutron master: Replace deprecated is_advsvc  https://review.opendev.org/c/openstack/neutron/+/93157411:29
opendevreviewJakub Libosvar proposed openstack/ovn-bgp-agent master: Refactor ovn_bgp_agent.privileged.ovs_vsctl  https://review.opendev.org/c/openstack/ovn-bgp-agent/+/93174311:52
gsamfiraawesome! Thanks!11:56
opendevreviewEduardo Olivares proposed openstack/ovn-bgp-agent master: Refactor ovn_bgp_agent.privileged.ovs_vsctl  https://review.opendev.org/c/openstack/ovn-bgp-agent/+/93174312:45
opendevreviewBrian Haley proposed openstack/neutron stable/2024.1: [OVN] Check metadata HA proxy configuration before restart  https://review.opendev.org/c/openstack/neutron/+/93179114:43
opendevreviewMerged openstack/neutron master: [Fullstack] Add active waiting for one HA agent to be active for router  https://review.opendev.org/c/openstack/neutron/+/93176616:19
opendevreviewMerged openstack/ovn-bgp-agent master: Refactor ovn_bgp_agent.privileged.ovs_vsctl  https://review.opendev.org/c/openstack/ovn-bgp-agent/+/93174316:20
opendevreviewMerged openstack/neutron stable/2024.2: [CI] Functional: Increase Ulimit to 4096  https://review.opendev.org/c/openstack/neutron/+/93186316:53
opendevreviewMerged openstack/neutron stable/2024.1: [CI] Functional: Increase Ulimit to 4096  https://review.opendev.org/c/openstack/neutron/+/93186416:53
opendevreviewMerged openstack/neutron stable/2023.1: [CI] Functional: Increase Ulimit to 4096  https://review.opendev.org/c/openstack/neutron/+/93186617:27
opendevreviewMerged openstack/neutron unmaintained/zed: [CI] Functional: Increase Ulimit to 4096  https://review.opendev.org/c/openstack/neutron/+/93187117:48
opendevreviewMerged openstack/neutron master: Use the constant UNLIMITED_QUOTA=-1 in quota engine  https://review.opendev.org/c/openstack/neutron/+/93114319:13
opendevreviewMerged openstack/neutron stable/2024.2: Ensure the OVSDB and ovn-northd process stop  https://review.opendev.org/c/openstack/neutron/+/93101419:13
opendevreviewMerged openstack/neutron stable/2024.1: [OVN] Check metadata HA proxy configuration before restart  https://review.opendev.org/c/openstack/neutron/+/93179119:13
opendevreviewMerged openstack/neutron stable/2023.2: [CI] Functional: Increase Ulimit to 4096  https://review.opendev.org/c/openstack/neutron/+/93186520:52
opendevreviewIhar Hrachyshka proposed openstack/neutron master: mypy: neutron/plugins/ml2/drivers/ovn/mech_driver/ovsdb/*  https://review.opendev.org/c/openstack/neutron/+/92986621:38
opendevreviewIhar Hrachyshka proposed openstack/neutron master: mypy: neutron/db/l3*  https://review.opendev.org/c/openstack/neutron/+/92986721:38
opendevreviewIhar Hrachyshka proposed openstack/neutron master: mypy: use dataclass for AgentConsumer  https://review.opendev.org/c/openstack/neutron/+/92986821:38
opendevreviewIhar Hrachyshka proposed openstack/neutron master: mypy: neutron/extensions/*  https://review.opendev.org/c/openstack/neutron/+/92986921:38
opendevreviewIhar Hrachyshka proposed openstack/neutron master: mypy: neutron/agent/*  https://review.opendev.org/c/openstack/neutron/+/92987021:38
opendevreviewIhar Hrachyshka proposed openstack/neutron master: mypy: neutron/objects/*  https://review.opendev.org/c/openstack/neutron/+/92987121:38
opendevreviewIhar Hrachyshka proposed openstack/neutron master: Remove dhcp.PORTS attribute  https://review.opendev.org/c/openstack/neutron/+/93199321:47

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!