Thursday, 2023-09-21

opendevreviewlikui proposed openstack/neutron-vpnaas master: Update UPPER_CONSTRAINTS_FILE URL  https://review.opendev.org/c/openstack/neutron-vpnaas/+/89599002:04
opendevreviewlikui proposed openstack/neutron-vpnaas master: Update TOX_CONSTRAINTS_FILE URL  https://review.opendev.org/c/openstack/neutron-vpnaas/+/89599002:05
*** ravlew is now known as Guest80804:18
fricklerhaleyb: ralonsoh: please check https://review.opendev.org/c/openstack/os-ken/+/894134 (usually I think bot patches can be single-approved, not sure how neutron handles this in general?)07:00
ralonsohlet me check07:25
opendevreviewMerged openstack/os-ken master: Update master for stable/2023.2  https://review.opendev.org/c/openstack/os-ken/+/89413407:38
sahidlajoskatona: o/08:05
sahidregarding the rpc workers change, is that only related to neutron server or should we also update that on agents aswell?08:06
lajoskatonasahid: Hi, the workers' settings are for the server(s), so I would change it only on the server side08:14
sahidfor a given cluster of 4 neutron-server, we are agree we don't have any notion of a master which could get more traffics or doing more work?08:29
sahidone of how neutron server is doing twince of the work compared to the others08:29
sahids/how/our08:29
opendevreviewRodolfo Alonso proposed openstack/neutron-lib master: Add the "cancellable" flag to the ``CallbacksManager`` events  https://review.opendev.org/c/openstack/neutron-lib/+/89594008:31
opendevreviewRodolfo Alonso proposed openstack/neutron master: Make ``OVNMechanismDriver.post_fork_initialize`` callback cancellable  https://review.opendev.org/c/openstack/neutron/+/89594608:32
ralonsohslaweq, hi! if you have 1 minute: https://review.opendev.org/c/openstack/neutron/+/89344708:32
ralonsohah, and https://review.opendev.org/c/openstack/neutron/+/887977 (if you have time)08:42
sahidi'm suspecting something when a agent is reconnecting to the server08:46
sahidit looks like it's always the same that read the rpc08:46
*** elodilles_pto is now known as elodilles08:47
opendevreviewLucas Alvares Gomes proposed openstack/neutron master: [OVN] External ports scheduling (WIP)  https://review.opendev.org/c/openstack/neutron/+/89476709:02
opendevreviewRodolfo Alonso proposed openstack/neutron master: [OVN] Add a log message after the "post_fork_initialize" method  https://review.opendev.org/c/openstack/neutron/+/89600909:11
mohanhi10:12
mohanneutron-bug-deputy: could you have a look at : https://bugs.launchpad.net/neutron/+bug/202833810:37
fricklermohan: which version of neutron are you seeing this on?12:09
mohanfrickler: it's been a while I've raised this bug , maybe ussuri not sure !! 12:42
racostaHello folks, has anyone who uses OVN seen or has any idea why neutron-ovn-metadata-agent uses a high CPU load? (~10% - OpenStack Yoga).  With approximately 60 networks/processes on a chassis I'm seeing that the processes are receiving port updates every 3 seconds.12:43
racostarecvfrom(13, "{\"id\":null,\"method\":\"update3\",\"params\":[\".............",{\"Port_Binding\":{\"...........12:43
fricklermohan: well if the issue isn't reproducible, there's not much chance of getting it resolved. the "incomplete" status calls for more information to make it reproducible, being able to state at least one affected version is a pretty important part of that12:46
racostaI'm talking about "idle" operation, without creating/removing VMs. It seems to me like unusual CPU usage in these conditions. Have you seen this felixhuettner[m]?12:49
mohanfrickler: I'm sure it's reproducable . Let me setup devstack and test it 12:49
ralonsohmohan, hi, please check in the logs the message "Agent has just been revived. Doing a full sync." or any other message in L3NATAgentWithStateReport._report_state, in the L3 agent logs 12:55
ralonsohyou can also check the "report_state" RPC call in the  Neutron API12:55
ralonsohracosta, what do you see in the logs?12:55
ralonsohare the VMs doing http requests for metadata?12:55
ralonsohand how many metadata workers do you have?12:56
racostaralonsoh, the neutron-ovn-metadata-agent log for the last 5 minutes is empty (I'm running without debug in production).12:58
racosta60 worker (per network)12:58
ralonsohno no, metadata workers12:58
ralonsohnot neutron api workers12:58
racostalet me check12:59
ralonsohif the system is not very active, you can reduce the number of "metadata_workers" to 013:00
ralonsohthat will 1) reduce the number of OVN DB connections and 2) will spawn the HTTP server on the same process as the metadata agent13:00
ralonsohif the number of http request is low (not too much activity in the compute node, regardless of the number of VMs) that will reduce the metadata cpu load13:01
racostaI'm not configuring metadata_workers, it's default.13:03
ralonsohwhat version are you using? that value depends on the version13:04
racosta16.x (Ussuri) and  20.x (Yoga). 13:09
racostaIt seems that in cluster mode (OVN) it increased... in this case with more OVN DB connections13:11
ralonsohracosta, yoga can have metadata_workers=0, spawning the http server in the same process13:19
ralonsohussuri doesn't13:19
ralonsohcheck how many metadata processes you have per compute node13:20
racostasorry, I confirmed that on Yoga, even with frequent port update events, CPU usage is very low.(with more than 40 metadata processes on a compute node).13:23
ralonsohwhat does it means 40 metadata processes?13:24
ralonsohdidn't you say that you didn't overwrite the "metadata_workers" variable?13:25
ralonsohif you have 40 metadata workers, you are killing the OVN DB, in particular the SB13:25
ralonsohplease, check that, 40 workers is unnecessary13:25
racostaExactly, that seems to be the problem.13:26
racostaThe Yoga deploy uses one metadata and N haproxy processes... In Ussuri it is N to N (no way)... 13:27
ralonsohno, we spawn one haproxy per metadata worker13:28
ralonsohthat didn't change13:28
*** jlibosva is now known as Guest84413:33
racostaok, but when haproxy is spawed in the same process it replaces the old nneutron-ovn-metadata-agent process, right?13:34
ralonsohno, the ovn metadata agent is running in one process. This process will start as many metadata_workers as configured. Each worker is a haproxy that receives the http requests and proxies them13:38
ralonsoheach worker has a OVN SB DB connection too13:38
racostaThanks, it makes sense. I see a lot of metadata_workers in ussuri because the default is very high (default=host.cpu_count() // 2)13:43
racostaI'll set this metadata_workers=0 in Ussuri and test asap. Thanks a lot for the help ralonsoh :)13:47
opendevreviewJakub Skunda proposed openstack/neutron-tempest-plugin master: Remove test duplication between tempest and n-t-p RoutersDVRTest  https://review.opendev.org/c/openstack/neutron-tempest-plugin/+/89612214:05
opendevreviewRodolfo Alonso proposed openstack/neutron master: Add a "port" child table "porthardwareoffloadtype"  https://review.opendev.org/c/openstack/neutron/+/88283214:16
ralonsohslaweq, please check https://review.opendev.org/c/openstack/neutron/+/887977. Trivial stuff, no rush14:16
opendevreviewBrian Haley proposed openstack/neutron master: Fix pointless-string-statement pylint warning  https://review.opendev.org/c/openstack/neutron/+/89597514:44
opendevreviewRodolfo Alonso proposed openstack/neutron master: [OVN] Populate the "router.distributed" flag in ML2/OVN  https://review.opendev.org/c/openstack/neutron/+/88699214:57
opendevreviewRodolfo Alonso proposed openstack/neutron master: Replace "tenant_id" with "project_id" in IPAM engine  https://review.opendev.org/c/openstack/neutron/+/87753315:01
opendevreviewLuis Tomas Bolivar proposed openstack/ovn-bgp-agent master: Add initial support for local OVN cluster instead of kernel-networking  https://review.opendev.org/c/openstack/ovn-bgp-agent/+/88177915:01
opendevreviewRodolfo Alonso proposed openstack/neutron master: [OVN] Remove backwards compatibility with OVN < v20.09  https://review.opendev.org/c/openstack/neutron/+/88489815:02
ralonsohbcafarel, hi! if you have 2 mins: https://review.opendev.org/c/openstack/neutron/+/89543315:03
ralonsohthanks!15:03
jlibosvahaleyb: hey Brian :) thanks for commenting on https://review.opendev.org/c/openstack/neutron/+/895849/1/tools/configure_for_func_testing.sh - it was just to test provide testing results for the depending patch but perhaps we can bump it and merge. Is there a plan which versions we intend to use, do they have to be LTS?15:10
ralonsohjlibosva, for functional doesn't need to be the LTS one, this is why we are compiling it15:12
ralonsohwe use the distributed version in the OS in the integrated networking job (and other n-t-p ones)15:13
jlibosvaralonsoh: I think it would be good to have those aligned but the distribution one is quite "old" for master branch. Thinking about scenario where we work on a feature that requires OVN bits we may need fairly new OVN15:17
ralonsohjlibosva, but this is why we need to override the distributed version. We need to keep compatibility with 22.03 (during this release will remove compatibility wiht 20.03)15:17
ralonsohbut FT will need to check newer features implemented in newer versions15:18
jlibosvawhy would we want FT to be different? it's just a different level of testing. imho we should claim "This current stable release works with this OVN version"15:19
jlibosvaif we use 2 different versions of OVN in different testing suites then we may be missing coverage for the version we claim works15:19
ralonsohto test new features, as commented15:19
jlibosvabut then we won't test them with other jobs15:20
ralonsohwe can't be stuck in 20.03 or 22.03 during 2 years15:20
jlibosvaI agree with that15:20
haleybjlibosva: hey Kuba. I just noticed the LTS comment there, and v22.03.x is Ubuntu LTS for cloud, which is the underlying OS for most things here. If the goal is new features it doesn't fit15:20
ralonsohno, in the other jobs will check using the released OVN version, checking that there are no regressions15:20
haleybthe previous version was v23.03.3 which is stable15:20
haleyberr, maybe to me 23.03 == LTS, habit15:21
jlibosvaso the idea is to stay on LTS Ubuntu until the next LTS, with all the packages that come with it (e.g. OVN being 22.03) and then have FT with some new compiled stable OVN?15:24
jlibosvaand we won't test a Neutron feature, if it relies on some late OVN code, until the next Ubuntu LTS release?15:25
jlibosvatest end-to-end that is15:25
ralonsohFor example: chassis_private support. It was implemented after 20.03, but we started supporting and testing it in FT15:27
ralonsohusing newer versions15:27
jlibosvabut with tempest we stayed with chassis?15:27
ralonsohyes when using Focal (ovn 20.03)15:27
ralonsohnow we test it with Jammy (ovn 22.03)15:27
ralonsohin any case, we have "two" tempests: we have "tempest-integrated-networking", that uses always the released OVN in the OS15:28
jlibosvathanks for explanation - so now with the FT, I understand it would be ok to bump it to stable non-LTS version?15:28
ralonsohand n-t-p, where we alse override the OVN version15:28
haleybjlibosva: well, in this case we're building OVN by default, and looking at the changelog it went from main/20.03/20.06/22.03... so i guess in this case 22.06.1 is ok.15:29
jlibosvagot it15:29
ralonsohyes for FT if we need to test new features15:29
haleybwhat rodolfo said :)15:29
jlibosvaok, so now what would be a good version? the latest stable or the earliest that is required for a feature?15:30
haleybmaybe it was just 22.06 seemed random, if it was 23.0x that would be newer, not sure how bleeding edge we want to go15:31
-opendevstatus- NOTICE: The lists.openinfra.dev and lists.starlingx.io sites will be offline briefly for migration to a new server15:31
jlibosvaI put 22.06 as this was the first release with additional_chassis column that I needed15:32
jlibosvabut still, this is more than a year old release15:33
jlibosvaancient :)15:33
haleybah, makes sense. v23.09.0 just dropped, as did v23.06.215:33
jlibosvaso maybe we want newer. I don't know. I'm asking15:33
ralonsohin FT, I would use the newer one, testing that this version is not breaking anything in neutron15:34
jlibosvamy thinking is if we use 23.09 then maybe we won't need to change it for another year, perhaps just the minor version for bugfixes if any15:34
ralonsohwe have tempest to avoid regressions in the Neutron code15:34
haleybi don't know either, anything newer should work, if not we change when we find a bug15:34
jlibosvabut we have conditions in Neutron that depends on OVN schema version, so the regressed code might not be hit until we have the updated schema15:34
jlibosvadepend*15:35
jlibosvaok, so I'm gonna make that patch more "official" and will use the v23.09.0 tag, how about that?15:35
ralonsoh+115:35
haleyb+1, and that requires 1d78a3f3164a6bf651b34f52812f38655b28a9ce ovs (v3.2.0-20-g1d78a3f31)15:36
haleybfrom 'git submodule'15:36
jlibosvaperfect, thanks a lot! :)15:36
haleybi'm assuming 'git submodule update' did the right thing15:36
ralonsohyes15:37
opendevreviewRodolfo Alonso proposed openstack/neutron master: Add a new extension "security-groups-rules-belongs-to-default-sg"  https://review.opendev.org/c/openstack/neutron/+/88390715:50
opendevreviewRodolfo Alonso proposed openstack/neutron master: Add a new extension "security-groups-rules-belongs-to-default-sg"  https://review.opendev.org/c/openstack/neutron/+/88390716:37
opendevreviewMerged openstack/neutron master: Create a single method to set the quota usage dirty bit  https://review.opendev.org/c/openstack/neutron/+/88797717:52
opendevreviewJakub Libosvar proposed openstack/neutron master: ovn-metadata: Refactor events  https://review.opendev.org/c/openstack/neutron/+/89616320:32
opendevreviewJakub Libosvar proposed openstack/neutron master: ovn-metadata: Refactor events  https://review.opendev.org/c/openstack/neutron/+/89616320:47
*** damian___921832 is now known as damian___9218320:56
opendevreviewJakub Libosvar proposed openstack/neutron master: ovn-metadata: Refactor events  https://review.opendev.org/c/openstack/neutron/+/89616321:00

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!