14:00:43 #startmeeting networking 14:00:43 Meeting started Tue Jan 21 14:00:43 2025 UTC and is due to finish in 60 minutes. The chair is haleyb. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:00:43 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 14:00:43 The meeting name has been set to 'networking' 14:00:54 Ping list: bcafarel, elvira, frickler, mlavalle, mtomaska, obondarev, slaweq, tobias-urdin, ykarel, lajoskatona, jlibosva, averdagu, amotoki, haleyb, ralonsoh 14:00:55 \o 14:01:00 hello 14:01:00 ralonsoh, I couldn't login to edit the wiki page, so if there will be one-two minute for an ad-hoc announce in the end of the meeting, would be nice. 14:01:20 ralonsoh, sure, there will be time 14:01:22 o/ 14:01:38 o/ 14:01:58 o/ 14:01:59 svinota: there will be an on-demand time at the end, thanks for attending 14:02:06 thanks 14:02:13 #announcements 14:02:35 We are in week R-10 if my math is correct 14:02:41 #link https://releases.openstack.org/epoxy/schedule.html 14:03:15 Rodolfo Alonso proposed openstack/neutron master: WIP == [eventlet-deprecation] Remove the usage of eventlet in the Neutron API https://review.opendev.org/c/openstack/neutron/+/938659 14:03:20 in R-6 we will have non-client library freeze 14:03:32 Rodolfo Alonso proposed openstack/neutron master: DNM - Test "neutron-ovn-tempest-ipv6-only-ovs*" with WSGI https://review.opendev.org/c/openstack/neutron/+/932601 14:03:33 in R-5 will be E-3 milestone 14:03:38 late o/ 14:05:07 Reminder: If you have a topic for the drivers meeting on Friday, please add it to the wiki @ https://wiki.openstack.org/wiki/Meetings/NeutronDrivers 14:05:18 And of course a reminder to use the priorities dashboard for patches in the "ready to merge" state 14:06:05 I sent the Linuxbridge removal email to the ML last week, and it seems people have accepted is was the right thing to do 14:07:01 great :) thx haleyb for doing this finally 14:07:02 there was a related email from Cern regarding knowledge sharing 14:07:06 good question if heores will move the code to x/ namsspace, let's see:-) 14:08:52 i will not hold my breath on that, i would hope we can spend any time helping with migration questions and OVN adoption 14:09:12 +1 for pushing toward migration 14:09:47 i did not have any other announcements 14:10:16 #topic Bugs 14:10:21 lajoskatona was deputy last week 14:10:26 #link https://lists.openstack.org/archives/list/openstack-discuss@lists.openstack.org/thread/JIRUHN7HMJUDB7KB55UYUZ5Q2IK2MC6X/ 14:10:36 there were quite a few 14:11:19 Sahid Orentino Ferdjaoui proposed openstack/neutron master: async_process: fix potential race condition with respawn https://review.opendev.org/c/openstack/neutron/+/939627 14:11:20 Sahid Orentino Ferdjaoui proposed openstack/neutron master: async_process: remove usage of eventlet for AsyncProcess https://review.opendev.org/c/openstack/neutron/+/939348 14:11:20 Sahid Orentino Ferdjaoui proposed openstack/neutron master: ovs: reimplement signals handling https://review.opendev.org/c/openstack/neutron/+/939321 14:11:21 Sahid Orentino Ferdjaoui proposed openstack/neutron master: common: fix wait_until_true to support native thread https://review.opendev.org/c/openstack/neutron/+/937843 14:11:21 Sahid Orentino Ferdjaoui proposed openstack/neutron master: ovs: remove the usage of eventlet in the OVS agent https://review.opendev.org/c/openstack/neutron/+/937765 14:11:33 lajoskatona: were there any bugs you wanted to highlight? 14:12:31 not really there were few without owners 14:12:55 but as I remember the high prio ones are under control 14:13:15 ok 14:13:18 https://bugs.launchpad.net/neutron/+bug/2094842 14:13:27 this one in the high list without owner 14:13:37 "ovs.db.error.Error: ovsdb error: 0 values when type requires between 1 and 9223372036854775807" when router is attached to network without subnet 14:13:40 that looks familiar, or at least we just fixed something related in quota 14:14:10 at least it looks like the quota one, but it wasn't in ovsdb of course 14:14:40 the cause is explained in the description 14:14:41 oh, maybe this is related to a fix ralonsoh had regarding gw_port without an IP address? 14:14:51 we can't create a GW port wihtout subnet in the GW network 14:15:13 haleyb, I don't have anything yet 14:15:25 there is a quick attempt https://review.opendev.org/c/openstack/neutron/+/939253 14:15:32 but this is something that needs to be tested 14:16:12 ralonsoh: ah, that was it, it was ihrachys not you, sorry i mis-remembered 14:17:03 it is visible in zuul logs also not just for master 14:17:13 see the opensearch logs I added as comment 14:18:16 ack, i guess we can wait until that change is updated to see if it fixes it 14:18:33 +1 14:18:56 another one ihar filed was 14:19:02 #link https://bugs.launchpad.net/neutron/+bug/2094846 14:19:22 trunk driver functional failure 14:20:26 according to codesearch there was only 1 failure so far, but it looks like something that shouldn't have happened 14:20:32 it happened only once 14:20:40 and this is a functional test 14:20:53 I would reduce the importance, but I'll keep it in the list 14:21:17 yes, let's see if it happens again first 14:21:21 it is medium at the moment due to low frequesncy, but we can dereease 14:22:19 sure 14:22:38 there was another medium i started to take a look at 14:22:42 #link https://bugs.launchpad.net/neutron/+bug/2095097 14:22:49 500 error creating resource with emoji for name 14:23:11 it happens for any string field, for example, description too 14:23:28 that's a funny one, but really happens 14:23:50 my original thought was to just fix the network db code, but now i think we should change neutron-lib to add a validator for such fields 14:23:59 but i wanted to get others thoughts on that 14:24:16 an extra check for printable chars is ok 14:24:23 haleyb: yes a validator sounds good 14:24:30 it won't break anything using "normal" chars 14:24:50 and with the n-lib validator, we'll have a printable exception, not error 500 14:24:50 and i'm not sure it's a medium as i originally tagged it, since it only affects the caller 14:25:21 the only problem here is to return a valid exception 14:25:22 yes, a quick hack of the network db code and i could get a 400, 14:25:23 not 500 14:25:33 so I would lower the priority 14:25:37 i.e. InvalidInput such exception 14:25:43 exactly 14:26:26 nova gives back 400 for such input for example 14:26:54 as it should be, this is a bad request, not a server error 14:27:19 exactly, so good to have simlar response from Neutron API also 14:27:53 ++ for some 4xx error in such case 14:28:23 i took it and can work on a validator 14:28:40 next unowned bug 14:28:45 #link https://bugs.launchpad.net/neutron/+bug/2095185 14:29:03 ihrachys was busy last week 14:29:33 test_legacy_router_conntrack_helper failure 14:29:35 even on Friday I got a bunch of bugs from Him :-) 14:29:51 he gave a possible solution, but would need more investigation 14:29:58 this one was seen in zuul logs only on stable/ 14:30:07 last sentence of the description is a clue for a fix 14:30:19 so I would give a try 14:30:26 but as he wrote can happen on master 14:30:34 Ihar Hrachyshka proposed openstack/neutron master: QinQ implementation for the ML2/OVN backend https://review.opendev.org/c/openstack/neutron/+/937633 14:30:51 right, just needs an owner if someone can take it 14:31:38 the last bug without owner is 14:31:44 #link https://bugs.launchpad.net/neutron/+bug/2095091 14:32:28 ykarel_: you filed this, does it happen frequently enough to debug it further? 14:32:39 yesterday we discussed on the CI meeting with ykarel and slwaq and mlavalle 14:32:49 haleyb, happened only once 14:32:50 ah 14:33:02 mlavalle, will be working on alternative 14:33:02 * haleyb was out yesterday as it was a holiday 14:33:32 ok, as long as someone is working on it 14:33:45 any other bugs to discuss? 14:34:19 ykarel is the deputy this week, mtomaska next week 14:34:26 is that ok? 14:35:33 i will follow-up later so we can move along with other items 14:35:37 #topic community goals 14:36:04 haleyb ykarel I'll keep an eye on that failure 14:36:09 ack 14:36:22 lajoskatona: seems forward progress on neutronclient deprecation? 14:36:28 #link https://review.opendev.org/q/topic:%22bug/1999774%22 14:36:43 haleyb, ack for deputy, thx for the reminder 14:36:44 not this week, I had no time for it 14:37:16 well, at least both open changes have zuul +1, i will take a look at the second in the chain 14:37:26 thanks 14:38:22 and evenlet deprecation 14:38:35 this is not going well 14:38:47 I can't identify what is happening 14:38:59 initially the hash ring manager was a possible culprit 14:39:22 but since last week (we merged a patch), the timeouts are increased and we don't have idle nodes 14:39:49 now the problem is, sometimes, that the events are processed in different order, DB locks and other problems I can't explain 14:40:02 I'm trying to open the related bugs 14:40:09 and I'm trying with this: 14:40:16 https://review.opendev.org/c/openstack/neutron/+/938659 14:40:42 there are many places in the code (python-ovs, ovsdbapp, Neutron, etc) where the libraries have workarounds for eventlet 14:40:44 so I' 14:40:46 sorry 14:41:01 so I would prefer testing Neutron API without eventlet 14:41:07 Merged x/whitebox-neutron-tempest-plugin master: Add config dict dataplane_podified_services https://review.opendev.org/c/x/whitebox-neutron-tempest-plugin/+/939519 14:41:10 because this is actually the expected behaviour 14:41:29 and that's all (I'm overwhelmed right now) 14:43:20 ralonsoh: what else can we do to help? i know ihrachys was helping last week, and i can look at things this week 14:43:36 but i realize it is very complicated 14:43:47 haleyb, first of all, trying to check what is happening in CI patches like https://review.opendev.org/c/openstack/neutron/+/932601/12?tab=change-view-tab-header-zuul-results-summary 14:44:17 even without eventlet, I see common errors (ports not active, PG not present, etc) 14:44:53 so, for example, to take one CI log and try to squeeze it to find what was the problem 14:45:30 ack, and i would ask anyone with spare cycles to take a look so we come up with some ideas 14:46:17 I ry to allocate some time to it, anyway the n-d-r issue is also there with the same root 14:46:24 thanks 14:46:41 i want to move to on-demand as svinota is here 14:46:44 #topic on-demand 14:46:52 svinota, hello! 14:46:56 hello 14:47:06 and thanks for your time :) 14:47:17 thanks for pyroute2 :) 14:47:26 Welcome :-) 14:47:38 so shortly, the pyroute2 project is undergoing a big (to say the least) shift in the core 14:48:07 I continue to support the old synchronous API, it will not be dropped 14:48:28 but from 0.9.1 the core is based on asyncio 14:48:49 related docs: https://docs.pyroute2.org/asyncio.html 14:48:58 also: https://docs.pyroute2.org/iproute_intro.html 14:49:15 thanks to ykarel , I start to monitor the zuul as well 14:50:02 so I hope not to miss major compatibility errors; beside of that I run own integration tests 14:51:15 I don't know how this shift can affect the neutron core, so as I said — I keep the sync API anyways 14:51:54 but if the project can provide something more within the async paradigm — let me know 14:52:01 is there a point in time where we should start migrating things? will there be gotcha's, etc? 14:52:05 that's all folks :) thanks 14:52:12 this is a bug change, of course. We use pyroute "only" in the agents (luckily, because asyncio is not compatible with wsgi) 14:52:35 and most of the pyrouter calls are executed inside a root daemon (privsep) 14:52:59 so we'll need to refactor the non-admin calls and the admin calls independetly 14:53:08 can both coexist right now? 14:53:29 0.9.1 is scheduled for the beginning of February, but as I see, the current neutron code works with the master branch 14:53:42 and the master branch is already async-enabled 14:53:46 perfect 14:53:53 so I believe no migration is required 14:54:07 but eventually this is the desired implementation, right? 14:55:05 async is the first class citizen now, so the sync code is simply a set of wrappers around it; so you can use either variant, totally up to you, both will coexist 14:55:24 perfect 14:55:32 in the best case the library users should not notice any change 14:55:59 svinota thx for sharing all of that with us 14:56:05 (it took like some months to fix it in that way, but I hope it pays back) 14:56:14 but neutron is not the only user of the pyroute2 in openstack 14:56:25 os-vif, right 14:56:27 did you maybe had any chance to test it with other projects as well? 14:56:34 and other projects in networking 14:56:38 octavia I see is using it a bit 14:57:06 I tested kuryr. Octavia should work as before. 14:57:08 so this should be on us to create pyroute-master CI jobs 14:57:11 https://codesearch.opendev.org/?q=pyroute2&i=nope&literal=nope&files=&excludeFiles=&repos= 14:57:16 I see many of them really 14:57:16 I forgot os-vif, thanks 14:57:33 I don't want you to test them all on your own 14:57:58 ralonsoh: we do have functional and fullstack pyroute2 master jobs 14:58:05 slaweq, thanks for the link. I will try to go through the list and see if we can include them in the integration testing 14:58:07 but maybe you could send email to the openstack-discuss ML to raise awarness of this change in different teams 14:58:14 haleyb, yes but for all projects? 14:58:43 ralonsoh: i only know of neutron 14:58:50 perhaps some common tempest integration job? 14:58:59 that an cover a lot of projects 14:59:25 svinota thx a lot, if you will have any questions about it, please ping me :) 14:59:36 slaweq, 10x 15:00:01 ok, let's create a bug in LP to handle this 15:00:11 to create the needed CI jobs for pyroute-master 15:00:16 functional or tempest 15:00:37 and, of course, svinota thanks a lot 15:00:55 and advertise it on mail list to have a flag for all users not just for us 15:01:09 thanks, colleagues 15:01:22 svinota thanks for the good cooperation and for the news :-) 15:01:38 yes, thanks a lot svinota! 15:02:01 does anyone want to file the bug, etc? or is this on the PTL :) 15:02:32 always the one who asks :-) 15:02:43 haha 15:02:58 ok, i have to drop for another meeting 15:03:07 thanks everyone for attending, have a good week 15:03:12 o/ 15:03:14 #endmeeting