Wednesday, 2026-06-03

*** ykarel__ is now known as ykarel04:46
ralonsohsean-k-mooney, gibi hello folks. Yesterday I was looking at the issues with pyroute2 and asyncio in the nova-compute agent07:12
ralonsohI know this is not a solution but it could help, as a workaround, to wrap all pyroute2 operations with a privsep context07:12
ralonsohthese operations will be executed in the daemon process instead of running in the main process07:13
ralonsohthat will avoid the asyncio problem experienced when executing non-privileged commands07:13
ralonsohthat will allow to bump the pyroute2 version07:14
gibiralonsoh: as I'm not sure about the root cause of the issue I cannot judge if moving the pyroute2 calls to another process will help or not. Have you tried to see if it helps?07:24
ralonsohgibi no, but I can propose a os-vif patch, at least for the OVS related commands08:20
ralonsohlet me try it08:20
gibicool08:21
opendevreviewRodolfo Alonso proposed openstack/os-vif master: Make ``PyPyroute2._lookup_interface`` private  https://review.opendev.org/c/openstack/os-vif/+/99129208:34
opendevreviewRodolfo Alonso proposed openstack/os-vif master: Make ``PyRoute2.exists`` privileged in the OVS library  https://review.opendev.org/c/openstack/os-vif/+/99129308:47
ralonsohgibi, I've proposed https://review.opendev.org/c/openstack/requirements/+/973210, depending on ^^08:54
ralonsohnova-alt-configurations-os-vif should install this os-vif patch08:55
opendevreviewdalekseev proposed openstack/nova master: Restrict machine type check to QEMU instances  https://review.opendev.org/c/openstack/nova/+/99113708:55
opendevreviewJoan Gilabert proposed openstack/nova master: move compile earlier  https://review.opendev.org/c/openstack/nova/+/95051609:12
opendevreviewJoan Gilabert proposed openstack/nova master: Add mtty/mdpy support for testing fake mdevs  https://review.opendev.org/c/openstack/nova/+/89810009:12
opendevreviewribaudr proposed openstack/nova master: Add regression test for bug #2120927  https://review.opendev.org/c/openstack/nova/+/99129409:17
opendevreviewribaudr proposed openstack/nova master: Fix shelve-offload/unshelve race wiping instance host  https://review.opendev.org/c/openstack/nova/+/99129509:17
sean-k-mooneyralonsoh: im not really a fan of that idea by the way10:17
sean-k-mooneyit woudl perfer to just run the command in a futureist process pool10:18
sean-k-mooneythen to pretened the call neeed privlage escalation10:18
sean-k-mooneyor provide a cli driver that we use when eventlet is enabled10:19
sean-k-mooneynova-compute already defaults to threaded mode. in septempeber we can start removing eventlet support for the 2027.1 cycle if we choose too10:20
sean-k-mooneyat which point we wont need the privsep hack10:20
sean-k-mooneyfrickler: so the next issue after you fix the first-boot being interupted is apparently gettign shceulded to a rax xen host where the cirrors image that 11 seconds for the intram to start 19 cectd for  /etc/init.d/rc.sysinit: and 58 second to get to the longin prome when we only wait 15 seconds for the system to boot...10:26
sean-k-mooneyso i dont think we can fix randaom slow nodes but thats progress at least10:27
ralonsohsean-k-mooney, but if you are running in threaded mode, why pyroute has this issue?10:30
sean-k-mooneyim not sure it does have it in threaed mode10:30
sean-k-mooneybut we supprot both modes and we only enabeld threadign by defautl this cycle10:30
sean-k-mooneyas in its only been that way for like 2 months10:31
sean-k-mooneyralonsoh: i have not had time to look at the actual issue in quite a whiel10:32
sean-k-mooneybut perhaps there are other approch we coudl take10:32
sean-k-mooneyis there a way to repoduce this issue locally?10:32
sean-k-mooneymy understanding was it was only seen in ci intermitently10:32
ralonsohno, that happens always in the CI when bumping pyroute to 0.9.610:34
ralonsohI'm not sure exactly in what command10:34
sean-k-mooneyoh ok if its reptable can we trigger ti with tempest locally in devstack10:36
sean-k-mooneyim askign because if we can i can maybe take a look at it in a vm and see if i cna debug it a bit10:36
opendevreviewJoan Gilabert proposed openstack/nova master: WIP : Add mtty support to nova-next  https://review.opendev.org/c/openstack/nova/+/92214012:04
opendevreviewAshish Gupta proposed openstack/nova master: tests: file-backed SQLite with WAL in threading mode for Database and CellDatabases Fixtures  https://review.opendev.org/c/openstack/nova/+/98858312:45
opendevreviewRodolfo Alonso proposed openstack/os-vif master: Make ``PyPyroute2._lookup_interface`` private  https://review.opendev.org/c/openstack/os-vif/+/99129213:31
opendevreviewRodolfo Alonso proposed openstack/os-vif master: Make ``PyRoute2.exists`` privileged in the OVS library  https://review.opendev.org/c/openstack/os-vif/+/99129313:32
opendevreviewribaudr proposed openstack/nova master: Add reproducer for bug #2117544  https://review.opendev.org/c/openstack/nova/+/99135013:34
opendevreviewribaudr proposed openstack/nova master: Deserialize JSON properties from volume_image_metadata  https://review.opendev.org/c/openstack/nova/+/99135113:34
opendevreviewribaudr proposed openstack/nova master: Deserialize JSON properties from volume_image_metadata  https://review.opendev.org/c/openstack/nova/+/99135113:36
opendevreviewTakashi Kajinami proposed openstack/nova-specs master: libvirt: AMD SEV-SNP support  https://review.opendev.org/c/openstack/nova-specs/+/98337613:50
UgglaI guess there is a dependency resolution issue with grenade and stable/2025.1  https://zuul.opendev.org/t/openstack/build/23ff7cd6c0e649a9b83bb7df6f9dfc7b14:40
Ugglaelodilles are you aware of ^ 14:40
elodillesUggla: yes, rpds-py does not support python 3.10 anymore. on QA channel it was discussed the other day and the plan is to release tempest and pin it for jobs that still uses Ubuntu Jammy 22.04 that has python3.10 by default14:50
Ugglaelodilles thx, so I will wait the fix.14:52
elodilles+114:56
gmaanelodilles: Uggla which job failing, I can fix that as i sent in email14:59
Ugglagmaan it is nova-grenade-multinode15:00
Ugglagmaan you can see it from here : https://review.opendev.org/c/openstack/nova/+/98815415:01
elodillesgmaan: i can update this patch with your fix to unblock the gate https://review.opendev.org/c/openstack/nova/+/98957915:02
elodilleso:)15:02
UgglaReminder upstream bug triage in ~30mn15:02
gmaanelodilles: Uggla ohk grenade on 2025.1 which i suggest to remove it :) but if you are fixing it is ok15:03
elodillesUggla: sorry, i didn't have enought time for bug triaging this time, so i'll just listen in today :S15:03
elodillesgmaan: yes, yes, that we agreed to remove o:)15:04
elodillesgmaan: my fix was about to land when the upper-constraints were bumped and struck again on the grenade o:)15:05
Ugglaelodilles, no worries, tbh I do not expect much this week on the triage due to review activities.15:06
*** ralonsoh is now known as ralonsoh_ooo15:09
UgglaUpstream bug triage: https://meet.google.com/zjr-rxus-hzj15:28
sean-k-mooneythat in 20 minutes right?16:11
sean-k-mooneyoh no it was na hour ago16:11
sean-k-mooneyis it 15:30 UTC ?16:12
melwittsean-k-mooney: yes 15:30 UTC16:23
gmaanelodilles: commented on grenade job fix,. you still needs to pin the tempest https://review.opendev.org/c/openstack/nova/+/98957917:28
gmaantempest change to support that is merged so no depends-on needed anymore17:28
opendevreviewGhanshyam Maan proposed openstack/nova stable/2025.1: [CI][stable-only] nova-grenade-multinode fix  https://review.opendev.org/c/openstack/nova/+/98957917:44
sean-k-mooneyso while debuging with the new cirror image i noteice something intersting18:04
sean-k-mooneyhttps://tinyurl.com/2k63h77v18:04
sean-k-mooneywhile it does not alwasy cause a failure https://paste.opendev.org/show/b5V9zieRqmJVFjyb5uz9/18:05
sean-k-mooneynova is trigging that traceback like 50 times a day18:06
sean-k-mooneyor rather the volume_snapshot_delete assisted volume snapshto test18:06
sean-k-mooneyis triging that often18:06
sean-k-mooneyif i remove t he filter on nova that tirggerd 2341 times in the last 10 days18:07
sean-k-mooneythat with lvm or cephs as the backend18:08
sean-k-mooneyit looks like if hte snapshot is not foudn we explosde bcuase we dont have the type info info18:09
melwittthat is a very old issue18:14
sean-k-mooneyit felt familar 18:14
sean-k-mooneyi dont see a bug report for it quickly so im going to file one18:14
melwittyeah I'm looking for it, sec18:15
melwittok that works too18:15
sean-k-mooneyhttps://bugs.launchpad.net/nova/+bug/215518718:15
sean-k-mooneyi kind fo hate "standard tracebacks" in logs18:16
sean-k-mooneythat partly why i try to pretnt os-brick is not a thing18:17
dansmithagree, tracebacks in logs for normal occurrences is extremely uncool18:17
dansmithwe had an effort like ten years ago to get rid of them all18:17
dansmithbut as you say, brick is like a traceback generator :)18:17
melwittyeah. I know there was a fix proposed for this exact problem and I can't remember why it stalled out. and I can't even find it right now 18:18
sean-k-mooneywe have drefinlly fixed other case wehre we asuem a key was alswasy present18:18
sean-k-mooneyi dont recall if it was this exact one but we have had that class of issue with the cinder path a few times18:18
sean-k-mooneywhat i find intersting about this is we seam to mostly handel this internally18:19
sean-k-mooneyas in its a key error but of the 2200 or so hits only like 180 of those failed18:19
melwittit was this https://bugs.launchpad.net/nova/+bug/2033541 and this https://review.opendev.org/c/openstack/nova/+/90078318:20
sean-k-mooneyso at least in the volume delete path this is just ugly in the logs18:20
sean-k-mooneyah cool i can close it as a dupe then18:20
melwitt(that was what I was thinking of)18:20
melwittis the one you saw with ceph also?18:20
sean-k-mooneyyep18:21
melwittI swear google is working less and less. launchpad search found me that when google couldn't18:21
melwittyou might consider closing the older one as a dupe, it clearly is not getting searched easily haha18:22
sean-k-mooneyhttps://tinyurl.com/2k63h77v18:22
sean-k-mooneyso that the case where the trace happend but the job passes18:22
sean-k-mooneybut if you weiden that out18:22
melwittyeah. it absolutely happens all the time. and there was/is a patch proposed but it got stalled out for reasons i don't remember and I think that discouraged everything18:23
sean-k-mooneyits present in cidner cinder-plugin-ceph-tempest devstack-plugin-nfs-tempest-full18:23
sean-k-mooneyand cinder-tempest-lvm-multibackend18:23
sean-k-mooneyjob18:23
sean-k-mooneyso it seam to be backend indepenent18:23
sean-k-mooneyif for any reason the snapshot is not found 18:23
sean-k-mooneyit will explode18:23
melwittyeah it is the arbitrary dict thing which really sucks18:24
sean-k-mooneywell in my case there was a 40418:24
melwittsome backends use X keys some use Y keys and it's a wild west18:24
sean-k-mooneyi..e the snapshot was not found in cinder durign the delet test18:24
melwittthere's no standard for that payload, last I checked18:24
sean-k-mooneyand then we didn have the info a as a reult18:24
sean-k-mooneymelwitt: ya i have noticed that too18:25
sean-k-mooneyi.e. tht the payload is very backend secific18:25
melwittI remember digging into it a bit and getting really discouraged18:25
melwittyeah. and I think they are not documented either. so I'm not sure what the first step in the right direction is even18:25
melwittthat isn't just papering over something, I mean18:26
melwittit could probably be its own PTG topic18:26
sean-k-mooneyhave you heard fo microverions :P18:26
melwittheh18:27
melwittI mean like for this, we could catch and ignore KeyError for this specific thing but the problem is bigger than that18:27
sean-k-mooneyskimmig the patch comment im unsure if this just stalled out because fo a 2 month dely in updatin the patch for your -1 or if there was any other outstading issue with it18:29
sean-k-mooneyhttps://review.opendev.org/c/openstack/nova/+/900783/13/nova/virt/libvirt/driver.py18:30
sean-k-mooneyi think is fine for this specific case18:30
sean-k-mooneyit just does nto resove that class of issue generally18:30
opendevreviewRajesh Tailor proposed openstack/nova master: Fix KeyError on assisted snapshot call  https://review.opendev.org/c/openstack/nova/+/90078318:31
melwittok I see18:31
sean-k-mooneyi just rebased that in th ui lets see if it passes or not18:31
melwittcool thanks18:31
sean-k-mooneythat was july 2024 which was right atount the time we wre dealing with https://security.openstack.org/ossa/OSSA-2024-001.html18:34
sean-k-mooneyi.e we were fixign the qemu image stuff and the fallout for breaking iso ectra18:35
opendevreviewGhanshyam Maan proposed openstack/nova stable/2025.1: [CI][stable-only] nova-grenade-multinode fix  https://review.opendev.org/c/openstack/nova/+/98957921:07

Generated by irclog2html.py 4.1.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!