Thursday, 2025-06-19

*** tosky is now known as Guest1839707:37
*** tosky_ is now known as tosky07:37
jlejeunehello, I'm also impacted by that bug : https://bugs.launchpad.net/nova/+bug/2084238, do you know if someone begin to work on it ?09:41
jlejeuneI confirm that a request-id is set for new instances (created after migration to 2023.1) but that field stays at NULL for instances created before the migration10:03
gibijlejeune: I don't know about any work on that10:03
jlejeuneok thanks, maybe it's something the migration tools should take care10:04
jlejeuneto fill random request-id in for allocated devices10:05
gibijlejeune: i don't have the time to dig into it right now. But I think this is a valid bug that needs to be fixed10:12
opendevreviewribaudr proposed openstack/nova master: Reproducer for bug 2114951  https://review.opendev.org/c/openstack/nova/+/95289414:10
opendevreviewribaudr proposed openstack/nova master: Fix bug 2114951  https://review.opendev.org/c/openstack/nova/+/95289514:10
opendevreviewAndriy Kurilin proposed openstack/nova master: Reuse 'detail:get_all_tenants' policy in server get api  https://review.opendev.org/c/openstack/nova/+/95289614:33
sean-k-mooneythe request_id fucntionalty was added a very very long time ago14:57
sean-k-mooneyjlejeune: this feels like you skipped runnign an online data mighration14:58
sean-k-mooneyoh the one i was thinking of was a diffent fild https://github.com/openstack/nova/blob/master/nova/cmd/manage.py#L22114:59
sean-k-mooneyits the uuid on the device not the requst id15:00
jlejeunesean-k-mooney: exactly15:01
sean-k-mooneythe request id should alwasy be set in recent version of openstack15:02
sean-k-mooneyit was added 11 years ago15:03
elodillessean-k-mooney: sorry for pinging you directly but you already reviewed the patch on master branch o:) could you please take a quick look/review on the stable/* backports of this patch? https://review.opendev.org/q/I56705bce8ee4354cd5cb1577a520c2d1c525f57b15:03
jlejeunein recent version, yes it's filled, but not on stein release for example15:03
sean-k-mooneyjlejeune: it soudl have been there in stein too15:04
jlejeunesean-k-mooney: hm15:04
jlejeuneit's not the case a t all15:05
sean-k-mooneyelodilles: sure 15:05
elodillessean-k-mooney: o// thanks in advance!15:05
sean-k-mooneythe filed is nullable, when the reques is form a neuton port the request id is the neutron port uuid i belvie15:09
jlejeunesean-k-mooney: indeed, the request_id field has been added 11 years ago 15:09
jlejeuneI don't understand why it's null in my situation15:09
sean-k-mooneywhen the request sorce is the flavor i belive it is expecte dto be null but i woudl have to dig itno this in detail15:09
sean-k-mooneywell there is request id and requeter_id15:10
sean-k-mooneyyour speicificly asking about request.request_id)15:10
sean-k-mooneythe requester id is the one that hold the neutron port butalso backfil the request_id with a new uuid when we do that15:11
sean-k-mooneyjlejeune: is this a pci device related to a pci alis or a neutorn port15:13
jlejeuneto a pci alias15:15
sean-k-mooneyack so the alisas was the older usage of this field as far as i recal which is whwere we woudl least expect to need ot back file15:17
sean-k-mooneyjlejeune: this is where it shoudl be generated https://github.com/openstack/nova/blob/64ca204c9cf497b0dcfff2d3a24b0dd795a57d1d/nova/pci/request.py#L26115:20
sean-k-mooneyhttps://github.com/openstack/nova/commit/ccab6fed463337c029459469c76e92af3b96fa0615:20
jlejeunesean-k-mooney: ok, thanks for your commit, indeed I don't have backported it in my stein sources... that can explain15:28
sean-k-mooneyoh you did a feature backprot of the pic in placment code15:32
sean-k-mooneyya this was new in zed 15:32
jlejeuneho there are a lot of missing commits: https://review.opendev.org/q/topic:%22bp/pci-device-tracking-in-placement%2215:35
sean-k-mooneyit was a non triival feature which is why we did nto backport it downstream in redhat15:35
sean-k-mooneywell not the only reaon15:35
sean-k-mooneywe are actully still n the process of fully graducatign it to full support in our antelope based product15:36
sean-k-mooneywhile the final QE works is happeing its "tech preview" althoug it does work15:36
sean-k-mooneygibi: stephenfin: if ye have time for a very short patch please review https://review.opendev.org/c/openstack/nova/+/952306 to enable the memballon optimisation when you have time15:45
sean-k-mooneyUggla: and when you have time can you mark https://blueprints.launchpad.net/nova/+spec/automatic-memballoon-freeing as approved15:45
Ugglasean-k-mooney sure.15:46
opendevreviewMerged openstack/nova stable/2025.1: [tool] Fix backport validator for non-SLURP  https://review.opendev.org/c/openstack/nova/+/95196815:54
Ugglasean-k-mooney I speak a bit too fast. I tracked your SLBP in the etherpad document as approved. But atm I can't set it in launchpad, I'm lacking rights to do it. So I'll set all approved SLBP as soon as I can.16:07
gibisean-k-mooney: is this memballon bp was approved? or you will bring it up for approval on the next nova meeting?16:11
gibimy memories are vague16:11
Ugglagibi, I approved it.16:14
Ugglasorry I = we16:14
gibicool16:18
gibithanks16:18
opendevreviewElod Illes proposed openstack/osc-placement master: DNM: gate health test  https://review.opendev.org/c/openstack/osc-placement/+/95291316:23
gibisean-k-mooney: left some questions in the patch. The impl looks good btw.16:24
opendevreviewElod Illes proposed openstack/python-novaclient master: DNM: gate health test  https://review.opendev.org/c/openstack/python-novaclient/+/95292816:27
gibisean-k-mooney: FYI I think this will be the first use of the round robin placement a_c strategy out in the field https://bugs.launchpad.net/nova/+bug/211494716:31
opendevreviewStephen Finucane proposed openstack/nova master: api: Add response body schemas for images APIs  https://review.opendev.org/c/openstack/nova/+/95228416:37
opendevreviewStephen Finucane proposed openstack/nova master: api: Separate volume, snapshot and volume attachments  https://review.opendev.org/c/openstack/nova/+/95234716:37
opendevreviewStephen Finucane proposed openstack/nova master: api: Add response body schemas for volumes APIs  https://review.opendev.org/c/openstack/nova/+/95234816:37
opendevreviewStephen Finucane proposed openstack/nova master: api: Add response body schemas for snapshots APIs  https://review.opendev.org/c/openstack/nova/+/95234916:37
opendevreviewStephen Finucane proposed openstack/nova master: api: Add response body schemas for volume attachments APIs  https://review.opendev.org/c/openstack/nova/+/95235016:37
opendevreviewStephen Finucane proposed openstack/nova master: tests: Use valid UUIDs for cinder resources  https://review.opendev.org/c/openstack/nova/+/95293516:37
opendevreviewStephen Finucane proposed openstack/nova master: api: Only apply "soft" additionalProperties validation to requests  https://review.opendev.org/c/openstack/nova/+/95293616:37
opendevreviewStephen Finucane proposed openstack/nova master: api: Correct expected errors  https://review.opendev.org/c/openstack/nova/+/95164016:41
sean-k-mooneygibi: di  it result in a bug or is it the fix for the bug16:46
sean-k-mooneyUggla: i can set it as approved if you want in launchpad. at least in this case16:47
sean-k-mooneyupdated https://blueprints.launchpad.net/nova/+spec/automatic-memballoon-freeing16:47
Ugglasean-k-mooney, no it is ok, when I will be able to do it, I will check all approved SLBP and set them accordingly.16:48
gibisean-k-mooney: they just hit the bug, I suggested to tune placement, we will see if this will help them16:51
sean-k-mooneygibi: ah you saying it can be fixed by it, i think16:51
gibithe reporter is pretty responsive so hopefully we get feedback16:51
sean-k-mooneycool16:52
sean-k-mooneyim not entirly sur if this is the same since it appears to hvae allcoated a device with the worng? resouce class16:52
gibiwhich reminds me that we still not have upstream testing with breadth-first strategy due to the blocked devstack patch I have to go back to 16:52
sean-k-mooneyoh they are not doing it properly16:53
gibisean-k-mooney: originally they had a config issue not having pci_in_placement enabled16:53
sean-k-mooneywel16:53
gibinow they enabled it and hitting timeout 16:53
sean-k-mooneythey seam to have both a cutom trait and resouce class16:53
sean-k-mooneydevice_spec = { "vendor_id":"10de", "product_id":"26b1", "address":"0000:4F:00.4", "resource_class":"CUSTOM_NVIDIA_RTX6000_ADA_48Q", "traits":"CUSTOM_NVIDIA_RTX6000_ADA_48Q", "managed":"no" }16:54
sean-k-mooneythey shoudl just be useign the reouces class not the triat16:54
sean-k-mooneyit wont break anything to have both16:54
gibiyeah16:54
gibiwe can follow up with that once they see it working16:55
sean-k-mooneybut the other thing they are trying to do that kind fo unsupproted16:55
sean-k-mooneyis allcoatign mutlipel vFs16:56
sean-k-mooneyso these are gpus16:56
sean-k-mooneyand they are suing pci passthough to use the VFs16:56
sean-k-mooneywhich i guess means they are trying to use the new way of doing vgpu that we added last cycle16:56
sean-k-mooneywith managed = no16:57
sean-k-mooneybut whiel you shoudl be able to have mulitpel devices in one vm16:57
sean-k-mooneywe have not actully tested that16:57
sean-k-mooneyi guess there is no really reason form a nova perspecitve why that woudl not work16:58
sean-k-mooneywe know it did o r does work for generic vfs16:58
sean-k-mooneyso the only reaosn for it not to work with gpus woudl eb due to a hardware/driver limitaiton16:58
sean-k-mooneyoh ok they are doign bad things16:59
sean-k-mooneyopenstack flavor create 8xRTX-ADA-48Q --private \16:59
sean-k-mooney  --ram 4096 --vcpu 4 --disk 0 \16:59
sean-k-mooney  --property "resources:CUSTOM_NVIDIA_RTX6000_ADA_48Q"=1 \16:59
sean-k-mooney  --property "trait:CUSTOM_NVIDIA_RTX6000_ADA_48Q"="required" \16:59
sean-k-mooney  --property "pci_passthrough:alias"="rtx6000-ada-48q:8"16:59
sean-k-mooneyopenstack flavor set --project admin 8xRTX-ADA-48Q16:59
sean-k-mooneythey are requesting 8 of the gpus VF via the alisa btu also orverwriting the resouce: and traits request16:59
sean-k-mooneythey shoudl not be settign 17:00
sean-k-mooney  --property "resources:CUSTOM_NVIDIA_RTX6000_ADA_24Q"=1 \17:00
sean-k-mooney  --property "trait:CUSTOM_NVIDIA_RTX6000_ADA_24Q"="required" \17:00
sean-k-mooneyor well the relevent lines form the same flavor17:00
opendevreviewMerged openstack/nova stable/2024.2: [tool] Fix backport validator for non-SLURP  https://review.opendev.org/c/openstack/nova/+/95196917:16
gibiyepp they should not set the rc and trait on the flavor just in the alias17:16
sean-k-mooneyi commented on the bug to that effect17:24
opendevreviewsean mooney proposed openstack/os-resource-classes master: Add VCPU_SHARES resource class for CPU performance tiering  https://review.opendev.org/c/openstack/os-resource-classes/+/95295121:50
opendevreviewMerged openstack/nova master: Fix disable memballoon device  https://review.opendev.org/c/openstack/nova/+/94562122:51

Generated by irclog2html.py 4.0.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!