Wednesday, 2024-07-10

*** bauzas_ is now known as bauzas01:05
*** bauzas_ is now known as bauzas03:02
opendevreviewmelanie witt proposed openstack/nova-specs master: Propose config option for unified limits of zero  https://review.opendev.org/c/openstack/nova-specs/+/92380703:38
opendevreviewmelanie witt proposed openstack/nova master: Config option to treat unified limit of zero as unlimited  https://review.opendev.org/c/openstack/nova/+/92381305:55
opendevreviewmelanie witt proposed openstack/nova master: Config option to treat unified limit of zero as unlimited  https://review.opendev.org/c/openstack/nova/+/92381305:57
*** bauzas- is now known as bauzas06:00
opendevreviewRobert Hoffmann proposed openstack/nova master: Fix: clean up volume attachments  https://review.opendev.org/c/openstack/nova/+/92364606:56
opendevreviewAbhishek Kekane proposed openstack/nova master: DNM: Test glance new location api  https://review.opendev.org/c/openstack/nova/+/89120708:43
fricklersean-k-mooney: did you have a chance to look at the failures on the iso stable backports yet? the grenade failure seems unrelated, I'm more worried about the barbican one08:59
sean-k-mooneyfrickler: not yet but ill do that shortly, thanks for the heads up08:59
sean-k-mooneyfrickler: im not really sure how any fo this could be related to barbican given we do not currently have any encyped local storage support in nova09:00
sean-k-mooneyat least not in the context of barbican09:00
sean-k-mooneyJayF: they do not need to be seperate i just dont blan to work on the metadta feature this cycle. so if its ok with you can you squash your changes into min and take over that patch?09:01
sean-k-mooneyfrickler: oh interesting i wonder if that was failing in the backport of the orginal cve fix09:03
fricklersean-k-mooney: well this does sound somehow related to me, maybe not your patch but one of the other cve fixes? though not sure why it would only be failing sometimes? "Image ... is unacceptable: Image content does not match disk_format"09:03
fricklerhttps://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_f2e/923724/1/check/barbican-tempest-plugin-simple-crypto/f2e7279/testr_results.html09:03
sean-k-mooneyits a non voting job and we merged those quite quickly09:03
sean-k-mooneyyep it sounds related but i wonder if this is on the glance side or the nova side09:04
sean-k-mooneygiven its a snapshot related test09:04
sean-k-mooneyim going to look into the logs now and see09:04
fricklerwell that msg is in a 500 response to create_server afaict09:05
sean-k-mooneyya so maybe we have an issue with bootign form a snapshot or somehting with the image signing09:05
sean-k-mooneyso i think the instance has a cidner volume associated with it i wonder if its on nfs09:11
sean-k-mooneyok no so cinder is configure for lvm09:15
sean-k-mooneyif it was nfs i know there are specific snapshot related issue where the glance and actual image format can differr09:15
sean-k-mooneyfrickler: i may need to add some more debug logging which might be useful in general09:16
sean-k-mooneynova.exception.ImageUnacceptable: Image 93f91dc1-8c6f-416e-ab0d-abab96d1d03e is unacceptable: Image content does not match disk_format09:17
sean-k-mooneyis well an good but it would be nice if it printed the requested and actual format09:17
sean-k-mooneyits possibel that this si a bug in the barbican plugin too so ill look at the test09:19
sean-k-mooneyfrickler: so it using a uec image file img_file = /opt/stack/devstack/files/cirros-0.6.2-x86_64-uec.tar.gz09:21
sean-k-mooneyhttps://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_f2e/923724/1/check/barbican-tempest-plugin-simple-crypto/f2e7279/controller/logs/tempest_conf.txt09:21
sean-k-mooneythe image format is not set in the config09:22
sean-k-mooneyand that default to qcow https://github.com/openstack/tempest/blob/master/tempest/config.py#L118309:23
sean-k-mooneyfrickler: so the nova code is correctr the imnage is not what it says it is 09:23
sean-k-mooneyfrickler: this is because we are overriding the image https://github.com/openstack/nova/blob/stable/2024.1/.zuul.yaml#L92809:28
fricklerbut why doesn't it fail 100%, then? otoh maybe we don't want to care too much about barbican anyway, I just saw that the barbican-grenade job was last passing 2y ago ...09:28
sean-k-mooneyfrickler: i need to backport https://github.com/openstack/nova/commit/eed3e2b47ffea24d08ad7a85a4e9c36ef56d815e first09:29
sean-k-mooneyfrickler: i think this is the first backprot that has run on that branch sicne we merge the check09:29
sean-k-mooneythe previous two in the seriese did not trigger the barbican job since they were test only09:29
sean-k-mooneyill cherry-pick https://github.com/openstack/nova/commit/eed3e2b47ffea24d08ad7a85a4e9c36ef56d815e back and see if that passes if it does then we can proceed with that and recheck the iso backport09:30
opendevreviewsean mooney proposed openstack/nova stable/2024.1: Stop using split UEC image (mostly)  https://review.opendev.org/c/openstack/nova/+/92383109:38
sean-k-mooneyfrickler: ok i expect that should pass, if it does we only need that in 2024.1 as that was the only release use used uec images to work around gate insablitiy09:38
sean-k-mooneythey didnt actully fix anything they just chagne the timing slightly which looks like panics happend less often but they still happend so we went back to useing qcow a few months ago09:40
fricklerah, ok, that explains why neither master nor older stable branches were affected09:43
sean-k-mooneyya we ment to revret that after FF but before the final RC and didnt get around to it09:44
sean-k-mooneyby revert i mean do what the patch im now backporting does of keep one job with uec images for coverage but go back to qcow for the rest09:44
elodillessean-k-mooney: sorry for the late response, so about the skip test patch: it's OK to have it separately, maybe a 'related-bug' would be good, but it is good as it is10:07
sean-k-mooneyelodilles: ack thanks i can add that to the backports sure10:25
sean-k-mooneyi think it merged on master lastnight and i have not created the cherry picks yet so ill do that shortly10:26
elodilles+110:28
mnaserelodilles: sorry to ping here but I dont se you in glance :) do you mind kicking https://review.opendev.org/c/openstack/glance/+/923310 in and I can work on getting the rest to pass and ready to +W13:59
elodillesmnaser: let me answer at the glance channel o:)14:30
*** bauzas_ is now known as bauzas14:55
melwittsean-k-mooney: this is lame and late but fyi I proposed the spec about a config option for being able to treat missing or zero unified limits for resources as unlimited https://review.opendev.org/c/openstack/nova-specs/+/92380715:58
melwittand super basic WIP code https://review.opendev.org/q/topic:%22bp/unified-limits-nova-zero-as-unlimited%2215:59
*** bauzas_ is now known as bauzas16:33
*** bauzas_ is now known as bauzas17:28
sean-k-mooneymelwitt: :) better late then never, do you think it could happen thsi cycle or should we premetivly mvoe that to 2025.117:51
sean-k-mooneyor backlog i guess17:51
melwittsean-k-mooney: I think this cycle. unless I'm massively missing something about it 😛 17:54
sean-k-mooneythats more a question of do you think we cna do that and your other feature but if you think we can review both im not agiasnt doign that now17:55
sean-k-mooneyi would love to see us make unifed limits the default in 2025.117:55
sean-k-mooneyso making incremental progress to that is good17:55
melwittthe WIP code is all it should take but perhaps people will find other issues about it17:55
sean-k-mooneyoh cool i have not looked at that btu the fact it exists is promising17:56
sean-k-mooneyyoru not refering to the oslo.limits poc right, you have a nova poc for this17:57
sean-k-mooneyah https://review.opendev.org/c/openstack/nova/+/92381317:57
melwittit should be only two patches, one for nova-manage limits migrate_to_unified_limits (to make it give more information like advice on what limits need to be created after running it) and one patch to check the config option and consider none/zero limit as unlimited if configured17:57
sean-k-mooneyhum that is not quite what i was expecting but it proably would work17:58
sean-k-mooneyi have not read the spec yet but i was expecting somethign like, ask keystone for all limit it supprot then check only those limits17:59
melwittok, if you can add a comment on the spec or patch I can potentially change it. I proposed what first came to my mind17:59
sean-k-mooneyinsead of "oh i got zero ignore if config option is set"17:59
melwittah ok. that might work too, have to try it18:00
sean-k-mooneyso i guess there is only one edge case that matters18:00
sean-k-mooneywhat happens in these two cases18:00
sean-k-mooneyi recuest a resouce and no limit is defied18:01
sean-k-mooneyi request a resouce and its defied as 018:01
sean-k-mooneywith your code i think those will be treated the same 18:04
sean-k-mooneybut im not sure if they should be18:04
sean-k-mooneyimo i think no limit defiend shoudl be unlimeted and limit defied as 0 mean you can not use it.18:06
sean-k-mooneybut i know others had different views18:06
sean-k-mooneymelwitt: ill refect on your spec but not tonight, probaly friday18:08
sean-k-mooneybut thats what my intition expects i just dont recall all the context18:08
sean-k-mooneyif we take the approch im suggesting im not even sure it requries a cofnig option beyond the quota driver18:09
sean-k-mooneysince registering the limits with 0 or any value starts enforcing that limit18:10
sean-k-mooneyalthough i could see use wanting this to be explict too in case anywone is currently using the unifed limits driver in production18:11
opendevreviewsean mooney proposed openstack/nova stable/2024.1: fix qemu-img version dependent tests  https://review.opendev.org/c/openstack/nova/+/92387818:32
melwittsean-k-mooney: sorry, was on a call. yeah, I hear you. where I'm coming from ultimately is I'm concerned about the fallout potentially of when we switch the default to unified limits. I keep envisioning the scenario where as soon as they install or upgrade, all server creates start failing because they forgot or missed setting one limit18:51
melwittso at the core of it I want to prevent operator pain and I'm sure there's other ways to handle it that I haven't thought of18:52
sean-k-mooneyyep i want systems to more or less "just work" when we switch but i dont think we shoudl force peole to define limtis for things that previously coudl not have a limit18:55
sean-k-mooneyso that why im usggeting treating no registered limit as unlimited and registered with 0 as not allowed18:56
sean-k-mooneyor over quota when you try ot request one of that thing18:56
sean-k-mooneythe problem of couse if if we do that when we change the default everythign becomes unlimited18:57
sean-k-mooneyunless you run the migration tool18:57
melwittsean-k-mooney: ok, yeah that makes sense18:57
melwittyeah :/18:57
sean-k-mooneyi think thats where the idea was if any limti is regesterd18:57
sean-k-mooneythen we know they have started configuring it18:57
melwittah, right18:57
sean-k-mooneyso there are 3 cases. no limits at all -> they didnt migrate18:58
sean-k-mooneythere are some registered limits -> only those resocue classes are enforced18:58
sean-k-mooneyall resources classes in the request have limits -> all are enforced18:58
melwittthat makes sense18:59
sean-k-mooneyyour patch treats no limit and limit of 0 as the same. so that is simpler then i was suggesting above19:03
sean-k-mooneyim not sure if that is enough or if we need the extra complexity of above19:03
melwittsean-k-mooney: it's likely oversimplified ... I thought of it because oslo.limit literally returns 0 for no limit, and doing something smarter will need additional keystone API calls. but not being able to differentiate between intentionally 0 and not is probably too restricting19:16
sean-k-mooneyi think an extra call while not ideall is ok19:33
sean-k-mooneyespciclaly if its opt in vai a cofnig option and you can turn it off once your fully upgraded19:33
sean-k-mooneyits like the placement fallback query19:33
sean-k-mooneyfor pcpus as vcpus19:33
melwittyeah. although making it opt in means the default behavior will be the "if you missed anything everything fails until you fix it"19:35
sean-k-mooneyi was thinking opt out for this release and deprecate it next release then drop it in 2025.219:36
sean-k-mooneyso thise release old quota driver by default and opt out if you use unified limit19:37
sean-k-mooneynext release default to unifed limits and deprecat the extra option in the slurp19:37
sean-k-mooney2025.2 or later drop the config and extra query19:38
sean-k-mooneythat makes 2025.1 the cross over release where most will upgrade to unified limtis19:38
sean-k-mooneyim not sure if that is too fast or not19:38
melwitthm, yeah. not sure19:40
sean-k-mooneydo you think it would take much to poc the limt checking approch like you did with the 0 as unlimited patch20:29
sean-k-mooneyi.e. with the extra api call so we can compare and contrast20:29
sean-k-mooneyim not saying no to what you propsoed but im wondering how easy it is to do one vs the other20:30
melwittsean-k-mooney: sure I can do that. not a problem at all20:36
opendevreviewsean mooney proposed openstack/nova stable/2024.1: fix qemu-img version dependent tests  https://review.opendev.org/c/openstack/nova/+/92387822:39
opendevreviewsean mooney proposed openstack/nova stable/2023.2: fix qemu-img version dependent tests  https://review.opendev.org/c/openstack/nova/+/92390722:40
opendevreviewsean mooney proposed openstack/nova stable/2023.2: fix qemu-img version dependent tests  https://review.opendev.org/c/openstack/nova/+/92390722:40
opendevreviewsean mooney proposed openstack/nova stable/2023.1: fix qemu-img version dependent tests  https://review.opendev.org/c/openstack/nova/+/92390822:41
*** haleyb is now known as haleyb|out22:50
*** bauzas_ is now known as bauzas23:00
opendevreviewJay Faulkner proposed openstack/nova master: [ironic] Factor out metadata and  send to ironic  https://review.opendev.org/c/openstack/nova/+/92391023:03
JayFsean-k-mooney: ^ the new unified patch, it's passing unit tests locally but I haven't reviewed it from a gerrit-perspective yet as it's my EOD (I won the race to get the tests passing :D)23:03
opendevreviewJay Faulkner proposed openstack/nova master: [ironic] Factor out metadata and  send to ironic  https://review.opendev.org/c/openstack/nova/+/92391023:05

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!