Monday, 2025-02-24

mikalsean-k-mooney: I am unsure if I should recheck the final spice-direct patch now that melwitt's tempest fix has landed or not. Discuss.07:48
opendevreviewSasha Malberg proposed openstack/nova master: Update SHELVED VM az when attaching disk  https://review.opendev.org/c/openstack/nova/+/94256109:50
opendevreviewVasyl Saienko proposed openstack/nova master: Fix serial console for ironic  https://review.opendev.org/c/openstack/nova/+/94257511:38
opendevreviewAmit Uniyal proposed openstack/nova master: Reproducer for cold migration on shared storage  https://review.opendev.org/c/openstack/nova/+/94030412:02
opendevreviewVasyl Saienko proposed openstack/nova master: Fix serial console for ironic  https://review.opendev.org/c/openstack/nova/+/94257512:23
opendevreviewTakashi Kajinami proposed openstack/nova master: Use consistent program name for wsgi scripts and entry points  https://review.opendev.org/c/openstack/nova/+/94260515:35
opendevreviewTakashi Kajinami proposed openstack/nova master: Use consistent program name for wsgi scripts and entry points  https://review.opendev.org/c/openstack/nova/+/94260515:41
sean-k-mooneyUggla: can you appove this https://review.opendev.org/c/openstack/releases/+/941862 to do the final release of os-vif15:46
Ugglasean-k-mooney, I'll have a look right after the call and will approve it.15:48
sean-k-mooneyack15:48
sean-k-mooneyzull is still runing on it anyway 15:48
Ugglasean-k-mooney, +1 for me.16:13
sean-k-mooneycool thanks :)16:13
danfaihi, do you think there is a way to make nova aware of a change of the vif_type when live migrating? I'm currently testing live migrations between linuxbridge and OVS/OVN, which seems doable (for our case) but the bridge name is wrong16:17
sean-k-mooneydanfai: we do not allow changing the vif_type for port attached to a nova instnace16:46
sean-k-mooneydanfai: so you cannot change it form linxu bridge ot ovs/ovn16:46
sean-k-mooneythat is unsupproted16:47
sean-k-mooneythere are som hacky script to do that on the neutron side16:47
sean-k-mooneybut its offically unsuprpoted form a nova perspective16:47
pas-ha[m]Hi all, I'm interested in the status of the 'local disk encryption' work. https://review.opendev.org/q/topic:%22specs/yoga/approved/ephemeral-encryption-libvirt%22 . AFAIU it was kind of abandoned after that CVE flurry last summer... Are there any plans to continue woking on it? can I help with it somehow (rebasing, fixing comments etc)? cc melwitt 16:59
sean-k-mooneywe coudl add support for it but we might  need to chnage the multiple port bidning api respoce to incude vif_type17:03
* melwitt pas-ha[m]: hi, yes it has been postponed due to the CVE stuff and the nova team agreed some amount of refactoring of the existing disk image code should be done before we resume the disk encryption series. I have some proposal for the refactor part that I have not formally pushed yet but will be pushing in the near future. once that is reviewed and agreed, local disk encryption can be picked up again. I don't yet know if it will17:17
* melwitt be prioritized however17:17
sean-k-mooneypas-ha[m]: where we are with that is before adding that functionality there is a set of technial debt that we would like to clean up 17:23
sean-k-mooneybasically our concern is the CVE showed there is a lot of tech debt in the iamge backend code and we are concerned about making it even more complext by adding local disk encyption17:24
pas-ha[m]yeah, I bet at least one of those patches there is my own.. (the ISO one)17:25
sean-k-mooneyas melwitt noted the priority of the feature was downgraded and the piority fo the refactorign raised but so far we have not commited to when eitehr will be done17:25
sean-k-mooneypas-ha[m]: which iso one17:25
sean-k-mooneythere is at least one i was hoping to land this cycle17:25
pas-ha[m]is there any discussion /docs/aims on the planned refactor you could point me to?17:25
pas-ha[m]sean-k-mooney: https://review.opendev.org/c/openstack/nova/+/90961117:26
sean-k-mooneyah yep thats the one im +2 on. so im hoping melwitt or dan will have time to review that before FF or at least before RC117:26
pas-ha[m]bottom line - we are really interested in this feature, as the only proper alternative ATM for local ephemeral encryption is LVM and it starts to be a major operational PITA (no migrations/resize etc).. so if there's anything I could help with I'm all ears 🙂 17:29
melwittpas-ha[m]: to my understanding, the refactor will be basically the first 3 patches here (maybe just the first 2, will depend on review) https://review.opendev.org/c/openstack/nova/+/925635 obviously I need to rebase and clean up them and I'll write up a blueprint about it. if a spec is needed I will also write a spec17:29
pas-ha[m]thanks, will dive in17:30
melwittsean-k-mooney: ack I will look at the iso patch17:30
sean-k-mooneymelwitt: tldr it will make sure we attach isos as a cdrom rather then as a block device17:30
sean-k-mooneyits more subtle then that but that the main part17:31
melwittok, thanks17:32
pas-ha[m]fwiw, I recently tried to implement a cold migration / resize for LVM image backend... and it even kind of worked but was ugly AF... really not worth actually doing for sure17:32
melwittah, interesting17:34
sean-k-mooneypas-ha[m]: it used to wrok a very long time ago17:35
sean-k-mooneyit broke at some point but i have no idea when17:35
sean-k-mooneyi dont see why we woudl not supprot it for lvm again eventurlly17:36
sean-k-mooneywe just need to scp/rsync the data17:36
sean-k-mooneyit very similar to raw/qcow local storage migration17:36
sean-k-mooneywe have apis to do this17:36
sean-k-mooneyvia the remotefs dirvers17:37
sean-k-mooneythere is a copy image fucntion for this that lvm shoudl be implementing17:37
sean-k-mooneyor using17:37
sean-k-mooneythe lvm image backend dirver is the highest perfromign stroage option nova currently supports17:38
sean-k-mooneyespically for write intensive operations17:38
pas-ha[m]yeah, kind of what I did. the main problem is that I did not thought of a correct way of dd  | ssh | dd to skip the intermediate file, so I was dd to local file, scp to remote, on remote make a new lvm and dd local file into that.. 17:38
sean-k-mooneyyou do not need an a intermediate file17:39
sean-k-mooneydd directly to the remote lvm volume shoudl be enough17:39
sean-k-mooneythat is an optimistaion howver17:40
pas-ha[m]also, it seems the libvirt itself does support the migration with lvm, even live one, but for that this lvm must be managed by libvirt itself as a storage pool... which is to much of a rewrite for how nova manages local storage I guess..17:41
sean-k-mooneyi think you can live migrate without the storage pool 17:41
sean-k-mooneybut for cold migration it need it yes17:41
sean-k-mooneyand ya we have no plans to use libvirt sotrage pools in the future17:42
sean-k-mooneylibivrt is one of those things that is a doubel edge sword. it provides some useful abstration over qemu but at a cost of addtioanl complexity in the tech stack17:44
sean-k-mooneyfor the most part its been a win17:44
sean-k-mooneybut not alwasys17:44
melwitthi all, I need a second reviewer on this not-yet-released-regression fix (partial revert) related to tpool.Proxy + libvirt listDevices() if anyone could please look https://review.opendev.org/q/topic:%22bug/2098892%2217:57
sean-k-mooneyoh right i have reviewd that already so someone other then melwitt  or i17:58
sean-k-mooneybauzas: gibi  this impacts gpus and pci passthough so perhaps ye could find time dansmith may also have context since we chatted about it a little when it was found downstream17:59
sean-k-mooneymelwitt: thats needed on stable too yes?18:00
sean-k-mooneyi dont think we have done a release with it yet18:00
sean-k-mooneybut we sould avoid doing one until the latests patches are included18:00
melwittyes, it is needed on 4 stable branches also :(18:00
gibisean-k-mooney: I'm out of energy today, made a note to check it tomorrow 18:04
sshmlbrgsean-k-mooney: o/ Could u plz review this mr https://review.opendev.org/c/openstack/nova/+/942561 ?18:22
dansmithmelwitt: question hre: https://review.opendev.org/c/openstack/nova/+/942334/6/nova/virt/libvirt/host.py18:32
dansmithah, nevermind, it's in the commit message below the fold18:36
melwittthanks dansmith! 18:41
danfaisean-k-mooney: thanks for the response re vif_type change. sorry only saw the message now. what I wondered was during a migration, the target hypervisor could have a different model. from what I see nova picks up the new vif_type after the migration, but the bridge_name is at that time wrong18:48
sean-k-mooneyso it used to be possibale a long time ago to live migrate between linux bridge and ovs18:50
sean-k-mooneydanfai: and i had patches up to fix the bridge name18:51
sean-k-mooneybut i dont think neuton ever merged them18:51
sean-k-mooneyill see if i still have the fixes in gerrit18:51
danfaithanks, I saw some comments in the code base from you and mriedem :)18:52
sean-k-mooneyhttps://bugs.launchpad.net/neutron/+bug/178800918:52
sean-k-mooneywe fixed it for ml2/ovs18:52
sean-k-mooneybut ml2/ovn didnt populated it proeprly18:52
sean-k-mooneyi think that is still a gap in tehr migration tooling but it might have got fixed in the last year or two18:53
danfaioh, ok. I'll have a more detailed look on the neutron side then. We are still in yoga, so...18:53
sean-k-mooneydanfai: for what ite work codl migration shoudl still work i think18:54
sean-k-mooneyits been a long time since i tried to fix this18:54
mikalMorning18:54
sean-k-mooneydanfai: if your using vlan networking with linux breige yoru move will be less painful18:54
danfaicold migration works flawlessly indeed. also two live migrations (we only have provider networks though)18:54
sean-k-mooneydanfai: but just be aware ovn does not supprot vxlan or gre tunnes18:54
mikalsean-k-mooney: I feel like I should recheck the spice-direct final patch now that melwitt's tempest fix has landed. Do you see any reason to not do that?18:54
sean-k-mooneymikal:i already did18:55
sean-k-mooneydid it fail again18:55
sean-k-mooneymikal: i have started merging your code18:55
mikalsean-k-mooney: doh, I mismashed the refresh button because I am pre coffee.18:55
mikalNah, it passed.18:55
mikalUgh, arm passed.18:56
mikalI will be quiet now.18:56
sean-k-mooneymikal: its still in check i think18:56
sean-k-mooneyso it failed on the funtional jobs and weill need a recheck18:56
sean-k-mooneybut https://zuul.opendev.org/t/openstack/build/1098d7514ba843e28a5477c69892b176 the hybird plug job passed again18:56
danfaiyes, regarding vxlans, this was mentioned in the meeting we had with operators. 18:57
sean-k-mooneymikal: rechecks wont work until it repors back so lets just let it complete18:57
mikalsean-k-mooney: will do18:57
sean-k-mooneydanfai: the main issue iwht the migration scripts that neutron provides is geneve requires a small mtu on the client or extra overhead on the underlying network then vxlan18:58
danfaihmm, I thought there was more, like ID mappings. I'll add the MTU to the notes, thanks18:59
sshmlbrgsean-k-mooney: Hi! Did u see my request on review?18:59
sean-k-mooneydanfai: nova does not supprot updatign the mtu on conencted ports for what its worth19:00
sean-k-mooneydanfai: the mtu change wil only take effect after a hard reboot19:00
sean-k-mooneydanfai: https://github.com/openstack/neutron/commit/baaf240ce3f7802fe1431cc13913b9d93fc7f742 that is the ovn bridge name patch19:00
sean-k-mooneysshmlbrg: yep. we will need to add some nova functioanl tests to replciate the manual procedure you followed in the bug report19:01
danfaiThank you very much19:02
sean-k-mooneysshmlbrg: unit test are good to have but for operations like shelve we try to also have functional tests. and for bugs we prefer to create a functional test that show the broken behavior as an addtion patch and then have the fix as a second commit 19:02
sean-k-mooneysshmlbrg: without loading all the context im not sure that the finally block is corect19:03
sean-k-mooneysshmlbrg: you should not be modifying the az of the instnace directly19:04
sshmlbrgsean-k-mooney: ok, thanks. I will add functional tests as a first commit19:04
sean-k-mooneysshmlbrg: we do not really use the az colume in the cell database to store the az of the instance anymore.19:04
sean-k-mooneysshmlbrg: cool i would od it be creating a new file in https://github.com/openstack/nova/tree/master/nova/tests/functional/regressions19:05
sean-k-mooneyfor the bug ill see if i can quickly find an exmaple for you to copy19:05
sean-k-mooneysshmlbrg: perhaps https://github.com/openstack/nova/blob/master/nova/tests/functional/regressions/test_bug_2025480.py19:06
sshmlbrgsean-k-mooney: then will need to change the methods used and add the az argument to them19:06
sean-k-mooneyupdatign the object in memory may be ok its the save that is a little suspect19:07
sean-k-mooneywe have a temporay mutation context manager19:07
sean-k-mooneyi that we may want to condier19:07
sean-k-mooneywe can dicuss that on the review19:07
sshmlbrgsean-k-mooney: thank you for example! ok, sounds good19:08
sean-k-mooneyif you look at similar files in the same directory there shoudl be a boot form volume example19:08
sean-k-mooneyor other shelve example so it should not be too hard to create a repoducer following the steps in the bug19:09
sshmlbrgsean-k-mooney: i got it. I am gonna return with changes tomorrow. Have a nice day or evening :)19:11
sean-k-mooneysshmlbrg: https://github.com/openstack/nova/blob/375d95565d594182c5472f710b3f7bd53f0746d4/nova/tests/functional/regressions/test_bug_1675570.py#L2619:12
sean-k-mooneythat a better example19:12
sean-k-mooneysshmlbrg: o/19:13
sean-k-mooneysshmlbrg: that actully attaching a server to a shelve offloaded instnace19:14
sean-k-mooneythe reason its not failing is cross az attach is allowed by defult19:14
sean-k-mooneyso if you were to set that to false in yoru version i woudl eeepct the attach to fail and repoduce the bug :)19:14
sean-k-mooneymikal: https://zuul.openstack.org/status?change=924844&change=926876&change=926877 we will need to do a few rechecks... but we can kick thos off again shortly19:16
sshmlbrgsean-k-mooney: cool idea, I'll look into this19:16
sean-k-mooneysshmlbrg: let us know if you have issues there is a self.flags function aviable on the test case to set fconfig options19:17
sean-k-mooneyif you grep for flags you will see examples of how to use it19:17
sean-k-mooneybauzas: RuntimeError: detected leaked greenlets in nova.tests.functional.libvirt.test_vgpu.VGPUMultipleTypesTests.test_create_servers_with_specific_type19:19
sean-k-mooneyi dont know how often tthat fails but that ocatinal does19:20
sean-k-mooneymikal: ^ that the error in the funtional test on the third patfch so its not related ot your code19:21
sean-k-mooneyUggla: https://review.opendev.org/c/openstack/releases/+/941862 need your +1 again i updated the version to 4.1.0 based on elodilles comment on the release channel19:24
mikalsean-k-mooney: are you going to do the rechecks or should I?19:25
sean-k-mooneyeither works ill be finsihing in the next 20 mins or so but ill preodiclly check in19:26
mikalWell, we'19:26
mikalWell, we're still waiting for them to finish right now anyways.19:26
sean-k-mooneyya. if multiple people recheck it wotn kick it out of check19:27
sean-k-mooneyi dont it migh kick it out of gate i dont recally so we should be carful in that case19:27
sean-k-mooneymikal: over all the results look pretty green the nova-grenade-multinode failure in gate on the first patch was a kernel panic19:29
sean-k-mooneythat not related to yoru code so just bad luck19:29
sean-k-mooneymikal: sometime rechecking at a less busy time help with things like that but its a bug in our cirros image we think19:30
mikalsean-k-mooney: without checking, I feel like cirros hasn't done a release in ages?19:31
mikalHuh, more recent than I expected. September last year.19:31
sean-k-mooneyits been a while there is a 6.x branch based on ubuntu 22.04 or maybe 24.0419:31
mikalThe author now works at chainguard IIRC.19:32
sean-k-mooneyi dont recall if they have compelte the rebase on the 24.04 kernel yet19:32
sean-k-mooneywe have been considering other distos for testign but few if any really meet the min ram requirements we have19:32
sean-k-mooneygento recently started providign cloud-images iwth cloud ini but they are like 930+ MBs19:33
sean-k-mooneyvs the like 20-50mb of a cirros image19:33
mikalI have been toying with the idea of a specialised UEFI binary for SPICE testing and skipping the OS entirely. I guess it depends what functionality you need from the instance.19:33
mikalAlso, I have way more random tangents than I will ever have time to persue.19:34
sean-k-mooneyya so ironic were not supper ok with the idea of alpine because they worried about he use of musle libc19:34
sean-k-mooneynova does not really care about that which is why i was prposin git19:34
sean-k-mooneymikal: for tempest we need a small usbset of utils19:35
sean-k-mooneytoybox/busybox mostly covers it19:35
sean-k-mooneyalthough i belive we use gnu core utils version in cirros today19:35
sean-k-mooneymikal: i think frickler has been somehwat invovled in cirros mantaince lately but im not reallly sure19:36
sean-k-mooneyfrickler: do you know if they are still working on a 0.7.x release with 24.04 kernel and package rebase?19:37
mikalsean-k-mooney: they certainly did the most recent release.19:37
sean-k-mooneymikal: i know we are hoping to do a release with https://github.com/cirros-dev/cirros/pull/122 after epoxy is released19:39
sean-k-mooneybefore would also be fine but as of expoxy nova will be able to create vms with igb nic which supprot emulated sriov in the guest19:39
sean-k-mooneymikal: we are hopping to be able to do some sriov tesing in ci once our providers are capabliy fo supproting that19:40
sean-k-mooneyit will be a while before thats realsitc however19:40
Ugglasean-k-mooney, elodilles, https://review.opendev.org/c/openstack/releases/+/941862   +1 again.20:31
fricklersean-k-mooney: I'm not aware of anyone actively working on cirros, but I could do a bit if needed. that PR should be in 0.6.3 though?20:36
opendevreviewLuis Enrique Morales Mendez proposed openstack/nova master: Add 'LOG' to archive_deleted_rows function  https://review.opendev.org/c/openstack/nova/+/94244120:39
opendevreviewMerged openstack/nova master: Reproducer for bug 2098892  https://review.opendev.org/c/openstack/nova/+/94240221:25
sean-k-mooneyfrickler: 0.6.3 is using hte latest (at the time) 22.04 hwe kernel21:57
sean-k-mooneyfrickler: but not 24.0421:57
sean-k-mooneyfrickler: unless you ment the igb change which might be in 0.6.321:57
opendevreviewMerged openstack/nova master: libvirt: Fix regression of listDevices() return type  https://review.opendev.org/c/openstack/nova/+/94233423:25
opendevreviewmelanie witt proposed openstack/nova stable/2024.2: Reproducer for bug 2098892  https://review.opendev.org/c/openstack/nova/+/94265123:36
opendevreviewmelanie witt proposed openstack/nova stable/2024.2: libvirt: Fix regression of listDevices() return type  https://review.opendev.org/c/openstack/nova/+/94265223:36
opendevreviewmelanie witt proposed openstack/nova stable/2024.1: Reproducer for bug 2098892  https://review.opendev.org/c/openstack/nova/+/94265323:37
opendevreviewmelanie witt proposed openstack/nova stable/2024.1: libvirt: Fix regression of listDevices() return type  https://review.opendev.org/c/openstack/nova/+/94265423:37
opendevreviewmelanie witt proposed openstack/nova stable/2023.2: Reproducer for bug 2098892  https://review.opendev.org/c/openstack/nova/+/94265523:38
opendevreviewmelanie witt proposed openstack/nova stable/2023.2: libvirt: Fix regression of listDevices() return type  https://review.opendev.org/c/openstack/nova/+/94265623:38

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!