*** mhen_ is now known as mhen | 01:49 | |
aravindh7murugesan | sean-k-mooney: The issue I was talking about yesterday about the VM not booting is only if I set the VM to UEFI boot mode with image meta. Its working fine with BIOS mode. Is there anything specific for GPU passthrough and UEFI I should take care of? | 06:38 |
---|---|---|
*** tosky_ is now known as tosky | 07:33 | |
opendevreview | Pierre Riteau proposed openstack/nova master: doc: Fix typo in nova-manage command https://review.opendev.org/c/openstack/nova/+/955566 | 07:58 |
opendevreview | Biser Milanov proposed openstack/nova master: StorPool: Pass the instance UUID and device_name to os-brick https://review.opendev.org/c/openstack/nova/+/930297 | 10:01 |
opendevreview | Biser Milanov proposed openstack/nova stable/2024.1: StorPool: Pass the instance UUID and device_name to os-brick https://review.opendev.org/c/openstack/nova/+/954115 | 10:02 |
sean-k-mooney | aravindh7murugesan not that i know of specificly but apprently yes | 10:09 |
opendevreview | Florian proposed openstack/nova master: Add check for PCIe devices attach limit for volume and ports https://review.opendev.org/c/openstack/nova/+/955584 | 11:20 |
opendevreview | Florian proposed openstack/nova master: Add check for PCIe devices attach limit for volume and ports https://review.opendev.org/c/openstack/nova/+/955584 | 12:58 |
opendevreview | Merged openstack/nova master: api: Add response body schemas for server password APIs https://review.opendev.org/c/openstack/nova/+/945736 | 13:48 |
opendevreview | Dan Smith proposed openstack/nova master: Remove eventlet timer from multi_cell_list https://review.opendev.org/c/openstack/nova/+/954990 | 14:27 |
noonedeadpunk | fwiw, tunnelled migrations are around 20-30% faster then native ones with TLS enabled... | 15:01 |
bauzas | folks, I need to skip today's upstream nova meeting | 15:24 |
bauzas | can someone run it ? | 15:24 |
bauzas | gibi ? | 15:24 |
gibi | bauzas: I can run it but I will run it with a quick "does anybody has anything to raise to the team" style | 15:29 |
gibi | and if silence then I will not iterate the agenda | 15:29 |
jssfr | I have two questions, which may be answerable outside the team meeting: (a) the AMD SEV-ES patches are at the top of the review list, is there anything we as a contributing company (not to that patchset specifically, but in general) can do to help? | 15:33 |
jssfr | (b) I pointed out the loss of vTPM state upon start/stop of an instance the other day and I have something which resembles a patch. Do I need to file an issue first, or is it enough if I submit a patchset to gerrit? | 15:33 |
bauzas | thanks gibi | 15:34 |
dansmith | jssfr: B will probably warrant a spec | 15:34 |
gibi | jssfr: a) honestly I don't have a good answer for that. Uggla noted that it is ready for review and even pinged us downstream about it. I'm think most of our cores are busy with other review promises. Keep pinging us (I know it is tiring) | 15:34 |
jssfr | dansmith, uh-uh, how so? | 15:34 |
jssfr | gibi, oh I am fine with pinging, I just don't want to come off as annoying. :-) | 15:35 |
dansmith | jssfr: IIRC there are some gotchas around both vTPM and NVRAM state that cause us to dump them at certain times, and there's an overlap with the planned/ongoing vTPM live migration work | 15:35 |
jssfr | okay, that's good to know. fwiw, the patch is basically just passing VIR_DOMAIN_UNDEFINE_KEEP_TPM if destroy_secrets == False, which seems logical enough to me. | 15:35 |
jssfr | maybe it makes sense to submit the patch first and then you can still tell me it needs a spec :-). | 15:36 |
jssfr | (always easier to talk about an actual diff than about a hypothetical one, in my experience) | 15:36 |
gibi | jssfr: you need to find the balance of being seen but not beeing annoying :) Anyhow I can tell you we are aware of your patches waiting for review | 15:36 |
jssfr | (not my patches, but thanks :)) | 15:36 |
dansmith | jssfr: there's no reason not to push the code. ever. | 15:36 |
gibi | (don't push a code if you feel it is a security bug :) | 15:37 |
dansmith | gibi: dammit gibi :) | 15:37 |
jssfr | fair point :D | 15:37 |
gibi | otherwise I'm fully agree with dansmith of course :) | 15:38 |
jssfr | thanks for the feedback :) | 15:38 |
gibi | #startmeeting nova | 16:00 |
opendevmeet | Meeting started Tue Jul 22 16:00:47 2025 UTC and is due to finish in 60 minutes. The chair is gibi. Information about MeetBot at http://wiki.debian.org/MeetBot. | 16:00 |
opendevmeet | Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. | 16:00 |
opendevmeet | The meeting name has been set to 'nova' | 16:00 |
fwiesel | o/ | 16:00 |
sp-bmilanov | o/ | 16:01 |
dansmith | o/ | 16:01 |
gibi | Uggla is on a well deserved PTO and his stand in cannot be here today. So I agreed to run something like a nova meeting but I'm not prepared. | 16:01 |
gibi | lets wait a bit to see if other cores join but so far we are low on quorum | 16:02 |
elodilles | o/ | 16:02 |
gmaan | o/ | 16:02 |
gibi | (an I have a dinner invitation that helps me prioritize) | 16:03 |
gibi | lets get roling | 16:03 |
gibi | #topic Bugs (stuck/critical) | 16:03 |
gibi | any fresh critical bug we need to look at? | 16:03 |
gibi | the on one the agenda https://bugs.launchpad.net/nova/+bug/2116852 is not critical any more as we disable the single tempest test that caused the blockade | 16:04 |
gibi | I filed the upstream bug to Ceph https://tracker.ceph.com/issues/72203 they are silent so far. I have way to ping them downstream which I will use next week if no reaction on the upstream tracker | 16:05 |
gibi | any other critical adjacent bug? | 16:05 |
gibi | #topic Gate status | 16:05 |
gibi | any issues with our gate? | 16:05 |
gibi | I'm not tracking anything major on my side at least | 16:06 |
gibi | #topic tempest-with-latest-microversion job status | 16:06 |
gibi | it is red | 16:06 |
gibi | :) | 16:07 |
gibi | - Failed: 27 | 16:07 |
gibi | I have no other info. Anybody wants to comment? | 16:07 |
gmaan | yeah, my last fix is still not merged but that make 6 more test green and 21 still failing | 16:07 |
gmaan | I did not get chance to continue this one. | 16:07 |
gibi | gmaan: cool. Thanks | 16:07 |
gibi | #topic Release Planning | 16:08 |
gibi | #link https://releases.openstack.org/flamingo/schedule.html | 16:08 |
gibi | anybody has any comment here? | 16:08 |
gibi | #topic Review priorities | 16:08 |
gibi | #link https://etherpad.opendev.org/p/nova-2025.2-status | 16:08 |
gibi | any comments? | 16:09 |
gibi | #topic OpenAPI | 16:09 |
gibi | #link: https://review.opendev.org/q/topic:%22openapi%22+(project:openstack/nova+OR+project:openstack/placement)+-status:merged+-status:abandoned | 16:09 |
gibi | 16 open patches mostly stuck on gate or waiting for rebase | 16:10 |
gibi | any comments? | 16:10 |
gibi | #topic Stable Branches | 16:11 |
gibi | elodilles: give us what you have! | 16:11 |
elodilles | ACK :) | 16:11 |
elodilles | actually state is pretty same as last week | 16:11 |
elodilles | stable branches seems healthy | 16:11 |
elodilles | and stable releases are waiting for release liaisons | 16:11 |
elodilles | gibi: back to you | 16:12 |
gibi | who are our liaisons to ping? | 16:12 |
* gibi feels bad not knowing | 16:13 | |
elodilles | Uggla Amit and Sylvain | 16:13 |
gibi | bauzas: ^^ auniyal ^^ | 16:13 |
gibi | please look at the stable release requests | 16:14 |
gibi | #topic vmwareapi 3rd-party CI efforts Highlights | 16:14 |
gibi | fwiesel: any news? | 16:14 |
fwiesel | Hi, nothing from my side. | 16:14 |
gibi | fwiesel: OK cool | 16:14 |
gibi | #topic Gibi's news about eventlet removal. | 16:14 |
gibi | hey thats me :) | 16:14 |
gibi | we are slowly landing patches from the scheduler series | 16:15 |
gibi | I got a nice set of reviews from bauzas on the doc patch. I have to go back there and touch up the doc | 16:15 |
gibi | sambork's patch https://review.opendev.org/c/openstack/nova/+/949754 logic looks good to me but I found some extra cleanup pieces and a bit of test issues | 16:16 |
gibi | and I'm following Dan's series starting https://review.opendev.org/c/openstack/nova/+/954990/4 | 16:17 |
gibi | I still have the intention to go back making our unit tests run with threading | 16:18 |
gibi | that is it | 16:18 |
gibi | #topic Open discussion | 16:18 |
gibi | (sp-bmilanov) Bug #2092391: duplication instances when nova compute service restart: https://bugs.launchpad.net/nova/+bug/2092391 | 16:18 |
sp-bmilanov | hi :) | 16:18 |
gibi | I guess this is a review request for https://review.opendev.org/c/openstack/nova/+/938223 | 16:19 |
gibi | am I correct? | 16:19 |
sp-bmilanov | not exactly | 16:19 |
gibi | ohh | 16:19 |
gibi | then tell us :) | 16:19 |
sp-bmilanov | I wonder if it would be better to bring this up when more core people are around but still -- we hit this bug recently and it was not due to a graceful Nova agent shutdown | 16:20 |
gibi | what was the trigger? | 16:20 |
sp-bmilanov | the tldr; is that during a migration, if a nova-agent crashes at the correct moment, it is possible to have the same VM running on the source and destination hypervisor | 16:20 |
dansmith | I think it has already been noted on that bug that nova-compute doesn't really have any graceful shutdown support, and what the bug describes during a live migration is pretty much expected at the moment | 16:21 |
gibi | even with graceful shutdown a crash would not be handled | 16:22 |
sp-bmilanov | dansmith: right, I read Sean's comment as "it is not supported to ask nova-compute to shutdown during live migration" | 16:22 |
dansmith | gibi: I think the problem is likely on restart we re-activate the instance | 16:22 |
gibi | so I guess we need a solution where a compute starting up can fix the situation | 16:22 |
gibi | dansmith: yeah | 16:22 |
sp-bmilanov | yes, as gibi said, it's about when it crashes | 16:22 |
dansmith | and the review mentioned above would only be the non-crash situation | 16:23 |
gibi | yepp | 16:23 |
gibi | I feel that nova-compute during statup can be smarter about this to remove the VM duplication | 16:23 |
sp-bmilanov | the VM was seen in an error state after the nova-compute got back up because of a mismatch in what libvirt was reporting and the contents of the nova DB | 16:24 |
sp-bmilanov | a teammate suggested it would be better to have this as an separate error state which has more obstacles to get around until you are able to start the VM again | 16:25 |
sp-bmilanov | else nova-compute recreates the libvirt domain on VM start on the source hypervisor | 16:26 |
gibi | whichever compute puts the VM to error could be smarter and try to abort the live migration I guess | 16:26 |
dansmith | any solution for this is going to be something we need to document as a spec I think, because there are a lot of factors in play.. it's hard to know what to do when a live migration fails and in the past, we've basically said "lean on the operator to clean it up" | 16:26 |
gibi | dansmith: make sense | 16:27 |
gibi | it is complicated | 16:27 |
dansmith | preventing nova from re-starting on startup if it's not sure is good, but barriers to prevent the user from doing something bad are part of the complexityt | 16:27 |
gibi | I agree | 16:28 |
gibi | Also having a spec would force us to load context around this codepath (I don't have it loaded) | 16:28 |
gibi | sp-bmilanov: could you draft a spec even if it is just the problem statement with more details about what exactly happening and why | 16:29 |
dansmith | +1 | 16:29 |
gibi | I think that would help us brainstorming on a list of potential solutions | 16:29 |
sp-bmilanov | sure can | 16:30 |
gibi | cool. thanks. | 16:30 |
sp-bmilanov | thanks gibi dansmith! | 16:31 |
gibi | Is there anything else to discuss? | 16:31 |
gibi | then thanks for joining today. Next week we will have Uggla back. | 16:32 |
gibi | #endmeeting | 16:32 |
opendevmeet | Meeting ended Tue Jul 22 16:32:35 2025 UTC. Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4) | 16:32 |
opendevmeet | Minutes: https://meetings.opendev.org/meetings/nova/2025/nova.2025-07-22-16.00.html | 16:32 |
opendevmeet | Minutes (text): https://meetings.opendev.org/meetings/nova/2025/nova.2025-07-22-16.00.txt | 16:32 |
opendevmeet | Log: https://meetings.opendev.org/meetings/nova/2025/nova.2025-07-22-16.00.log.html | 16:32 |
elodilles | thanks o/ | 16:32 |
gmaan | o/ | 16:33 |
fwiesel | o/ | 16:36 |
opendevreview | Merged openstack/nova master: Rename DEFAULT_GREEN_POOL to DEFAULT_EXECUTOR https://review.opendev.org/c/openstack/nova/+/948086 | 16:47 |
opendevreview | Merged openstack/nova master: Make the default executor configurable https://review.opendev.org/c/openstack/nova/+/948087 | 16:48 |
opendevreview | Merged openstack/nova master: api: Add response body schemas for server group APIs https://review.opendev.org/c/openstack/nova/+/952281 | 18:13 |
mikal | Heya. https://review.opendev.org/q/topic:%22libvirt-vdi%22 has been sitting with a single set of +2s for a couple of weeks. Does anyone have a minute to take a look at them please? | 19:25 |
opendevreview | Merged openstack/nova master: api: Address issues with server group APIs https://review.opendev.org/c/openstack/nova/+/953281 | 21:14 |
opendevreview | Ghanshyam proposed openstack/nova master: Add project manager role in Nova API policy rule https://review.opendev.org/c/openstack/nova/+/953063 | 21:49 |
Generated by irclog2html.py 4.0.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!