16:00:02 #startmeeting nova 16:00:02 Meeting started Tue Apr 25 16:00:02 2023 UTC and is due to finish in 60 minutes. The chair is bauzas. Information about MeetBot at http://wiki.debian.org/MeetBot. 16:00:02 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 16:00:02 The meeting name has been set to 'nova' 16:00:13 hey everyone 16:00:17 o/ 16:00:21 o/ 16:00:22 we are quite busy today so let's try to be quick 16:00:26 o/ 16:00:33 o/ 16:00:37 #link https://wiki.openstack.org/wiki/Meetings/Nova#Agenda_for_next_meeting 16:00:46 #topic Bugs (stuck/critical) 16:00:51 #info No Critical bug 16:00:54 #link https://bugs.launchpad.net/nova/+bugs?search=Search&field.status=New 17 new untriaged bugs (+4 since the last meeting) 16:01:00 #info Add yourself in the team bug roster if you want to help https://etherpad.opendev.org/p/nova-bug-triage-roster 16:01:20 honestly, I think I forgot to tell melwitt to look at the bugs, so it's on me :) 16:01:46 the next person in the roster is artom 16:01:55 ohhai 16:01:57 artom: fancy triaging some upstream bugs if you like ? 16:02:05 Sure 16:02:10 thanks a lot 16:02:17 CLOSED RUSTWILLFIXIT 16:02:19 I'll also try to cherry-pick some 16:02:42 artom: well, we're on Launchpad 16:03:15 so it'd be 'Closed' only with a comment saying you'd think Rust would work 16:03:20 or even 'Wontfix' 16:03:22 anyway 16:03:28 #info bug baton is being passed to artom 16:03:32 #topic Gate status 16:03:41 grab your popcorns 16:03:47 #link https://bugs.launchpad.net/nova/+bugs?field.tag=gate-failure Nova gate bugs 16:03:52 #link https://etherpad.opendev.org/p/nova-ci-failures 16:03:54 but alas, 16:04:17 #link https://lists.openstack.org/pipermail/openstack-discuss/2023-April/033454.html Gate is blocked 16:04:46 last status : https://lists.openstack.org/pipermail/openstack-discuss/2023-April/033468.html 16:05:03 dansmith: sean-k-mooney: elodilles: wanting to add anything else on that ? 16:05:18 probably not 16:05:30 we are mainly waiting on ci 16:05:36 so no i think that fine 16:05:42 i've just updated the nova patches according to the discussions (if i did not miss anything) 16:05:51 we have two concurrent patches afaics https://review.opendev.org/c/openstack/nova/+/881409 and https://review.opendev.org/c/openstack/nova/+/881339 16:05:53 if those will be the chosen ones :) 16:06:07 we may want to only use one :) 16:06:19 yeah 16:06:51 elodilles: your missing the depend on agains the devstack patch but those need to be squashed first 16:06:51 anyway, we'll sort that out of the meeting 16:06:53 so its fine for now 16:07:21 ++ 16:07:55 yup and thanks to dansmith and sean-k-mooney for working hardly on the ceph patches 16:08:04 mainly dansmith 16:08:05 -EOLDGREEK to me 16:08:18 i have been busy with other things other then checking back every now and then 16:08:27 (I mean, I know what ceph does and all the things, but that is a different story) 16:08:42 anywya we can move on 16:08:46 cool 16:08:51 https://zuul.openstack.org/builds?project=openstack%2Fnova&project=openstack%2Fplacement&pipeline=periodic-weekly 16:09:01 unsurprisingly they are all green 16:09:15 but by their runs we didn't had the py39 updates :)à 16:09:31 good timing i guess 16:10:31 well, tooz upgraded to 4.0 the week before 16:10:32 Merged openstack/nova stable/xena: Fix rescue volume-based instance https://review.opendev.org/c/openstack/nova/+/875343 16:10:46 so that's fortunate we haven't merged the u-c patch during the weekend 16:10:58 anyway, moving on 16:11:04 #info Please look at the gate failures and file a bug report with the gate-failure tag. 16:11:09 #info STOP DOING BLIND RECHECKS aka. 'recheck' https://docs.openstack.org/project-team-guide/testing.html#how-to-handle-test-failures 16:11:14 #topic Release Planning 16:11:18 #link https://releases.openstack.org/bobcat/schedule.html 16:11:23 #info Nova deadlines are set in the above schedule 16:11:28 #info Bobcat-1 is in 2 weeks 16:11:51 (we'll have a stable branch review day on Bobcat-1 but I'll explain this the next week) 16:12:02 #topic Review priorities 16:12:09 #link https://review.opendev.org/q/status:open+(project:openstack/nova+OR+project:openstack/placement+OR+project:openstack/os-traits+OR+project:openstack/os-resource-classes+OR+project:openstack/os-vif+OR+project:openstack/python-novaclient+OR+project:openstack/osc-placement)+(label:Review-Priority%252B1+OR+label:Review-Priority%252B2) 16:12:14 #info As a reminder, cores eager to review changes can +1 to indicate their interest, +2 for committing to the review 16:12:19 #topic Stable Branches 16:12:28 elodilles: take the mic 16:12:43 yepp 16:12:53 beyond the usual stuff, 16:13:04 (unblocked gate & many rechecks) 16:13:19 auniyal prepared release patches for yoga and zed 16:13:30 they haven't merged yet 16:13:53 and meanwhile 1-1 patches merged to stable/yoga and zed 16:14:07 yes, got 1 +2 yet :), thanks for review elodilles 16:14:12 otherwise they are good as they are 16:14:26 auniyal: thanks for proposing the patches :) 16:14:41 cool 16:15:54 anything else ? 16:16:04 nope, i think that was all 16:16:07 from my side 16:16:15 sorry :) 16:17:10 cool, moving on 16:17:19 #topic Open discussion 16:17:29 (ykarel) Allow to add tb-cache size libvirt option for qemu, context https://bugs.launchpad.net/nova/+bug/1949606 16:17:33 ykarel: go for it 16:19:20 hi 16:19:41 so qemu-5.0.0(included in ubuntu jammy) update default tb cache size to 1 GiB(from 32 MiB) for system-emulated guest vms and with that each guest VM is using much more system memory(1 GB+ memory for guest), resulting into oom-kill issues when creating multiple guest vms concurrently in neutron scenario jobs using ubuntu guest vms 16:20:17 libvirt-8.0.0 added an option to configure it per guest vm 16:20:25 Currently testing WIP nova patch https://review.opendev.org/c/openstack/nova/+/868419 in https://review.opendev.org/c/openstack/neutron-tempest-plugin/+/881391 16:20:46 should this be a specless RFE or can continue as a bug? 16:20:54 ykarel: did you file the specless pluprint i asked for 16:20:56 ah 16:20:58 its not a bug 16:21:19 sean-k-mooney, i just added to meeting agenda to get clearity 16:21:29 but if it's a way to go then i will file it 16:21:30 we might conider it to be a small enough workaround that we might want to backport it upstream 16:21:33 to enable ci testing 16:22:27 hmm mainly would be needed for releases running on jammy 16:22:30 the issue is that i dont think 22.04 has libvirt 8.0.0 16:22:37 it has 16:22:44 that's what we testing 16:22:54 ah ok 16:23:07 we damn need update of https://docs.openstack.org/nova/latest/reference/libvirt-distro-support-matrix.html 16:23:19 yes 16:23:30 so your patch is mising a min libvirt version check 16:23:33 we're still having libvirt 6.0.0 as a min, right? 16:23:35 so that will need to be added 16:23:40 from job logs libvirt-daemon 8.0.0-1ubuntu7.4 16:23:55 bauzas: yes and im hoping to bump that to 7.0.0 this release 16:23:58 sean-k-mooney, yes will include that in next update 16:24:10 but we will sitll need to check 8.0.0 supprot 16:24:15 ack 16:24:16 sean-k-mooney: but we would still need to only set it if libvirt >=8 16:24:32 do we want to schedule on it, or is it just a performance fix ? 16:24:34 ya either only set it or preferable check this in init-host 16:24:43 and raise an error if you set the config option on an old libvirt 16:24:58 will there be any config knob ? 16:25:07 yes there shoudl be 16:25:12 yes that's second question 16:25:14 to set the cache size 16:25:15 Default setting for this new option, unconfigured or set to defaults like 32MB or 128 MB etc? 16:26:07 ykarel: if that's a config option, that indeed needs to be an hard fail if the operators sets this config value 16:26:18 and libvirt isn't recent enough 16:26:23 its not somethign we should hardcode 16:26:28 bauzas, sure will take care that in the patch 16:26:29 for that reason, I'm not happy with a default value except none 16:26:47 so it shoudl eb a config option im ok with a low default or leaving it unset 16:27:02 I'd prefer it unset for upgrade reasons 16:27:04 okk Thanks will keep it like that no default and let user configure it 16:27:05 bauzas: to your poitn it woudl be nice to have a triat for this but not required 16:27:28 ykarel: s/user/operator but I think I get your point 16:27:50 so will propose the blueprint and update the patch with as per all the suggestions, Thanks 16:27:50 in this case really it will be the zuul job 16:28:01 yes 16:28:03 this is of interest to peopel usign qemu for emulation 16:28:06 sean-k-mooney: man, we could recommend the operators to provide custom traits for this, exactly like vgpu types 16:28:36 I mean, eventually all the computes will support that, right? 16:28:44 yes 16:28:53 after a couple of releases, once we cap libvirt to >=8 16:29:10 so its not goign to break live migration 16:29:13 so I'm not a big fan of adding some scheduling thing for something that will eventually be supported mid-term 16:29:24 because we do not allwo live migration from a newer to older microversion 16:29:36 and for cold migration we will regenerate the xml on the approiate host 16:29:41 based on what it has avaiable 16:29:43 s/microversion/libvirt version but yeah 16:29:55 so i dont think we need anythign sepcial here 16:30:01 so ya custom trait woudl eb fine with me 16:30:13 they can use provider.yaml to set that if they want 16:30:17 cool, so that only seems a config knob to add, a check on init_host to fail if set and some magic in the driver to enable it 16:30:30 amirite ? 16:30:34 more or less 16:30:48 then, I'm OK for specless 16:30:56 +docs test ectra but its effectivly self contaied in the libvirt driver + the config tweak 16:30:58 we had precedents 16:31:11 + a relnote obviously 16:31:24 ykarel: do you agree with the direction ? 16:31:30 bauzas, yes 16:31:31 im ok with specless too assuming all of the above are done 16:31:41 anyone disagreeing ? 16:32:07 looks not 16:32:20 Thanks folks \o/ 16:32:41 the only other thin i would suggest is once this is done a devstack patch shoudl be added to set this by default to say 32mb if qemu is used instead of kvm 16:33:37 thats out of scope fo this meeting and coud be doen on a per job basis too 16:33:39 #agreed enabling tb cache seems a specless blueprint, provided it only adds a config knob defaulting to unset, init_host failing on an older libvirt and just libvirt config tweak 16:34:01 #action ykarel to ping bauzas once the blueprint is created so that he can approve it 16:34:34 ykarel: and yeah, the scope of this feature can include devstack change and testing, for sure 16:34:53 like we could enable it in nova-next 16:35:41 bauzas: it will reduce the memory pressue in all our jobs so ocne we know it does not have a negitive impact we will proably want to have it enable din all of them 16:35:42 anything else to add on this item ? 16:35:55 but we can take it slow like the mariadb reduced memroy 16:36:00 sean-k-mooney: oh yeah, but I'd be in favor of testing it first 16:36:08 yah 16:36:33 ok, fwiw, I have another item 16:36:53 (bauzas) Can https://blueprints.launchpad.net/nova/+spec/cold-migrate-to-host-policy be specless ? 16:37:05 tl;dr: we discussed this at the PTG 16:37:29 assuming there is no change in the default policy then yes i think so 16:37:32 operators want a better granularity and maybe change the cold-migrate action to be admin_or_owner 16:37:46 but, here, we're just adding a new policy which is admin-only 16:37:55 so no API change, and no policy changes 16:38:11 it will just go check a separate policy if host is set 16:38:32 (literally a one-liner patch besides the policy file) 16:38:55 any objections to have it specless ? 16:38:55 so basicaly there will be two poicies now for cold migration one for migratoin with a host adn one without but admin only by default 16:39:01 and then operators can choose 16:39:18 +1 16:39:37 correct, like we have for os_compute_api:servers:create:forced_host 16:40:10 except I won't change the defaut rule for os_compute_api:os-migrate-server:migrate 16:40:34 there will be os_compute_api:os-migrate-server:migrate and os_compute_api:os-migrate-server:migrate:host 16:40:34 both being admin-only 16:40:48 (and operators can decide to open os_compute_api:os-migrate-server:migrate to endusers) 16:41:10 so I reiterate, any objection to have it specless ? 16:41:40 looks not 16:41:49 if so, 16:42:08 as long ast there is at least a blueprint im happy. i dislike changing policy without any tracker to works for me 16:42:17 #agreed https://blueprints.launchpad.net/nova/+spec/cold-migrate-to-host-policy accepted as a specless feature for Bobcat 16:42:31 sean-k-mooney: there is a blueprint, and there will be a relnote 16:42:45 and there will be functional tests covering this 16:42:53 yep all good. 16:43:15 I don't think we need a tempest change, do you think it's a nice to have ? 16:43:29 i dont think tempst shoudl test non default policy 16:43:41 yeah, that was my question 16:43:48 I'm not a QA expert 16:43:55 tempest is branchless 16:44:05 so that would be a bit hard to test it with tempest 16:44:22 i suspect you could reuse some fo the exsiting test with the right config if you needed too 16:44:24 anyway, I think we're done on this 16:44:44 about tempest, we could discuss this on the review time 16:44:50 thanks folks 16:44:51 Artom Lifshitz proposed openstack/nova master: Reproduce bug 1995153 https://review.opendev.org/c/openstack/nova/+/862967 16:44:51 Artom Lifshitz proposed openstack/nova master: Save cell socket correctly when updating host NUMA topology https://review.opendev.org/c/openstack/nova/+/862964 16:45:01 any other item to add before we end the meeting ? 16:45:22 small thing o/ 16:45:27 CI on yoga: this one keep failing for different reasons, mostly 16:45:27 https://review.opendev.org/c/openstack/nova/+/839922 16:45:28 shot 16:45:42 mostly volume tests 16:46:23 ya that kind fo a pain im not sure there is anythin we can do beyond recheck 16:46:29 is it the volume detach tests 16:46:45 auniyal: gibi found some tests that are not waiting properly 16:46:47 I think this is also tracked on the stable CI failures etherpad 16:46:50 yes, attach and detach , but they are always different 16:47:04 sometime tomeout 16:47:15 yoga is not EM right so its still using tempest master? 16:47:20 no 16:47:46 tbc no, its not EM 16:47:50 ok 16:47:58 so it still can get tempest fixes if we fix those tests 16:48:09 yup 16:48:40 are we done ? 16:48:59 sorry I didn't get, any action on above 16:49:20 we need to fix tempest tests ? 16:49:47 i think just continue to reheck it. gibi found at least on test that is not waiting for sshable 16:49:53 no, we have some tempest patches up 16:49:57 and notice other dont appear to eb waiting but i dont have the context 16:50:01 and yoga would benefit from those 16:50:10 since tempest is branchless 16:50:14 oh do you have a link? 16:50:16 ack thanks 16:51:54 I was referring to gibi's recent discoveries of testing gap for ssh wait 16:52:13 (sorry was looking at the -tc meeting) 16:52:23 -tc chan* 16:52:35 can we close this meeting now ? 16:52:38 its fine we can wrap this here and chat after 16:53:06 cool 16:53:08 thanks all 16:53:12 #endmeeting