16:00:05 #startmeeting nova 16:00:05 Meeting started Tue Jun 22 16:00:05 2021 UTC and is due to finish in 60 minutes. The chair is gibi. Information about MeetBot at http://wiki.debian.org/MeetBot. 16:00:05 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 16:00:05 The meeting name has been set to 'nova' 16:00:25 \o 16:00:54 o/ (kinda) 16:01:00 o/ 16:01:35 o/ 16:01:41 \o 16:02:08 o/ 16:02:18 #topic Bugs (stuck/critical) 16:02:25 No critical bugs 16:02:30 #link 21 new untriaged bugs (+7 since the last meeting): #link https://bugs.launchpad.net/nova/+bugs?search=Search&field.status=New 16:02:43 o/ 16:02:51 our backlog is growing so please if you have some time then look at it and triage some bugs 16:03:36 is there any specific bug that we need to discuss now? 16:04:10 gibi: i am pretty sure that https://bugs.launchpad.net/nova/+bug/1933096 16:04:16 is a bug in ther ci env 16:04:24 so hopefully we can close that soon 16:04:44 sean-k-mooney: cool. thanks for looking into that 16:05:53 if nothing else then 16:05:53 #topic Gate status 16:05:58 Nova gate bugs #link https://bugs.launchpad.net/nova/+bugs?field.tag=gate-failure 16:06:01 gibi: I'll look at open bugs 16:06:16 bauzas: thank you 16:06:23 I don't see fresh gate bug in that list 16:06:31 last week we merged couple of fixes 16:06:46 how do you feel, does the gate improved? 16:06:46 am any suggestions on who i should ping to move https://review.opendev.org/c/openstack/devstack/+/796826 along in devstack 16:07:06 thanks lyarwood btw. 16:07:13 for the gate failures fixes 16:07:44 sean-k-mooney: gmann could be one 16:08:09 i assume that we are still seeing the intermitant failures due to the agent haning 16:08:31 so if we can merge that sooner rather then later it should help with gate stablity 16:08:32 I was off yesterday and did not push any code today so I don't have the view what is failing ont he gate 16:08:46 but I agree that devstack patch is a good step forwa4rd 16:09:17 anything else about the gate? 16:10:34 Placement periodic job status #link https://zuul.openstack.org/builds?project=openstack%2Fplacement&pipeline=periodic-weekly 16:10:38 placement jobs are green 16:10:47 #topic Release Planning 16:10:51 Milestone 2 is 15 of July which is spec freeze 16:10:55 Spec review day is 6th of July #link http://lists.openstack.org/pipermail/openstack-discuss/2021-June/023083.html 16:11:01 anything else about the coming milestone? 16:12:47 #topic Stable Branches 16:12:52 copying notes from elodilles 16:12:56 stable/ussuri is blocked (fix needs to be merged: https://review.opendev.org/c/openstack/nova/+/794675 ) 16:13:00 other branches should be OK (however, there are quite frequent volume detach failures for example on stable/victoria) 16:13:05 stable/ocata branches of nova are EOL, branches are deleted ( https://review.opendev.org/c/openstack/releases/+/795664 ) 16:13:08 EOM 16:13:25 sorry, I was lost in the ussuri gate fixing patches, 16:13:35 the detach failure has a fix on master 16:13:36 i will try to sort it out in the coming days 16:13:59 https://review.opendev.org/c/openstack/nova/+/796255 16:14:05 this needs to be backported I think 16:14:10 hm 16:14:23 no\ 16:14:24 sorry 16:14:32 I'm not sure 16:15:28 that fix helps with the detach issue on master where we have the new notification based detach 16:15:46 it would need to be backported to any release using the events? 16:15:56 but not where we poll? 16:16:12 so if we backport the events based detach we need to backport both 16:16:30 sean-k-mooney: that is my understanding yes 16:16:42 I saw lot of failures in victoria as I said, can it be valid backport there? 16:17:18 i think its a relitivly safe backport in either case 16:17:36 elodilles: the detach modification is pretty huge patch but it does not change external interfaces 16:17:51 gibi: did we backport your events patch to victoria? 16:17:53 so from process perspective it is backportable 16:17:57 sean-k-mooney: no 16:18:02 I dont think so 16:18:20 anyway, that sound good (except that it's a huge patch :S), I'll add this to my TODO to investigate :) 16:18:29 so the failure on vicotria are proably not related to OPERATION_FAILED 16:18:44 sean-k-mooney: yeah 16:18:45 elodilles: wehre they OPERATION_FAILED specifcialy or was it just detach failures 16:19:20 well, the usual error: volume ... failed to detach (in XXXX state ) something 16:19:46 ok then that is likely not related to https://review.opendev.org/c/openstack/nova/+/796255 16:19:50 sean-k-mooney: I dont think nova logs the sepcific error code 16:19:53 :-/ 16:19:58 we can review after the meeting if you have an example 16:20:23 sure, thanks! 16:20:25 ack 16:20:28 gibi: we might see it in the libvirt log in the ci run not sure but just a guess 16:20:34 ok 16:20:42 any other stable related issue? 16:21:16 nothing else i think :X 16:22:37 #topic Sub/related team Highlights 16:22:42 Libvirt (bauzas) 16:22:45 just one thing, we discussed about https://review.opendev.org/c/openstack/nova/+/769614 about how to help operators to know about a behaviour modification for a libvirt issue 16:23:05 we had some argument about some relnote reno section 16:23:08 but, 16:23:16 eventually, I +Wd the fix 16:23:53 bauzas: what is the summary? 16:24:00 do we need to create some better way to tell operators that we no longer verify the cpu topologies ? 16:24:09 yes just noticed if you really wanted me to update it i was ok with that just wanted you and stephenfin to agree on a path forward 16:24:26 gibi: this ^ 16:24:58 basically, flavors could continue to have the same extraspecs 16:25:06 bauzas: I'm still not sure what you're asking for. Are you suggesting we drop the 'upgrade' section? 16:25:18 stephenfin: no, not about this 16:25:45 stephenfin: about whether we would need more some kind of documentation for this 16:25:50 oh, that 16:26:03 oh ok you were conserened if we needed to document it at all 16:26:09 not how we did it 16:26:10 yup 16:26:29 ok i tought you just wanted it in the fixes section so i missed that in your previous comments 16:26:33 My personal opinion is no. We never documented this behaviour and we never agreed to guarantee it. I wanted to include the release note simply as a courtesy 16:26:39 basically, the flavors would continue to be the same, it's just the libvirt driver that won't longer verify them 16:26:54 similar to how we announce virt API changes, even though we don't support out-of-tree virt drivers 16:27:16 I think you've misunderstood things. Nothing changes in what we verify 16:27:21 well its not about verificaiotn is that we change unspecified behavior to different unspecifed behavior 16:27:27 stephenfin: fair enough, I'm having the same opinion but I'm asking the community here if they have concerns 16:27:34 yes, what Sean said 16:28:05 if we are changing undefined behavior then reno is just an extra, but not at all mandatory. 16:28:06 we had a hidden behavior where we would try to match the number of guest CPU "threads" to the number of host CPU threads 16:28:25 but it was hidden. We didn't even know it existed until sean-k-mooney started fixing this bug 16:28:28 yes, just a performance kind of help 16:28:37 exactly. An optimization at most 16:28:47 okay, any concerns ? 16:28:52 if not, I'm OK 16:28:59 I'm Ok too 16:29:20 stephenfin, sean-k-mooney: is is settled from your perspective too? 16:29:25 no issues from me 16:29:33 yep 16:29:41 i did not have a stong opiion 16:29:42 k, that's it for me 16:29:50 ok, thanks 16:29:57 #topic Open discussion 16:30:03 there is one topic 16:30:07 (wenpingsong) vGPU spec reviewing: as we discussed during ptg, cyborg-managed/nova-managed gpu should have a trait indicating who owns this gpu, thanks for bauzas's comments with different thoughts about the trait. We need to have an agreement on that. https://review.opendev.org/c/openstack/nova-specs/+/780452 (can't attending the meeting due to the time slot, but will check the logs later) 16:30:33 as far as I understand we originally suggested the owner traits 16:30:43 but bauzas now suggest using separate RC 16:30:49 for cyborg managed vgpus 16:30:56 this way the trait would not be needed 16:30:56 both could work 16:31:10 my -1 was just for discussing about it 16:31:25 I guess we don't have the use case: Give me a vgpu I don't care if it is cyborg or nova managed 16:31:30 given I was also working on the mdev spec 16:31:40 I thought about it 16:31:43 owner traits i guess could have upgrade issues 16:31:49 and I'm not sure we need a "nova" trait 16:31:50 e.g. if you upgrade cyborg first 16:31:59 without upgrading nova to support owner traits 16:32:07 I know tho we said we would do it at some PTG 16:32:19 so I'm sorry to reopen the can 16:32:44 well no i think there is a usecase for a nova triat 16:32:51 or well RP ownwer in general 16:32:56 not nessiarly a trait 16:33:33 it's the other way of a consumer type :) 16:33:45 something like a inventory type :) 16:33:49 so its a provider type :) 16:33:58 yeah 16:34:24 tbh, I don't really like us marking traits for things unnecessary 16:34:43 am would it be better to continue this discsssuon on the spec. was tehre a summary you wanted to give in real time 16:34:51 agreed 16:34:59 not sure we can have a consensus now 16:35:11 agreed too, I don't have the brainpower to think this through right now 16:35:12 but I want us to think more about this 16:35:26 and again, I'm sorry to hold a bit the spec 16:35:39 no worries 16:35:45 once we add a trai we cant remove them so we shoudl get this right 16:35:53 but if we are about to be generic, we could also help cyborg I think 16:36:06 sean-k-mooney: unfortunately yes 16:36:34 unless we used CUSTOM_OWNER i guess we cloud i will try and read the spec again tomorow 16:36:44 maybe it's also the fact that a 'nova' trait seems to me bizarre 16:36:50 OK, continue this in the spec 16:37:07 i dont think we shoudl treat nova as special in placment 16:37:09 any other topic for today? 16:37:17 with jay's mind, I would transform this into "I can support 'nova'" 16:37:19 not form me 16:37:32 gibi: nope, I'm done 16:38:33 bauzas: well it would mean "i can support consumtion by nova" but that just a detail 16:38:45 sean-k-mooney: that's why I think the name is wrong 16:38:51 if nothing else then I close the meeting and you can continue :) 16:38:52 #endmeeting