16:02:02 #startmeeting nova 16:02:02 Meeting started Tue May 17 16:02:02 2022 UTC and is due to finish in 60 minutes. The chair is bauzas. Information about MeetBot at http://wiki.debian.org/MeetBot. 16:02:02 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 16:02:02 The meeting name has been set to 'nova' 16:02:10 sorry, was late 16:02:11 o/ 16:02:14 o/ 16:02:16 o/ 16:02:25 o/ 16:02:33 #link https://wiki.openstack.org/wiki/Meetings/Nova#Agenda_for_next_meeting 16:02:47 * bauzas was having his brain locked due to some previous meeting 16:03:12 and the heat wave here makes me melting 16:03:25 let's start then 16:03:31 #topic Bugs (stuck/critical) 16:03:36 #info No Critical bug 16:03:40 #link https://bugs.launchpad.net/nova/+bugs?search=Search&field.status=New 14 new untriaged bugs (-5 since the last meeting) 16:03:49 kudos to Uggla for this excellent work 16:04:26 he wrote a triage etherpad (yet again, this wasn't mandatory to do it) but in case you wanna know what he triaged https://etherpad.opendev.org/p/nova-bug-triage-20220510 16:04:46 #link https://storyboard.openstack.org/#!/project/openstack/placement 26 open stories (0 since the last meeting) in Storyboard for Placement 16:04:53 #info Add yourself in the team bug roster if you want to help https://etherpad.opendev.org/p/nova-bug-triage-roster 16:05:20 which leads me to the last point 16:05:51 sean-k-mooney: are you okay with holding the bug baton for this week ? 16:06:01 yes 16:06:06 gracias 16:06:12 #info Next bug baton is passed to sean-k-mooney 16:06:38 any bugs to discuss before we move on ? 16:07:03 #topic Gate status 16:07:10 #link https://bugs.launchpad.net/nova/+bugs?field.tag=gate-failure Nova gate bugs 16:07:15 #link https://zuul.openstack.org/builds?project=openstack%2Fplacement&pipeline=periodic-weekly Placement periodic job status 16:07:19 #link https://zuul.opendev.org/t/openstack/builds?job_name=nova-emulation&pipeline=periodic-weekly&skip=0 Emulation periodic job runs 16:07:25 all the above seems fine ^ 16:07:37 and not fine like in a fire 16:07:51 I think Uggla had a bug he wanted to double-check? Dunno if you want to leave that for the open discussion 16:07:59 triaged 16:07:59 1959186 (appears to be valid TBC tomorrow) 16:08:01 artom: we can 16:08:52 https://bugs.launchpad.net/nova/+bug/1959186 16:08:59 I'll leave it for open discussion then 16:09:06 people can start digesting it meanwhile 16:09:16 * bauzas has zero context yet 16:09:25 initally this does not seam like a bug 16:09:28 #info Please look at the gate failures and file a bug report with the gate-failure tag. 16:09:29 but know behavior but ya 16:09:34 #info STOP DOING BLIND RECHECKS aka. 'recheck' https://docs.openstack.org/project-team-guide/testing.html#how-to-handle-test-failures 16:09:35 lets wait till later 16:09:46 #topic Release Planning 16:09:52 #link https://releases.openstack.org/zed/schedule.html 16:09:56 #info Zed-1 is due in 2 days 16:09:58 \o/ 16:10:09 before we discuss about this milestone 16:10:13 #info Spec review day happened last week on May 10th 16:10:18 #info 2 specs were approved but a lot of them were reviewed. Kudos to the team for this hard work. 16:10:26 <3 16:10:42 I've seen lot of feedback there 16:10:50 very happy 16:10:54 now, 16:10:57 zed-1 16:11:20 in theory, we only have one thing to take care of for this milestone 16:11:31 libraries releases 16:12:29 yep the release patches have been proposed 16:12:36 i have not looked at them yet 16:13:07 are theree any patches peopel would like to wait for? 16:13:18 that's my humble question 16:13:32 there is one patch in flight in os-vif but i dotn think we have to wait for that 16:13:32 do people care of something they'd like to see published before end of this cycle for the libs ? 16:13:44 no need to wait, we can push out a new lib release any time it is freeee 16:13:45 os-traits seems interesting to look at 16:14:08 yep we use cycle with intermdiaries 16:14:16 which requries at least 1 release per cycle 16:14:22 but we can have as many as we want 16:14:29 https://review.opendev.org/q/project:os-traits+is:open is empty :) 16:14:48 oh shit 16:14:48 yep i think we close to approving spec that would add some 16:14:52 https://review.opendev.org/q/project:openstack/os-traits+is:open 16:14:56 better now:) 16:15:26 so the manial share spec is still pending 16:15:30 ok, only the manila-related traits 16:15:34 so Uggla's patch cant merge yet 16:15:36 and yeah the spec is pending 16:15:38 correct 16:15:46 so, let's release os-traits by now 16:15:52 we can merge teh runtim patch but that is not urgent 16:16:00 +1 16:16:04 and we'll publish a newer os-traits release once the spec is approved 16:16:07 Uggla: ^ 16:17:16 note: os-traits is in independent release model, so release patch won't be generated by release team 16:17:21 https://review.opendev.org/q/project:openstack/os-resource-classes+is:open is a bit more crap 16:17:32 oh shit you're right 16:17:44 I was looking at https://governance.openstack.org/tc/reference/projects/nova.html#deliverables 16:17:50 bauzas, \o/ 16:17:52 but it's not describing the release models 16:17:54 elodilles: os-traits isnt independint is it 16:18:09 I'm lost with all the references 16:18:20 oh it is 16:18:22 for release models, this is another website unrelated to governance 16:18:24 well that does not make sense 16:18:34 because placment is tightly coupled ot it 16:18:36 sean-k-mooney: it is 16:18:53 elodilles: placmentn currently only works with exactly one version of os-triats 16:18:55 ok, found the right doc https://releases.openstack.org/teams/nova.html 16:19:03 https://github.com/openstack/releases/blob/master/deliverables/_independent/os-traits.yaml 16:19:24 this leaves os-r-c and os-traits be independent 16:19:49 right but placment assert the exact numober of triat and resouce classes in its test 16:19:55 https://releases.openstack.org/teams/nova.html#independent 16:20:13 sean-k-mooney: we discussed this at the PTG right ? 16:20:15 so it only works form a unit test perspecive with exactly one version of os-traits/os-rescoue classes 16:20:16 and we agreed on a proposal 16:20:17 yep 16:20:22 relax the test 16:20:25 yup 16:20:29 so let's do it 16:20:33 yep 16:20:45 just pointing out that indpenent did not make sense wiht that context 16:21:00 but ok we can wait to release it until we have new traits 16:21:08 and fix the test in the interim 16:21:22 point is, we should only care of os-vif and client releases for placement and nova, that's it 16:21:40 (for zed-1 I mean) 16:21:58 yep 16:23:06 gmann: btw. this is not going well for https://review.opendev.org/c/openstack/os-vif/+/840020 16:23:15 but let's discuss this on open discussion ^ 16:23:26 and let's continue the agenda 16:23:35 #topic Review priorities 16:23:40 #link https://review.opendev.org/q/status:open+(project:openstack/nova+OR+project:openstack/placement+OR+project:openstack/os-traits+OR+project:openstack/os-resource-classes+OR+project:openstack/os-vif+OR+project:openstack/python-novaclient+OR+project:openstack/osc-placement)+label:Review-Priority%252B1 16:23:59 #link https://review.opendev.org/c/openstack/project-config/+/837595 Gerrit policy for Review-prio contributors flag. Naming bikeshed in there. 16:24:08 reviews are welcome on ^ 16:24:21 I see gibi having concerns on the naming 16:24:49 I have no good suggestion 16:24:53 so feel free to ignore me 16:27:03 my problem on the current naming is that is sounds like a core approves that something is a prioirty 16:27:19 but the aim is instead that the core says "I will review this" 16:27:33 gibi: I can propose something else 16:27:39 to unblock the patch 16:27:52 #link https://docs.openstack.org/nova/latest/contributor/process.html#what-the-review-priority-label-in-gerrit-are-use-for Documentation we already have 16:27:55 bauzas: sure go for it 16:28:24 next topic, 16:28:34 #topic Stable Branches 16:28:41 elodilles: I haven't seen you updating the section 16:28:45 so far, so good ? 16:29:06 #info ussuri and older branches are still blocked, newer branches should be OK 16:29:32 I think we said last week, newer branches *are* OK :) 16:29:35 i haven't got any further with the investigation of ussuri branch :/ 16:30:10 bauzas: yes, unfortunately nothing news :( 16:30:15 damn, looks like we have a netsplit at the time of the meeting 16:30:27 yepp, it seems :S 16:30:35 people will be back soon proably 16:30:55 hopefully yes 16:31:00 looks like mainly NA folks 16:31:01 let's continue then 16:31:24 sean-k-mooney: yeah, one OFTC server seems to have killed the conns 16:31:34 nothing more to add for now :X 16:31:55 elodilles: at one time, we'll need to cut the rope on the older branches 16:32:38 older starting form ? 16:32:38 bauzas: indeed. now ussuri is in bad shape for weeks now... 16:32:50 hum 16:32:52 sean-k-mooney: ussuri 16:33:00 so we woudl be droping train and below too 16:33:23 actually it fails due to multiple intermittent failures 16:33:27 we need help or some action in order to mitigate the blocker 16:33:42 elodilles: yup, constant rechecks 16:33:45 bauzas: what is the blocker exactly 16:34:01 the failure ratio is so high it becomes unpractical to merge 16:34:15 right but on what issue 16:34:22 a couple of them 16:34:28 * sean-k-mooney has not looked at stable/ussiri 16:34:29 elodilles: we tracked them, right 16:34:46 ok i was wonderign if there was one we shoudl look at specificly 16:34:57 sean-k-mooney: guest kernel panics, volume timeouts, 16:35:12 as in volume detach 16:35:19 is it resize related? 16:35:44 becasue we noticed today that reize is not suing hte sshable change yet 16:35:56 gibi proposed a patch to cover that 16:36:08 also this bug comes up often: https://bugs.launchpad.net/nova/+bug/1901739 16:36:39 im not familar with that 16:37:22 looks like lee fixed that in V but it has not been backported 16:37:52 it is a mix of fixes already due to zuul migration and ubuntu version cahgne 16:37:55 chnage 16:37:56 ok we can proably move on but if you have a list elodilles please share 16:38:48 sean-k-mooney: well it's on the recheck list on this patch: https://review.opendev.org/c/openstack/nova/+/838033 16:38:51 :/ 16:39:28 can we ask infra to force merge that 16:39:57 sean-k-mooney: it won't help to other patches, they will need the same rechecks 16:40:17 yes i know 16:40:35 in general, we can try to release the jobs 16:40:50 in order to merge one specific patch we want 16:41:05 yes, one way is to decrease the test coverage 16:41:05 +1, better than force merge 16:41:06 we did that in some past 16:41:19 ok 16:41:26 here that would mean to cut out devstack based jobs 16:41:29 we can make some non voting for now 16:41:35 an rely on unit and functional 16:41:43 *and 16:41:45 this was ages ago 16:41:47 yeah just for temporary time 16:41:53 gibi: or some volume related tests. though i don't know which is worse :S 16:41:53 but I still remember the magic formula 16:42:22 gmann: it would not be temporary if we disable jobs we will never fix tem 16:42:25 them 16:42:34 well i asusme we have a limited set of patches that we want to merge and we woudl turn these abck on like droping lc jobs 16:42:54 so i was epecting disable patch then revert 16:43:02 gibi: the idea is to relax the jobs before merging patches we want and then enabling again the jobs 16:43:19 bauzas: and will we do it for every patch we want to merge? 16:43:21 and see whether the failure ratio drops again 16:43:23 yeah that is what I thought, enabling them again 16:43:40 bauzas: it would work iff we would have patches in queue that fixes those breaks but we dont 16:43:48 gibi: no, we need to identify a set of patches we'd like to merge in order to help with CI stability 16:44:01 we don't have the fixes :/ 16:44:33 it is not like merge these 5 patches to stabilize ussuri, we don't have those 5 patches ready 16:44:50 we have random patches backported and waiting 16:44:51 humm 16:45:25 ok, we won't obviously solve this problem by now 16:45:33 one thing to note if we are removing integration tests coverage then it is better to make branch EOL like we did for ocata 16:45:38 so something like: 1) collect the issue to be fixed in ussurit to stabilize CI 2) create fixes for them 3) force merge them 4) profit 16:45:39 let's state we all know about this problem and we'll figure out the solutions later 16:45:58 gibi: yeah, sounds 1/ and 2/ are not done yet 16:45:59 maybe if that "SSHABLE" fix helps things in ussuri we have one, but still we probably have other failures 16:46:30 elodilles: a buncs of SSHABLE already landed, so we should see them help 16:46:38 elodilles: that is again difficult as we are not supposed to use tempest master for ussuri so not all master change will go for ussuri testing 16:46:43 elodilles: ideally an etherpad of doom may help gathering ideas and feedback 16:46:48 gmann: ahh 16:46:50 that explains 16:46:55 I have patch up to cap tempest for ussuri but that is not merged yet 16:46:58 gmann: wait we are not? 16:47:08 gmann: temest is ment to be branchless upstream 16:47:13 but we have stopped ussuri support in tempest master 16:47:31 hum 16:47:37 because of what reason? 16:47:41 sean-k-mooney: yes and master tempest is for SUpported branch and we cannot support EM branches 16:47:42 python version ? 16:48:04 shoudl that not be a project choice 16:48:16 EM branches - #link https://docs.openstack.org/tempest/latest/stable_branch_support_policy.html 16:48:17 tempest should not have to support it 16:48:42 but project that support em branches shoudl still be able to choose which tempest to run 16:48:46 sean-k-mooney: we have to pin tempest for EM branch so that using master there can break them anytime 16:49:06 sean-k-mooney: yeah, you can but no guarantee if master tempest will work 16:49:19 as long as it is passing it is ok to use 16:49:58 ack 16:50:12 so i think i would prefer to keep tempest uncapped until ti breaks 16:50:14 (well, i guess changes on tempest master can affect not only EM branches, but they are affected most likely) 16:50:30 elodilles: well nova uses microversion 16:50:40 so it should work form our point of view 16:50:53 elodilles: for supported branches we make sure tempest does not break them but for EM it is not the case 16:50:57 chagnes to tempest should not break our stable branches expcti if there ar package dep issues 16:51:14 plan is : by default devstack will cap the tempest in ussuri and project can override it in jobs 16:51:15 I need to timestop the conversation 16:51:15 which i guess is why this was done 16:51:41 yeah, we can discuss it in qa or nova channel after meeting 16:51:49 +1 16:51:51 +1 16:51:52 gmann: sean-k-mooney: gibi: you're all free to continue the conversation after the meeting 16:51:57 cool 16:51:57 sure 16:52:01 moving on quickly 16:52:05 * gibi will pass out after the meet :/ 16:52:08 #topic Open discussion 16:52:21 Uggla: do you want to discuss https://bugs.launchpad.net/nova/+bug/1959186 16:52:22 ? 16:52:44 Uggla: you left this bug with comments on your triage etherpad 16:52:53 yep 16:53:18 It appears valid to me, but I would like a crosscheck. 16:53:26 i dont think it is 16:54:02 there is a long runnign knwon issue that you cant delete in use image form glance if it uses ceph 16:54:14 that only happens if you use raw images 16:54:27 at that enabel our copy on write shallow clone code path 16:54:34 sounds at least unrelated to nova 16:54:42 which i suspect is what is happening here with the snapshot 16:54:45 maybe valid but not on our side 16:54:52 right? 16:55:21 it's really desired behavior even 16:55:27 there have been some feature request in this area 16:55:39 some peopel woudl like too break the shallow copy 16:55:43 I think it'd be a feature on the glance side, IIRC 16:56:07 ya so they have not providded enough info in the bug 16:56:12 we do not know the image type 16:56:21 nova doesn't even know about the linkage after it's created right? so if the link is broken, nova is fine with it 16:56:21 to confirm wone way or another 16:56:39 am im not sure 16:57:00 dansmith: hence my point, unrelated to the project 16:57:04 => Opinion 16:57:14 have a feature proposal, but not in the agenda yet. may I ask here and gauge interest/feasibility? 16:57:15 bauzas: yeah 16:57:24 there is some metadata on teh image but i dont know if we read that when we create the new vm 16:57:29 or tack it on our side 16:57:43 sean-k-mooney: we can add glance as a service project in the bug report 16:57:50 and mark it invalid on our side 16:58:02 we'll see it coming back if that's really a nova bug 16:58:21 Uggla: works for you ? 16:58:26 yes 16:58:57 OK, sold 16:59:12 #agreed https://bugs.launchpad.net/nova/+bug/1959186 to be punted to Glance 16:59:56 ill do that and ask for more info in the bug 17:00:02 I wanted to discuss https://review.opendev.org/c/openstack/os-vif/+/840020 but we're at time 17:00:14 thanks all 17:00:16 #endmeeting