16:00:04 #startmeeting nova 16:00:04 Meeting started Tue Jun 6 16:00:04 2023 UTC and is due to finish in 60 minutes. The chair is gibi. Information about MeetBot at http://wiki.debian.org/MeetBot. 16:00:04 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 16:00:04 The meeting name has been set to 'nova' 16:00:11 #chair bauzas 16:00:11 Current chairs: bauzas gibi 16:00:15 #chair dansmith 16:00:15 Current chairs: bauzas dansmith gibi 16:00:16 o/ 16:00:22 o/ 16:00:28 o/ 16:01:02 bauzas has an appointment so I try to chair this 16:01:14 #link https://wiki.openstack.org/wiki/Meetings/Nova#Agenda_for_next_meeting 16:01:26 #topic Bugs (stuck/critical) 16:01:30 lets see 16:01:40 #info No Critical bug 16:02:25 #link https://bugs.launchpad.net/nova/+bugs?search=Search&field.status=New 18 new untriaged bugs (+3 since the last meeting) 16:02:31 #info Add yourself in the team bug roster if you want to help https://etherpad.opendev.org/p/nova-bug-triage-roster 16:02:52 last week the baton was at bauzas 16:03:14 so I'm not sure if we have any news from him now 16:03:50 the next on the roster is me 16:04:02 I'll take it next week 16:04:08 but I will be mostly away next week 16:04:15 o/ 16:04:15 I didn't had time to look at them this week 16:04:38 Ditto due to the summit 16:04:48 so moving down the list the next on it is melwitt 16:04:48 But I can try to look at them 16:04:56 (sorry on my phone) 16:05:18 melwitt: could you take the baton? 16:05:24 gibi, last to last week I looked into this bug: https://bugs.launchpad.net/nova/+bug/2018719, I could not reproduce so added comment to ask for more info 16:06:09 auniyal: ack 16:06:42 I guess logging in to the rescue image depends on the actual image so you are right 16:07:15 I will ping melwitt later about the bug baton 16:07:27 any other bugs we need to discuss? 16:08:45 nothing from my side, thanks 16:09:35 #topic Gate status 16:09:56 #link https://bugs.launchpad.net/nova/+bugs?field.tag=gate-failure Nova gate bugs 16:10:01 #link https://etherpad.opendev.org/p/nova-ci-failures 16:10:16 lots of little test failures lately which is making it challenging to get a clean result 16:10:27 but nothing outstanding as a super common thing to go tackle that I've seen 16:10:50 I saw two different guest failures one case an disk io error 16:10:59 the other was probably some metadata error 16:11:08 but I agree I did not see a pattern yet 16:11:22 I have seen some IO errors related to volumes yeah, but I don't know what that's coming from 16:12:43 I don't see any new bug reported tagged with gate-failure. If I see a pattern in tomorrows reject then I will file some 16:13:04 s/reject/recheck/ 16:13:44 haven't seen any gate failure 16:13:52 (still otp) 16:14:14 #link https://zuul.openstack.org/builds?project=openstack%2Fnova&project=openstack%2Fplacement&pipeline=periodic-weekly Nova&Placement 16:14:34 periodics look good 16:15:11 any other gate issues to raise? 16:15:21 #info Please look at the gate failures and file a bug report with the gate-failure tag. 16:15:59 nothing from me 16:16:14 I'm back 16:16:21 then the usual announcement 16:16:22 #info STOP DOING BLIND RECHECKS aka. 'recheck' https://docs.openstack.org/project-team-guide/testing.html#how-to-handle-test-failures 16:16:47 fwiw, 16:17:11 I think we're doing quite well on the blind recheck thing.. not that we shouldn't remind people, but we could probably un-ALL-CAPS-ify that now :D 16:17:13 gibi: wants me to take again the chair seat ? 16:17:29 dansmith: cool. I'm OK to uncap it :) 16:17:41 it's been tracked in the TC meeting and we' 16:17:45 bauzas: the char is yours :) 16:17:49 we seem to be settling around pretty good behavior 16:17:49 dansmith: lol, I'll change it :) 16:19:01 #topic Release Planning 16:19:07 #link https://releases.openstack.org/bobcat/schedule.html 16:19:15 #info Nova deadlines are set in the above schedule 16:19:20 #info Nova spec review day today 16:19:25 as a reminder ^ 16:19:39 I deeply missed that :/ 16:19:42 tbh, I wasn't able to do my duty but I'll do this later tonight 16:20:00 (some internal discussion ate my whole afternoon) 16:20:12 so, yeah, would be nice 16:20:24 I saw that there is a spec proposal for continuing the PCI in placement work 16:20:24 nothing to tell apart this 16:20:33 gibi: indeed, someone proposed 16:20:40 I need to review that 16:20:43 cool 16:20:57 but other can chime in there too :) 16:20:58 as a reminder, if folks don't have time to review specs today, that's fine (c) 16:21:09 but please try to look at them this week 16:21:10 there is alway tomorrow :) 16:21:22 at least before the Summit in case people discuss there 16:21:31 anyway, good related point, 16:21:40 #topic pPTG Planning 16:21:45 #info please add your topics and names to the etherpad https://etherpad.opendev.org/p/vancouver-june2023-nova 16:21:57 crickets in there ^ 16:22:12 so I'll write an -discuss ML thread for this 16:22:20 nah, I added one thing now :D 16:22:23 in case ops or devs want to discuss with us 16:22:30 hehe 16:23:23 + I'll tell ops during our forum meet&greet about our PTG 16:23:36 #info The table #24 is booked for the whole two days. See the Nova community thereĀ ! 16:23:40 that's it 16:23:42 moving on 16:23:48 #topic Review priorities 16:23:56 #link https://review.opendev.org/q/status:open+(project:openstack/nova+OR+project:openstack/placement+OR+project:openstack/os-traits+OR+project:openstack/os-resource-classes+OR+project:openstack/os-vif+OR+project:openstack/python-novaclient+OR+project:openstack/osc-placement)+(label:Review-Priority%252B1+OR+label:Review-Priority%252B2) 16:24:01 #info As a reminder, cores eager to review changes can +1 to indicate their interest, +2 for committing to the review 16:24:06 #topic Stable Branches 16:24:12 elodilles is maybe afk 16:24:18 so lemme add his points 16:24:31 #info stable gates should be OK (from stable/2023.1 to stable/train) 16:24:37 #info stable branch status / gate failures tracking etherpad: https://etherpad.opendev.org/p/nova-stable-branch-ci 16:24:42 huzzah for this 16:24:51 and my point 16:24:57 #info train-eol patch proposed https://review.opendev.org/c/openstack/releases/+/885365 16:25:10 I'd appreciate if nova-cores could comment it ^ 16:25:19 I will 16:25:20 * gibi just proposed a train backport today 16:25:31 I noticed the cinder people are taking a more aggressive approach 16:25:40 gibi: okay, then -1 my patch and I'll modify it to await for your backport merge 16:25:47 dansmith: yup, saw it too 16:25:53 bauzas: I think I'm OK to drop that backport 16:26:05 I did that to see if the patch works 16:26:12 gibi: as you want, just tell me your insights in the train-eol patch 16:26:18 ack 16:26:44 dansmith: fwiw I'm afraid of EOLing the whole EM branches 16:27:09 shrug.. my argument for train applies to all the EM ones too 16:27:16 maybe we wait and see how it goes for cinder ;) 16:27:38 but since we haven't backported the os-brick CVE fix in Ussuri and Victoria, I could understand 16:28:01 but yeah, let's see what it's happening for cinder :D 16:28:09 * bauzas takes his popcorn :) 16:28:26 anyway, moving on 16:28:35 #topic Open discussion 16:28:45 none in the agenda 16:28:54 anything that someone wants to tell ? 16:30:01 one thing 16:30:05 shoto 16:30:11 shoot even 16:30:34 there was a request for opinion about openstack on k8s 16:30:40 let me find the link 16:30:55 its a thing that people do 16:31:36 okidoki, let's wait 16:32:15 13:01 < mdbooth> I'll be running a forum session on Kubernetes on OpenStack in Vancouver next week. It's for users and developers of all related projects to talk to each other. Etherpad is here if there's anything you'd like to discuss: https://etherpad.opendev.org/p/openinfra-2023-kubernetes-on-openstack 16:32:20 13:01 < mdbooth> I'll be running a forum session on Kubernetes on OpenStack in Vancouver next week. It's for users and developers of all related projects to talk to each other. Etherpad is here if there's anything you'd like to discuss: https://etherpad.opendev.org/p/openinfra-2023-kubernetes-on-openstack 16:32:30 sorry for the duplicate 16:32:47 im wondering why mdboot is runnign that session but ok 16:32:51 that's k8s on openstack not what you said right? 16:32:57 sorry I mixed up 16:32:59 that k8s on openstack 16:33:01 ya 16:33:05 ok, so lemme add the link 16:33:17 that makes more sense why mdbooth is involved 16:33:26 #link https://etherpad.opendev.org/p/openinfra-2023-kubernetes-on-openstack OpenInfra Forum session for discussing about k8s on openstack 16:33:37 gibi: the other way btw. :) 16:33:41 there is some nova related question in the etherpad 16:34:24 about getting notified if anything changed with servers 16:34:30 I offered the nova notification inteface 16:34:35 yeah and it's a public API :) 16:34:38 but apperantly they want something public 16:34:58 the notifcation are the only interface we have currently 16:34:58 the API is documented but the message bus access then to be non public 16:35:03 since someone worked on notifications objects like 6 years ago (guess who :p ) 16:35:14 bauzas: hah :D 16:35:24 gibi: are you sure that the message bus can't be public ? 16:35:33 it shouldn't be 16:35:39 yeah 16:35:43 we have instance events.. that's what they want I think 16:35:48 the notification bus is semi privladged 16:35:50 and our notifications tend to contain infra information 16:35:52 I think they just need an async way to get those 16:35:58 the notifications can have private infor in them 16:36:02 depening on what you configure 16:36:04 like the bdms 16:36:30 yeah, bottom line the notification API is designed for consumed by admins or other openstack service not endusers 16:36:43 ah they want it to be consumable by endusers ? 16:36:43 right, it would leak bad things between tenants for sure 16:36:47 not just infra things 16:36:52 yeah 16:36:58 urth 16:37:01 urgh 16:37:01 you could have a multi tentant service that converts the notificaton in to a webhook callback or similar 16:37:09 instance events/actions is the right thing I think, it's just only polling currently 16:37:13 but that still not greate 16:37:28 dansmith: ya the event stream would work but im not sure that 16:37:37 yeah, a websocket around instance actions wouldbe nice 16:37:41 even if it was event based they woud lwant ot liste per instance 16:37:53 more liek open a websocket and get all instance events for a project? 16:38:09 they probably want to be able to register a handler with a scope (one instance, all my instances) that lives for a period of time that we call when there's a new event 16:38:14 or that you are allowed to see based on teh scope of the keystone token 16:38:17 I think everytime someone asks us to monitor some instance action, we tell them 'lookup the notifications' 16:38:25 but this is for admin usage 16:38:29 websocket will require a lot of standby resources that I think would be hard for us to manage 16:38:45 true 16:38:45 anyway, 16:38:58 so, they want some enduser public subscription mechanism for asynchronously being notified on my instance state changes ? 16:39:00 not sure how many people will be there to make any sort of headway on that topic, since I think those people are likely here :) 16:39:11 bauzas: yeah 16:39:24 sounds a client thing to mre 16:39:26 me 16:39:31 dansmith: I will be there hence collecting ideas here now :) 16:39:32 lets see if they acan at least expand on the usecases 16:39:58 bauzas: a client can poll (or long poll) but that's much less efficient 16:39:59 yeah I will pull out some specific use case and try to limit the scope to something very simple on our side 16:40:04 especially when instance actions could be days apart 16:40:21 but yeah, someone could provide some tool that would listen to the notification bus and scramples all the admin-only data 16:40:29 bauzas: exactly 16:40:39 sorry, by client I meant something unrelated to nova 16:40:45 so the way to do this in the past was ceilometer put the relevetn events in AODH 16:40:51 they could, but that's basically re-constructing the tenant isolation that nova already has, so it's a big new surface to secure and new services to run 16:40:54 and then you woudl set up alarms on the events you cared about 16:41:15 sean-k-mooney: that's all intended to be operator-focused, not for users to get status/events on their instances right? 16:41:27 no 16:41:34 if they only need a trigger to re-read the instance action API then most of of the data can be hidden from our notifications 16:41:39 aodh and celoimeter provided user facing events/metrics 16:41:51 okay I didn't realize 16:41:52 yeah 16:41:53 they didnt actully expose the full notificaiton 16:42:01 ceilometer was the fit 16:42:04 jsut instance boot started and instance boot finsined events 16:42:19 and that's why we never had this in nova 16:42:27 honestly, I feel like this is probably something nova can/should be doing 16:42:40 nowadays this is how stuff plugs together 16:42:51 ya i think its something we coudl do 16:42:56 but we need to think about how 16:42:58 making an external tool reconstruct what we already know is kinda :/ 16:43:18 yeah 16:43:24 it could be a service like console that you run if you want, and scale separately to handle the amoun tof load you want to tolerate 16:43:31 dansmith: I'm still struggling to find how we would ensure the tenancy isolation by the message bus, but I'm open to ideas 16:43:45 unless we create a bus per tenant 16:43:46 if its in nova 16:43:47 bauzas: we wouldn't? 16:43:48 we can just filter 16:44:36 I'm maybe misunderstanding the proposal, but I thought we were about saying that we may emit project-related notifications 16:44:46 not at the rabbit level 16:44:51 let's let gibi collect some data, 16:44:59 and then we probably need a high-bandwidth conversation about options 16:45:01 bauzas: we def need to understand their use case better 16:45:02 dansmith: ah ok 16:45:18 dansmith: then we need to construct some HTTP/2 layer with keystone auth 16:45:33 or something like that 16:45:38 bauzas: not necessarily 16:45:56 it just depends.. but it should be HTTP-something, either an event stream or callbacks 16:46:07 you would do somethign like "openstack project event subsribe (instnace.action.)*" which woudl return a websock url that would only stream the relevent events for the current project based on the keystone token 16:46:27 yep, could be something like that 16:46:38 sounds very console-ish 16:46:40 so like the console you woudl first create it and then get a handel for where to collect the data 16:46:42 but okay 16:46:44 I can simplify it down to give me a server uuid and I stream you data about notifications affecting the server, but only with very limited data provided 16:46:58 bauzas: exactly.. it's the same sort of arrangement, and the same target audience 16:47:38 ok, then it sounds we have an agreement on the direction, let's not overpaper the technical details 16:47:42 and if its a seperate binary nova-event-proxy 16:47:53 gibi: yeah I just don't think you should need 100 websockets you have to read from if you have 100 instances in your wordpress deployment 16:47:58 sean-k-mooney: right 16:48:02 then its scalablity and wether its deployed is up ot the operator 16:48:07 yup 16:48:10 dansmith: ahh true, we can do it pre project then 16:48:13 per 16:48:18 yeah, devil is in the details of the productization 16:48:21 gibi: or even server group 16:48:42 anyway lets see what they actully bring up 16:48:42 gibi: honestly, the granularity sounds per project to me 16:48:53 and see if this type of solution would work for them or not 16:48:57 well, nfv people are all one project in some cases, so that probably won't work for them 16:49:01 gibi: are you done with this topic now that we drafted a solution for you ? :D 16:49:11 I'm done 16:49:15 cool 16:49:16 thanks for the discussion\ 16:49:22 I will link this to the etherpad 16:49:27 and I will report back from the summit 16:49:27 cool 16:49:37 I'll be back watching you at the Summit anyway 16:49:51 so if you promise too many things, I could yell :p 16:49:58 bauzas: please do so 16:49:59 :D 16:50:12 I don't need another 3 years of "notification" work 16:50:22 heh 16:50:28 still got scars eh/ 16:50:30 ok, I was balancing the idea to paperwork the scaphandre and manila series but I'm exhausted of today 16:50:54 so, let's skip it and pretend it will be discussed in two weeks from now 16:51:01 dansmith: time makes all these memory nicer and nicer actaully 16:51:06 heh 16:51:18 so bauzas' has a good point watching me :) 16:51:32 gibi: https://www.youtube.com/watch?v=dLjNzwEULG8 16:51:48 gibi: you're fortunate, canadians don't open carry 16:52:05 sorry, was a terrible joke :) 16:52:16 dansmith: I need to check this out after the meeting :) 16:52:20 anyway, I think we're done for today 16:52:29 indeed 16:52:40 thanks all 16:52:45 and thanks gibi for the chair 16:52:50 #endmeeting