16:00:06 #startmeeting nova 16:00:06 Meeting started Tue Mar 14 16:00:06 2023 UTC and is due to finish in 60 minutes. The chair is bauzas. Information about MeetBot at http://wiki.debian.org/MeetBot. 16:00:06 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 16:00:06 The meeting name has been set to 'nova' 16:00:12 howdy folks 16:00:18 o/ 16:00:21 #link https://wiki.openstack.org/wiki/Meetings/Nova#Agenda_for_next_meeting 16:01:04 ok, let's start, people can join meanwhile 16:01:10 o/ 16:01:10 #topic Bugs (stuck/critical) 16:01:15 #info No Critical bug 16:01:19 #link https://bugs.launchpad.net/nova/+bugs?search=Search&field.status=New 16 new untriaged bugs (+2 since the last meeting) 16:01:26 o/ 16:01:30 o/ 16:02:02 elodilles: I saw you did a bit of triage 16:02:07 yepp 16:02:08 I also looked at the new bugs todayt 16:02:16 thanks! 16:02:21 (to see whether we were regressing) 16:02:31 elodilles: nothing you want to discuss ? 16:02:43 bauzas, regarding this - https://bugs.launchpad.net/nova/+bug/2011567 16:02:52 The message right now is, "fill this in after the PTG", shall we update it to something else. 16:02:57 o/ 16:03:21 auniyal: as I said as a comment, we don't have cycle priorities 16:03:40 so, either we delete the Bobcat file, or meh 16:04:07 we have kept it for other release 16:04:17 its been blank for years at this point 16:04:18 yup, that's what I wrote 16:04:33 we can provide some upstream priorities if we want 16:04:40 okay, the msg seems like we missed to update after PTG 16:04:47 but, if we don't have anyone, should we delete the file ? 16:05:03 im ok either way 16:05:10 we could discuss this in the PTG if we want 16:05:14 sure 16:05:18 kk 16:05:18 ack 16:05:21 let's do this then 16:05:42 auniyal: I also closed your other bug report https://bugs.launchpad.net/nova/+bug/2011564 16:05:56 auniyal: as I said, those links are just examples 16:05:57 yes, that I missed the example 16:06:03 my bad 16:06:09 ok 16:06:15 moving on then 16:06:20 about your question bauzas : there was a serialproxy TR. serialproxy example client seems very old and not really functional (bug: https://bugs.launchpad.net/nova/+bug/2009956 ) 16:07:07 i guess low prio is OK for that 16:07:16 but correct me if i was wrong 16:07:32 elodilles: mmm, good question 16:07:46 'low' is good for me 16:07:57 we expect the serial proxy to be used with netcat in generalright 16:08:48 oh actully we expeort it as a websocket 16:08:59 so you need a websocket client 16:09:04 sean-k-mooney: netcat or any script like https://docs.openstack.org/nova/latest/contributor/testing/serial-console.html 16:09:07 yes, rather a websocket client 16:09:29 ya so i dont really consider the example in the docs to be maintained 16:09:38 if its not tested its broken by default 16:09:41 and we dont test that 16:09:56 ack 16:10:24 well, then should we deprecate this API ? 16:10:27 so we proably shoudl fix this and by either providing a new nova console script that we test or delete the current example and document using a diffent client 16:10:29 as a signal 16:10:32 no 16:10:36 it works fine in horizon 16:10:40 saying "sorry, we don't really test it" 16:10:42 or it did the last time i used it 16:10:56 we shoudl remove the example script 16:11:05 sean-k-mooney: looks to me elodilles tested this with master and it wasn't longer working 16:11:18 elodilles: did you test it with horizon 16:11:24 or that test script 16:11:55 I can try to test it with https://github.com/vi/websocat 16:11:56 sean-k-mooney: well, I don't know whether my devstack was properly configured, but it didn't work 16:12:13 neither with any old clients 16:12:46 so i don't know whether this is a problem in serial proxy or the clients 16:12:51 ack we shoudl look into this more i dont currently have a devstack but i can try and find time to deploy one and see 16:13:11 elodilles: if you can try configuring horizon to use it that would be the main way its used 16:13:16 me too 16:13:22 sean-k-mooney: ack 16:13:42 maybe let's put the importance to Medium, as this is an API 16:13:53 https://docs.openstack.org/nova/xena/admin/remote-console-access.html#serial 16:14:02 its not an api 16:14:18 but it is a fully supprot compenet 16:14:43 ah yes 16:14:45 you're right 16:14:56 a specific service 16:15:00 yep 16:15:08 with another port 16:15:11 its just an alternivte to the novnc proxy 16:15:15 anyway, i've changed its prio to Medium 16:15:24 anyway, let's not try to find the solution here 16:15:30 ++ 16:15:45 elodilles: if you want, I can help you, ping me tomorrow 16:16:05 I don't think this is an Antelope regression btw. 16:16:36 bauzas: ack, thanks 16:16:44 moving on 16:16:54 #info Add yourself in the team bug roster if you want to help https://etherpad.opendev.org/p/nova-bug-triage-roster 16:17:00 #info bug baton is being passed to auniyal 16:17:05 auniyal: you're ok with that ? 16:17:11 yes, 16:18:16 cool 16:18:25 #topic Gate status 16:18:29 #link https://bugs.launchpad.net/nova/+bugs?field.tag=gate-failure Nova gate bugs 16:18:35 #link https://etherpad.opendev.org/p/nova-ci-failures 16:18:55 I'll be honest, I haven't uploaded a lot of implementation changes this week 16:19:08 so I had not found any problem 16:19:22 so looks to me the gate is now better :) 16:19:33 :] 16:19:34 yeah, lots of small changes have improved things a lot 16:19:37 but we'll see after a few time, when we're back 16:19:56 for the moment, the gate doesn't run a lot of jobs 16:20:02 job runs* I mean 16:20:29 and in general, we know the gate becomes bad around the milestones 16:20:39 so we'll see how it goes later 16:20:46 #link https://zuul.openstack.org/builds?project=openstack%2Fnova&project=openstack%2Fplacement&pipeline=periodic-weekly Nova&Placement periodic jobs status 16:20:50 all greens here 16:20:56 #info Please look at the gate failures and file a bug report with the gate-failure tag. 16:21:00 #info STOP DOING BLIND RECHECKS aka. 'recheck' https://docs.openstack.org/project-team-guide/testing.html#how-to-handle-test-failures 16:21:05 #topic Release Planning 16:21:10 #link https://releases.openstack.org/antelope/schedule.html 16:21:15 #info This week is the last time for creating a new RC if we need it. 16:21:23 #link https://etherpad.opendev.org/p/nova-antelope-rc-potential 16:21:33 #link https://bugs.launchpad.net/nova/+bugs?field.tag=antelope-rc-potential no open regression bugs were found. 16:21:46 so, yeah I looked at the open bugs and I haven't found any regression 16:22:30 I also tried to look at the 'In Progress' bugs 16:22:32 https://bugs.launchpad.net/nova/+bugs?search=Search&field.status=In%20Progress&orderby=-datecreated&start=0 16:22:34 by Age 16:24:17 amirite by saying none of the 8 in-progress bug reports we have since 2 months are not regressions ? I hope so 16:25:30 anyway, will continue to look at those bugs 16:25:47 because as I said, this is the last week for a new release candidate 16:26:12 #link https://releases.openstack.org/bobcat/schedule.html 16:26:26 would be nice to have a release where we dont need RC2 16:26:41 you'll be interested in the below : 16:26:46 #link https://review.opendev.org/c/openstack/releases/+/877094 Proposed deadlines for Bobcat 16:27:15 we'll discuss those proposal dates at the PTG 16:27:21 vPTG I mean 16:27:49 but you can start to write comments if you wish 16:28:41 moving on I guess 16:28:43 oh 16:28:51 I was about to forge 16:28:53 forget 16:29:16 I created all bobcat launchpad series for placement, novaclient and nova 16:29:51 now, we said to try to remove placement from storyboard 16:30:01 so I started to look and it looks difficult 16:30:26 #info as a reminder, please no longer use Storyboard for creating feature requests or bug reports https://storyboard.openstack.org/#!/project/list?q=placement 16:30:45 but I'll try to see if I can do anything with https://opendev.org/ttygroup/boartty/ 16:31:03 moving on 16:31:55 #topic vPTG Planning 16:32:06 usual reminder 16:32:08 #link https://www.eventbrite.com/e/project-teams-gathering-march-2023-tickets-483971570997 Register your free ticket 16:32:15 #link https://etherpad.opendev.org/p/nova-bobcat-ptg PTG etherpad 16:32:20 now this is time 16:32:24 #info please add your topics before end of this week, so I could provide an agenda on Monday. 16:32:33 this is important 16:32:49 as I don't see a lot of topics yet in the ptg etherpad 16:33:00 (well, I actually wrote the majority of them) 16:33:26 also, if you want to discuss with other teams, please add your topics in the cross-project agenda 16:33:43 and then I'll ping the other PTLs in order to find some time 16:34:09 as a reminder, I asked for 4 days of 4 hours 16:34:47 but if the agenda remains quite empty as it is, I'll probably unbook the Friday rooms 16:35:26 if it was in person i would suggest using the firday slot for an unconfernce/hackaton 16:35:38 but i kind of hate the idea of doing that virutally 16:35:44 don't get me on that direction :) 16:35:52 so sure 16:36:04 we could keep the slot and decied durign the ptg 16:36:09 I mean, after 3 vPTGs, I'm quite done with them 16:36:12 i.e. if there is a topic we ant to come back too 16:36:23 surely 16:36:44 and I'm pretty sure people will add topics on the last time like every cycle.. 16:36:56 (tbc, I'm also doing this game) 16:37:19 but even with that, I just feel Friday will be either off or hackathon 16:37:45 fwiw, my main priorities for the beginning of Bobcat is to review some series we accepted 16:37:56 like the manila one, which I'll probably help too 16:38:00 if we have noting to do on firday im fine with just say "thanks for coming folks" and doing a spec review day or something 16:38:13 sure 16:38:20 that's an idea 16:38:23 anyway 16:39:18 fwiw, the ptg main agenda is at the moment not large https://ptg.opendev.org/ptg.html 16:39:37 hopefully projects will book their rooms next week 16:40:25 (one thing I also think we miss with *virtual* PTGs is those kind of cross-project large discussions we had before) 16:40:48 but meh, moving on 16:41:09 #topic Review priorities 16:41:14 #link https://review.opendev.org/q/status:open+(project:openstack/nova+OR+project:openstack/placement+OR+project:openstack/os-traits+OR+project:openstack/os-resource-classes+OR+project:openstack/os-vif+OR+project:openstack/python-novaclient+OR+project:openstack/osc-placement)+(label:Review-Priority%252B1+OR+label:Review-Priority%252B2) 16:41:18 #info As a reminder, cores eager to review changes can +1 to indicate their interest, +2 for committing to the review 16:41:25 #topic Stable Branches 16:41:27 elodilles: your time 16:41:40 well, nothing special 16:41:49 i mean, not so many patches merge 16:41:59 but as far as i see: 16:42:09 #info stable gates seem to be OK - though it's hard to merge patches due to intermittent failures 16:42:16 #info stable branch status / gate failures tracking etherpad: https://etherpad.opendev.org/p/nova-stable-branch-ci 16:42:31 that's all :X 16:43:15 elodilles: dansmith: afaik, we haven't backported the mysqld memory reduction changes into stable branches ? 16:43:15 a bunch of the things we've fixed haven't been backported to stable, 16:43:21 so I expect that will remain challenging 16:43:22 hah, jinx 16:43:40 we could work on that 16:43:48 I haven't been monitoring the mysql memory thing, but it seems like that _has_ helped yeah/ 16:44:27 Alexey Stupnikov proposed openstack/nova master: Don't remove cached base images for failed resize ops https://review.opendev.org/c/openstack/nova/+/877410 16:44:31 there was a potential for negative impacts, but I've heard no complaints 16:45:19 dansmith: do you have a topic set for those patches? so that I could see whether some could be backported? 16:45:21 mmmh 16:45:56 elodilles: this in devstack plus the flag enabled: https://review.opendev.org/c/openstack/devstack/+/873646 16:46:26 dansmith: thanks, i'll have a look 16:46:29 example here https://review.opendev.org/c/openstack/nova/+/874664 16:47:02 dansmith: but that means we would need to backport the devstack change too right? 16:47:12 that's why I said "this in devstack" 16:47:21 meaning you need it in the devstack branc you're running on 16:47:25 the nova patch is at least part of stable/2023.1 :) 16:47:27 which is kinda :/ 16:48:13 so the memory patch proably shoudl be backported in devstack 16:48:14 yeah 16:48:23 becuase i think that might be useful on other stable branches 16:48:28 https://review.opendev.org/c/openstack/devstack/+/873646/3/lib/databases/mysql I understand dansmith's concerns 16:48:52 but it looks to me mysql doesn't bubble up the memory 16:48:54 I don't have specific concerns 16:49:08 if it's really that impactful, it's probably worth it, 16:49:18 do we have some mysql monitoring in devstack ? 16:49:19 it's just that it takes the devstack patch on each branch, plus job changes to enable 16:49:31 bauzas: we have my performance.json which has the info in it 16:49:56 gmann specifically didn't want to enable by default until bobcat (understandable) 16:50:02 so backporting to stable is kinda the opposite of that :) 16:50:14 yeah, I can understand 16:50:24 we shouldn't default this to all the jovs 16:50:26 obs 16:50:31 damn, jobs even 16:50:32 why not 16:50:38 the concerns are that it could slow down mysql and introduce other performance regressions that manifest as failures 16:50:44 it hasn't seemed to have done that in practice, 16:50:46 did we see a change in the job execution time 16:50:57 right 16:51:00 but the thought was to minimize the risk, in an already risky environment 16:51:05 sean-k-mooney: because of the fact we don't really monitor mysqld runs 16:51:12 in the logs 16:51:15 bauzas: what does that mean? 16:51:15 and reducing memory pressure might actully reduce swapping and speed up the job 16:51:51 bauzas: I think we do monitor it plenty, it's just that it's a pretty core function and breaking it could have lots and lots of wide impacts, both obvious and non-obvious 16:51:52 dansmith: correct me if I'm wrong, but do we trace the mysql performance in the logs, you said we have that in performance.yaml 16:52:06 I should take a look at this file 16:52:10 (tbh) 16:52:17 performance.json 16:52:30 there's a lot of data in there, but memory is probably the only relevant bit 16:52:39 we dont messure query reponce time as far as i know but we moditor memory usage 16:52:45 I'm just saying I don't know what "monitor mysqld runs" means in this context 16:53:16 yeah, no query time logging, but that would need to be done in aggregate to have any sort of meaningful result I think 16:53:29 as long as we are not seeing errors form teh services (timeouts) or longer overall job runs that all we really need to know 16:53:30 *and* it's time-based which is nearly impossible to compare across runs in the gate 16:54:01 sean-k-mooney: right, well, not having it on by default in master yet, we don't really have that large of a sample 16:54:13 we have a few jobs that are already atypical opt-ed into it 16:54:15 dansmith: I'll look at what we get from performance.yaml 16:54:31 anyway, I was good to go default on, but there's definitely risk so we just have to keep that in mind 16:54:48 bauzas: please stop saying performance.yaml :) 16:54:56 it's performance.json dammit :D 16:55:00 my question was more about the fact that given we have less large temporary tables and innodb pool sizes, I'd love to see some mysql insights about botyh 16:55:14 oh f, you're right 16:55:17 pardon my YAML 16:55:28 ... 16:55:38 anyway, we're quite at the end of the meeting 16:55:58 #topic Open discussion 16:56:10 the agenda is free from any item 16:56:16 so, anything to say ? 16:57:22 looks not 16:57:29 thanks all 16:57:34 #endmeeting