14:00:46 #startmeeting nova 14:00:47 Meeting started Thu Feb 6 14:00:46 2020 UTC and is due to finish in 60 minutes. The chair is efried. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:00:48 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 14:00:51 The meeting name has been set to 'nova' 14:00:57 \o (in 1:1 meeting also) 14:01:00 o/ 14:01:02 o/ 14:01:21 o/ (in TC meeting also ) 14:02:20 #link agenda https://wiki.openstack.org/wiki/Meetings/Nova#Weekly_Nova_team_meeting 14:03:19 o/ 14:03:21 #topic Last meeting 14:03:21 #link Minutes from last meeting: http://eavesdrop.openstack.org/meetings/nova/2020/nova.2020-01-30-21.00.html 14:03:23 o/ 14:03:31 We had some action items from last time... 14:03:41 * efried sean-k-mooney to do the train implemented specs business 14:03:54 damit ill do that now 14:04:03 :) Thanks 14:04:17 * efried lyarwood to propose train & stein releases 14:04:33 ^ is done and released 14:04:34 this got done. I think I saw both merged today. 14:04:45 yup 14:04:45 Thanks lyarwood 14:04:45 np 14:04:46 * efried lyarwood to curate rocky EM list 14:05:16 ^ this hasn't started due to stable/rocky being blocked by tempest issues 14:05:22 once these pass I'll get on with this 14:06:27 great, thanks. 14:06:41 #topic Bugs (stuck/critical) 14:06:41 No Critical bugs 14:06:53 However, the other bug numbers (see project heartbeat) have been creeping up 14:06:59 #help triage bugs! 14:07:01 can we close this bug fix https://review.opendev.org/#/q/status:open+project:openstack/nova+branch:master+topic:bug/1663456 14:08:12 stephenfin: that ^ was where dansmith had jumped in and redirected at some point, right? 14:08:23 so he should probably be the +A? 14:09:21 brinzhang: let's follow up on that in the main channel when dansmith is about. 14:09:52 #topic Reminders 14:09:52 SPEC FREEZE IS NEXT WEEK 14:09:59 yeah, that need get dansmith's comments 14:10:34 Last week I scrubbed the 14:10:34 #link ussuri blueprints https://blueprints.launchpad.net/nova/ussuri 14:10:35 and the specs I could find 14:11:06 #link etherpad (feel free to leave notes etc.) https://etherpad.openstack.org/p/nova-ussuri-planning 14:11:53 I've been making noise about scrubbing the list once we pass the milestone. 14:12:40 As a reminder, this would be an exercise where we make up *some* kind of criteria to pick which blueprints we're going to pursue and which we're going to defer for Ussuri. 14:13:33 Right now we have 41 blueprints against ussuri, of which seven are already implemented. 14:13:59 IMHO, given the size of the team, that's too many to land 14:15:02 but 20 are still not design:approved at this point, so I'm guessing we'll be starting with a smaller number anyway. 14:15:25 Probably a good idea to set up a spec review day for early next week. Tuesday work for everyone? 14:15:33 so 21 are design approve and 7 of those are done 14:15:41 yes 14:15:42 leaveing 14 remaining 14:15:53 design approve means spec not merged right? I always get confused with LP mapping 14:16:01 *design not approve 14:16:24 sorry gmann, it's a new thing. Yes, when we merge the spec we change the blueprint to design:approved. 14:16:51 efried: count me on it 14:16:53 for specless blueprints we can change to design:approved once we have agreement on the *how*. 14:17:18 direction:approved speaks to the *when* 14:17:23 https://review.opendev.org/#/c/706276/ is the review to update the implemented spec in train by the way 14:17:23 #action efried to ML about spec scrub day 14:17:25 efriedok, yeah you old last time also but somehow i forget 14:17:31 there is a spec, I think it's ready to go, I sended ML of this spec https://review.opendev.org/#/c/623120/, ML:http://lists.openstack.org/pipermail/openstack-discuss/2020-February/012352.html 14:17:32 thanks sean-k-mooney 14:18:25 Probably given who's here (and not here) it'll be best to take specific spec review requests/discussions offline. 14:18:50 +1 14:19:30 anything else, process-wise, about spec freeze? 14:19:47 Rocky EM (lyarwood) 14:19:47 #link open rocky patches https://review.opendev.org/#/q/status:open+(project:openstack/os-vif+OR+project:openstack/python-novaclient+OR+project:openstack/nova)+branch:stable/rocky 14:19:47 we already talked about this. 14:20:00 #topic PTG/Summit planning 14:20:00 Please mark **attendance** and topics on 14:20:00 #link PTG etherpad https://etherpad.openstack.org/p/nova-victoria-ptg 14:20:35 I've been asked to fill out the survey thingy to let the planners know how much space we're going to need, so please do at least the attendance bit of that ^ as soon as you can. 14:21:56 Tough to say based on the shanghai PTG, but how big a room do folks think we'll need? 14:22:41 I think at least 15 people sits :P 14:23:05 agree; that would seem to be the bare minimum to me. 14:23:24 it was more dynamic attendance also, so expected +5 should be counted 14:23:35 that would cover the active memebers barely 14:23:39 agree 14:24:22 if we want to be able to have a cross team meeing in the nova room 14:24:26 the layout options jump from 20 to 40, which is weird, I would expect us to be somewhere in between at max capacity. I guess I'll ask for the bigger one. 14:24:42 yeah, good call 14:24:45 okay, moving on. 14:24:47 #topic Sub/related team Highlights 14:24:47 Placement (tetsuro) 14:25:10 I think the only thing going here is consumer types, which tetsuro and melwitt have been working through. 14:25:46 There was some discussion about whether we should try to close the gaps in anti-affinity expressiveness so we can cover a few remaining corner cases in NUMA modeling. 14:26:03 but I think we agreed that could be deferred, right sean-k-mooney bauzas? 14:26:27 correct, PCI devices affinity can't be done in Ussuri 14:26:30 we could defer if we keep the numa toplogy filter to check the numa of numa nodes 14:26:31 (at least from me) 14:26:39 for now we'll use group_policy=none and let the NTF do the anti-affinity 14:26:49 yep 14:26:51 we're not talking about devices yet. 14:26:58 okay 14:27:07 API (gmann) 14:27:07 gmann did not send the updated report this week. Last updates on ML: http://lists.openstack.org/pipermail/openstack-discuss/2020-January/012260.html 14:27:11 i did not get chance to send updates on ML. But working on few API spec review and policy work. 14:27:26 i will appreciate review on this - #link https://review.opendev.org/#/q/project:openstack/nova+branch:master+topic:bp/policy-defaults-refresh 14:27:44 changes are easy and on same line as first set of os-service API merged 14:28:12 I will be pushing more API changes soon once bring up the stable gate up 14:28:19 that's all from my side 14:28:22 gmann: that review of the code not the spec right 14:28:28 the design is already approved 14:28:34 sean-k-mooney yes, code 14:28:38 cool 14:29:05 first set of changes also merged with agreement with john and how we should do for other APIs 14:29:57 ready to move on? 14:30:09 #topic Stuck Reviews 14:30:18 [huaqiang] the spec 'mixed cpu policy instance spec' still needs more input 14:30:19 #link Use PCPU and VCPU in one instance : https://review.opendev.org/#/c/668656/ 14:30:19 placement-ess style interface ('resources:(P|V)CPU') vs legacy flavor style (eg: 'hw(:|_)numa_pinned_cpus') interface through a CPU mask 14:30:35 we've been back and forth on this one. 14:30:38 I think that needs dansmith to resolve 14:30:39 stephen gives a -1 14:31:03 he recomends the legacy flavor style interface 14:31:16 it does? 14:31:35 by the way, I thought the design work for NUMA topo brought out some great examples of why we should prefer the hw: style over the placement syntax. 14:31:36 dansmith: Yup. fwict, you're asking for us to mix placement-style syntax with traditional extra specs. I absolutely don't want that 14:32:33 sounds like there's no room for compromise, and I'm not even -1 on that spec, so .. not much to discuss I guess? 14:33:13 I was guessing you'd a good reason for preferring the thing I don't like 14:33:33 well at one point supporting both was an option. 14:33:41 Yup, I'm fine with that too 14:33:52 ok for support both too 14:33:52 just not bits of one style and bits of another 14:33:53 the consensus on the NUMA topology spec is to *not* ask for placement-ished flavor syntax 14:34:03 yes 14:34:10 we'll provide a translation mechanism 14:34:12 but 14:34:31 the question remains, should we block flavors already expressing placement-esque ? 14:34:56 let's figure that out in that spec 14:34:56 because I'd like to see the same agreement for huaqiang's spec 14:35:04 oh, okay 14:35:11 I don't especially have a problem if the code supports the placement-ese syntax, but IMO we should not encourage it. IOW we should document the other way for any given feature. 14:35:15 i think that is a seperate question. i spoke to dansmith about this and i think long term no we dont block 14:35:24 but i would not encurage the use of groups 14:35:45 if we say that operators are enough grown-up for using placement-esque syntaxes, then we aslo agree that the translation mechanism is just for legacy reasons 14:35:47 and we need to consider evolving the current support to add a translation layer for the plamcenet syntax 14:36:05 I've no objections to someone doing e.g. '... --propety resources:PCPU=N --property resources:VCPU=M' 14:36:11 right, that's why I tie the two specs together 14:36:21 not sure how useful it is but if people want it, why not 14:36:38 we're talking of flavor extra specs 14:36:43 yes 14:36:55 the only point i object to for placement syntax is using it to expres toplogy via groups 14:36:56 but I disallowed '--property hw:cpu_policy= --property:(V|P)CPU=' for cpu-resources 14:37:06 and I still think disallowing that makes sense 14:37:06 stephenfin: if you're okay with validating the craziness of it with your own spec, then okay :) 14:37:06 my major complaint is that we have a hundred bespoke DSLs encoded into 255 characters with little consistency, and that we are (IMHO) reversing course on the preferred way to specify those things. I want it to be highly abstract and consistent, and placement syntax is at least a step in that direction 14:37:33 but I think that's the thing that most of you actually want, because it's easier on *us* instead of the *users* 14:37:42 I think we have a way to restrict the fooliness of extra specs with stephenfin's spec about flavor validation 14:37:44 it's the abstract-ness I have a problem with, exactly because it's harder on the users. 14:37:55 yes, well, we disagree obviously 14:38:17 dansmith: what's easier on us? the legacy extra spec stuff? 14:38:20 honestly, the current NUMA extra specs are a poor DSL 14:38:27 anyway, I said _many_ times in the original conversation that I would not block on this, and haven't even lodged a -1 so I'm not sure why we're talking about this 14:38:33 I'd prefer placement-esque syntax 14:38:34 with hw:* only *we* have to understand the shape of the provider model. With resources:* etc, the *user* would have to understand it. And how placement works. 14:38:38 butg 14:38:44 DSL is an abstract which is better than placement synatx 14:38:56 numa vs a set of request groups... 14:39:08 but, between a standard alloc candidates query and a larger one asking for in_tree and member_of, there is a big change 14:39:12 ah, then we can move on. I just didn't want huaqiang rewriting stuff only to have to undo it again 14:39:22 ++ 14:39:23 we can move on, but the question remains 14:39:50 seems I can move on now... 14:39:57 and I'd appreciate we get a consensus on this either in the NUMA topology spec or in the huaqiang's spec 14:40:01 what about the bitmask? 14:40:17 huaqiang: ^ that is still question? 14:40:25 we are goign to support that right? 14:40:31 alex_xu: I _think_ that's what huaqiang is suggesting still, yeah 14:40:40 ok cool 14:40:45 I like the cpu mask solution 14:40:57 agree 14:41:12 yea, me too, just want to ensure 14:41:25 that is consitent with the numa toplogy in plamcent spec too 14:41:50 we shoudl revisit the question of placement syntax as a ptg topic i think 14:42:09 good idea 14:42:21 actually sean-k-money offers a good comments on cpu mask interface I'll follow that 14:43:03 efried: i thnk we can move to the next topic 14:43:09 #topic Open discussion 14:43:09 https://blueprints.launchpad.net/nova/+spec/config-tsc-freq 14:43:09 https://etherpad.openstack.org/p/invtsc 14:43:13 umbSublime: ^ 14:43:22 proposal is to add the ability to expose an invariant tsc cpu flag and specify the frequency to allow live migration. 14:43:22 we previously discussed this and said it could be a specless blueprint if the blueprint content was sufficient. do we agree that it is? 14:44:10 so looking the blueprint it has nto been updated yet but the proposed text is in the etherpad 14:45:28 umbSublime: could you summerise you usecase for this 14:46:16 Yes, in our openstack platform we host many game servers, some only run on windows VMs and require this flag to be present. If it isn't the process either crashes or the game server is really slow because it relies on another clock 14:47:11 am i correct in saying you currently work around this with the cpu_extra_flags config option 14:47:33 but that approch causes issues since you cannot live migrate for maintance 14:47:58 My work-around it to manually virsh edit the VM after it's lauched, but obvisouly that doesn't persist any operations and I also loose the ability to live-migrate 14:48:29 this seems like a straightforward spec.. does it need discussion? 14:49:04 yeah, that's need a spec 14:49:11 we had said it could be adress specless if we agreed the blueprint contianed the enough info 14:49:18 back in november 14:49:48 but if it needs a spec it would noe be hard to submit it and fill out the remaining parts of the template 14:49:54 it adds two new extra_specs, image props, and some virt specific behavior..seems like a spec to me 14:50:22 agreed 14:50:44 ok umbSublime if that is ok with you ill help you prepare that 14:51:09 sure this is fine with me 14:51:10 only allow to migrate on hosts with same TSC frequency? 14:51:22 #agreed need spec for https://blueprints.launchpad.net/nova/+spec/config-tsc-freq 14:51:38 huaqiang: I think so 14:51:41 huaqiang: isn't the tsc frequency specified in the flavor/image? 14:51:55 huaqiang: if you need to inspect something about the host like that, it will be a much larger effort 14:52:25 dansmith: I don't need it, just curious about the answer. 14:52:41 some CPUs with differnt model has same TSC 14:52:51 huaqiang: not the tsc frequecse is not the host tsc frequence 14:52:58 its the emulated frequency 14:53:01 huaqiang: I'm confused.. what happens if you migrate to a host with a cpu that isn't compatible? 14:53:04 and how do you know? 14:53:26 KVM does some math based on the TSC value presented to the VM and the one on the host. to present an invariant TSC to them VM even after migrating to another host with different base freq 14:53:31 sean-k-mooney: right, I thought it was pure emulation 14:53:42 right, okay... 14:53:47 well its usign https://lkml.org/lkml/2015/9/28/15 14:54:32 anyway we can call that out in the spec i guess 14:54:39 Note that time is short here. In order to have a shot, I think the spec should be ready for serious review by start of business Tuesday for the review push 14:54:48 Any other open topics before we end? 14:55:10 Okay, thanks all. 14:55:10 o/ 14:55:10 #endmeeting