14:00:18 #startmeeting nova 14:00:18 Meeting started Thu Aug 11 14:00:18 2016 UTC and is due to finish in 60 minutes. The chair is mriedem. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:00:19 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 14:00:22 The meeting name has been set to 'nova' 14:00:28 o/ 14:00:30 o/ 14:00:30 o/ 14:00:31 o/ 14:00:31 o/ 14:00:34 o/ 14:00:43 o/ 14:00:48 0/ 14:00:49 oj <-- headset emoticon, finishing another call 14:00:51 o/ 14:01:16 dansmith: i thought you were snorkling 14:01:19 * edleafe is shocked to learn that dansmith is oJ! 14:01:30 \o 14:01:49 ok let's get started 14:01:52 #link agenda https://wiki.openstack.org/wiki/Meetings/Nova 14:01:59 #topic release news 14:02:07 #link Newton release schedule: https://wiki.openstack.org/wiki/Nova/Newton_Release_Schedule 14:02:12 #info Aug 22-26: final non-client library release (oslo) - so if you need things released from dependent libraries get them done soon 14:02:19 ^ just another reminder 14:02:26 #info Call for extra ATCs for Newton: http://lists.openstack.org/pipermail/openstack-dev/2016-August/101360.html 14:02:46 if you're looking for ATC credit for summit w/o a commit, see ^ 14:02:54 and let me know 14:03:05 mikal had posted some names in the ML thread and some i was actually surprised by 14:03:09 since i thought they had commits 14:03:23 any questions on the release? FF is 9/2 so we have about 3 weeks left 14:03:37 #topic bugs 14:03:53 gate CI status is OK 14:04:09 the check queue was super backed up yesterday, 500+ 14:04:27 but i'm not aware of any blocking failures 14:04:42 #link 3rd party CI status http://ci-watch.tintri.com/project?project=nova&time=7+days 14:04:48 #info Intel NFV CI is failing on stable branches because it doesn't install Tempest in a venv 14:04:55 wznoinsk: do you have any updates on that? 14:05:33 anyway, they are aware and are working on it 14:05:43 i'm not aware of any critical bugs, 14:05:57 there were two yesterday and fixes were released 14:05:59 we did have a regression with one of the config option cleanup changes reported in channel yesterday 14:06:29 auggy: ok, 14:06:34 this is the one i was thinking of https://bugs.launchpad.net/nova/+bug/1611940 14:06:34 Launchpad bug 1611940 in OpenStack Compute (nova) "vncserver_proxyclient_address changed from stropt to ipopt, breaking backwards compat without deprecation" [High,Fix released] 14:06:40 mriedem: yeah, that was a good catch 14:06:50 I added another for the service.py file for listen_addresses 14:07:06 the ones fixed yesterday were https://bugs.launchpad.net/nova/+bug/1609625 and https://bugs.launchpad.net/nova/+bug/1609691 14:07:06 Launchpad bug 1609625 in OpenStack Compute (nova) "The 'create:forced_host' policy is set to 'rule:admin_or_owner' by default" [Critical,Fix released] - Assigned to Takashi NATSUME (natsume-takashi) 14:07:08 Launchpad bug 1609691 in OpenStack Compute (nova) "Non-admin users can lists VM instances of other projects (tenants) by default" [Critical,Fix released] - Assigned to Takashi NATSUME (natsume-takashi) 14:07:25 here is my patch around that: https://review.openstack.org/#/c/353917/ 14:07:28 ah, ok, so policy slips 14:07:57 johnthetubaguy: cool, i'll look after the meeting 14:08:02 yeah, not sure if some where v2.1 back to beginning of time 14:08:08 mriedem: I'm sorry to be late, yes, the stable branches were affected because of a not-so-great workaround we had in place, it's now removed and the tempest/urllib issue is now gone 14:08:19 wznoinsk: great 14:08:24 thanks for quickly fixing that 14:08:33 mriedem: using a hostname is a bit unreliable (does dns resolve, uses first ip it gets back), but legit and we used to allow, so we still should 14:09:09 johnthetubaguy: yeah so maybe for these cases we should open a bug to eventually deprecate them from using hostname, but give that signal before we switch to IPs 14:09:36 moving on 14:09:38 mriedem: yeah +1, give me the action to sort that out 14:10:08 #action johnthetubaguy to sort out the deprecation plan for changing StrOpt configs to IPOpt for things that allow hostnames today 14:10:18 thanks 14:10:20 #topic reminders 14:10:27 #link Newton review focus list: https://etherpad.openstack.org/p/newton-nova-priorities-tracking 14:10:33 #help https://wiki.openstack.org/wiki/Nova/BugTriage#Weekly_bug_skimming_duty Volunteers for 1 week of bug skimming duty? 14:10:50 looks like next week is open for bug triage 14:11:00 #topic stable branch status 14:11:06 stable/mitaka 13.1.1 was released this week 14:11:18 we have some good fixes lined up for review in stable/mitaka related to shelve bugs 14:11:38 would be nice to get those in before 9/2 so we can do a small patch release 14:11:55 #topic subteam highlights 14:12:01 alaski: cells v2 meeting highlights? 14:12:16 dansmith and I are going to have a chat about grenade testing after this meeting 14:12:26 otherwise we touched on reviews that are up and ready 14:12:47 that was about it 14:12:54 yup, ok 14:12:59 thanks 14:13:06 edleafe: jaypipes: scheduler subteam meeting highlights? 14:13:09 Short meeting this week 14:13:14 Adding eyes on jaypipes's rt-unit-test series, Yinxin's flavor/capability proposal, and cdent's placement DB stuff 14:13:21 That's it 14:13:28 i had a question, 14:13:37 dynamic resource classes 14:13:42 mriedem: yes? 14:13:43 the spec wasn't approved for that 14:13:47 * jaypipes perks up 14:14:05 is it still a thing being pushed on for newton? or is it just a distraction from the placement API stuff at this point? 14:14:21 mriedem: that is the stretch goal for us in Newton, yeah. 14:14:36 mriedem: it's part of the placement API. CRUD operations on resource classes. 14:15:03 so https://review.openstack.org/#/c/312696/ 14:15:10 i see lots of -1s 14:15:19 mriedem: the integration of Ironic node resource class and dynamic resource classes isn't something we're aiming for in Newton, though. The stretch goal is only to add the REST API CRUD operations for user-defined resource classes. 14:15:23 to the placement API. 14:15:30 so https://review.openstack.org/#/c/312696/ is something else then 14:15:49 mriedem: no.. that is the right spec. 14:16:06 heh, ok :) 14:16:08 mriedem: the -1s are for really minor things or not related to the spec frankly. 14:16:21 mriedem: I'd REALLY appreciate a review on that from you and spec cores. 14:16:37 my concern is if time is being spent on a stretch goal when the main thing isn't done yet and we have 3 weeks left 14:16:48 mriedem: perfection enemy of the good and all that... 14:17:01 but i don't know the scope of the work proposed for newton for the CRUD ops and all 14:17:07 anyway, i'm just bringing it up 14:17:14 mriedem: absolutely time is being spent 100% on the non-dynamic resource classes stuff right now. 14:17:32 and the RT unit test cleanups are paving the way for the RT calling the placement API? 14:17:38 sorry i haven't been through all of that yet 14:17:50 mriedem: it's very small scope (maybe 4 patches) that add simple lookup and add/delete ops for resource class strings. 14:18:23 mriedem: yes, absolutely. the rt-unit-tests line things up (and actually most are already merged, only a few left on top of that stack) 14:18:46 the rt-unit-tests set is really an excuse to pad my stats 14:18:46 ok, thanks for the update, i've been off on other things so needed that 14:18:51 and I commend jaypipes for doing it 14:18:52 mriedem: and the final patch in that stack gets rid of the test_tracker.py/test_resource_tracker.py duplication, which is a nice win for clarity. 14:19:14 we can move on 14:19:19 jaypipes: hah, I read that as "a nice win for charity" 14:19:23 heh 14:19:35 tdurakov: any highlights from the live migration meeting this week? 14:20:19 well, 14:20:32 my recollection is most discussion was around live migration CI 14:20:42 mriedem: https://review.openstack.org/#/c/353002/ - merged, so hope we could add live-migration job to voting list soon 14:21:12 tdurakov: ok and rebase https://review.openstack.org/#/c/329466/ 14:21:16 to enable nfs again 14:21:25 yup 14:21:40 alright, thanks 14:21:53 mdbooth: did you want to mention anything about the image backend series? 14:22:27 starts here https://review.openstack.org/#/c/333242/ 14:22:49 my impression is a lot of test re-write to get to some more meat 14:23:03 we can move on 14:23:08 sdague: alex_xu: api meeting highlights? 14:23:31 mostly the merge of the user_id policy spec 14:24:05 though, we're still figuring out exactly what the code will look like 14:24:11 i also started working on https://review.openstack.org/#/c/347514/ yesterday 14:24:19 and alex_xu has a thread about that here http://lists.openstack.org/pipermail/openstack-dev/2016-August/101402.html 14:24:39 i think we'll need to figure out if we're going to do a novaclient release before https://review.openstack.org/#/c/347514/ lands 14:24:52 because it has some backward incompatible changes in it 14:25:01 at least in it's current form 14:25:07 mriedem: that was always the intent right 14:25:14 we talked about it at the midcycle 14:25:18 i'd have to dig up notes to see 14:25:21 we talked about a lot of things... 14:25:22 given the way releases work, we can do that even after the patch lands 14:25:27 true 14:25:27 mriedem: you mean before or after the release freeze for newton? 14:25:38 johnthetubaguy: i meant before that patch lands, 14:25:46 but we can pick the hash so that matters less 14:25:49 ah, gotcha 14:25:54 yeah 14:26:00 it might be a 6.0.0 is what i mean 14:26:39 but i think it's in pretty good shape, just need to cleanup functional tests and test it against mitaka 14:26:58 lbeliveau: was there an sriov/pci meeting this week? 14:27:31 will move on 14:27:35 gibi: was there a notifications meeting? 14:28:12 mriedem: sriov/pci was skipped this week 14:28:17 ok 14:28:39 #topic stuck reviews 14:28:46 https://review.openstack.org/#/c/200870/19, please give your opinion about comment on PS19 (abhishekk) 14:28:55 abhishekk: jaypipes: dansmith: ^ 14:29:07 mriedem: yeah, so... 14:29:10 mriedem: yes, but was quick 14:29:16 melwitt is working on another angle at this, 14:29:18 from the RT side 14:29:26 I will reserve comment until I see that 14:29:34 mriedem: I understand abhishekk and other's concerns on this. 14:29:39 but I'm definitely not okay with the one that's proposed, 14:29:40 and also don't think it's stuck 14:29:59 mriedem: however, like dansmith I don't like the proposed solution. 14:30:13 dansmith: I'm curious, what is melwitt's proposed solution involving the RT? 14:30:26 i haven't read into it so don't know the scope of the problem, but it sounded like 'resource providers fixes this, but that's not ready, and this bug has been open for 2+ years' 14:30:36 jaypipes: like you suggested I will send the mail to ML 14:30:42 jaypipes: just not adding disk to the total we report as used if the instance is volume-backed, like I suggested in one of my comments 14:30:52 jaypipes: i.e. instead of harming the stored data about instances, just changing what is reported 14:30:56 mriedem: that is a good summary, yes. 14:31:10 mriedem: N+ years.. since the beginning of volume-backed instances.. nothing new 14:31:28 dansmith: that doesn't fix the scheduler filers I guess? 14:31:35 dansmith: you mean in the compute node's local_gb and local_gb_used fields, right? 14:31:41 jaypipes: yeah 14:31:45 dansmith: or at least, we need to do RT and when passing to scheduler I guess 14:31:46 johnthetubaguy: it would fix those, yes. 14:31:58 johnthetubaguy: right, fix what we report that the scheduler users 14:31:59 johnthetubaguy: since the scheduler goes from the compute node objects. 14:32:06 jaypipes: but it would request 50 GB from the scheduler, when it actually needs 0 GB? 14:32:27 agreed the reporting of used would be better, so the scheduler has the correct current state, if we fix the RT 14:32:29 johnthetubaguy: the current patch changes what we request, I'd rather change what we report 14:32:48 because what we report is a point in time, what we request exists for the lifetime of the instance 14:32:51 I just think we should do both, (although not the way that patch does it) 14:33:08 why both? that does't help 14:33:20 so agreed about reporting the usage 14:33:22 dansmith: but in the boot from volume case, the requested disk_gb should actually be 0, no? 14:33:26 just changing the rt means false negatives from the scheduler 14:33:31 jaypipes: right, thats the bit 14:33:55 for when you have a cloud with both bfv and non-bfv living next to each other 14:34:26 I'm missing something 14:34:43 johnthetubaguy: right, and the alternative is essentially to create two different flavors, one for bfv cases and the other, with the increased disk_gb request, for non-bfv cases, right? 14:35:08 jaypipes: well, thats related to that spec we didn't get merged about having flavors that say BFV only 14:35:24 dansmith: the request to the scheduler needs to have disk_gb = 0 or it may filter out computes that can host a BFV instance 14:35:27 dansmith: so the request to the scheduler, current it says 50GB rather than 0GB 14:35:27 there's been a feature requested for a while that we restrict the size of a root volume to what the flavor says you can have 14:35:39 and if we destroy the data we store about the flavor booted, we can't do that 14:35:55 dansmith: if I boot a flavor that has 50GB local_gb in the request (spec), but use a boot-from-volume operation, that 50GB should be 0GB. 14:36:00 alaski: I see 14:36:08 alaski: is that in the reqspec though? 14:36:21 it is, but it's put there from the flavor 14:36:37 if the conversion can happen there that would work 14:36:42 yeah, needs to check for the bfv case 14:37:08 what is requested is zero, but that's not related to the flavor from which it was booted, which might have further requirements 14:37:22 like, how big of a volume it creates or allows, etc 14:37:23 guh, we should just use containers and all our problems would go away. ;P 14:37:35 jaypipes: or rewrite in Go 14:37:39 * cdent gives jaypipes a cookie 14:38:19 dansmith: yeah, that would work. just need that and the proper rt reporting 14:38:21 dansmith: what is requested (in the request spec) is the *flavor's* local_gb, though, which if you boot-from-volume, should actually be ignored since the boot-from-volume case doesn't consume that space on disk. 14:38:39 jaypipes: currently, right 14:38:56 jaypipes: but I'm saying the reqspec _should_ be the things we're asking the scheduler to find for us, 14:38:59 dansmith: so I agree with johnthetubaguy that we need to fix both the request spec and the usage reporting in the RT. 14:39:03 which in the bfv case should be zero 14:39:18 dansmith: agreed. 14:39:30 but storing the flavor on the instance with a zeroed out disk doesn't make sense to me 14:39:40 dansmith: why not? 14:39:47 jaypipes: this is the thing you're -2d for right? 14:40:09 yes, but I -2'd it because in the resource providers world, all this accounting changes dramatically. 14:40:42 dansmith: FWIW, I hate that too, I just didn't see this alternative till after you pointed it out 14:40:49 dansmith: because in the r-p world, there simply won't be a requested amount of DISK_GB resources. 14:40:57 jaypipes: right 14:41:25 dansmith: a non-existing record in the request in the r-p world is different from a request for 0 amount of local_gb, that's all I mean.. 14:41:39 when we create a volume for a nfv request, don't we use the flavor size to create the volume? 14:41:51 dansmith: lol, bfv, not nfv. :) 14:41:58 hah 14:42:05 when we create a volume for a bfv request, don't we use the flavor size to create the volume? 14:42:09 we don't use the flavor size at all in bfv 14:42:10 dansmith: and no, I don't believe we use the flavor size. 14:42:18 what size do we use? 14:42:21 the size is passed in on the block_device_mapping 14:42:23 so I think the request spec conversion just ensures 0GB, and the RT reports 0GB usage when its a BFV thingy, isn't that what we want here? I like not hacking the stored flavor, if possible 14:42:26 it's passed in as part of the request 14:42:35 dansmith: we optionally use the flavor size I thought 14:42:41 like if you don't give a size 14:42:43 root_gb makes zero sense in bfv 14:42:43 johnthetubaguy: yeah, I thought so 14:43:01 I mean I hate it, I just thought thats what we did 14:43:09 alaski: unless you use it to prevent a user from booting a big root disk from a tiny flavor, 14:43:13 which is a request we've had outstanding for a while 14:43:24 alaski: i.e. don't boot an m1.tiny with a 25T root disk volume 14:43:35 I disagree with it for bfv though 14:43:45 but we can take that offline 14:44:20 I disagree with bfv 14:44:22 in general 14:44:29 +1 14:44:54 so, a decision needs to be made in the case of abhishekk's patch. I think I'd be OK letting this patch in -- possibly with an addition from melwitt around the RT accounting of things (if needed). And putting a giant TODO(jaypipes): Remove all of this in r-p world. To indicate it will all be ripped out when resource providers comes around. 14:45:04 well I disagree with shared storage and bfv being a thing, we should have one, but that ship sailed 14:45:07 why do that though? 14:45:18 we'll end up with some instances with different data for the flavor 14:45:21 for like six months, 14:45:26 and then we merge the real fix 14:45:32 this has been this way for YEARS 14:45:34 so we don't need to persisit it 14:45:35 i thought the agreement was (1) fix request spec and (2) fix RT 14:45:49 we could just convert it in request spec and RT, for the short term 14:46:41 i'm not seeing in the compute api where we ues the flavor size for a volume bdm 14:46:43 dansmith: right, that's what we're saying. fix the request spec and fix the RT to report the compute_node's usage "properly" when a BFV instance is on it (i.e. don't use flavor.root_gb in the usage calcs on the RT for BFV instances_) 14:46:53 dansmith: and do NOT change the stored flavor.root_gb value. 14:47:01 i even see 14:47:01 mriedem: me neither 14:47:01 mriedem: if we can just convert it in the reqspec then I'm cool with that, but if that ends up storing zero disk in the flavor I'm unhappy aboutthat 14:47:02 # Target disk is a volume. Don't check flavor disk size because it 14:47:02 # doesn't make sense, and check min_disk against the volume size. 14:47:12 jaypipes: if we can do that then that's fine 14:47:37 i only see flavor size used for ephemeral and swap bdms 14:48:17 mriedem: well, if that's the case, we only need to fix the RT. 14:48:25 BUT 14:48:28 this is bdm code we're talking about 14:48:37 oh fuck. 14:48:40 so there is probably an OR or a default value somewhere 14:48:41 never mind then ;) 14:49:04 we could, you know, just actually test the gd scenario and see what happens :) 14:49:55 bingo 14:49:56 def _volume_size(instance_type, bdm): 14:49:56 size = bdm.get('volume_size') 14:49:56 # NOTE (ndipanov): inherit flavor size only for swap and ephemeral 14:50:40 but let's move on, sounds like there is a path forward, 14:50:50 hi all, could you please add your opinion on the patch as a comment, it will be helpful 14:50:56 with maybe an open question about flavor inherited size for bfv, maybe mdbooth or ftersin knows, but we could also test it 14:51:34 #topic open discussion 14:51:40 there wasn't anything on the agenda 14:51:43 does anyone have anything? 14:52:19 ok let's wrap up then 14:52:22 thanks everyone 14:52:25 #endmeeting