14:00:13 #startmeeting nova 14:00:15 Meeting started Thu Jul 13 14:00:13 2017 UTC and is due to finish in 60 minutes. The chair is mriedem. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:00:17 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 14:00:19 The meeting name has been set to 'nova' 14:00:24 \o 14:00:25 o/ 14:00:25 \o 14:00:28 o/ 14:00:29 * ildikov is lurking 14:00:36 \o 14:01:00 alright let's get started 14:01:03 #link agenda https://wiki.openstack.org/wiki/Meetings/Nova#Agenda_for_next_meeting 14:01:09 #topic release news 14:01:14 #link Pike release schedule: https://wiki.openstack.org/wiki/Nova/Pike_Release_Schedule 14:01:21 #info July 27 is feature freeze (2 weeks away) 14:01:33 #info July 17 (next week) is the release freeze for non-client libraries (oslo, os-vif, os-traits, os-brick): https://releases.openstack.org/pike/schedule.html#p-final-lib 14:01:50 * gibi sits in a bit late 14:02:07 i haven't gone digging yet, but if we have any changes to os-vif or os-traits that are going to be dependencies for other work within nova, please let me know asap so i can monitor for a final release 14:02:22 none I saw for os-traits 14:03:03 ok 14:03:10 I have a question 14:03:13 shoot 14:03:17 "feature" 14:03:27 meaning things that affect functionality or change APIs or whatnot? 14:03:34 Not just "you can't drop code for a blueprint after this date" 14:03:43 efried: as in "bug fixes still ok" 14:03:51 Cause the service catalog endponit stuff may be tight. 14:04:05 efried: doesn't that rely on a ksa release? 14:04:16 Yes, we're still waiting on a big pile of stuff from ksa, sta, and the newly-minted os-service-types 14:04:21 ksa is a non-client lib so that would have to get released next week 14:04:33 mordred: ^ 14:04:41 efried: which blueprint is that holding up? 14:04:54 service catalog for endpoints - lemme get the name... 14:05:02 https://blueprints.launchpad.net/nova/+spec/use-service-catalog-for-endpoints 14:05:10 yuh 14:05:58 so how about we talk with mordred about the state of things after the meeting? 14:06:18 Sure. Been working closely with mordred, and I know we're close. But yes. 14:06:21 #action mriedem and efried to talk to mordred about the final ksa release for next week 14:06:36 remind me if i don't follow up :) 14:06:41 rgr 14:06:48 #info Blueprints: 65 targeted, 63 approved, 29 completed (+3 from last week) 14:06:55 so we're making some progress, 14:07:20 and there are at least about 5 other bps i'm watching that are very close to done 14:07:41 maybe i'll send those to the ML after the meeting to focus our attention on pushing those over the line 14:07:48 yeah good idea 14:07:59 if some are close to be merged, gtk 14:07:59 mriedem: this one can be closed or after pythonclient change - https://blueprints.launchpad.net/nova/+spec/fix-quota-classes-api 14:08:01 #action send list of close to done bps to ML after the meeting 14:08:14 gmann_: after the client change 14:08:18 ok 14:08:22 i don't consider the api microversion ones done until the novaclient patch is also merged 14:08:46 ok any other questions about the release? 14:08:56 i know it's tight and we have a lot of work yet to do 14:09:02 so just put a pot of coffee on 14:09:19 #topic bugs 14:09:32 there are no critical bugs 14:09:37 #help Need help with bug triage; there are 115 (+6 from last week) new untriaged bugs as of today (July 13). 14:10:17 gate status is mostly ok 14:10:25 there were some things yesterday with zuul but infra sorted it out 14:10:35 no news for 3rd party ci 14:10:51 #topic reminders 14:10:58 #link Pike Review Priorities etherpad: https://etherpad.openstack.org/p/pike-nova-priorities-tracking 14:11:10 i don't know how realistic that is at this point 14:11:22 #link Consider signing up for the Queens PTG which is the week of September 11 in Denver, CO, USA: https://www.openstack.org/ptg 14:11:44 #topic stable branch status 14:11:51 #link stable/ocata: https://review.openstack.org/#/q/status:open+project:openstack/nova+branch:stable/ocata,n,z 14:11:54 is there ptg planning etherpad yet? 14:11:56 #link stable/newton: https://review.openstack.org/#/q/status:open+project:openstack/nova+branch:stable/newton,n,z 14:11:58 cdent: nope 14:12:04 should we intentionally wait? 14:12:12 (so as not to be distracted) 14:12:14 that's very early 14:12:21 cdent: that's why i haven't created one yet 14:12:24 +1 on too early 14:12:27 ✔ 14:12:30 for stable branches, I did a bit of review for ocata 14:12:31 my focus is about hour to hour right now :) 14:12:33 yea lot of time 14:12:46 bauzas: yeah i saw that, thanks 14:12:49 do we need some point release soon ? 14:12:57 i wasn't planning one 14:13:01 we've done several this release already 14:13:13 we'll do another one around the rc1 i suppose 14:13:13 okay, so maybe after pike-3 I guess 14:13:18 yeah ok 14:13:21 so nothing urgent 14:13:29 i don't really have other news except they are finally killing off mitaka 14:13:35 infra is removing jobs, etc 14:13:45 yeah there was a thread 14:13:47 and the stable/mitaka branch is gone 14:14:08 ok moving on 14:14:14 #topic subteam highlights 14:14:28 i'll cover cells v2 as dan is not around this morning 14:14:36 #help Need reviews on quotas https://review.openstack.org/#/c/416521/ and fixing server external events https://review.openstack.org/#/c/445142/ 14:14:52 i've spent a good chunk of 2 days reviewing the bottom quotas change 14:14:56 and talking through things with melwitt 14:15:01 so she has some stuff to update in there 14:15:13 the quota thing is a bit frightening :( 14:15:19 that bottom one is the change to start counting instances, which is the last reservable resource we have left 14:15:25 yes so it's complicated 14:15:35 and melwitt has done an awesome job working through this and being patient 14:16:03 the other patch there is another dependency for multi-cell, 14:16:05 sure, my main question is how we could raise our confidence in that patch 14:16:05 and a bug fix, 14:16:09 looks like that needs to be rebased 14:16:16 bauzas: you could review it for one :) 14:16:22 mriedem: I began 14:16:35 I have some comments but nothing uploaded yet 14:16:35 and get it merged so we have time to bake it in and flush out issues 14:17:03 i would love to see someone do a rally comparison before and after that patch to see if performance is impacted much 14:17:23 dtp was looking for things to help with, maybe he could do that 14:17:30 mriedem, efried (yes to ksa discussion) 14:17:40 #action mriedem to talk to dtp about running rally against the bottom quotas change 14:17:50 mriedem: could we see the quota patch being tested live ? 14:18:00 bauzas: what does 'live' mean? 14:18:10 it's tested in our ci like everything else 14:18:11 some end-to-end testing 14:18:14 anyway 14:18:24 let's not derail the meeting 14:18:32 the thing that worries me the most about quotas test coverage, 14:18:47 is that tempest uses a separate tenant for every test 14:19:02 so if we start leaking quota, we won't find that in a normal tempest dsvm run 14:19:10 especially for complicated things like resize 14:19:14 :/ 14:19:16 so, 14:19:34 something someone could do is pull that change down and do like 20 resizes in a row or someting 14:19:37 and find out if we blow up quota 14:19:59 or, push a DNM patch that does that with tempest 14:20:08 that's the end-to-end testing I had in mine 14:20:10 mind* 14:20:10 just run the resize tests 10 times in a row with the same tenant 14:20:12 mriedem: separate tests you mean separate quota tests? 14:20:23 but I could try to devstack it 14:20:39 gmann_: no the tenant isolation in tempest 14:20:46 quotas are counted against the project 14:20:52 so each test has it's own project and quota 14:21:14 so if we're leaking quota, we won't notice because we aren't using a single tenant through the entire tempest run 14:21:36 gmann_: are you aware of any ci jobs that run tempest without tenant isolation? i know there used to be a periodic job that did that 14:21:54 the novaclient functional tests also do that, fwiw 14:22:08 it used to be a serial job which use single tenant 14:22:09 so we could point a dnm novaclient change at that series and see what happens 14:22:22 the novaclient functional test is a serial job with a single tenant 14:22:25 so we can start with that 14:22:48 #action run a DNM novaclient patch against dan's fleetify devstack change which depends on the quotas series to see if we're leaking quota at all 14:23:07 i still think we need to do something like resize 20 times in a row 14:23:10 mriedem: but we have deprecated that things and we can do via account file which has pre defined tenants to run tempest against 14:23:36 but let's move on - if we're worried about this stuff, we have things we can do, like reviews, rally testing and endurance testing with a single tenant 14:23:37 mriedem: writing a novaclient change that would do 20 resizes ? 14:23:53 bauzas: yes maybe 14:23:57 mmm 14:23:57 moving on 14:24:10 i will rebase and address comments on the services/hypervisors API uuid changes 14:24:24 thanks to alex_xu for the thorough reviews there 14:24:30 np 14:24:32 #info Started an etherpad to track remaining TODOs for Pike (like docs gaps): https://etherpad.openstack.org/p/nova-pike-cells-v2-todos 14:24:45 i should throw the test ideas up in ^ 14:25:10 ok moving on to scheduler 14:25:11 edleafe: 14:25:15 We briefly discussed the spec amendment regarding picking up the Ironic flavor migration work that jroll was going to do. In that discussion, dtantsur brought up his work to make devstack use custom resource classes in Ironic by default. Patches to use Traits with AllocationCandidates have also been pushed by alex_xu. 14:25:20 We then spent the next half hour discussing jaypipes's patch for claiming in the scheduler, and the implications of those changes. We managed to achieve harmony, then held hands and sang folk songs afterwards. 14:25:24 Finally there was a discussion about only implementing half of the proposed change to the scheduler for claiming, and that until that is made, complex resource providers cannot be supported. Since that's not happening until Queens, it was agreed that it was fine to change the method signatures twice to reflect that support for complex RPs would not happen in Pike. 14:25:29 EOM 14:26:26 ok 14:26:58 for those wondering, 14:27:07 the current focus for that series is starting here https://review.openstack.org/#/c/482381/ 14:27:14 #link claims in scheduler series is here now https://review.openstack.org/#/c/482381/ 14:27:39 ok moving on to api 14:27:41 not really nope ? 14:27:46 alex_xu: anything to bring up here? 14:27:49 Talk about 'PUT /os-services' API, agree with the API should be idempotent 14:27:57 #link https://review.openstack.org/#/q/status:open+project:openstack/nova+branch:master+topic:bp/api-no-more-extensions-pike 14:27:58 mriedem: https://review.openstack.org/#/c/482382/ is the bottom one now 14:28:02 AFAIK 14:28:12 anyway 14:28:42 need review help for the api-no-more-extension-pike, they are easy, just need one more core reviewer 14:28:54 that's all 14:28:54 bauzas: the dependencies are mixed up for that series 14:29:04 and then jaypipes almost committed suicide last night attempting to deal with the awfulness of the scheduler. 14:29:34 #action mriedem to create a scheduler hotline for people working on it 14:30:06 ok moving on to notifications 14:30:08 gibi: you're up 14:30:14 * edleafe has an idea for a new market for Valium 14:30:39 We discussed the last pieces of the additional-notification-fields-for-searchlight bp and as a result mand 14:30:42 atory part of the BDM piece has already been merged. The tag support at instance create is still ongoing and in focus. 14:30:45 Another patch in focus is a bugfix for https://bugs.launchpad.net/nova/+bug/1684860 https://review.openstack.org/#/c/475276/ 14:30:46 Launchpad bug 1684860 in OpenStack Compute (nova) "Versioned server notifications don't include updated_at" [Undecided,In progress] - Assigned to Takashi NATSUME (natsume-takashi) 14:30:48 Most of the notification transformation patches need a rebase due to the BDM addition mentioned above. 14:30:53 that is all 14:31:11 takashin: fyi on https://review.openstack.org/#/c/475276/ 14:31:35 mriedem: Thank you. 14:31:50 yeah and https://review.openstack.org/#/c/469800/ is on my short list of bps 14:31:57 that's the tags at server create one 14:32:14 i know alex_xu was +2 on it at one point 14:32:26 ok moving on 14:32:29 cinder stuff 14:32:36 needs reviews: https://review.openstack.org/#/q/status:open+project:openstack/nova+topic:bp/cinder-new-attach-apis+branch:master 14:32:51 specifically the swap volume change has a +2 https://review.openstack.org/#/c/456971/ 14:33:00 so it would be cool to see another core get to that one 14:33:11 Cinder dependencies are all merged for the patches in the chain 14:33:18 stvnoyes is working on some grenade testing 14:33:27 hyper-v live migration is passing with the new flow 14:33:39 need testing from xenapi team at Citrix 14:33:40 XenAPI is confirmed to work as well 14:33:46 ok, yeah i saw they were reviewing 14:33:54 the live migration patch is https://review.openstack.org/#/c/463987/ 14:34:03 i just started back on that before the meeting 14:34:31 but, i posted a change last week that runs the new flow through with the live migration changes and with the volume-backed live migration tests in tempest and things were all passing 14:34:53 so i'm fairly confident in the functionality, just need to dig into edge cases 14:35:10 moving on 14:35:13 we will have the Cinder-Nova meeting soon, so can sort those out hopefully 14:35:13 #topic stuck reviews 14:35:26 nothing on the agenda - is there anything to bring up here? 14:35:41 #topic open discussion 14:35:49 anything anyone wants to bring up? 14:35:50 I've got one 14:35:57 https://review.openstack.org/#/q/topic:doc-migration+project:openstack/nova+status:open 14:36:14 ^ they need +2s if we ever want to have documentation again 14:36:15 #link nova docs migration starts here https://review.openstack.org/#/c/478472/ 14:36:52 "It'll be a great day in the parish" 14:36:54 oh come on 14:37:02 your irish catholicism is bleeding through 14:37:09 personally, I think if we ignore stylistic things and merge anything that's not actually broken, we can come back and revisit after p-3 14:37:24 mriedem: Never forget where you came from ;) 14:37:37 Can we discuss the ironic migration? https://review.openstack.org/#/c/481748/ 14:37:54 edleafe: sure if jaypipes is around for it 14:37:58 i know dan isn't 14:38:06 I'm still not clear on the order that things will be changing for this 14:38:11 I'm here. 14:38:31 step 1 is getting the custom resource class reported into allocations 14:38:39 we report the inventory, but not the actual allocation 14:39:08 and i believe the vehicle for doing that is putting the resource class info in the flavor extra spec on startup of the driver 14:39:20 mriedem: step 1 is getting the custom resource class added to the flavor so we can tell that it's being requested. 14:39:27 mriedem: yes, bingo. 14:39:28 because then the scheduler report client can pull the resource class data off the extra spec in the update_available_resource periodic 14:39:39 ++ 14:39:49 that's the order i have in the work items section anyway 14:40:06 https://review.openstack.org/#/c/481748/1/specs/pike/approved/custom-resource-classes-in-flavors.rst@165 14:40:23 So what about existing ironic nodes? Will the extra_spec change show up in their instance.flavoe? 14:40:35 s/flavoe/flavor 14:40:41 yes 14:40:45 that's the whole poin 14:40:47 *point 14:40:56 on startup of nova-compute, the driver gets it's existing instances, 14:41:04 right 14:41:07 and for their corresponding ironic node, 14:41:07 ok, just checking whether this would be a new flavor or an update 14:41:10 that's in the spec, nope ?N 14:41:19 get the node.resource_class and shove that into the embedded instance.flavor.extra_specs 14:41:29 bauzas: it was never clear in the original spec 14:41:33 which is why i was amending it 14:41:39 ok, I'm unclear about the problem 14:41:42 ok 14:41:52 edleafe: to be clear, this isn't updating existing global flavors, 14:41:57 it's updating the embedded flavor in the existing instances 14:42:11 right 14:42:16 re: edleafe 14:42:24 because that's what the scheduler report client looks at when putting allocations 14:42:26 so the first step is adding the resource class to existing nodes? 14:42:31 no 14:42:33 's question about should be zero out cpu/ram/disk, etc... I think the aNSWER TO THAT IS YES. 14:42:38 oh ffs, typing sucks. 14:42:57 if the ironic node.resource_class is not None, then do data migration steps 14:42:59 else skip 14:43:03 ok, compute starts up and looks at node.resource_class. Where/how does that get populated? 14:43:12 it's populated in ironic 14:43:17 when the node is created in ironic 14:43:18 edleafe: in the ironic get_inventory() 14:43:24 here: 14:43:39 so all existing ironic nodes will be updated to return that class? 14:43:47 if we're going to step through this, can we do it in -nova rather than hold up the meeting? 14:43:50 https://github.com/openstack/nova/blob/master/nova/virt/ironic/driver.py#L615 14:43:57 mriedem: sure. 14:43:59 edleafe: that's already been done. 14:44:01 ok anything else? 14:44:12 jaypipes: ok, that wasn't clear 14:44:29 alright i'm going to end it, thanks everyone 14:44:31 #endmeeting