15:00:01 #startmeeting gantt 15:00:02 Meeting started Tue Mar 31 15:00:01 2015 UTC and is due to finish in 60 minutes. The chair is bauzas. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:00:04 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:00:06 The meeting name has been set to 'gantt' 15:00:27 aloha, anyone here ? 15:00:34 \o 15:00:41 o/ 15:00:47 n0ano is having a meeting conflict so I'm chairing for this week 15:01:10 okay, waiting a few mins and then we begin 15:01:23 o/ 15:03:41 okay, let's begin and people will come 15:03:46 so 15:03:48 #link https://etherpad.openstack.org/p/nova-scheduler-meeting 15:03:57 sorry I had no time for sending out an email 15:04:06 feel free to add your points in there 15:04:32 #topic meeting name 15:04:49 so, perhaps some of you already saw http://lists.openstack.org/pipermail/openstack-dev/2015-March/060179.html 15:04:52 #link http://lists.openstack.org/pipermail/openstack-dev/2015-March/060179.html 15:05:22 that's bad to not have n0ano today but would like to get your views 15:05:34 soooooo 15:05:42 I always thought of Gantt related to scheduler like Cinder related to nova-volume 15:05:53 I would like to see this meeting the last time I'm calling it Gantt 15:05:59 until we really spin off Gantt 15:06:14 bauzas: +1 on calling this nova-scheduler 15:06:21 edleafe: agreed but we're now in the same state than Cinder :) 15:06:33 *not 15:06:41 dammit, stupid fingers 15:07:13 bauzas: exactly, which is why it is premature to call this Gantt 15:07:17 ok, anyone else ? feel free to give your point at the thread 15:07:21 but we discussion what will look like for gantt at future at here also? 15:07:37 alex_xu: that's not actually bad 15:07:51 alex_xu: we can still discuss on the next features and the target 15:08:04 alex_xu: it will just help newcomers to join us 15:08:29 and my point goes to jay's comments about doubts of the legal issues we could raise 15:08:58 before actually being sure that Gantt is the right codename, we should also ask the Foundation if that's correct for a project name 15:09:13 I know how it's painful to rewrite the namespace... 15:09:31 wasn't gantt french? 15:09:38 (for teasers, just look at Climate/Blazar repo) 15:09:42 Not sure about legal problems. The name 'Gantt' for scheduling is over 100 years old 15:09:59 pretty generic at this point 15:10:04 PaulMurray: everything good is French, don't you know, English man ? 15:10:33 edleafe: yeah, but I would like to be sure we're not wrong 15:11:06 so, back to the topic, feel free to leave your comments on the thread 15:11:14 on the mean time... 15:11:28 #action bauzas to check with Foundation about possible legal issues with the "Gantt" codename 15:12:22 okay, let's continue, I just would like to see all your point before the next meeting 15:13:02 #info Comments appreciated on http://lists.openstack.org/pipermail/openstack-dev/2015-March/060179.html 15:13:14 #topic Patch status 15:13:26 anyone ? I'm passing 15:13:41 I'm quite busy with the cells fixable stuff 15:14:13 Saw that PaulMurray's first patch in the series finally merged. Congrats! 15:14:18 yay ! 15:14:48 see what happens when your not looking ! 15:14:53 at least we're in a better shape than before even if I wanted to have the RT objectified by Kilo :( 15:15:30 Well, we actually got 5 patches in a few days - including lxsli migration patches 15:15:35 PaulMurray: if that's a matter of *not* reviewing your changes, let me know and I'll make you a pleasure with this :) 15:16:00 PaulMurray: indeed, kudos to lxsli too 15:16:02 bauzas, you need to be careful with you language there - that sounded a bit .... wrong 15:16:14 PaulMurray: French joker card 15:16:48 bauzas, it was about core reviews.... 15:17:12 PaulMurray: then I missed your point 15:17:13 :) 15:17:42 no, I mean we lacked core reviews until last minute - your language was still almost rude 15:17:49 :) 15:18:18 lol 15:18:39 ndipanov, thanks for your help with the patches 15:18:54 PaulMurray, I am very sad that stuff didn;t make it for kilo 15:18:58 awesome work 15:19:09 indeed 15:19:16 there are some -1s IIRC 15:19:25 only you 15:19:28 (from me probably) 15:19:43 yay, shall I iterate over those still ? 15:19:45 and jenkins 15:20:00 I will rebase soon 15:20:05 man, I don't know who this Jenkins guy is, but he's quite rude too 15:20:29 anyway 15:20:43 any other scheduler-related stuff that we should discuss ? 15:21:12 bauzas, 2 things I'd like to mention 15:21:19 ndipanov: go on 15:21:20 that became apparent to me only lately 15:21:48 1 is that pci tracker as it stands now is reasonably buggy 15:22:14 well... 15:22:24 ndipanov: you know my opinion on that 15:22:28 I am not sure how easy that is to fix, but I know what the main cause of it is 15:22:30 yeah... 15:22:44 ndipanov: again, it's not a design problem 15:22:54 I think it actually is 15:22:59 ndipanov: it's about the integration with the existing 15:23:03 yes 15:23:08 trye 15:23:10 true 15:23:22 ndipanov: yeah I had a discussion with sean dague and dansmith about the PCI optionnality 15:23:32 so expect work going into that as soon as we are using objects in RT (Paul's patches land) 15:23:57 ndipanov: well, the problem is also that PCI is always tracking stuff while filters are perhaps not enabled 15:24:23 that's a whole other story 15:24:28 true 15:24:34 and too broad for discussing it there 15:25:03 second is that there have been several issues with "resources" such as pci and numa 15:25:03 ndipanov: so yeah, I expect PaulMurray's work on objectifying the RT could help us a lot 15:25:19 that I feel are basically the same class of errors 15:25:32 ndipanov: any precise issues that you could share with us ? 15:25:36 yes 15:25:43 * ndipanov looks for bugs 15:26:14 ndipanov: in the meantime, could we jump onto the next topic and come back to your point for the open discussion ? 15:26:35 the next topic is Vancouver summit talks 15:27:04 okay, guessing it's a yes 15:27:15 https://bugs.launchpad.net/nova/+bug/1417667 15:27:16 Launchpad bug 1417667 in OpenStack Compute (nova) "migration/evacuation/rebuild/resize of instance with dedicated cpus needs to recalculate cpus on destination" [Medium,In progress] - Assigned to Bart Wensley (bartwensley) 15:27:17 #topicVancouver discussion ideas 15:27:20 yes 15:27:24 erm 15:27:26 #topic Vancouver discussion ideas 15:27:28 ping me once open-discussion happens 15:27:32 ndipanov: sure thing 15:27:45 so, any ideas so far ? 15:28:59 just to be clear, I have no precise information about how the Design Summit will be held, only that we'll have a nice sightseeing from the place we'll be :)à 15:29:36 ttx proposed a discussion about the Design Summit format and talks for today's cross-project meeting 15:29:57 #link http://lists.openstack.org/pipermail/openstack-dev/2015-March/060258.html 15:30:21 I shouldn't be able to attend today's meeting (damn DST) but feel free to attend 15:31:14 I'll try to make it 15:31:21 cool 15:31:33 anyway, I propose to defer our discussions until next week 15:31:40 I have another call then, but I can lurk 15:31:43 so that we should get a better picture about that 15:31:46 also 15:32:30 mikal proposed an etherpad for tracking *nova related* discussions http://lists.openstack.org/pipermail/openstack-dev/2015-March/060024.html 15:32:38 #link https://etherpad.openstack.org/p/liberty-nova-summit-ideas 15:32:59 feel free to point out any point about our concerns that are related to Nova 15:33:49 at the moment, the only related point to the scheduler is about the split, but I guess we should have more to discuss 15:34:50 okay, last but not least topic unless someone's screaming in the next minute 15:35:39 let's openly open the open discussion 15:35:58 #topic open discussion 15:36:15 so, ndipanov had some bugs to provide us related to the PCI and NUMA resource tracking 15:36:42 ndipanov: ping 15:39:38 pinged him in openstack-nova 15:40:08 we can wait a few more mins, he's possibly afk 15:40:28 any other subject people are wanting to discuss meanwhile ? 15:40:45 oh 15:40:58 that reminds me one big thing I forgot to mention 15:41:22 #link https://review.openstack.org/#/c/168682/ 15:41:36 #link https://review.openstack.org/#/c/168883/ 15:41:48 both are related to the existing Gantt repo 15:42:02 which is actually just a wrong thing 15:42:13 (speaking of the existing repo)à 15:42:52 bauzas: I wasn't involved in that Gantt work. Is there value in keeping it? 15:43:00 bauzas: or should we start clean? 15:43:08 edleafe: honestly, nothing but a trace 15:43:26 edleafe: the real problem is how painful it is to remove a repo or rename it 15:44:09 bauzas, right 15:44:13 edleafe: so we keep it as a reference but with a big disclaimer saying that the code is just messy 15:44:15 bauzas: ok, but these are simply to get them out of infra, right? 15:44:25 edleafe: not really 15:44:30 https://bugs.launchpad.net/nova/+bug/1417667 15:44:32 Launchpad bug 1417667 in OpenStack Compute (nova) "migration/evacuation/rebuild/resize of instance with dedicated cpus needs to recalculate cpus on destination" [Medium,In progress] - Assigned to Bart Wensley (bartwensley) 15:44:33 is the bug 15:44:42 edleafe: all of the decisions would require a manual intervention 15:44:51 anyway 15:44:56 back to ndipanov's point 15:45:02 right 15:45:06 so these kind of problems 15:45:12 exist with 15:45:12 oh this 15:45:18 what I can only describe as 15:45:26 "stateful" resources 15:45:40 mmm 15:45:44 where next allocation is the function of previous allocations 15:45:49 not only thieir count 15:46:08 two of those that we have are numa and pci stuff 15:46:27 ndipanov: I see, the problem being that's a bucket of instances, and not a single instance boot, right ? 15:46:34 no 15:46:41 okay, then I misunderstood 15:47:01 so the problem is when migrating/resizing 15:47:16 and evacuating (although I am not sure how that should work) 15:47:21 oooooh understood 15:47:27 policies are not enforced 15:47:36 em 15:47:37 no 15:47:39 are we sure that the reporter didn't ask for a destination host ? 15:48:12 by saying "policies", I was meaning that the filters were not taken in account ( not the nova policy thing) 15:48:22 well that's one thing 15:48:40 ndipanov: because the request spec and status after boot was stored in single object? 15:48:44 ndipanov: you know that migrations are able to bypass the scheduler if a dest host is specified ? 15:49:18 so the thing is 15:49:27 the other thing 15:49:29 ndipanov: we totally get rid of the scheduler when we're migrating to a specified dest host 15:49:34 is that we track migrations via flavor 15:49:45 yup 15:49:59 but you can't do that for pci/numa 15:50:03 and similar resources 15:50:11 because it is not only the "count" 15:50:14 that is important 15:50:18 but the actual allocation 15:50:26 at the time of claim 15:50:56 its not the fact that there is a device - its which one 15:50:59 ndipanov: but claims are reported as usage right ? 15:51:04 because it may be different at the time of the RT loop because it is a function of all previous allocations on the host 15:51:10 yes PaulMurray that's exactly it 15:51:47 this likely means that we have to keep a record of said allocations with the migration row 15:51:51 which is fine 15:51:55 I guess 15:52:27 but 1) it needs to be written 2) I wonder if there is a more general API we could have 15:52:32 once we actually have resources 15:52:41 ndipanov: I'm quite blind on the NUMA stuff (with much regret) 15:52:41 so my question is 15:53:14 would it be ok to sprinkle these 2 special cases as columns 15:53:16 ndipanov: but I was thinking that we were tracking that for example pCPU0 was pinned as an usage 15:53:28 we are 15:53:37 but you can't recreate which one from just the flavor 15:53:54 you have to store the actual allocation somewhere 15:54:07 ndipanov: oh I see, that's not persisteed 15:54:08 for boot - we store it on the instance 15:54:13 but for migration... 15:54:22 ndipanov, if claims become a first class object, would that be the place? 15:54:31 PaulMurray, maybe 15:54:39 yes 15:54:41 I tend to say that we definitely miss bp/resource-objects 15:54:51 yes 15:54:53 so my question was 15:55:10 should we hack on fixing it and hope to generalize it later with Claim/resource objects 15:55:15 or the other way around 15:55:38 well, bp/resource-objects was expected to be worked during Kilo 15:55:51 so we should loop back with jaypipes 15:56:07 bauzas, the thing is that it's a bug 15:56:16 but one that seems to be easily generalizable 15:56:22 if there were a place to generalize it 15:56:48 ndipanov: agreed, but I think the generalization has to be done alas bp/resource-objects 15:57:25 right 15:57:41 ok well 15:57:44 ndipanov: that said, I would appreciate your bugfix 15:57:51 there is a patch that proposes this for numa already 15:57:57 ndipanov: because it gives us a path for fixing it 15:58:16 I will look into the pci side of things as well 15:58:17 ndipanov: oh right, the bug is mentioning it 15:58:30 and then hopefully we'll meet in the middle with resource objects 15:58:39 ndipanov: https://review.openstack.org/163440 is the one about NUMA ? 15:58:52 caution : not having yet reviewed the change 15:58:54 yes!! 15:59:03 woaw, fat one then 15:59:14 but that sounds doable 15:59:35 ndipanov: do you think we should target it for RC1 ? 15:59:57 ndipanov: I haven't yet reviewed the change so I don't know if it's landable yet 16:00:17 damn, we're running out of time 16:00:31 bauzas, with db migrations... 16:00:31 let's discuss that offline and free up the lock 16:00:33 I think not 16:00:33 bauzas: it hasobject version bumps 16:00:43 ndipanov: edleafe: ooooh 16:00:46 I don't think it's that critical 16:00:52 then yes, it will be fixed in Liberty 16:00:56 worth a relnote for sure 16:00:59 ok guys, take care 16:01:04 #endmeeting