14:00:16 #startmeeting Nova Live Migration 14:00:17 Meeting started Tue May 24 14:00:16 2016 UTC and is due to finish in 60 minutes. The chair is PaulMurray. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:00:18 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 14:00:21 The meeting name has been set to 'nova_live_migration' 14:00:22 * kashyap waves 14:00:24 o/ 14:00:26 o/ 14:00:26 o/ 14:00:29 hi 14:00:30 hi all 14:00:44 hi 14:00:44 o/ 14:01:02 Agenda: https://wiki.openstack.org/wiki/Meetings/NovaLiveMigration 14:01:25 hi 14:01:33 #topic CI 14:01:42 * PaulMurray straight in today 14:01:58 we had a couple of actions on CI from last week 14:02:08 kashyap to follow up with clarkb about experimental job running xenial fix (presumed cpu model) 14:02:12 PaulMurray: One was one me... 14:02:18 PaulMurray: https://review.openstack.org/#/q/topic:systemctrl 14:02:34 Specifically this should get merged - https://review.openstack.org/#/c/319819/ 14:02:53 But both the patches in that topic need to go in... 14:03:05 Reviews appreciated 14:03:12 kashyap: is it really helps to restart 14:03:34 Then, we've got a dummy Nova change that depends on the above -- https://review.openstack.org/#/c/319934/1 14:03:39 I've tested locally and there is still issues with gate64 model 14:04:04 tdurakov: From testing by clarkb and sdague -- yes, an explicit 'restart' seem to bring in the CPU model 14:04:31 tdurakov: What kind of issues you're seeing locally? 14:04:42 #link this should get merged to help cpu model fix - https://review.openstack.org/#/c/319819/ 14:04:57 tdurakov: Admittedly, I see the opposite effect of what both, Clark and Sean said: https://kashyapc.fedorapeople.org/behavior-of-systemctl-and-service-with-libvirt-bin-on-Xenial.txt 14:05:05 kashyap: actually the same, the only diff is that i'm using service, not systemctl, is it critical? 14:05:53 oh, ok 14:06:15 But I went ahead and made the change as the evidence from two people stacked against me -- and I'm not a regular Ubuntu user either. 14:06:29 So, I want to see what the CI reports with the above two changes, then we can take it from there. 14:06:38 tdurakov: it's still not passing last time I looked, so it is going to require someone to investigate further 14:07:05 #undo 14:07:06 Removing item from minutes: 14:07:40 sdague, what do you mean by not passing? 14:07:58 https://review.openstack.org/#/c/319934/1 14:08:05 the experimental change is still not working 14:08:24 https://www.irccloud.com/pastebin/iWNE3Qc9/gate64 14:08:37 but still got the same issue 14:09:02 so I looks like it's not enough just to restart libvirt 14:09:13 s/i/it 14:09:25 tdurakov: Clark confirmed that it surely worked for him 14:09:44 kashyap: well it's not working in code 14:10:04 could we discuss it just after meeting in #openstack-nova? 14:10:14 Good idea 14:10:22 sounds like this needs more digging 14:10:26 + 14:10:31 sdague: The dummy change failed because there were trivial bugs in the DevStack change. Let's see if it suceeds, otherwise, needs more investigation. 14:10:34 PaulMurray: Sure. 14:10:40 thanks for following up 14:10:47 moving on 14:11:03 next: mriedem to change an existing job to use raw instead of qcow2 14:11:08 he's not here 14:11:17 and I expect he had no time to do anything 14:11:36 PaulMurray: The sky's not falling and I'll ping him in the week if it becomes necessary. 14:11:54 Does anyone else have an update for CI ? 14:12:34 #topic Libvirt Storage Pools 14:12:49 Any updates here ? 14:12:52 mdbooth, paul-carlton 14:13:03 diana_clarke, ? 14:13:30 So, *all* the prepatches are in iirc! 14:13:39 I put 3 of the 5 backends up for review last week, and then went on vacation, so not much news from me ;) 14:14:05 good 14:14:22 * mdbooth is still addressing feedback, so no movement on the bdm stuff, yet 14:14:30 lets move on then 14:14:39 #topic Specs 14:14:40 I hope to finish the remaining backends this week, plus start using the new methods in the driver. 14:14:50 We are coming up to non-priority spec freeze 14:14:57 on June 2nd 14:15:05 https://wiki.openstack.org/wiki/Nova/Newton_Release_Schedule 14:15:31 The review trackin gpage is here: 14:15:32 https://etherpad.openstack.org/p/newton-nova-priorities-tracking 14:16:06 actually, mdbooth your patches should be on there 14:16:16 but I guess you're getting reviews anyway 14:16:34 PaulMurray: Yep. Will ensure they all go up there after this meeting. 14:16:58 I will move some of those specs up to the next section 14:17:11 there are several specs that need approval this week 14:17:33 these are close I think: 14:17:35 https://review.openstack.org/#/c/306561 14:17:47 auto complete ^^ 14:18:02 https://review.openstack.org/#/c/307131 - live migration of rescued 14:18:34 Any otheres need a mention or discussion? 14:18:50 Hi, I have a BP about migration state machine that needs some help. https://blueprints.launchpad.net/nova/+spec/migration-state-machine 14:19:19 I proposed it in mitaka, but I didn't have enough time to finish it. 14:19:54 I'd like you guys help to review it, and approve it if possible. :) 14:19:56 tangchen_, is there a current spec link 14:20:28 No, sorry. In mitaka, we agreed that it needs no spec. 14:20:35 It changes no API. 14:20:51 Just use a state machine to replace the current strings. 14:21:07 tangchen_, ok - I missed that 14:21:09 I can write a spec if necessary. 14:21:16 so its just the patches on the spec ? 14:21:23 ...sorry on the bp 14:21:37 yes 14:21:46 https://review.openstack.org/#/q/topic:bp/migration-state-machine,n,z 14:21:50 2 spec from me: https://review.openstack.org/#/c/292271/ - orchestration to conductor, https://review.openstack.org/#/c/320416/ - instance action to conductor, and checks before live-migration in background 14:22:24 The all have -2 from johnthetubaguy 14:22:34 that needs lifting or no one will look 14:22:55 Try pinging him in openstack-nova 14:23:03 yes. John told me that it is better to raise it up in a meeting. 14:23:06 and put on the spec review tracking page 14:23:13 I have talked with him. 14:23:28 tangchen_, did he mean here or in the nova meeting ? 14:24:10 I asked the same question, and he said live-migrate meeting will be a good idea. 14:24:19 So I came here. :) 14:24:25 ha 14:24:47 ok - I'll take a look and talk to him and you about it later 14:25:12 I think he asked me to discuss with you that if using the automaton is a good idea. :) 14:25:14 I'm not up on it at the moment - unless anyone else has any comments 14:25:45 OK, thank you very much. But it is late in my time zone. I have to get off. 14:25:49 I thought there was general agreement that it was a good idea 14:26:00 Please leave a message on the patch or the BP. 14:26:14 tdurakov, does it clash with what you are looking at ? 14:26:16 OK, thx. :) 14:26:34 i.e. the two specs you linked above? 14:26:56 Remove compute-compute communication in live-migration 14:26:59 ^^ 14:27:06 https://review.openstack.org/#/c/292271/ 14:27:08 PaulMurray: yes it is 14:27:36 why does it clash, I would expect it to be complementary ? 14:28:27 PaulMurray: wait, are you talking about state machines? 14:28:41 yes 14:29:00 https://review.openstack.org/#/c/241476/ - Implement state machine for live-migration 14:29:02 PaulMurray: let me check state machine spec first, will leave comment 14:29:30 ok 14:29:33 tdurakov: Thank you very much. :) 14:29:44 And Paul, thx too. :) 14:30:30 folks, could you take a look on the specs I've mentioned above 14:30:32 #action PaulMurray follow up on status for https://blueprints.launchpad.net/nova/+spec/migration-state-machine (needs no spec) 14:30:57 Any more specs ? 14:31:26 We only have until Thursday next week to get approval 14:31:55 moving on.... 14:31:59 #topic Review request 14:32:06 I think we spilled into this already 14:32:09 anything else here ? 14:32:23 on patches in general 14:32:38 I got 2 patches 14:32:41 hi, need review on this https://review.openstack.org/#/c/215483/ (Set migration status to 'error' on live-migration failure) 14:32:47 PaulMurray: https://review.openstack.org/#/c/310707/ libvirt: Prevent block live migration with tunnelled flag 14:33:40 PaulMurray: this is a regression bug in mitaka, pkoniszewski help to tested alread. 14:33:56 abhishekk, I said I would follow up on that one but I decided not to be pushy yet 14:34:03 I will follow up on it now 14:34:28 PaulMurray: thank you and sorry for making noise 14:35:18 PaulMurray: I can remove the -2, just ping the -2 on IRC once the blueprint is approved, in the usual way 14:35:43 PaulMurray: ah, this is the one from tangchen_, the blueprint needs approving first 14:35:45 johnthetubaguy, who is going to approve ? 14:36:24 do we all like the direction thats taking, as a live-migrate subteam? 14:37:08 We are going to take a look at the code 14:37:15 everyone has got rusty on it 14:37:59 johnthetubaguy, it seems we decided it would not need a spec - is that ok ? 14:38:28 not sure 14:38:33 johnthetubaguy, I have an action to catch up with you later 14:38:41 about it 14:38:41 the problem is actually getting people to understand it, and agree we like the direction 14:38:49 it feels like we need more than the blueprint to do that 14:39:01 yes, that's why I wondered about a spec 14:39:12 for me, I think a spec is likely to be quicker 14:39:18 as odd as that sounds 14:39:38 but I am willing to try the blueprint only approach, if we can agree the direction 14:40:02 anyways, happy to catch up after the meeting, with the interested folks 14:41:12 tangchen_, do you think you could resurect the old spec quickly ? 14:41:30 It would be a useful place to add comments - we may decide we don't need it 14:43:01 PaulMurray: +1 for spec 14:43:35 i've walked quickly over patches, I think it would be useful to read kind of doc 14:43:41 tangchen_, johnthetubaguy my thought is we could review look at the spec and try to decide what we want to do in the next few days ? 14:44:16 I'm particularly interested in what happens if/when we decide to change the state machine 14:45:07 looks like we lost tangchen_ 14:45:13 I'll follow up 14:45:21 cool 14:45:25 #topic Open Discussion 14:45:27 yeah, lets follow up later 14:45:35 so I wanted to rase the auto converge spec 14:45:44 https://review.openstack.org/#/c/306561/18 14:45:48 go ahead 14:45:48 I really like where its going 14:45:56 but I got confused about a contractition 14:46:13 I think you have the auto API to force post copy at any point of the migration (probably after the current memory copy completes?), and if not, we wait till the memory sync is not improving things by 10%? 14:46:17 does that sound correct? 14:46:49 just gave paul-carlton a poke 14:46:54 :) 14:47:19 I think that's right johnthetubaguy 14:47:25 OK, cool 14:47:35 if its just a typo thing, I am basically +2 on the approach 14:47:45 not a fan of config options changing what an API does 14:47:53 the force flag in live-migrate made it agressive about the switch 14:47:57 but I am much less of a fan of exposing implementation details vai the API 14:48:03 I'll check the spec, what I meant was that it would automatically switch if less than 10% progress after first cycle 14:48:14 so the force flag got deleted 14:48:27 paul-carlton2: cool 14:48:36 but the operator could use force-complete to make it switch at any time 14:48:48 yeah, that sounds good 14:49:05 paul-carlton2, so the foce flag in ive-migrate is definitely dropped then ? 14:49:12 except that if they try to force the switch too early, i.e. before a complete memory copy cycle we defer it 14:49:19 Yep 14:49:44 I don't see it having enough value to be worth having at this stage 14:50:01 We can add it later if we find a need 14:50:02 I'm good with that 14:50:20 johnthetubaguy, do you know if danpb is about at all ? 14:50:28 we haven't seen him in a while 14:50:33 mdbooth, ^^ ? 14:50:46 PaulMurray, johnthetubaguy: Hi, I'm here. I can submit a spec for it. :) 14:50:53 PaulMurray: He's definitely away for a few days at least 14:50:53 Will do it tomorrow. 14:50:55 tangchen_, thanks 14:50:59 Don't know exactly how long 14:51:01 :) 14:51:05 I'll mail him and dlist to solicit comments/approaval 14:51:33 paul-carlton2: He's on holiday, so don't expect a quick reply 14:51:48 mdbooth, ack 14:52:20 mdbooth, do you know when he is back a work ? 14:52:32 like next week ? 14:52:32 PaulMurray: Unfortunately not, no 14:52:37 ok 14:52:44 I expect so, but really don't know 14:53:17 Is there anything else to finish with ? 14:53:56 ...then we're done for this week. 14:54:02 thanks for coming 14:54:09 #endmeeting