09:01:53 #startmeeting ha 09:01:54 Meeting started Mon Feb 15 09:01:53 2016 UTC and is due to finish in 60 minutes. The chair is aspiers. Information about MeetBot at http://wiki.debian.org/MeetBot. 09:01:55 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 09:01:58 The meeting name has been set to 'ha' 09:02:05 let's start anyway :) 09:02:47 #info we have apologies from NTT guys for not being here because they are at the European Ops meetup in Manchester 09:02:59 I wish I could have gone to that :-( 09:03:08 I think they might be moderating a session on compute node HA there 09:03:36 #topic Current status (progress, issues, roadblocks, further plans) 09:03:50 I'll go first since I will be quick :) 09:04:00 I got back from holiday last Wed 09:04:06 so I'm still catching up on stuff 09:04:26 but SUSE has made a lot of progress recently on compute node HA - we now have the whole setup totally automated via Crowbar/Chef 09:04:42 so it's a few simple mouse clicks to set up the whole thing, which is pretty cool 09:04:53 also on Friday I met 4 NTT guys including masahito 09:05:05 they came to London before the Ops meetup in Manchester 09:05:34 there were 3 developers and one operator, and we talked for a few hours about compute node HA, masakari etc. 09:05:39 it was a very useful meeting 09:05:58 <_gryf> aspiers, any output in a textual form out this? 09:06:09 _gryf: not yet but I will try to write something up 09:06:14 <_gryf> aspiers, cool 09:06:29 #info aspiers met 4 NTT guys working on compute node HA in London 09:06:47 #info (they work in Japan, but we met in London!) 09:06:59 while we were talking, I had an idea 09:07:14 to draw some kind of strategic map for community compute node HA 09:07:30 which outlines the various failure modes and challenges on one diagram 09:07:49 and shows how the current approaches cover them and where there are gaps 09:07:56 aspiers, and perhaps update the spec as well 09:08:03 bogdando: yes, great idea 09:08:29 currently SUSE is finalising our release 6 of SUSE OpenStack Cloud 09:08:47 it should be coming pretty soon, and will have full support for compute node HA 09:08:52 so that's it from my side 09:09:07 maybe bogdando next? 09:09:10 I updated the spec https://review.openstack.org/#/c/257809/ and reflected the state of things, as I see it. Also , we have to bring more attention to this on the cross projects meetings 09:09:34 my last attempt to do so in the open discussion section seemed futile 09:09:38 great idea 09:09:44 keep trying ;) 09:10:41 #info bogdando is working on the automatic evacuation spec 09:10:59 and please make revies, some things may not be correct 09:11:31 <_gryf> actually, it was Timofey, which bring this topic up - I have a little conversation with him during nova midcycle. 09:11:47 <_gryf> bogdando, did you guys coworking together on this topic? 09:12:23 * _gryf reviewd the bp the other day last week :) 09:12:31 so we now have a user story and a spec - that's cool 09:12:55 _gryf, I'm only trying to help 09:13:02 no code writing, yet 09:13:02 <_gryf> bogdando, that's ok :) 09:13:25 I'll try to contribute to these too 09:13:46 _gryf: anything new from your side? 09:14:04 <_gryf> aspiers, besides the reviews, nope. 09:14:10 ok 09:14:20 ddeja: anything you want to report? 09:14:29 aspiers: yes 09:14:59 I was working on preparing fence agent for running evacuation 09:15:27 I would have finished it last week, but new topic in Mistral occured 09:15:45 #link https://review.openstack.org/#/c/279018/4 09:16:16 Me and _gryf have a mail conversation with Mistral PTL 09:16:25 and this change is an output 09:16:38 cool 09:16:57 they basically decide to not wait for oslo team 09:17:40 to bring ACK after message is processed feature and implement it by themselfs 09:17:52 which is cool for us, since this may fix Mistral HA big problem 09:18:05 and Mistral HA was a big consern on last meeting :) 09:18:10 concern* 09:18:16 nice! 09:18:30 <_gryf> other thing to mention is, that the idea of the own implemented ack system on mistral 09:18:40 nervertheless, I'm trying to test this change to see if it helps and give feedback to Mistral team 09:18:41 <_gryf> is to have oslo.messaging in place 09:19:14 and I'll continue work on fence agent 09:19:17 <_gryf> so that if oslo team would accept the patch for the oslo itself, there would be just a small change to make it use within oslo 09:19:18 #info ddeja working on fence agent for running evacuation 09:19:20 that's all from my side 09:19:46 <_gryf> and it will not blocking mistral to going forward at the same time 09:19:54 #info some progress on mistral HA 09:21:14 is there any doc yet on the mistral compute node HA architecture? 09:21:31 It's under construction 09:21:33 ok 09:21:41 I'm wondering what the fence agent does exactly 09:22:03 is it run by Pacemaker? 09:22:12 for the PoC, it will only send http call to Mistral, so evacuate process is started 09:22:16 aspiers: yes 09:22:20 doh, meeting 09:22:31 <_gryf> o, hi beekhof 09:22:36 hey beekhof 09:22:45 ddeja: so that is what triggers the mistral workflow? 09:22:49 * beekhof has a keyword alert on "pacemaker" :) 09:23:00 * aspiers makes a mental note of that 09:23:05 aspiers: yes, it's the most simple scenario 09:23:37 ddeja: and that fence agent gets run after the normal fencing of the compute node? 09:23:51 aspiers: yup 09:23:58 ddeja: i.e. 2 fencing devices via fencing_topology? 09:24:09 exactly 09:24:15 cool, that makes sense 09:24:27 beekhof: you wanna report any status? 09:24:50 nothing related to here. with the possible exception of getting sbd working in remote nodes 09:25:20 which might be interesting later on 09:28:54 pacemaker pacemaker pacemaker pacemaker 09:28:54 :) 09:28:54 I guess the kids get the highest priority interrupt in the evening :) ok, let's talk about Austin quickly 09:28:54 #topic Austin summit 09:28:54 so we have some HA talks submitted 09:28:54 #link ddeja and aspiers submitted https://www.openstack.org/summit/austin-2016/vote-for-speakers/Presentation/7327 09:28:54 please vote for it! 09:28:54 I also submitted https://www.openstack.org/summit/austin-2016/vote-for-speakers/Presentation/7329 09:28:54 which is about automated deployment of HA in general 09:28:54 the intention is to share our experiences of automation with other vendors 09:29:11 if that's interesting then please vote for that too 09:29:18 anyone else submit any talks? 09:29:28 nope :) 09:29:39 IIRC masahito may have submitted one for masakari but I'm not 100% sure 09:30:14 anything else to mention about Austin? 09:31:32 if not, let's move on to AOB 09:31:39 #topic AOB (Any Other Business) 09:31:54 so one interesting implementation detail which arose during our discussions on Friday ... 09:32:04 I was asking NTT why they didn't use Pacemaker for process monitoring 09:32:14 currently they have their own for monitoring libvirtd etc. 09:32:49 one answer was that if libvirtd fails, it makes sense to do a nova service-disable on that node 09:33:08 to stop the scheduler from starting any new instances on it 09:33:28 extra policies basically 09:33:33 but at the same time, in some clouds the policy would dictate a preferred action of keeping the VMs running 09:34:01 beekhof: yeah 09:34:07 and that gave rise to a follow-on discussion about policies and action matrices etc. 09:34:15 another example is if the admin network fails 09:34:24 but everything else still works 09:34:38 also in that case you'd probably want to keep the VMs running but remove the node from nova-scheduler 09:34:50 it depends on the nature of the VMs on that node of course 09:34:55 and on how many 09:35:12 but a typical policy might be to avoid the disruption of instantly fencing that node 09:35:21 and instead notify the cloud ops and wait for manual evacuation 09:35:58 #info aspiers and NTT discussed failure modes where the preferred course of action might be notifying cloud ops to do *manual* evacuation 09:36:10 beekhof: does Pacemaker support that kind of thing? 09:36:23 it seems a shame not to have Pacemaker doing process monitoring 09:37:04 it's a strange case, because you'd still want Pacemaker to ensure that libvirtd is started before nova-compute 09:37:24 implying a standard order constraint "libvirtd before nova-compute" 09:37:53 if libvirtd dies, it might be OK to shut down nova-compute but not kill the VMs 09:38:15 anyway we don't have to discuss that now, but I just wanted to raise it 09:38:20 we can cover it in the user stories and specs 09:38:28 one question 09:38:40 if libvirt is dead, how we are supossed to kill vms? 09:38:41 i'm not bothered, its very domain specific 09:38:47 ddeja: fencing 09:39:02 fencing the whole node? 09:39:08 yes 09:39:13 oh, ok 09:39:15 we can have a manual fencing target... but then you're blocking everything else until a human shows up 09:39:30 beekhof: that might be desirable in some cases 09:39:30 not cool 09:39:30 <_gryf> right 09:39:47 i guess you could write a new agent that turned off libvirt but didnt power off the node 09:39:57 aspiers: almost never 09:40:14 beekhof: well, the situation I'm talking about doesn't involve fencing 09:40:19 turned the human into a single point of failure 09:40:33 beekhof: it's actually a degradation of service not an outage 09:40:47 well you can write an RA that does anything 09:40:51 I mean, you can no longer control VMs on that compute node, but they are still running OK 09:40:53 and you have on-fail=block 09:41:08 so unrelated resource trees can still be recovered 09:41:20 so the control plane has an outage, but the user workload doesn't 09:41:48 in that case live migration of VMs is more appropriate 09:42:08 but if there's no shared storage then that might cause brief workload outage 09:42:17 so again it comes down to policy and operator's decision 09:42:20 if libvirt is dead, you can't live-migrate instances, AFAICT 09:42:28 well, it might, but you'd never know 09:42:31 ddeja: oh yeah, good point 09:42:39 <_gryf> aspiers, are you talking about live migrate using virsh, or nova facility? 09:43:24 _gryf: that's kind of the point. it's hard for a computer to choose the right one 09:43:27 <_gryf> right, exactly what ddeja said 09:43:27 ok, i gotta go again 09:43:34 bye 09:43:37 <_gryf> beekhof, :) 09:44:08 well, let's include a user story where libvirtd dies, and we can figure out a good spec for how to handle it 09:44:15 like I said, no need to solve it in this meeting ;-) 09:44:20 ok 09:44:28 one other important piece of info 09:44:36 we found some serious stability bugs in Pacemaker's remote code 09:44:49 they are now fixed upstream 09:44:53 #info SUSE found some serious stability bugs in Pacemaker's remote code 09:45:06 #link https://github.com/ClusterLabs/pacemaker/pull/908 09:45:11 #link https://github.com/ClusterLabs/pacemaker/pull/909 09:45:15 * beekhof will look 09:45:20 beekhof: they're merged 09:45:34 beekhof: you probably already saw 10 days ago 09:45:40 versioning will change a bit for remote nodes too 09:45:53 beekhof: oh, what do you mean by versioning? 09:48:15 ok never mind :) anything else from anyone? 09:50:34 I guess not, so let's close for today 09:50:40 thanks all, and bye for now! 09:50:48 see you on #openstack-ha! 09:51:14 <_gryf> cu! 09:51:33 hi 09:51:42 #endmeeting