#openstack-meeting log

09:01:53 <aspiers> #startmeeting ha
09:01:54 <openstack> Meeting started Mon Feb 15 09:01:53 2016 UTC and is due to finish in 60 minutes.  The chair is aspiers. Information about MeetBot at http://wiki.debian.org/MeetBot.
09:01:55 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote.
09:01:58 <openstack> The meeting name has been set to 'ha'
09:02:05 <aspiers> let's start anyway :)
09:02:47 <aspiers> #info we have apologies from NTT guys for not being here because they are at the European Ops meetup in Manchester
09:02:59 <aspiers> I wish I could have gone to that :-(
09:03:08 <aspiers> I think they might be moderating a session on compute node HA there
09:03:36 <aspiers> #topic Current status (progress, issues, roadblocks, further plans)
09:03:50 <aspiers> I'll go first since I will be quick :)
09:04:00 <aspiers> I got back from holiday last Wed
09:04:06 <aspiers> so I'm still catching up on stuff
09:04:26 <aspiers> but SUSE has made a lot of progress recently on compute node HA - we now have the whole setup totally automated via Crowbar/Chef
09:04:42 <aspiers> so it's a few simple mouse clicks to set up the whole thing, which is pretty cool
09:04:53 <aspiers> also on Friday I met 4 NTT guys including masahito
09:05:05 <aspiers> they came to London before the Ops meetup in Manchester
09:05:34 <aspiers> there were 3 developers and one operator, and we talked for a few hours about compute node HA, masakari etc.
09:05:39 <aspiers> it was a very useful meeting
09:05:58 <_gryf> aspiers, any output in a textual form out this?
09:06:09 <aspiers> _gryf: not yet but I will try to write something up
09:06:14 <_gryf> aspiers, cool
09:06:29 <aspiers> #info aspiers met 4 NTT guys working on compute node HA in London
09:06:47 <aspiers> #info (they work in Japan, but we met in London!)
09:06:59 <aspiers> while we were talking, I had an idea
09:07:14 <aspiers> to draw some kind of strategic map for community compute node HA
09:07:30 <aspiers> which outlines the various failure modes and challenges on one diagram
09:07:49 <aspiers> and shows how the current approaches cover them and where there are gaps
09:07:56 <bogdando> aspiers, and perhaps update the spec as well
09:08:03 <aspiers> bogdando: yes, great idea
09:08:29 <aspiers> currently SUSE is finalising our release 6 of SUSE OpenStack Cloud
09:08:47 <aspiers> it should be coming pretty soon, and will have full support for compute node HA
09:08:52 <aspiers> so that's it from my side
09:09:07 <aspiers> maybe bogdando next?
09:09:10 <bogdando> I updated the spec https://review.openstack.org/#/c/257809/ and reflected the state of things, as I see it. Also , we have to bring more attention to this on the cross projects meetings
09:09:34 <bogdando> my last attempt to do so in the open discussion section seemed futile
09:09:38 <aspiers> great idea
09:09:44 <aspiers> keep trying ;)
09:10:41 <aspiers> #info bogdando is working on the automatic evacuation spec
09:10:59 <bogdando> and please make revies, some things may not be correct
09:11:31 <_gryf> actually, it was Timofey, which bring this topic up - I have a little conversation with him during nova midcycle.
09:11:47 <_gryf> bogdando, did you guys coworking together on this topic?
09:12:23 * _gryf reviewd the bp the other day last week :)
09:12:31 <aspiers> so we now have a user story and a spec - that's cool
09:12:55 <bogdando> _gryf, I'm only trying to help
09:13:02 <bogdando> no code writing, yet
09:13:02 <_gryf> bogdando, that's ok :)
09:13:25 <aspiers> I'll try to contribute to these too
09:13:46 <aspiers> _gryf: anything new from your side?
09:14:04 <_gryf> aspiers, besides the reviews, nope.
09:14:10 <aspiers> ok
09:14:20 <aspiers> ddeja: anything you want to report?
09:14:29 <ddeja> aspiers: yes
09:14:59 <ddeja> I was working on preparing fence agent for running evacuation
09:15:27 <ddeja> I would have finished it last week, but new topic in Mistral occured
09:15:45 <ddeja> #link https://review.openstack.org/#/c/279018/4
09:16:16 <ddeja> Me and _gryf have a mail conversation with Mistral PTL
09:16:25 <ddeja> and this change is an output
09:16:38 <aspiers> cool
09:16:57 <ddeja> they basically decide to not wait for oslo team
09:17:40 <ddeja> to bring ACK after message is processed feature and implement it by themselfs
09:17:52 <ddeja> which is cool for us, since this may fix Mistral HA big problem
09:18:05 <ddeja> and Mistral HA was a big consern on last meeting :)
09:18:10 <ddeja> concern*
09:18:16 <aspiers> nice!
09:18:30 <_gryf> other thing to mention is, that the idea of the own implemented ack system on mistral
09:18:40 <ddeja> nervertheless, I'm trying to test this change to see if it helps and give feedback to Mistral team
09:18:41 <_gryf> is to have oslo.messaging in place
09:19:14 <ddeja> and I'll continue work on fence agent
09:19:17 <_gryf> so that if oslo team would accept the patch for the oslo itself, there would be just a small change to make it use within oslo
09:19:18 <aspiers> #info ddeja working on fence agent for running evacuation
09:19:20 <ddeja> that's all from my side
09:19:46 <_gryf> and it will not blocking mistral to going forward at the same time
09:19:54 <aspiers> #info some progress on mistral HA
09:21:14 <aspiers> is there any doc yet on the mistral compute node HA architecture?
09:21:31 <ddeja> It's under construction
09:21:33 <aspiers> ok
09:21:41 <aspiers> I'm wondering what the fence agent does exactly
09:22:03 <aspiers> is it run by Pacemaker?
09:22:12 <ddeja> for the PoC, it will only send http call to Mistral, so evacuate process is started
09:22:16 <ddeja> aspiers: yes
09:22:20 <beekhof> doh, meeting
09:22:31 <_gryf> o, hi beekhof
09:22:36 <aspiers> hey beekhof
09:22:45 <aspiers> ddeja: so that is what triggers the mistral workflow?
09:22:49 * beekhof has a keyword alert on "pacemaker" :)
09:23:00 * aspiers makes a mental note of that
09:23:05 <ddeja> aspiers: yes, it's the most simple scenario
09:23:37 <aspiers> ddeja: and that fence agent gets run after the normal fencing of the compute node?
09:23:51 <ddeja> aspiers: yup
09:23:58 <aspiers> ddeja: i.e. 2 fencing devices via fencing_topology?
09:24:09 <ddeja> exactly
09:24:15 <aspiers> cool, that makes sense
09:24:27 <aspiers> beekhof: you wanna report any status?
09:24:50 <beekhof> nothing related to here. with the possible exception of getting sbd working in remote nodes
09:25:20 <beekhof> which might be interesting later on
09:28:54 <aspiers> pacemaker pacemaker pacemaker pacemaker
09:28:54 <aspiers> :)
09:28:54 <aspiers> I guess the kids get the highest priority interrupt in the evening :) ok, let's talk about Austin quickly
09:28:54 <aspiers> #topic Austin summit
09:28:54 <aspiers> so we have some HA talks submitted
09:28:54 <aspiers> #link ddeja and aspiers submitted https://www.openstack.org/summit/austin-2016/vote-for-speakers/Presentation/7327
09:28:54 <aspiers> please vote for it!
09:28:54 <aspiers> I also submitted https://www.openstack.org/summit/austin-2016/vote-for-speakers/Presentation/7329
09:28:54 <aspiers> which is about automated deployment of HA in general
09:28:54 <aspiers> the intention is to share our experiences of automation with other vendors
09:29:11 <aspiers> if that's interesting then please vote for that too
09:29:18 <aspiers> anyone else submit any talks?
09:29:28 <beekhof> nope :)
09:29:39 <aspiers> IIRC masahito may have submitted one for masakari but I'm not 100% sure
09:30:14 <aspiers> anything else to mention about Austin?
09:31:32 <aspiers> if not, let's move on to AOB
09:31:39 <aspiers> #topic AOB (Any Other Business)
09:31:54 <aspiers> so one interesting implementation detail which arose during our discussions on Friday ...
09:32:04 <aspiers> I was asking NTT why they didn't use Pacemaker for process monitoring
09:32:14 <aspiers> currently they have their own for monitoring libvirtd etc.
09:32:49 <aspiers> one answer was that if libvirtd fails, it makes sense to do a nova service-disable on that node
09:33:08 <aspiers> to stop the scheduler from starting any new instances on it
09:33:28 <beekhof> extra policies basically
09:33:33 <aspiers> but at the same time, in some clouds the policy would dictate a preferred action of keeping the VMs running
09:34:01 <aspiers> beekhof: yeah
09:34:07 <aspiers> and that gave rise to a follow-on discussion about policies and action matrices etc.
09:34:15 <aspiers> another example is if the admin network fails
09:34:24 <aspiers> but everything else still works
09:34:38 <aspiers> also in that case you'd probably want to keep the VMs running but remove the node from nova-scheduler
09:34:50 <aspiers> it depends on the nature of the VMs on that node of course
09:34:55 <aspiers> and on how many
09:35:12 <aspiers> but a typical policy might be to avoid the disruption of instantly fencing that node
09:35:21 <aspiers> and instead notify the cloud ops and wait for manual evacuation
09:35:58 <aspiers> #info aspiers and NTT discussed failure modes where the preferred course of action might be notifying cloud ops to do *manual* evacuation
09:36:10 <aspiers> beekhof: does Pacemaker support that kind of thing?
09:36:23 <aspiers> it seems a shame not to have Pacemaker doing process monitoring
09:37:04 <aspiers> it's a strange case, because you'd still want Pacemaker to ensure that libvirtd is started before nova-compute
09:37:24 <aspiers> implying a standard order constraint "libvirtd before nova-compute"
09:37:53 <aspiers> if libvirtd dies, it might be OK to shut down nova-compute but not kill the VMs
09:38:15 <aspiers> anyway we don't have to discuss that now, but I just wanted to raise it
09:38:20 <aspiers> we can cover it in the user stories and specs
09:38:28 <ddeja> one question
09:38:40 <ddeja> if libvirt is dead, how we are supossed to kill vms?
09:38:41 <beekhof> i'm not bothered, its very domain specific
09:38:47 <aspiers> ddeja: fencing
09:39:02 <ddeja> fencing the whole node?
09:39:08 <aspiers> yes
09:39:13 <ddeja> oh, ok
09:39:15 <beekhof> we can have a manual fencing target... but then you're blocking everything else until a human shows up
09:39:30 <aspiers> beekhof: that might be desirable in some cases
09:39:30 <ddeja> not cool
09:39:30 <_gryf> right
09:39:47 <beekhof> i guess you could write a new agent that turned off libvirt but didnt power off the node
09:39:57 <beekhof> aspiers: almost never
09:40:14 <aspiers> beekhof: well, the situation I'm talking about doesn't involve fencing
09:40:19 <beekhof> turned the human into a single point of failure
09:40:33 <aspiers> beekhof: it's actually a degradation of service not an outage
09:40:47 <beekhof> well you can write an RA that does anything
09:40:51 <aspiers> I mean, you can no longer control VMs on that compute node, but they are still running OK
09:40:53 <beekhof> and you have on-fail=block
09:41:08 <beekhof> so unrelated resource trees can still be recovered
09:41:20 <aspiers> so the control plane has an outage, but the user workload doesn't
09:41:48 <aspiers> in that case live migration of VMs is more appropriate
09:42:08 <aspiers> but if there's no shared storage then that might cause brief workload outage
09:42:17 <aspiers> so again it comes down to policy and operator's decision
09:42:20 <ddeja> if libvirt is dead, you can't live-migrate instances, AFAICT
09:42:28 <beekhof> well, it might, but you'd never know
09:42:31 <aspiers> ddeja: oh yeah, good point
09:42:39 <_gryf> aspiers, are you talking about live migrate using virsh, or nova facility?
09:43:24 <aspiers> _gryf: that's kind of the point. it's hard for a computer to choose the right one
09:43:27 <_gryf> right, exactly what ddeja said
09:43:27 <beekhof> ok, i gotta go again
09:43:34 <aspiers> bye
09:43:37 <_gryf> beekhof, :)
09:44:08 <aspiers> well, let's include a user story where libvirtd dies, and we can figure out a good spec for how to handle it
09:44:15 <aspiers> like I said, no need to solve it in this meeting ;-)
09:44:20 <ddeja> ok
09:44:28 <aspiers> one other important piece of info
09:44:36 <aspiers> we found some serious stability bugs in Pacemaker's remote code
09:44:49 <aspiers> they are now fixed upstream
09:44:53 <aspiers> #info SUSE found some serious stability bugs in Pacemaker's remote code
09:45:06 <aspiers> #link https://github.com/ClusterLabs/pacemaker/pull/908
09:45:11 <aspiers> #link https://github.com/ClusterLabs/pacemaker/pull/909
09:45:15 * beekhof will look
09:45:20 <aspiers> beekhof: they're merged
09:45:34 <aspiers> beekhof: you probably already saw 10 days ago
09:45:40 <beekhof> versioning will change a bit for remote nodes too
09:45:53 <aspiers> beekhof: oh, what do you mean by versioning?
09:48:15 <aspiers> ok never mind :) anything else from anyone?
09:50:34 <aspiers> I guess not, so let's close for today
09:50:40 <aspiers> thanks all, and bye for now!
09:50:48 <aspiers> see you on #openstack-ha!
09:51:14 <_gryf> cu!
09:51:33 <ddeja> hi
09:51:42 <aspiers> #endmeeting