20:00:06 <johnsom> #startmeeting Octavia 20:00:07 <openstack> Meeting started Wed Oct 25 20:00:06 2017 UTC and is due to finish in 60 minutes. The chair is johnsom. Information about MeetBot at http://wiki.debian.org/MeetBot. 20:00:08 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 20:00:10 <openstack> The meeting name has been set to 'octavia' 20:00:11 <xgerman_> o/ 20:00:24 <johnsom> Hi folks 20:00:56 <nmagnezi> םץ. 20:00:59 <nmagnezi> o/ 20:01:02 <nmagnezi> (sorry :D ) 20:01:02 <longstaff> hi 20:01:10 <jniesz> hi 20:01:17 <johnsom> Ah, there are some people. 20:01:21 <johnsom> #topic Announcements 20:01:27 <rm_work> o/ 20:02:03 <johnsom> The Queens MS1 release has not yet gone out. We are working through gate/release system issues from the zuul v3 changes. I expect it will happen in the next day or two. 20:02:33 <johnsom> The newton release has officially gone EOL and the git branches removed. This happened last night. 20:03:04 <johnsom> Just a reminder if you need to reference that code, there is a tag newton-eol you can pull, but we can't put anymore patches on newton. 20:03:30 <johnsom> And finally, I am still working through all of the zuul v3 changes. 20:04:01 <johnsom> Octavia and neutron-lbaas should no longer have duplicate jobs, but I'm still working on the dashboard repos. 20:04:21 <johnsom> I still have more cleanup to finish and I need to fix the stable branches. 20:04:37 <nmagnezi> for zuul v3 ? 20:04:39 <johnsom> However, I think we have zuul v3 functional at this point. 20:04:46 <johnsom> yes 20:05:18 <johnsom> stable branch gates aren't running at the moment (just failing). 20:06:18 <johnsom> Oh, we do have some TC changes: 20:06:20 <johnsom> #link https://governance.openstack.org/election/results/queens/tc.html 20:06:40 <johnsom> Any other announcements today? 20:07:23 <johnsom> #topic Brief progress reports / bugs needing review 20:07:39 <johnsom> Please continue reviewing and commenting on the provider driver spec: 20:07:47 <johnsom> #link https://review.openstack.org/509957 20:08:06 <rm_work> back to asking folks to consider THIS option for amphora az cacheing: 20:08:08 <rm_work> #link https://review.openstack.org/#/c/511045/ 20:08:29 <johnsom> longstaff Are you planning patchset version 3 soon? 20:08:33 <xgerman_> I added a topic to the the agenda 20:08:36 <xgerman_> for that 20:08:41 <rm_work> oh k 20:09:19 <longstaff> Yes. I plan to commit an update to the provider driver spec tomorrow. 20:09:30 <johnsom> Excellent, thank you! 20:10:17 <johnsom> I have been focused on zuul v3 stuffs and some bug fixes to issues that came up over the last week. 20:10:33 <johnsom> I still have more patches for bug fixes coming. 20:10:51 <johnsom> And much more zuulv3 patches... sigh 20:11:12 <jniesz> #link https://storyboard.openstack.org/#!/story/2001258 20:11:21 <johnsom> Any other progress updates to share or should we jump into the main event: AZs 20:11:24 <xgerman_> johnsom and I put up some patches to improve LBaaS v2 <-> Octavia 20:12:00 <jniesz> johnsom: saw your notes about HM failover after the listener fails . Is there a way to not trigger failover unless the original listener was in a known good state? 20:12:12 <jniesz> since this is new provision 20:12:17 <johnsom> jniesz Yeah, one of those has a patch up for review. The secondary outcome is around the network driver being dumb. I haven't finished that patch yet 20:13:21 <xgerman_> I have taken the eyes of the ball in OpenStackAnsible and so there is some cruft we are tackling: https://review.openstack.org/#/c/514767/ — 20:14:02 <xgerman_> good news I will remain core for the Q cycle over there… 20:14:14 <xgerman_> in Octavia OSA 20:14:38 <johnsom> jniesz Well, the failover we saw was a valid failover. The amp should have a working listener on it, but it didn't, so IMO Octavia did "the right thing' and ended in the right state with the load balancer in error as well because we could not resolve the problem with the amp. (notes for those that haven't read my novel. A bad local jinja template change caused listeners to fail to deploy) 20:15:09 <johnsom> Yeah, Octavia OSA is getting some more attention 20:15:37 <jniesz> so wouldn't we want to not fail over unless the original provisioned ok? 20:15:38 <rm_work> that seems right -- if the listeners fail to deploy, even on an initial create, why wouldn't it be an error? 20:15:55 <rm_work> oh, failover -- ehh... maybe? I mean, it could have been a one-time issue 20:16:02 <rm_work> go to error -- definitely 20:16:06 <rm_work> but was it failover-looping? 20:16:09 <johnsom> jniesz No, it could have failed for reasons related to that host, like the base host ran out of disk 20:16:37 <rm_work> we might want to have something that tracks failover-count (I wanted this anyway) and at some point if it fails too many times quickly detect something is wrong 20:16:58 <johnsom> No, it saw the listener was failed, attempted a failover, which also failed (same template), so it gave up and marked the LB in error too 20:17:07 <rm_work> ah 20:17:10 <rm_work> yeah that seems right 20:17:23 <jniesz> is failover the same has re-create listener? 20:17:26 <jniesz> is that the same code path 20:17:31 <johnsom> LB in ERROR will stop the failover attempts 20:18:16 <johnsom> No, failover is rebuild the whole amphora, which, because there was a listener deployed did attempt to deploy a listener again. 20:18:57 <johnsom> Fundamentally the "task" that deploys a listener is shared between create and failover 20:20:28 <jniesz> and we trigger the failover of the amp because at that point it is in an unknown state I assume 20:20:33 <jniesz> because of error 20:21:52 <johnsom> It triggered failover because the timeout expired for the health checking on the amp. The amp was showing no listeners up but the controller knew there should be a listener deployed. It gives it some time and then starts the failover process. 20:22:17 <jniesz> because the listener create responded back with invalid request 20:23:39 <johnsom> No, this health checking was independent of that error coming back and the listener going into "ERROR". 20:24:06 <johnsom> Two paths, both figuring out there was a problem, with one escalating to a full amp failover 20:24:33 <jniesz> so if listener create goes from PENDING -> ERROR HM will still trigger? 20:24:42 <jniesz> or if it goes from PENDING -> RUNNING -> failure 20:26:41 <johnsom> If the LB goes ACTIVE and then some part of the load balancing engine is not working it will trigger 20:27:52 <jniesz> ok, because from logs octavia.controller.worker.tasks.lifecycle_tasks.ListenersToErrorOnRevertTask went to RUNNING from PENDING 20:28:08 <jniesz> and then it failed on octavia.controller.worker.tasks.amphora_driver_tasks.ListenersUpdate 20:28:10 <rm_work> that task always runs I think? 20:28:30 <jniesz> which is when hit the template issue 20:28:33 <rm_work> it's so when the chain reverts, it hits the revert method in there 20:29:37 <johnsom> Yeah, in o-cw logs you will see ListenersUpdate went to running, then failed on the jinja so went to REVERTED, then reverted ListenersToErrorOnRevertTask which put the listener into ERROR. 20:29:49 <rm_work> yep 20:30:24 <johnsom> independent the o-hm (health manager) was watching the amp and noticed there should be a working listener but there is not. 20:30:46 <johnsom> that is when o-hm would trigger a failover in an attempt to recover the amp 20:31:06 <jniesz> ok, but should ListenersUpdate have ever went to running? 20:31:14 <johnsom> Yes 20:31:45 <johnsom> ListenersUpdate task should have gone: PENDING, RUNNING, REVERTING, REVERTED 20:32:02 <johnsom> I think there is a scheduling in the beginning somewhere too 20:32:19 <johnsom> ListenersUpdate definitely started, but failed due to the bad jinja 20:33:18 <jniesz> so jinja check is not done until post LIstenerUpdate 20:33:24 <johnsom> Ok, we are at the half way point in the meeting time, I want to give the next topic some time. We can continue to discuss the bug after the meeting if you would like. 20:33:35 <jniesz> ok that is fine 20:33:40 <johnsom> jinja check is part of the ListenerUpdate task 20:33:47 <rm_work> yeah basically, i think it went exactly as it should 20:33:55 <rm_work> from what i've heard 20:34:16 <johnsom> Yeah, me too aside from the bug and patch I have already posted. 20:34:31 <johnsom> #topic AZ in Octavia DB (xgerman_) 20:35:05 <rm_work> So, I wonder if there are highlights from our PM discussion on this that would be good to summarize 20:35:14 <johnsom> Sure, 20:35:42 <rm_work> Basically, there are cases where we need to be able to quickly filter by AZ ... and doing a full query to nova is not feasible (at scale) 20:35:43 <johnsom> My summary is rm_work wants to cache the AZ info from an amphora being build by nova 20:36:20 <xgerman_> and my argument is if somebody wants to query by data center, rack — I don’t want to add those columns 20:36:39 <xgerman_> so this feels like meta data and needs a more generalized solution 20:36:39 <rm_work> so we can compromise by storing it clearly labeled as a "cache" that can be used by processes that need quick-mostly-accurate, and we can query nova in addition for cases that need "slow-but-exact" 20:36:52 <rm_work> well, that's not something that's in nova is it? 20:36:57 <rm_work> rack? 20:37:07 <rm_work> and DC would be... well outside the scope of Octavia, since we run in one DC 20:37:11 <rm_work> as does ... the cloud 20:37:20 <rm_work> so it's kinda obvious which DC you're in :P 20:37:23 <johnsom> The use case is getting a list of amps that live in an AZ and to be able to pull an AZ specific amp from the spares pool. 20:37:51 <rm_work> i mean, the scope limit here is "data nova provides us" 20:37:52 <xgerman_> so we are doing scheduling by AZ 20:38:06 <rm_work> I'd like to enable operators to do that if they want to 20:38:11 <johnsom> Well, other deployer have different ideas of what an AZ is.... Many AZ is different DC 20:38:23 <rm_work> right which does make sense 20:38:28 <rm_work> so, that's still fine IMO 20:38:32 <xgerman_> yeah, so if is for scheduling it should be some schesulign hint 20:38:47 <rm_work> yeah, we can do scheduling hints in our driver 20:39:05 <jniesz> yea, I would like the scheduling control as well 20:39:07 <rm_work> but they rely on knowing what AZ stuff is in already 20:39:38 <xgerman_> my worry is to keep it generalzied enought that we can add other hints in the future 20:39:42 <rm_work> and querying from nova for that isn't feasible at scale and in some cases 20:39:52 <rm_work> i'm just not sure what other hints there ARE in nova 20:39:54 <rm_work> maybe HV 20:39:58 <rm_work> err, "host" 20:40:08 <xgerman_> I don’t want to limit it to nova hints 20:40:10 <rm_work> which ... honestly i'm OK with cached_host as well, if anyone every submits it 20:40:14 <johnsom> Really the issue is OpenStack has a poor definition of AZ and nova doesn't give us the tools we want without modifications or sorting through every amphora the service account has. 20:40:17 <rm_work> well, that's what we get back on the create 20:40:38 <rm_work> we're just talking about adding stuff we already get from a compute service 20:40:50 <rm_work> and it's hard to talk about theoretical compute services 20:40:52 <nmagnezi> johnsom, it does not allow to get a filtered list by AZ ? 20:41:03 <nmagnezi> i mean, just the instances in a specific AZ? 20:41:03 <rm_work> but in general, we'd assume some concept that fits generally into the category of "az" 20:41:43 <xgerman_> I am saying we should take a step back and look at scheduling — is adding an az column the best/cleanest way to achieve scheduling hints 20:41:56 <johnsom> nmagnezi Yes, but we would have to take that list, deal with paging it, and then match to our spares list. For example. 20:42:16 <rm_work> nmagnezi: basically, if we want to see what AZs we have in the spares pool, the options are: call nova X times, where X is the number of amps in the spares pool; or: call nova-list and match stuff up, which will cause significant paging issues at scale 20:42:36 <jniesz> or you can pull out of nova db : ) 20:42:39 <johnsom> Don't get me wrong, I'm not a huge fan of this, but thinking through it with rm_work led me to feel this might be the simplest solution 20:42:46 <rm_work> jniesz: lol no we cannot :P 20:42:52 <nmagnezi> yeah sounds problematic.. 20:43:10 <rm_work> xgerman_: well, our driver actually *does* do scheduling on AZ already 20:43:16 * johnsom slaps jniesz's had for even considering getting into the nova DB.... 20:43:18 <rm_work> it's just not reliant on existing AZs 20:43:24 <xgerman_> so if I run different nova flavors and want to pull one out based on flavor and AZ — how would I dod that? 20:43:34 <johnsom> had->hand 20:43:41 <xgerman_> would I add another column? 20:43:54 <rm_work> xgerman_: do we not track the flavor along with the amp? 20:43:59 <rm_work> I guess not? actually we probably should 20:44:18 <rm_work> I even kinda want to track the image_ref we used too... 20:44:47 <rm_work> we can add image_ref and compute_flavor IMO 20:44:53 <rm_work> alongside cached_az 20:44:58 <rm_work> i would find that data useful 20:45:19 <johnsom> I see xgerman_'s point, this is getting a bit tightly coupled to the specific nova compute driver 20:45:38 <jniesz> should we store this in flavor metadata? 20:45:38 <rm_work> (I was also looking at a call to allow failover of amps by image_id so we could force-failover stuff in case of security concerns on a specific image) 20:45:51 <jniesz> that would only be initial though and wouldn't account for changes 20:45:54 <nmagnezi> johnsom, i was just thinking the same. we need to remember we do want to allow future support for other compute drivers 20:45:54 <rm_work> jniesz: i was talking about in the amphora table 20:45:54 <xgerman_> yep, that’s what I was getting at + I don’t know if I want to write python code for different scheduling hints 20:46:12 <rm_work> i think the DB columns should absolutely be agnostic 20:46:18 <rm_work> but we can use them in the compute drivers 20:46:48 <rm_work> I think the concept of "compute_flavor" and "compute_image" and "compute_az" are pretty agnostic though -- won't most things need those? 20:47:09 <rm_work> even amazon and GCE have AZs AFAIU 20:47:17 <rm_work> and flavors and images... 20:47:43 <rm_work> kubernetes does too 20:47:46 <jniesz> what about vmware 20:47:49 <rm_work> so I would be in favor of adding *all three* of those columns to the amphora table 20:47:51 <nmagnezi> aside from flavors (which just don't know if it exists as a concept in other alternatives) I agree 20:48:19 <nmagnezi> i'm not saying i disagree about flavors, just don't really sure 20:48:20 <nmagnezi> :) 20:48:35 <rm_work> i don't know much about vmware -- i would assume they'd NEED something like that, no? 20:49:27 <rm_work> we're all about being generic enough here to support almost any use-case -- and i'm totally on board with that, and in fact that's the reason for this 20:49:30 <xgerman_> we can either use some metadata concept where we go completely configurable or pick sinner and throw them in the DB 20:49:38 <xgerman_> winners 20:49:40 <nmagnezi> rm_work, google says they do have it. 20:49:41 <xgerman_> not sinners 20:49:44 <rm_work> it should be possible to write a custom compute driver that can take advantage of these fields 20:50:05 <rm_work> i think it's pretty clear that those fields are something shared across most compute engines 20:50:08 <johnsom> I mean at a base level, I wish nova server groups (anti-affinity) just "did the right thing" 20:50:15 <xgerman_> +1 20:50:17 <rm_work> yeah, that'd be grand, lol 20:50:40 <jniesz> vmware I thought only had flavors with OpenStack, not native 20:50:49 <jniesz> you clone from template or create vm 20:51:20 <rm_work> jniesz: in that case, flavor would be ~= "template"? 20:51:41 <jniesz> well that included image 20:51:48 <jniesz> and I thin you still specify vcpu, mem 20:51:51 <johnsom> So, some thoughts: 20:51:51 <johnsom> Cache this now 20:51:51 <johnsom> Work on nova to make it do what we need 20:51:51 <johnsom> Setup each compute driver to have it's own scheduling/metadata table 20:51:51 <johnsom> .... 20:51:56 <jniesz> has been awhile for me with VMware 20:52:20 <jniesz> different drivers might care about different metadata, so like that idea 20:52:25 <johnsom> jniesz the vmware integration has a "flavor" concept 20:52:31 <xgerman_> johnsom +1 20:53:03 <xgerman_> I think keeping it flexible and not putting it straight into the amp table gives us space for future comoute drivers /schedulers 20:53:13 <johnsom> Those were options/ideas, not a ordered list BTW 20:53:26 <rm_work> i think it *is* flexible enough in the amphora table to allow for future compute drivers 20:54:16 <johnsom> I just worry if the schema varies between compute drivers, how is the API going to deal with it... 20:54:27 <nmagnezi> i think rm_work has a point here. and for example even if a compute driver does not have the concept of AZs we can just see it as a single AZ for all instances 20:55:04 <johnsom> Yes 20:55:10 <rm_work> yep 20:55:17 <rm_work> and yes, exactly, worried about schema differences 20:55:40 <johnsom> We have five minutes left in the meeting BTW 20:55:50 <xgerman_> I am not too worried about those — they are mostly for scheduling and the API doesnt know about azs right now anyway 20:56:23 <johnsom> We have the amphora API now, which I expect rm_work wants the AZ exposed in 20:56:48 <rm_work> it's been three years and what we have for compute is "nova", haven't even managed to get the containers stuff working (which also would have these fields), and every compute solution we can think of to mention here would also fit into these ... 20:57:00 <rm_work> so blocking something useful like this because of "possible future whatever" seems lame to me 20:57:02 <rm_work> just saying 20:57:10 <xgerman_> so what’s bad about another table? 20:57:18 <johnsom> Well, let's all comment on the patch: 20:57:20 <johnsom> #link https://review.openstack.org/#/c/510225/ 20:57:21 <rm_work> just, why 20:57:30 <rm_work> we could do another table -- but we'd want a concrete schema there too 20:57:43 <rm_work> and the models would just auto-join it anyway 20:57:45 <johnsom> I will mention, AZ is an extension to nova from what I remember and is optional 20:57:48 <xgerman_> yes, but it would be for nova and if we do vmware they would do their table 20:58:00 <rm_work> the models need to do the join 20:58:07 <rm_work> we can't have a variable schema, can we? 20:58:12 <johnsom> +1 20:58:20 <rm_work> +1 to which, lol 20:58:28 <johnsom> I really don't want a variable schema. It has ended poorly for other projects 20:59:24 <jniesz> isn't it hard to have single schema for multiple drivers that would have completely different concepts? 20:59:36 <johnsom> Final minute folks 20:59:43 <rm_work> jniesz: yes, but i don't think they HAVE different concepts 21:00:02 <rm_work> please name a compute system that we couldn't somehow fill az/flavor/image_ref in a meaningful way 21:00:07 <rm_work> i can't think of one 21:00:09 <johnsom> Review, comment early, comment often... grin 21:00:22 <johnsom> #endmeeting