#openstack-meeting log

00:02:55 <thinrichs> #startmeeting CongressTeamMeeting
00:02:56 <openstack> Meeting started Thu Aug 18 00:02:55 2016 UTC and is due to finish in 60 minutes.  The chair is thinrichs. Information about MeetBot at http://wiki.debian.org/MeetBot.
00:02:57 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote.
00:03:00 <openstack> The meeting name has been set to 'congressteammeeting'
00:03:27 <ramineni_> hi
00:03:29 <masahito> hi
00:03:54 <ekcs> hi
00:04:04 <thinrichs> aimeeu: courtesy ping (she maybe on vacation)
00:04:05 <ekcs> welcome back masahito !
00:04:26 <ekcs> Yes, I think aimeeu is away.
00:04:46 <thinrichs> Agenda for today..
00:04:50 <masahito> ekcs: thanks
00:04:57 <thinrichs> 1. milestones for newton
00:04:59 <thinrichs> 2. status
00:05:01 <thinrichs> Anything else?
00:05:47 <thinrichs> #topic Newton milestones
00:05:53 <thinrichs> Here's the schedule...
00:05:58 <thinrichs> #link http://releases.openstack.org/newton/schedule.html
00:06:04 <thinrichs> Few noteworthy items..
00:06:33 <thinrichs> Aug 29-Sep 02: newton-3 and final release for client libraries
00:06:51 <thinrichs> So that's about 2 weeks away
00:07:29 <thinrichs> masahito: are you still planning to expose lazy polling for datasources thru the aPI?  We'll want client code in place for the same.
00:07:48 <masahito> Yes.
00:08:11 <masahito> I notice that we don't need to modify client codes.
00:08:34 <masahito> because the flag for lazy is in config option of datasource.
00:09:01 <thinrichs> That's good.  So it's set at the time of creating the datasource, and if you want to change it you delete the datasource and create a new one.
00:09:10 <thinrichs> Is that what you had in mind?
00:09:19 <masahito> yes.
00:09:35 <thinrichs> Cool.  I love getting functionality in without changing the API!
00:09:53 <thinrichs> Means we have a solid API.
00:10:32 <masahito> If we want to change config value of datasource on the fly, I think it's different API.
00:10:46 <masahito> but now, this is out of scope.
00:11:04 <thinrichs> masahito: makes sense to me
00:11:21 <thinrichs> Other item on the calendar is that PTL nominations are Sep 12-16.
00:11:33 <thinrichs> If anyone is interested and wants to chat about what's involved, let me know.
00:12:22 <ekcs> got it.
00:12:28 <thinrichs> One thing we should discuss is what we hope to finish by newton-3
00:12:40 <thinrichs> I think we can save that til after we go through status updates
00:12:58 <thinrichs> Moving on.
00:13:01 <thinrichs> #topic status
00:13:13 <thinrichs> ramineni_: want to give your status update?
00:13:27 <ramineni_> thinrichs: ok
00:15:30 <ramineni_> thinrichs: im trying to add a new job in gate for HA , to run existing tempest tests in HA, need to test locally first
00:16:49 <ramineni_> thinrichs: but im thinking of starting only one more PE process not the API one
00:16:58 <thinrichs> Remind me what deployments we're testing now…
00:17:00 <thinrichs> PE+DSD+API in 1 process,
00:17:08 <thinrichs> PE, DSD, API in 3 separate processes
00:17:37 <thinrichs> What's the HA version?
00:18:00 <ramineni_> thinking of API+PE+PE+DSD
00:18:14 <ramineni_> all seperate processes
00:18:52 <ramineni_> because we cant start myultiple API´s on the same node, it will conflict with port
00:19:19 <ramineni_> i think we should be able to test with multiple PE´s
00:19:53 <thinrichs> So for policy API calls we'll let oslo-messaging pick the PE to send the message to
00:20:02 <ramineni_> yes
00:20:13 <thinrichs> As long as we think oslo-messaging won't always pick the same PE, that seems fine
00:20:20 <thinrichs> ekcs: what do you think?
00:20:58 <thinrichs> Another option: is it possible/easy to (a) startup multiple API servers on different ports and (b) write tempest tests to send requests to different ports?
00:21:09 <ekcs> we discussed it last time. That’s what I also thought is best, but we should check to make sure oslo won’t always pick same.
00:21:53 <ramineni_> thinrichs: thats what current HA tempest tests does
00:22:06 <ramineni_> it starts on different port
00:22:21 <ramineni_> but isn´t it the same
00:22:43 <ramineni_> anyway we are sending it to one API only
00:23:05 <thinrichs> ramineni_: Ok.  Sounds like you're doing all the right things then.
00:23:13 <thinrichs> ramineni_: anything you need from us?
00:23:30 <ekcs> if it’s easy to start on diff ports, then I think that would be best.
00:23:52 <thinrichs> and if it's easy to send requests to different ports.  We don't do that today
00:24:12 <ramineni_> ekcs: y?
00:24:42 <ekcs> two reasons: 1. that way we have more control over the test. send to A then to B. more reproducible if there is failure.
00:25:01 <ekcs> 2. that way we test in the recommended deployment config of API+PE in one node.
00:25:28 <ekcs> but I didn’t know whether it was easy or hard to start on diff ports and send requests to diff ports.
00:25:31 <ekcs> just my thoughts.
00:25:33 <thinrichs> ekcs: +1 Lots of nondeterminism in letting oslo-messaging pick the PE
00:26:01 <thinrichs> Seems worth investigating whether we can send API requests to different ports in tempest
00:26:18 <ramineni_> ekcs: thinrichs: i still dont get it, even if we start on different ports PE would be automatically chosen
00:26:45 <ramineni_> may be we could discuss this later
00:27:12 <ekcs> yea maybe I’m confused about whether we can send a request to a specific port.
00:27:21 <thinrichs> If we deployed 3 processes: (i) API+PE, (ii) API + PE, (iii) DSD
00:27:34 <thinrichs> then we can write tests that send API calls to (i) or (ii) explicitly.
00:27:49 <thinrichs> (Assuming we can send API calls to different ports)
00:28:57 <thinrichs> Let's come back to this later if there's time.
00:29:05 <thinrichs> masahito: want to give a status report?
00:29:50 <masahito> thinrichs: sure
00:30:31 <masahito> I'm updating a patch for lazy datasource. I'll push it today.
00:30:48 <masahito> that's from my side. not a lot progress.
00:31:41 <thinrichs> masahito: thanks
00:31:59 <thinrichs> ekcs: status update?
00:32:13 <ekcs> Submitted patches for:
00:32:14 <ekcs> 1. sync datasources by UUID to avoid mix-up due to datasoure name reuse. #link https://review.openstack.org/#/c/356157/
00:32:14 <patchbot> ekcs: patch 356157 - congress - HAHT - datasource synchronizer use UUID not name
00:32:15 <ekcs> 2. resubscribe if sequenced data missing for too long. #link https://review.openstack.org/#/c/356806/
00:32:15 <patchbot> ekcs: patch 356806 - congress - DSE2 - resubscribe if update missing for too long
00:32:16 <ekcs> 3. quick fix for ironic_driver that prevented congress from starting.
00:32:19 <ekcs> That’s all.
00:33:03 <thinrichs> Anything to discuss on those patches here?
00:34:01 <ekcs> not really.
00:34:27 <thinrichs> On the resubscribe if update missing patch, did you run into that case?
00:34:56 <ekcs> no I didn’t, but it was always planned (in my head =p )
00:35:20 <ekcs> without it, if a single update gets lost, no more updates can be processed from that table forever.
00:35:50 <thinrichs> Just wondered if you were doing some serious testing.  Seemed like a tough one to hit by accident.
00:36:25 <thinrichs> I'll take a look tomorrow probably
00:36:30 <thinrichs> I'll do a quick update
00:36:33 <ekcs> no haven’t done heavy testing. the version without resubscribe was basically meant for single-process dse2 use.
00:36:54 <thinrichs> I've been underwater all week.  No real progress.  Just tried to keep up with reviews.
00:37:19 <thinrichs> Let's move to the newton-3 discussion
00:37:22 <thinrichs> #topic newton-3
00:37:39 <thinrichs> We should figure out what we want to finish up in the next 2 weeks.
00:38:09 <ekcs> I think HAHT is on track for newton-3. All major required code changes are merged or in review. Remaining are tests and docs and optional features and I'm sure bug fixes. Still a fair amount of work for tests and docs, but I assume it's not the end of the world if those slip a couple weeks.
00:38:10 <ekcs> Here are the remaining bugs targeting newton-3 #link https://launchpad.net/congress/+milestone/newton-3
00:39:05 <ekcs> slip into RC1 that is.
00:40:03 <ekcs> actually here’s a question about newton-3.
00:40:06 <ekcs> Soft StringFreeze¶
00:40:06 <ekcs> You are no longer allowed to accept proposed changes containing modifications in user-facing strings. Such changes should be rejected by the review team and postponed until the next series development opens (which should happen when RC1 is published).
00:40:28 <ekcs> does that mean no changes to docs or just no changes to program output?
00:40:49 <thinrichs> Pretty sure that's program output
00:41:01 <ekcs> ok good =)
00:41:12 <thinrichs> That's for translation, I'm pretty sure.  It gives the translation team time to internationalize the messages.
00:41:13 <ekcs> “user-facing strings” is a bit ambiguous.
00:41:39 <thinrichs> Docs always lag behind code, so freezing the docs before the code doesn't make sense
00:42:07 <thinrichs> We haven't gotten around to internationalizing, so the string freeze doesn't apply
00:42:33 <thinrichs> Here's my first question: are we stable without HAHT deployments?  I.e. if we deploy in 1 process?
00:43:08 <ekcs> i’ve only done very light testing so far.
00:43:20 <thinrichs> We have tempest tests running, so that gives us some confidence
00:43:40 <thinrichs> I'd say that once we hit newton-3, we should do some of that testing immediately.
00:44:27 <thinrichs> The goal then is to get the HAHT code in by newton-3 and use the remainder of the cycle to find/fix bugs.
00:44:50 <thinrichs> It looks like all the code is underway.
00:45:00 <thinrichs> Any worries about landing the ones in progress?
00:45:24 <thinrichs> The synchronizer update for distributed deployment (ramineni_)?
00:45:37 <thinrichs> datasource add/delete (ekcs)?
00:46:29 <ekcs> no notable concern on my end.
00:46:37 <ramineni_> i have patch for that already
00:47:14 <thinrichs> Awesome!
00:47:37 <ekcs> I hope we can get all tests in by newton-3 too, to expose bugs as early as possible. should be doable.
00:47:55 <thinrichs> ekcs: +1
00:48:07 <thinrichs> Maybe what I can do is save up some Congress time and once those two land start manual testing
00:48:40 <ekcs> new devstack will be interesting.
00:48:57 <ekcs> s/new/fresh
00:49:18 <thinrichs> That's true.  That'll take some time to get set up.
00:49:40 <thinrichs> Is our devstack plugin stable at this point?
00:50:02 <thinrichs> I could start getting that running now and then just upgrade the Congress code once it comes in
00:50:53 <ramineni_> thinrichs: sorry, I have to leave now, will catchup later
00:51:05 <thinrichs> masahito: are you expecting your patches to land in time?
00:51:14 <thinrichs> ramineni_: sure.  Thanks.  See you later
00:51:26 <masahito> thinrichs: yap.
00:52:10 <thinrichs> Great!
00:52:18 <thinrichs> That's all I have for today.
00:52:25 <thinrichs> Anything else to discuss?
00:52:28 <thinrichs> #topic open discussion
00:52:33 <ekcs> i think I realized that ramineni_ meant on ha tempest test
00:53:08 <ekcs> we’re trying to reuse existing tempest tests on new deployment configuration. so we’re not targeting specific nodes anyway.
00:53:34 <thinrichs> ekcs: that makes sense.  Reusing existing tests
00:54:03 <thinrichs> So I guess we'll need new variants of tests for the HA stuff
00:54:28 <thinrichs> I guess those should be focused on testing just the HAHT features.  We can leave the more basic functionality to the existing tests
00:54:35 <ekcs> so the tests themselves won’t target specific nodes. but we could still write a new layer that sends requests to diff ports in a deterministic fashion.
00:55:20 <ekcs> right. and from our discussion last time, i think the conclusion is that most of the detailed HA feature testing should be done in test_congress style unit tests.
00:55:39 <thinrichs> ekcs: makes sense
00:55:56 <ekcs> so it may or may not be worth doing extra work to adapt the existing tempest tests to distribute requests to diff ports.
00:56:34 <thinrichs> Agreed
00:56:39 <ekcs> determinism still nice. but seems like we can go either way.
00:57:12 <thinrichs> I'd say start with something that tests multiple PEs and if nondeterminism becomes a problem address it then
00:57:15 <masahito> agreed.
00:57:21 <ekcs> agreed.
00:57:49 <thinrichs> So it seems we're agreed about the API + PE + PE + DSD setup for running tempest tests
00:58:06 <ekcs> makes sense to me.
00:58:08 <thinrichs> ekcs: how thorough would you say our test-congress tests are for HAHT?
00:58:33 <thinrichs> 2 min left
00:58:34 <ekcs> not thorough at all at the moment.
00:59:02 <ekcs> I was thinking specific failover tests would be in tempest. but now targeting unittests after discussion.
00:59:27 <thinrichs> So one of those bugs is actually targeting test-congress tests, instead of tempest
00:59:29 <ekcs> not thorough in terms of simulating actual failvover scenario.
00:59:48 <ekcs> pretty good for testing specific behavior.
00:59:56 <ekcs> basically their more unit tests than integrationtests.
01:00:01 <thinrichs> ekcs: got it.  Good to know
01:00:15 <ekcs> yes. i’ll change the bug.
01:01:03 <thinrichs> Out of time.  Thanks all!
01:01:05 <thinrichs> #endmeeting