16:01:08 <inc0> #startmeeting kolla 16:01:09 <openstack> Meeting started Wed Oct 12 16:01:08 2016 UTC and is due to finish in 60 minutes. The chair is inc0. Information about MeetBot at http://wiki.debian.org/MeetBot. 16:01:10 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 16:01:13 <openstack> The meeting name has been set to 'kolla' 16:01:14 <akwasnie> o/ 16:01:15 <pbourke> woot 16:01:19 <Jeffrey4l_> 0/ 16:01:20 <egonzalez90> o/ 16:01:25 <duonghq> o/ 16:01:25 <qwang> o/ 16:01:27 <vhosakot> o/ 16:01:28 <berendt> o/ 16:01:29 <jascott1> o/ 16:01:30 <hrito> o/ 16:01:31 <coolsvap> o/ 16:01:33 <duonghq> no roll call topic? 16:01:34 <inc0> hold on 16:01:40 <inc0> #topic rollcall - woot for kolla 16:01:42 <sdake> o/ 16:01:44 <rhallisey> lul 16:01:49 <Jeffrey4l_> \o/ 16:01:54 <inc0> woot guys plz;) 16:01:55 <egonzalez90> woot 16:01:57 <duonghq> woot 16:01:58 <berendt> woot 16:02:04 <sdake> i do not consent ;) 16:02:13 <vhosakot> \o/00T 16:02:14 <pbourke> woot again 16:02:16 <akwasnie> woot 16:02:19 <Jeffrey4l_> woot 16:02:21 <rhallisey> vhosakot, nice 16:02:26 <jascott1> woot 16:02:27 <vhosakot> ;) 16:02:37 <zhubingbing> \o/ 16:02:47 <inc0> ok, cores please -2 sdakes changes until he submits to our internal rituals 16:02:58 <sdake> your pain not mine :) 16:03:08 <inc0> (kidding ofc, we'll deal with it phusically in summit if he wont) 16:03:18 <inc0> #topic announcements 16:03:21 <sdake> buy me lots o beers 16:03:21 <sdake> i'm broke ;) 16:03:42 <inc0> 1. RC2 is tagging today, we need to open stable/newton branch 16:03:58 <inc0> we will have rc3 due to critical bugs outstanding 16:04:08 <inc0> it's going to be Oct18 16:04:23 <inc0> 2. remind - summit schedule is up for review 16:04:41 <inc0> any announcements from community> 16:05:18 <inc0> guess not 16:05:25 <inc0> #topic newton release 16:05:27 <inc0> sdake, you're up 16:05:47 <sdake> https://launchpad.net/kolla/+milestone/newton-rc2 16:06:00 <sdake> the summary there is there are 7 critical bugs that make kolla unusable 16:06:13 <sdake> we have a slew of high bugs which make a particular service have some kind of defect 16:06:25 <wirehead_> o/ 16:06:32 <sdake> on the plus side, we have fixed 66(!) bugs in rc2 16:06:44 <sdake> we also have a slew of bugs in the "INPROGRESS STATE" 16:06:55 <sdake> if there is a bug in INPROGRESS that can be merged, lets get it merged 16:07:11 <sdake> after we tag rc2, only critical bugs will be backported to stable/newton 16:07:21 <sdake> a few mechanics on the tagging/branching 16:07:41 <sdake> rc2 tags today 23:45 UTC -> the release team tags when they get to it 16:07:51 <sdake> the branch gets created when the release team gets to it 16:07:51 <sdake> they are not slow 16:07:59 <sdake> but don't expect a branch immediately when the tag occurs 16:08:13 <sdake> for rc3, we are carrying 7 bugs 16:08:21 <sdake> these need critical attention 16:08:27 <sdake> there are some other bugs in that list that are in triaged 16:08:32 <sdake> that are critical 16:08:56 <sdake> if someone could get them out of the triaged state, that would be fantastic (by confrming, marking invalid, etc) 16:09:10 <sdake> i will be carrying over triaged bugs that are critical as well as confirmed bugs 16:09:39 <sdake> if they are marked critical - I assume whoever triaged it understands the bug impacts all of kolla 16:09:51 <sdake> now, what to do if you have a high severity bug that you think should be critical? 16:09:54 <sdake> change it to critical 16:10:02 <sdake> (before 23:45) 16:10:21 <sdake> the workflow seems to be going well from last meeting 16:10:42 <sdake> it would be great if we could finish the job on those inprogress bugs rather then having to backport them, so i'd suggest fixing those today :) 16:10:46 <sdake> or reviewing them 16:10:51 <sdake> etc 16:10:58 <sdake> if its critical, just assume it will be fixed starting tomorrow 16:11:04 <sdake> I need 1 more week out of everyone :) 16:11:12 <sdake> and then we have free time for some time :) 16:11:23 <sdake> (tag is tuesday for rc3) 16:11:32 <inc0> 3 days, then summit:) 16:11:46 <sdake> shhhh :) 16:12:03 <sdake> after summit - you will have plenty o time to recover :) 16:12:40 <sdake> inc0 you wanted a status update on each bug? 16:12:52 <sdake> rather each critical 16:12:53 <inc0> yes, let's do this, for critical at least 16:13:04 <sdake> cool take it away - have to do something rq like :) 16:13:08 <inc0> #topic critical bugs review 16:13:26 <inc0> let's spend few moments to discuss each critical bug 16:13:37 <inc0> to help fixing it 16:13:53 <inc0> 1 https://bugs.launchpad.net/kolla/+bug/1564773 16:13:53 <openstack> Launchpad bug 1564773 in kolla mitaka "Continously restarting rabbitmq container for CentOS" [Critical,Confirmed] 16:14:55 <inc0> hmm, anyone have steps to reproduce this one? 16:15:34 <Jeffrey4l_> i followed this bug. but got nothing about it. :( 16:15:58 <inc0> dug 16:15:59 <inc0> duh 16:16:02 <zhubingbing> Bug report time is too long. 16:16:13 <inc0> yes, but comments are fres 16:16:14 <inc0> h 16:16:20 <inc0> so it's still a thing as it seems 16:16:25 <Jeffrey4l_> yes. the error message is different too. 16:16:34 <duonghq> tomorrow I'll try to triaged this bug, today the epel repo is broken for awhile 16:16:37 <berendt> sean-k-mooney noted it happens on maste, can he support? 16:16:42 <sdake> pbourke conirmed it 16:16:47 <sdake> i changed it to critical as a result 16:17:43 <sdake> we need to circle back with pbourke on this issue and see what his defintiion of confirmed is 16:17:51 <sdake> i'll do tha ttomororw morning 16:18:11 <pbourke> one moment I'll check now 16:18:21 <sdake> oops ssorry pbourke didn't know you were here 16:18:51 <pbourke> i confirmed it as there are at least 3 separate people in in the comments saying they see it 16:18:57 <pbourke> as well as our own sean-k-mooney 16:19:15 <sdake> pbourke well sean should know how the software works 16:19:22 <sdake> can a core take on confirming this problem 16:19:33 <inc0> pbourke, can you check with Sean tomorrow to and try to triage it? 16:19:38 <pbourke> will do 16:19:39 <sdake> and fixing it 16:19:42 <inc0> thanks 16:19:57 <inc0> https://bugs.launchpad.net/kolla/+bug/1616268 16:19:57 <openstack> Launchpad bug 1616268 in kolla mitaka "Stale namespace removal causing "RTNETLINK answers: Invalid argument" errors" [Critical,Confirmed] - Assigned to Jeffrey Zhang (jeffrey4l) 16:20:13 <inc0> Jeffrey4l_, I understand you have theory and you're going to fix it? 16:20:27 <sdake> pretty sure Jeffrey4l_ is on the money on it 16:20:32 <Jeffrey4l_> inc0, yes. i am working on it. will push a PS later today. 16:20:39 <inc0> great 16:20:40 <inc0> thanks 16:20:52 <inc0> https://bugs.launchpad.net/bugs/1617334 16:20:53 <openstack> Launchpad bug 1617334 in kolla "reconfigure action fails on [neutron | Restart the neutron_openvswitch_agent container] " [Critical,Triaged] 16:21:12 <sdake> this is marked critical because if reconfigure is busted, thats not good 16:21:21 <sdake> its not marked confirmed because it hasn't been confirmed in the bug log 16:21:27 <sdake> with several core reviewers trying iirc 16:21:51 <sdake> i use openvswitch all the time 16:21:53 <sdake> and it wfm 16:21:56 <sdake> even reconfigure 16:22:01 <berendt> same here 16:22:14 <sdake> i'm surprised linuxbridge works for htis fellow and not ovs 16:22:30 <sdake> a good course of action would be eto request more info and mark it incomplete 16:22:42 <sdake> determine what info to request is difficult ... 16:23:52 <inc0> but it's not triaged as well right? 16:24:01 <sdake> its been triaged 16:24:06 <sdake> it hasn't been confirmed 16:24:25 <sdake> triage = setting milestone and priority 16:24:51 <sdake> Jeffrey4l_ for htis bug, you requested his inventory file 16:25:09 <Jeffrey4l_> sdake, yep. but still get nothing idea now. 16:25:10 <sdake> Jeffrey4l_ so hard for us to mark it incomplete without actually looking at his inventory file 16:25:21 <sdake> Jeffrey4l_ ok - i've got this one 16:25:24 <sdake> go ahead and move on inc0 16:25:25 <inc0> nothing too special there 16:25:33 <inc0> https://bugs.launchpad.net/kolla/+bug/1625648 16:25:33 <openstack> Launchpad bug 1625648 in kolla "horizon login fails with TLS enabled" [Critical,Triaged] 16:25:55 <inc0> bjolo promised to try to reproduce 16:26:30 <zhubingbing> i don't reproduce this bug 16:26:56 <inc0> let's wait for bjolo 16:27:00 <inc0> https://bugs.launchpad.net/kolla/+bug/1631072 16:27:00 <openstack> Launchpad bug 1631072 in kolla "iscsid container: mkdir /sys/kernel/config: operation not permitted" [High,Triaged] 16:27:24 <sdake> are we going throug hthe high bugs too? 16:27:37 <Jeffrey4l_> this is a ubuntu bug. 16:27:38 <inc0> ahh copyfail sorry 16:27:57 <inc0> https://bugs.launchpad.net/kolla/+bug/1623013 16:27:57 <openstack> Launchpad bug 1623013 in kolla "keystone-fernet: rsync: Failed to exec ssh: No such file or directory (2)" [Critical,In progress] - Assigned to Christian Berendt (berendt) 16:28:04 <sdake> this must be fixed 16:28:05 <Jeffrey4l_> see #4 and #6 16:28:21 <sdake> berendt any updates? :) 16:28:33 <Jeffrey4l_> this is under review i think. 16:28:35 <berendt> i thought Jeffrey4l_ took this one 16:28:43 <sdake> i see 16:28:51 <Jeffrey4l_> https://review.openstack.org/#/c/369418 16:28:51 <sdake> well - seems like whos on first to me :) 16:28:59 <Jeffrey4l_> review is welcome ;) 16:29:21 <inc0> ok, make this review priority folks please 16:29:24 <sdake> groan 16:29:32 <sdake> so - one thing i failed to mention in the announcements 16:29:34 <sdake> the gate is busted 16:29:37 <berendt> how to proceed? with or without ssh? 16:29:39 <sdake> because ceph.com is down 16:29:51 <inc0> yeah, but we can still review code 16:29:53 <inc0> without merging it 16:29:53 <sdake> a really dont want to merge anything complex before rc2 16:29:56 <sdake> right 16:30:05 <sdake> so review away - pls dontmerge anythign that isn't totally obvious :) 16:30:07 <Jeffrey4l_> we can not other ceph mirror. 16:30:10 <jascott1> ceph is back i thought 16:30:10 <inc0> so let's not merge until gates gets up, but review it 16:30:15 <Jeffrey4l_> we can use other ceph mirror 16:30:26 <Jeffrey4l_> jascott1, cool. 16:30:32 <berendt> jascott1 no it is still down 16:30:37 <berendt> at least on my site 16:30:56 <sdake> digitalocean has an outage 16:31:03 <sdake> ironically enough caused by ceph :) 16:31:14 <sdake> i think thats the place hosting this 16:31:21 <sdake> or maybe it was somewhere else 16:31:22 <sdake> anyway - its down for me too 16:31:24 <inc0> so last bug 16:31:28 <inc0> https://bugs.launchpad.net/kolla/+bug/1631503 16:31:28 <openstack> Launchpad bug 1631503 in kolla "inconsistent UID in rabbitmq results in inability to upgrade" [Critical,In progress] - Assigned to Steven Dake (sdake) 16:31:33 <jascott1> its dreamhost 16:31:38 <sdake> jascott1 right thanks 16:31:46 <Jeffrey4l_> fix is here: https://review.openstack.org/384598 16:32:03 <Jeffrey4l_> i will update the commit message. 16:32:04 <inc0> for rabbitmq 16:32:15 <Jeffrey4l_> but the fix logical is done. 16:32:15 <inc0> we need to figure out other affected services 16:32:24 <sdake> inc0 anything with a named volume 16:32:38 <inc0> which is almost everything 16:32:51 <Jeffrey4l_> right. so i think we need another bug to describe the detail rather then centos rabbitmq upgrade issue. 16:33:00 <inc0> Jeffrey4l_, can you make you review partial-bug please? 16:33:07 <inc0> or that 16:33:10 <sdake> change th topic of the bug eff 16:33:11 <sdake> Jeffrey4l_ 16:33:21 <sdake> click yellow button next to bug topic 16:33:22 <Jeffrey4l_> inc0, np. 16:33:51 <Jeffrey4l_> sdake, yep. change it to ? 16:34:51 <sdake> Jeffrey4l_ make something up? :) 16:35:11 <sdake> "upgrades broken because of inconsistent UIDs" should do it 16:35:30 <Jeffrey4l_> done. 16:35:30 <inc0> I'd rather not create 10 bugs for each service, but once Jeffreys patch is merged we need to do same thing for all the services 16:35:34 <inc0> so we can spread work 16:35:48 <inc0> let's make sure to use same topic in review 16:35:56 <inc0> https://review.openstack.org/#/q/topic:bug/1631503 16:36:04 <inc0> to avoid duplication of work 16:36:15 <inc0> and reviewers, please keep eye on this link 16:36:29 <sdake> inc0 an etherpad would come in handy here i think - to figure out who is doing which part if you intend to spread load 16:36:54 <Jeffrey4l_> we need some thing to trick all the container which need fix. should we use bugs/work items or anything else. 16:37:02 <sdake> 6 days to 3.0.0 - spreading work will be more difficult lI think 16:37:12 <Jeffrey4l_> sdake, agreed to use etherpad. 16:37:13 <inc0> well, one person won't handle it all 16:37:45 <inc0> #link https://etherpad.openstack.org/p/kolla-bug-1631503 16:37:58 <sdake> cool so if spreading work lets use etherpad - Jeffrey4l_ can you link that in the bug so devs ca nfind it 16:38:08 <Jeffrey4l_> no. 16:38:15 <Jeffrey4l_> sorry. np. 16:38:48 <inc0> http://s2.quickmeme.com/img/48/483ef0911f5e27073a015b45aee7a288b9c8d3bfa104f8bfe6625572f97cfa52.jpg sorry, couldn't help myself 16:39:20 <Jeffrey4l_> it is a typo. ;( 16:39:38 <inc0> I know, don't worry;) 16:40:02 <inc0> Jeffrey4l_, I changed topic on your ps to bug/1631503 16:40:18 <Jeffrey4l_> ok. 16:40:29 <inc0> so between etherpad and this gerrit review queue, let's fix this 16:40:53 <Jeffrey4l_> should we update the bug? it do not just fix rabbitmq upgrade in centos now. 16:41:11 <sdake> Jeffrey4l_ we are doing partial-fix on each component i think 16:41:26 <Jeffrey4l_> but the bug title is bad: inconsistent UID in rabbitmq results in inability to upgrade 16:41:32 <inc0> https://bugs.launchpad.net/kolla/+bug/1631503 <= changed name 16:41:32 <openstack> Launchpad bug 1631503 in kolla "inconsistent UID in named volume results in inability to upgrade" [Critical,In progress] - Assigned to Steven Dake (sdake) 16:41:44 <sdake> Jeffrey4l_ i gave you a title in scrollback to use for bug description 16:41:46 <inc0> no mention of centos there 16:41:46 <Jeffrey4l_> actually it is: fix the uid issue during upgrade 16:42:33 <Jeffrey4l_> hmm. drop the "rabbitmq" word? 16:42:38 <inc0> I did:) 16:42:50 <Jeffrey4l_> cool 16:42:51 <inc0> replaced it with named voluem 16:43:04 <Jeffrey4l_> another think is: should we backport this to mitaka? 16:43:13 <sdake> one note on this bug 16:43:23 <inc0> so again: 16:43:23 <sdake> folks - the approach ahs been bikeshedded to death 16:43:27 <inc0> #link https://review.openstack.org/#/q/topic:bug/1631503 16:43:29 <sdake> lets just merge the design as is 16:43:38 <inc0> this link has to be reviewed, keep an eye on it 16:44:03 <sdake> by bikeshed to death - i've played telephone for atleast 8 hours on this specific issue 16:44:34 <inc0> ok, I think we stressed the importance enough:) 16:45:10 <inc0> so, we have status update on all critical bugs 16:45:16 <inc0> let's get them fixed in following week 16:45:30 <inc0> anything else to add? 16:45:48 <inc0> guess not 16:45:55 <inc0> #topic open discussion 16:46:12 <rhallisey> inc0, can we re order the summit sessions 16:46:12 <inc0> so one thing, you guys think we need meeting next week? 16:46:33 <duonghq> do we have semi-fixed summit sessions? 16:46:35 <inc0> yeah, let me do it now 16:46:35 <sdake> yes meeting next week - last minute summit prep is critical 16:46:43 <inc0> ok 16:46:47 <inc0> meeting stays then 16:46:54 <zhubingbing> About Heka we have any good alternative? 16:47:30 <sdake> zhubingbing we need an honest evaluation of the alternatives 16:47:37 <sdake> of which there are two that I am aware of 16:48:03 <duonghq> iirc there is solution already use in kolla-k8s? 16:48:05 <inc0> zhubingbing, I see 3 now, filebeat and fluentd are there, snap is meaning to grow this feature in near future (we should be ok with timing) 16:48:43 <sdake> snap sounds like heka 2.0 :) 16:49:00 <inc0> rhallisey, sdake what were your rescheduling issues? I'm switching osic review with ci 16:49:01 <zhubingbing> -) 16:49:10 <inc0> sdake, yeah, snap is exactly that 16:49:19 <rhallisey> inc0, can we move the road map later 16:49:20 <inc0> heka was cool, until mozilla dropped it 16:49:29 <rhallisey> I want to change around a bunch of things 16:49:36 <rhallisey> disagree with the current order 16:49:55 <rhallisey> I do have some conflicts, but I think the overall order needs to be changed 16:49:57 <pbourke> should look at logstash again also 16:50:08 <sdake> pbourke logstash-forwarder is EOL bro ;) 16:50:11 <rhallisey> imo 16:50:13 <jascott1> Mozilla mention this yesterday failed to get kk8s up due to ceph.com outage so worked on ansible playbooks, reviewed code. Today create cinder ansible commit. Blockers: ceph.com is still down:( 16:50:23 <berendt> pbourke filebeat.. not logstash 16:50:23 <jascott1> whoa epic fail 16:50:25 <pbourke> sdake: you sure? 16:50:31 <jascott1> https://github.com/trink/hindsight) 16:50:35 <sdake> pbourke yup look at their repo 16:50:48 <duonghq> logstash-forwarder is replaced by Beat iirc 16:50:53 <inc0> rhallisey, whats wrong with it?;) 16:50:57 <duonghq> seem promise? 16:50:57 <sdake> pbourke elastic EOL'ed logstash-forwarder and introduced filebeat instead 16:51:15 <Jeffrey4l_> inc0, could u check this again https://review.openstack.org/378762 i perfer use the same timezone for all containers. 16:51:16 <sdake> why? don't know ;) 16:51:18 <duonghq> sdake: Beats, indeed 16:51:32 <rhallisey> inc0, let's get into it after the meeting 16:51:44 <pbourke> whats the difference between logstash and logstash forwarder :/ 16:51:56 <inc0> ok rhallisey 16:51:58 <sdake> logstash collects logs and sends them to elasticsearch 16:52:07 <zhubingbing> iirc what's mean ? i don;t know ;) 16:52:10 <sdake> logstachforwarder sends logs to logstash (or elasticsearch) 16:52:11 <inc0> Jeffrey4l_, ok, I just wanted more context 16:52:13 <sdake> we don't need logstash itself 16:52:27 <duonghq> pbourke: Elastic introdue Beats as protocol for log shipping purpose 16:52:35 <jascott1> is mozilla's own recommeded replacement not suitable? https://github.com/trink/hindsight 16:52:37 <Jeffrey4l_> what happen to logstash? isn't it widely used? 16:53:00 <sdake> jascott1 we need an honest eval of the different solutions 16:53:07 <sdake> rather then picking one randomly :) 16:53:12 <pbourke> jascott1: i would think twice before using something from mozilla again 16:53:21 <jascott1> oh sure but no one had mentioned it. got it. 16:53:21 <Jeffrey4l_> we'd better talk the alternative after the summit ;) 16:53:22 <wirehead_> heh 16:53:23 <berendt> logstash-forwarder was replaced by filebeat 16:53:24 <inc0> well, it's a 4th option to look at sdake 16:53:28 <berendt> logstash is not deprecated 16:53:40 <jascott1> logstash is 300M JVM iirc 16:53:49 <duonghq> sdake: is there some criterial for evaluate logging engine? 16:54:16 <wirehead_> Filebeat is a tiny little go app. 16:54:19 <inc0> it has to do what heka is doing today and not deprecated;) 16:54:21 <sdake> duonghq not that i know of - an action item 16:54:31 <duonghq> ya 16:54:32 <wirehead_> (Logstash-forwarder was also pretty cute and tiny) 16:54:44 <inc0> if it meets these 2 simple criteria - pick up best one 16:54:44 <sdake> wirehead_ minus the java jvm part... 16:54:59 <sdake> it should also be effecient - 3rd criteria 16:55:01 <wirehead_> You don’t need the Logstash engine actually running unless you are mediating queues and stuff, filebeat can go straight to elasticsearch 16:55:09 <sdake> 25% of our cpu consumption on controller nodes is the logging stack 16:55:25 <wirehead_> Naw, logstash-forwarder is also a go app 16:55:52 <sdake> guess its time to learn go :) 16:56:01 <duonghq> agreed with sdake 16:56:10 <sdake> i was making a joke 16:56:14 <sdake> but ok :) 16:56:14 <wirehead_> <best Marvin the paranoid android voice> It’s awful </best Marvin the paranoid android voice> 16:56:48 <wirehead_> I mean, Logstash / Filebeat / ElasticSearch is sufficently popular that there has to be a clear migration off of it if something changes. 16:56:52 <duonghq> about summit sessions, have we fixed topic about kolla-k8s? 16:57:08 <inc0> duonghq, 2 sessions for kolla-k8s 16:57:12 <sdake> duonghq the schedule is in #opnestack-kolla topic 16:57:13 <inc0> roadmap and arch 16:57:14 <rhallisey> duonghq, ya there are 2 16:57:36 <inc0> anwyay, we're running out of time 16:57:39 <duonghq> okay 16:57:44 <inc0> let's move to our normal channel please 16:57:56 <inc0> unless there are last comments to make?:) 16:57:57 <zhubingbing> bye;) 16:58:04 <duonghq> thank you all 16:58:12 <inc0> ok, thank you guys! see you in summit I hope! 16:58:12 <Jeffrey4l_> hey guys. 16:58:15 <inc0> #endmeeting kolla