14:00:16 <irenab> #startmeeting kuryr
14:00:16 <openstack> Meeting started Mon Feb 19 14:00:16 2018 UTC and is due to finish in 60 minutes.  The chair is irenab. Information about MeetBot at http://wiki.debian.org/MeetBot.
14:00:17 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote.
14:00:20 <openstack> The meeting name has been set to 'kuryr'
14:00:37 <leyal> Hi
14:00:40 <dulek> Hello!
14:00:54 <maysamacedos> Hi
14:01:27 <irenab> Let's wait few more minutes for others to join
14:01:47 <kaliya> Greetings
14:02:04 <ltomasbo> o/
14:02:44 <irenab> Hi all, thank you for joining. Let's start
14:02:56 <irenab> #topic kuryr-libnetwork
14:02:58 <yboaron> o/
14:03:13 <garyloug> o/
14:04:16 <irenab> there is a patch #link https://review.openstack.org/#/c/544548/ that deals with tags extension changes in neutron
14:05:00 <irenab> the question is if previous extensions, meaning previous neutron versions should be supported
14:05:18 <celebdor> irenab: I'm inclined to say that they should
14:05:27 <celebdor> is it a lot of trouble to make it compatible with both?
14:05:41 <irenab> actually there are 3 options
14:06:02 <dulek> tag and tag_ext are deprecated since pike: https://docs.openstack.org/releasenotes/neutron/pike.html#deprecation-notes
14:06:20 <dulek> And guys planned to remove it in Queens, but left it. It's getting removed in Rocky.
14:06:22 <irenab> It does not look like a big deal
14:06:57 <dulek> So question is do we want Rocky's Kuryr to support Ocata?
14:07:07 <dulek> s/do we want/do we care
14:07:41 <celebdor> dulek: it only depends on the cost
14:07:56 <irenab> I believe the cost is not big
14:08:13 <irenab> #chair celebdor
14:08:14 <openstack> Current chairs: celebdor irenab
14:08:33 <irenab> so shall we vote for keeping both?
14:08:47 <celebdor> if the cost is not big, since we are not using any new features of the new tags
14:08:51 <celebdor> we should support them
14:08:56 <irenab> +1
14:09:02 <irenab> dulek ?
14:09:31 <dulek> Works for me either way. It's not a lot to carry in the code.
14:09:40 <irenab> great
14:09:53 <irenab> any other libnetwork related issues to discuss?
14:10:33 <irenab> ok, moving on
14:10:41 <irenab> #topic kuryr-kubernetes
14:11:03 <irenab> anyone want to update?
14:11:26 <celebdor> man...
14:11:37 <celebdor> Well, I filed two bugs last friday
14:11:46 <irenab> links?
14:11:55 <celebdor> basically with neutron-lbaasv2 and firewall=openvswitch
14:11:58 <celebdor> services are broken
14:13:24 <celebdor> just a sec
14:13:30 <yboaron> #link https://bugs.launchpad.net/kuryr-kubernetes/+bug/1749968
14:13:30 <openstack> Launchpad bug 1749968 in kuryr-kubernetes "services backed by neutron-lbaas do not work with native ovs firewall" [Critical,In progress] - Assigned to Antoni Segura Puimedon (celebdor)
14:13:41 <celebdor> thanks yboaron
14:13:53 <celebdor> the biggest contention point though
14:14:12 <celebdor> is that for east-west, setting the same SG as the pods work
14:14:17 <celebdor> for loadbalancer service type
14:14:42 <celebdor> we either need to dynamically change the ports in a SG per LB depending ont he listeners
14:15:03 <celebdor> or just allow everything in for external traffic when service is loadbalancer type
14:15:10 <irenab> for the issue yboaron raises I was that woith Dragonflow there is a need to add SG for the specific port of the servic itself
14:15:22 <celebdor> since neutron-lbaasv2 is deprected I'm inclined to just allow everything
14:15:39 <celebdor> (specially considering haproxy will only LB the listener ports anyway)
14:15:55 <yboaron> irenab, IIRC it was LoadBalancer service type --> means N-S
14:16:03 <irenab> yes
14:16:52 <irenab> celebdor, the patch you proposed covers what you just explained?
14:17:19 <celebdor> only covers east-west
14:17:26 <celebdor> n-s needs to be added
14:17:30 <celebdor> one of the two options above
14:17:49 <yboaron> #link https://review.openstack.org/#/c/545363/  - should support east-west
14:17:56 <celebdor> (I'd rather not carry the extra SG per LB since it feels like repeating octavia code, but I can be persuaded)
14:18:47 <irenab> the question is if we officially support HAProxy flavor (till it is removed)
14:18:53 <celebdor> irenab: dulek: ltomasbo: yboaron: ^^
14:19:12 <celebdor> irenab: do you have native lb already in df?
14:19:33 <irenab> not yet, speced but not implemented
14:20:11 <dulek> celebdor: I don't have strong opinions here, but with neutron-lbaas being deprecated, I'd say to stick with Octavia's way.
14:20:28 <ltomasbo> neither do I
14:20:49 <ltomasbo> but agree with dulek, give preference to Octavia's way
14:20:57 <irenab> dulek, Octavia is a bit more resource heavy flavor,I won't be surprised if some will choose HAProxy
14:21:23 <celebdor> irenab: so do you use octavia or lbaasv2?
14:21:24 <ltomasbo> irenab, that's true
14:21:39 <irenab> celebdor, use for what?
14:21:48 <irenab> I mean the kuryr users
14:21:49 <celebdor> irenab: in your deployments
14:22:04 <celebdor> I'm inclined to support lbaasv2 until we have either containerized octavia or octavia plugins
14:22:38 <irenab> I agree, then probably need to solve n-s in the Octavia like way
14:22:38 <dulek> celebdor: It's awful that we don't have anything rock-solid in LB spaceā€¦
14:23:02 <dulek> I mean - we wonder if anyone will use Octavia, while LBaaSv2 is already deprecated.
14:23:09 <dulek> This isn't really healthy situation.
14:23:37 <celebdor> irenab: I was afraid you'd say that
14:23:49 <irenab> dulek, as far as I remember there is no specific details about when lbaasv2 will be removed
14:24:02 <dulek> irenab: It was announced recently.
14:24:04 <celebdor> dulek: Octavia, as is, can only be used at scale for N-S
14:24:14 <celebdor> in the kubernetes scenario
14:24:14 <irenab> celebdor, lets finalize it at PTG?
14:24:24 <celebdor> irenab: well, I'd like to merge it by wednesday
14:24:27 <celebdor> I think it is doable
14:24:30 <celebdor> with a bit of help
14:24:47 <celebdor> irenab: the advantage is that we can take "inspiration" from the octavia code
14:24:48 <dulek> celebdor: That's a release blocker for Queens or not?
14:24:57 <celebdor> dulek: for me it is
14:25:01 <dulek> irenab: http://lists.openstack.org/pipermail/openstack-dev/2018-January/126836.html
14:25:43 <dulek> celebdor: Then we need to have it fixed this week.
14:25:44 <irenab> dulek, "We are not announcing the end of the deprecation cycle at this time"
14:26:04 <irenab> when is the deadline for Q. release?
14:26:06 <dulek> irenab: Uh, oh. Did I simply read what I wanted to read? :P
14:26:08 <celebdor> it's like
14:26:20 <celebdor> we don't want you to use it, but we won't screw you if you do
14:26:25 <celebdor> that's how I read it
14:26:37 <dulek> Q is released next Wednesday. We can have another RC before that date.
14:26:46 <irenab> yes, mainly no new features on lbaasv2, but you can keep using it as is
14:26:55 <celebdor> dulek: this wednesday or next week's
14:26:57 <celebdor> ?
14:27:10 <dulek> celebdor: IIRC next, but let me check again.
14:27:40 <celebdor> thanks dulek
14:27:46 <celebdor> Oh, I feel like I'm PTL again
14:27:48 <celebdor> :D
14:27:54 <dulek> Ah, okay. So we have time until Friday to issue another RC.
14:27:58 <dulek> https://releases.openstack.org/queens/schedule.html#q-finalrc
14:28:10 <irenab> well then, lets try to have a fix and merge it by Friday
14:28:15 <celebdor> let's
14:28:44 <celebdor> do we have anything else?
14:28:49 <celebdor> oh, yes
14:29:00 <celebdor> maysamacedos reported that the cni daemon is leaking memory
14:29:09 <maysamacedos> yup
14:29:11 <celebdor> since she is working on a very cool feature
14:29:21 <celebdor> liveness checks for the cni daemon
14:29:32 <celebdor> that mark unhealthy when we go over certain memory
14:29:41 <maysamacedos> the IPDB more precisely
14:29:48 <celebdor> I fixed in pyroute2 upstream one memory leak
14:29:55 <celebdor> but we know there is another in setns
14:29:58 <celebdor> maybe more
14:30:15 <irenab> interesting
14:30:25 <ltomasbo> umm, nice!
14:30:26 <celebdor> this is only a chance to talk about the liveness memory leak health
14:31:01 <maysamacedos> should we maintain this (IPDB) check?
14:31:12 <dulek> maysamacedos: Had you checked the leak with newer pyroute2 version, including celebdor's fix?
14:31:16 <celebdor> I think we should
14:31:24 <maysamacedos> dulek: yes
14:31:26 <celebdor> dulek: she did
14:31:35 <dulek> And that doesn't solve it? :(
14:31:37 <celebdor> we were discussing it last week
14:31:43 <maysamacedos> dulek: no
14:31:48 <celebdor> dulek: as I said, I know there's at least one more
14:31:56 <celebdor> what I proposed was to have a conf option
14:32:15 <celebdor> that says use at max 8GiB of mem
14:32:23 <celebdor> if you use more it means you are leaking
14:32:26 <celebdor> and you should be killed
14:32:35 <maysamacedos> hmm
14:32:48 <irenab> celebdor, so when this check fails, node cannot host any more Pods?
14:32:49 <celebdor> dulek: which reminds me... Do we always evict entries from the registry?
14:32:50 <dulek> celebdor: Let's just have the limit configurable. -1 means no limit, the you can set it in MiBs.
14:33:04 <celebdor> irenab: no, when it fails the health check fails
14:33:14 <irenab> and this leads to what?
14:33:14 <dulek> irenab: No, kuryr-daemon will get restarted.
14:33:15 <celebdor> and k8s should restart the cni daemon
14:33:21 <celebdor> and then it should start working again :-)
14:33:27 <dulek> Magic. ;)
14:33:28 <irenab> ok
14:33:29 <celebdor> the node will go notready -> ready
14:33:38 <celebdor> maysamacedos: it does restart it, right?
14:33:44 <dulek> celebdor: I don't think that, most likely CNI will not notice.
14:33:47 <maysamacedos> tes
14:33:47 <irenab> so existing Pods are not affected
14:33:53 <maysamacedos> yes celebdor
14:34:21 <celebdor> good
14:34:22 <dulek> celebdor: And about deleting stuff from registry. We currently don't do that, don't even watch for DELETEs.
14:34:26 <celebdor> irenab: they are not
14:34:35 <celebdor> dulek: that's leaking
14:34:42 <celebdor> ok
14:34:44 <celebdor> not really
14:34:45 <dulek> celebdor: I think it's easy to be implemented now, as we have locks now.
14:34:48 <celebdor> it's ballooning
14:34:50 <celebdor> xD
14:34:53 <dulek> :)
14:35:01 <dulek> celebdor: I can try to fix that today.
14:35:03 <celebdor> dulek: please, file a bug
14:35:12 <celebdor> and fix it :P
14:35:23 <irenab> nice troubleshooting session :-)
14:35:42 <irenab> any other issues to discuss?
14:35:42 <celebdor> irenab: I feel like I'm forgetting about some release bug
14:35:49 <celebdor> but we can probably address it by the .1
14:35:53 <celebdor> .0 is always dangerous
14:35:55 <celebdor> :P
14:36:04 <dulek> celebdor: You have two critical issues in launchpad.
14:36:15 <dulek> https://bugs.launchpad.net/kuryr-kubernetes
14:36:38 <celebdor> the lbaas ones
14:36:54 <irenab> actually there are 3
14:37:07 <celebdor> dulek: what about this one https://bugs.launchpad.net/kuryr-kubernetes/+bug/1731485
14:37:07 <openstack> Launchpad bug 1731485 in kuryr-kubernetes "Kuryr ignores CNI_CONTAINERID when serving requests" [Critical,In progress] - Assigned to Michal Dulko (michal-dulko-f)
14:37:11 <celebdor> it has your name on it
14:37:13 <celebdor> :-)
14:37:28 <dulek> celebdor: Yup. I don't know how to fix it for case without kuryr-daemon.
14:37:44 <irenab> https://bugs.launchpad.net/kuryr-kubernetes/+bug/1749921
14:37:45 <openstack> Launchpad bug 1749921 in kuryr-kubernetes "Loadbalancer service type fails to create due to subnet access policy" [Critical,In progress] - Assigned to Yossi Boaron (yossi-boaron-1234)
14:38:00 <dulek> And for kuryr-daemon it's fixed. I think we'll get over it once kuryr-daemon will become a default.
14:38:44 <irenab> https://review.openstack.org/#/c/545270/ is almost ready to be merged
14:39:08 <celebdor> dulek: what do you think is missing to make it default
14:39:11 <irenab> We saw some strange issue with unhealthy lb handler, just need to verify it is resolved
14:39:12 <celebdor> (apart from mem leaks)
14:39:26 <yboaron> irenab, Did you manage to check it ?
14:39:39 <celebdor> irenab: I thought yboaron found out what it was
14:39:44 <celebdor> or you mean with df?
14:39:49 <irenab> yboaron, not the latest version, will complete asap
14:39:55 <dulek> celebdor: Nothing really. We gate on that, we know it solves an issue that we don't know how to approach without kuryr-daemon.
14:40:12 <dulek> celebdor: I'd say it's ready to become a default in Rocky.
14:40:17 <yboaron> It worked for me , crossing fingers :-)
14:40:17 <irenab> celebdor, not with DF, it was some kuryr-kubernetes issue
14:40:26 <celebdor> yboaron: what is that ep_subsets thing?
14:40:33 <celebdor> I don't recall putting that there
14:40:35 <celebdor> what did I miss
14:40:54 <celebdor> dulek: alright
14:41:11 <celebdor> you can put a BP to make it default and deprecate haproxy on Rocky
14:41:12 <yboaron> That was the exception I got , triggers the unhealthy of LB
14:41:24 <celebdor> yboaron: what's it about?
14:41:30 <celebdor> do you have a stactrace?
14:41:42 <irenab> celebdor, I posted it on the patch
14:41:50 <yboaron> iterable of None object
14:42:02 <celebdor> will check, thanks
14:42:15 <dulek> celebdor: HAProxy? I've talked about deploying without kuryr-daemon. :P
14:42:42 <irenab> I think celebdor is ulti tasking as usual, abit of context switch slip ...
14:42:45 <celebdor> dulek: and that's why I meant non daemonized cni
14:42:49 <celebdor> I'm just mistyping
14:42:51 <dulek> :)
14:42:59 <yboaron> I suspect that service doesn't work in latest devstack  - I was in the middle of testing that .. and guess what happened to RDO ?
14:43:00 <celebdor> irenab: only 4 conversations at one
14:43:10 <celebdor> yboaron: Kaboom?
14:43:24 <yboaron> celebdor, Yep
14:43:38 <irenab> yboaron, dmellado mentioned maitenance of RDO
14:44:01 <yboaron> irenab, yes disks upgrade ..
14:44:18 <irenab> Shall we move to open discussions?
14:44:22 <yboaron> Did someone check service LB lately on devstack ?
14:44:31 <celebdor> irenab: agreed
14:44:38 <irenab> yboaron, what do you mean?
14:44:59 <yboaron> clean devstack - create LB E-W
14:45:06 <yboaron> clusterIP
14:45:24 <irenab> not recently
14:45:26 <celebdor> yboaron: on BM?
14:45:27 <yboaron> I saw strange things with the health reports
14:45:42 <celebdor> yboaron: you mean the controller heatlh?
14:45:43 <yboaron> celebdor, VM-devstack
14:45:50 <celebdor> ok
14:45:51 <yboaron> celebdor, yes
14:45:54 <celebdor> that's "baremetal"
14:46:01 <celebdor> for poor people without real hardware
14:46:03 <celebdor> xD
14:46:06 <irenab> yboaron, please open a bug if you see some issues
14:46:21 <yboaron> irenab, I will
14:46:22 <irenab> #topic open discussions
14:46:29 <irenab> #chair dmellado
14:46:29 <openstack> Current chairs: celebdor dmellado irenab
14:46:58 <irenab> dmellado asked me to remind about comming PTG, its going to be next week
14:47:14 <dulek> \o/
14:47:38 <irenab> we have an etherpad where anyone who want to suggest session can add it #link https://etherpad.openstack.org/p/kuryr-ptg-rocky
14:47:41 <ltomasbo> yep, he wanted to plan a bit the schedule https://ethercalc.openstack.org/kuryr_ptg
14:48:26 <irenab> correct, so please if you plan to join remotely add the prefered time slot based on the availability at the ethercalc
14:48:36 <dulek> ltomasbo: Why is 15:30-16:30 not available everywhere?
14:48:38 <celebdor> irenab: theoretically the proposal time is closed
14:48:50 <celebdor> only scheduling now, right?
14:49:03 <celebdor> nap time?
14:49:26 <irenab> :-)
14:49:41 <ltomasbo> xD
14:49:45 <ltomasbo> I don't know...
14:50:30 <celebdor> we'll ask dmellado when he's available :-)
14:50:34 <irenab> I think its bug in the schedule, let;s give dmellado a chance to fix it
14:50:53 <celebdor> othewise we'll do bug fixing sessions
14:51:03 <irenab> +1
14:51:09 <irenab> any other topics?
14:51:41 <celebdor> not here
14:51:44 <irenab> then I guess we can close the meeting
14:51:48 <yboaron> Folks , please review #link https://review.openstack.org/#/c/536387/
14:52:05 <kaliya> garyloug: updates?
14:52:05 <irenab> yboaron, will do
14:52:12 <yboaron> irenab, 10x
14:52:31 <kaliya> also Danil told that he fixed his patches for interaction with the CNI-daemon, they can be reviewed
14:52:45 <kaliya> #link https://review.openstack.org/#/c/471012/ https://review.openstack.org/#/c/512280/ https://review.openstack.org/#/c/512281/
14:52:51 <irenab> kaliya, did you try them?
14:53:08 <kaliya> Not yet, but I plan to check on devstack the one which fails in zuul
14:53:20 <irenab> kaliya, great, thanks
14:53:24 <kaliya> This one https://review.openstack.org/#/c/524590/
14:53:31 <kaliya> if anyone has ideas, welcome :)
14:53:51 <garyloug> Hi kaliya, no update, something came up in my work last week that needed my attention. I'm working on the code right now and need to get it through internal review before I push.
14:54:14 <kaliya> we have a meeting tomorrow on this topic with garyloug and dmellado
14:54:21 <kaliya> don't forget! XD
14:54:32 <irenab> We plan to have a session on multivif at PTG also to go though the code, hope it will be the latest wehn we finalize the patches
14:54:39 <celebdor> kaliya: thanks
14:54:46 <kaliya> good to know irenab
14:55:05 <celebdor> dulek: after you finish the registry delete fix, take a look at danil's cni patches ;-)
14:55:07 <irenab> kaliya, worth to share the meeting details if someone wants to join
14:55:29 <celebdor> we should put the bluejeans links soon
14:55:30 <garyloug> Great, irenab has this session been scheduled for any day or time?
14:55:33 <kaliya> dmellado has the calendar event
14:55:42 <celebdor> garyloug: kaliya: right
14:55:47 <irenab> garyloug, I do not know
14:55:49 <dulek> celebdor: Sure, I'm helping daniel debug them. :)
14:55:50 <celebdor> he'll probably update tomorrow or wednesday
14:55:58 <celebdor> dulek: daniel or danil?
14:56:06 <garyloug> No worries, I'll keep an eye on the doc
14:56:07 <celebdor> we have both
14:56:09 <celebdor> :-)
14:56:11 <dulek> celebdor: danil, right.
14:56:14 <celebdor> :-)
14:56:24 <irenab> ok, we have 2 mins left
14:57:15 <irenab> thank you all for joining
14:57:21 <irenab> #endmeeting