Thursday, 2018-04-19

*** janonymous has quit IRC00:26
*** premsankar has quit IRC00:58
*** chenyb4 has joined #openstack-kuryr01:14
*** salv-orl_ has joined #openstack-kuryr01:59
*** salv-orlando has quit IRC02:02
*** maysamacedos has quit IRC02:03
*** janki has joined #openstack-kuryr02:07
*** hongbin_ has joined #openstack-kuryr02:10
*** jchhatbar has joined #openstack-kuryr02:10
*** salv-orl_ has quit IRC02:11
*** janki has quit IRC02:12
*** salv-orlando has joined #openstack-kuryr02:13
*** salv-orlando has quit IRC02:20
*** salv-orlando has joined #openstack-kuryr02:22
*** kiennt2609 has joined #openstack-kuryr02:33
*** kiennt2609 has quit IRC02:34
*** kiennt2637 has joined #openstack-kuryr02:34
*** kiennt2637 has quit IRC02:35
*** kiennt2609 has joined #openstack-kuryr02:35
*** caowei has joined #openstack-kuryr02:53
*** kiennt2609 has quit IRC03:10
*** kiennt2609 has joined #openstack-kuryr04:02
*** kiennt2609 has quit IRC04:03
*** hongbin_ has quit IRC04:11
*** gcheresh has joined #openstack-kuryr04:18
*** jchhatba_ has joined #openstack-kuryr04:18
*** jchhatba_ has quit IRC04:19
*** jchhatba_ has joined #openstack-kuryr04:19
*** jchhatbar has quit IRC04:21
*** premsankar has joined #openstack-kuryr04:21
*** jchhatbar has joined #openstack-kuryr04:50
*** gcheresh has quit IRC04:51
*** jchhatba_ has quit IRC04:53
*** janonymous has joined #openstack-kuryr05:04
*** gcheresh has joined #openstack-kuryr05:42
*** pcaruana has joined #openstack-kuryr06:21
*** premsankar has quit IRC06:49
*** dims has quit IRC06:54
*** dims has joined #openstack-kuryr06:56
*** dims has quit IRC07:01
*** dims has joined #openstack-kuryr07:02
*** celebdor1 has joined #openstack-kuryr07:21
*** celebdor1 is now known as apuimedo07:21
apuimedomorning07:21
*** salv-orlando has quit IRC07:23
*** salv-orlando has joined #openstack-kuryr07:24
*** kiennt2609 has joined #openstack-kuryr07:26
duleko/07:26
*** salv-orlando has quit IRC07:28
apuimedodulek: did you see I fixed https://review.openstack.org/#/c/562067/1 ?07:29
*** pmannidi has quit IRC07:29
apuimedosorry https://review.openstack.org/#/c/562067/207:29
dulek!!!07:29
openstackdulek: Error: "!!" is not a valid command.07:29
dulekI haven't noticed. Wow.07:30
dulekWait, no.07:30
dulekapuimedo: We need to rebase the test disabling skip decorator on top of that to test.07:30
dulekOtherwise without Service test we cannot prove anything.07:30
*** dmellado has joined #openstack-kuryr07:32
*** salv-orlando has joined #openstack-kuryr07:37
apuimedodulek: I meant that I fixed the fake router thing xD07:39
apuimedofrom the patch07:39
apuimedodulek: but yeah, let's rebase the skip test07:39
dulekapuimedo: You can do that from the UI. :)07:40
apuimedodulek: I'll just add a depends-on since it's a different repo07:41
dulekOh, right.07:42
openstackgerritAntoni Segura Puimedon proposed openstack/kuryr-tempest-plugin master: Revert "Skip service test"  https://review.openstack.org/56136407:42
*** pcaruana has quit IRC07:45
*** pcaruana has joined #openstack-kuryr07:46
*** salv-orlando has quit IRC08:00
*** salv-orlando has joined #openstack-kuryr08:01
*** salv-orlando has quit IRC08:05
*** garyloug has joined #openstack-kuryr08:17
apuimedoltomasbo: do you have details on this l2 failure?08:29
ltomasboI'm checking08:29
ltomasboapuimedo, I think we need to create (and use) the security group you set for the kubelet port for the octavia l2 mode08:30
ltomasboI'm testing that08:30
ltomasboapuimedo, if it works, I'll let you know08:31
ltomasbodmellado, dulek, apuimedo: and we should test both ovs-firewall and octavia l2 and l3 on gates08:31
dmelladoyeah08:32
ltomasbootherwise we will broke them all the time (as usual) :D08:32
dmelladoltomasbo: could you add that to the enhance gates bp?08:32
dulekAaaah, more deployment options ?!08:32
dmelladodulek: yeah xD08:32
dmellado+ containerized / non containerized08:32
dmelladoapuimedo: what did you say you fixed?08:33
apuimedodmellado: world hunger08:33
ltomasbodulek, perhaps we should just move to ovs-firewall...08:33
ltomasbodulek, nested does not work with ovs-hybrid anyway, and it will be safe to ensure security groups are the right ones08:33
dmelladoapuimedo: with fuet?08:34
apuimedodmellado: I'd rather it be with llangonissa08:35
dulekltomasbo: I'm not against it.08:35
apuimedoltomasbo: I tend to agree08:36
dulekBut we need to start thinking about what's Kuryr issue and what's Kuryr DevStack plugin issue.08:36
apuimedodulek: this is devstack08:36
apuimedoclearly08:36
dmelladodulek: also, we'd need to provide some healthchecks08:36
dmelladoin devstack08:36
dmelladosome timeout and check that the containers are ready08:36
dulekThis one - yes. SGs missing for LBaaS v2 Services was Kuryr's.08:36
dulekBecause putting too much effort in testing DevStack plugin isn't worth it IMO.08:36
ltomasbodulek, agree, but it is better to have sg enforcement so that we now that they are needed and where. That will also help when deploying with other SDNs/Tools08:40
*** salv-orlando has joined #openstack-kuryr08:58
dulekltomasbo: Agreed (sorry, missed this message).09:02
ltomasboxD09:02
ltomasboapuimedo, it (half) worked09:02
ltomasboapuimedo, the cni is now able to connect, but the loadbalancer is actually not working... probably missing some extra SGs09:03
apuimedoltomasbo: do you need help?09:03
ltomasboI'm in a call, but we can share the tmux if you want to dig into it09:03
apuimedoltomasbo: give me the details and I can look into it while you're on the call09:04
apuimedohttps://gist.github.com/celebdor/77f1130eb8763078a2c997a2ebf9149409:05
ltomasboapuimedo,  stack@38.145.33.12909:06
ltomasboapuimedo, I have a tmux session there09:07
ltomasboand I applied some changes on top of your patch09:07
* dmellado sighs09:07
dmelladothe issues that we're facing on the octavia gate seems to be related to upstream infra09:07
dmelladoI just created a dsvm ubuntu based with the qcow amphora and I don't see any issue09:08
dmelladobut it just doesn't make sense09:08
apuimedoltomasbo: ODL?!09:08
ltomasboapuimedo, it is ml2/ovs... don't worry09:08
dmelladoltomasbo: shhh just tell him it's odl so he becomes crazy09:09
ltomasbothat is because I used for odl at some time09:09
dmelladoxD09:09
apuimedoltomasbo: you scared me09:09
apuimedoxD09:09
ltomasboxD09:09
dmelladofolks any kind of idea of what could be using that ip on the upstream gates?09:10
dmelladohttp://logs.openstack.org/64/561364/2/check/kuryr-kubernetes-tempest-octavia/f028a60/controller/logs/screen-o-api.txt.gz#_Apr_19_08_25_22_23083909:11
apuimedodmellado: didn't you request access to one of those VMs?09:11
dmelladoapuimedo: still waiting for it09:11
dmelladoit just plain works on ubuntu from our side09:11
dmelladoeven on rdo cloud09:12
dmelladoand that's a lot to say09:12
dulekdmellado: If we're debugging this let's not speculate and go straight to frickler to freeze a VM.09:17
dulekdmellado: Infra thinks that this is still better than rechecking stuff like crazy.09:17
dmelladodulek: I'm rechecking in order to actually GET a hold on that vm09:17
dmelladobefore nodepool drops it09:18
dmelladothus the recheck09:18
dmelladoso don't complain :P09:18
dmelladoonce the results are spit I've been just told they can't do it09:18
dulekOkay!09:20
dulekdmellado: And no offense, I was just proposing a solution that worked last time. ;)09:21
apuimedoltomasbo: you've been touching the SGs right and left, huh?09:21
ltomasboapuimedo, I finished with the call09:21
ltomasboapuimedo, only one! I added octavia one to the kubelet09:21
ltomasboother than that, just the devstack modifications09:21
dmelladodulek: I was thinking that it could be that somehow the ip that it tries to allocate to the 2nd amphora is used by something in infra09:22
dmelladoor whatever09:22
dmelladolet's try to check once we get access to the vm09:22
apuimedoltomasbo: :P09:22
ltomasboapuimedo, I only added the df866 one09:22
ltomasboapuimedo, ca503 is added automatically09:22
apuimedoltomasbo: you created and added df866fe7-904a-4340-aa59-3c9047562dee09:23
ltomasboapuimedo, and d7de with the modification to your patch, without that, kuryr-cni is not able to connect to the API09:23
apuimedoright?09:23
ltomasboapuimedo, that is created (and needed) for the l2 mode to work09:23
ltomasboso, it is created by devstack/plugin.sh09:23
apuimedoltomasbo: the api lb is missing a port on the pod subnet!09:29
ltomasboapuimedo, really?09:29
apuimedoltomasbo: yup09:32
apuimedoI'm fixing it now09:32
dulekHA, I've plumbed the K8s API LB for DevStack with OpenShift. \o/09:32
dulekopenshift-master was binding to HOST_IP only. :)09:33
apuimedo:P09:33
dulekNow how do I fix that in DevStack plugin…09:33
dmelladoand now things break for yolanda and she can't gate the node09:33
* dmellado sighs09:33
* dmellado double sighs09:33
dmelladowhy everything just breaks all the time09:34
apuimedoltomasbo: do you see now that the LB can access the api?09:34
apuimedonow we need to see why I can't curl the VIP09:34
dmelladoapuimedo: but *why* does it work on our local installations?09:34
dmelladoit makes no sense09:34
apuimedodmellado: I'm not reading your thread. I'm talking to ltomasbo09:35
apuimedodmellado: what are you talking about?09:35
dmelladoLOL09:35
dmelladonvm I'll just go and deal with my pain alone xD09:35
dmelladothought you were tackling the issue on the ip address already allocated09:36
apuimedodmellado: now that I found one of the issues with ltomasbo env09:36
apuimedoI have a thread open09:37
apuimedodmellado: that also09:37
apuimedowait, the insurance people are here09:37
apuimedoI'll be back09:37
dmelladoinsurance?09:37
apuimedodmellado: broken glass door in the house entrance09:42
apuimedoltomasbo: I summon thee09:42
*** salv-orl_ has joined #openstack-kuryr09:43
ltomasboapuimedo, ??09:45
ltomasboapuimedo, tell me!09:45
*** salv-orlando has quit IRC09:47
apuimedoltomasbo: hey09:48
openstackgerritMichał Dulko proposed openstack/kuryr-kubernetes master: Gate with containerized deployment and OpenShift  https://review.openstack.org/55731309:48
apuimedoltomasbo: do you see those chksum errors?09:48
dulekOkay, hopefully that's it. ^09:48
ltomasboapuimedo, yes10:02
apuimedoltomasbo: any idea?10:02
ltomasboyes10:03
ltomasboI think this got fixed on ubuntu (when we hit it last time)10:03
ltomasboand this is a centos amphora, I bet nobody has tested it with l210:03
ltomasboand it is replying from the wrong eth10:04
ltomasboI remember we had to set some kernel flags to fix it10:04
apuimedoltomasbo: can you find out?10:04
ltomasboapuimedo, also, that was working before10:05
ltomasbowhy it is not working now?10:05
ltomasbodid you disable something?10:05
apuimedoltomasbo: what was working?10:07
ltomasbothe lb api10:08
ltomasboapuimedo, ^^10:08
apuimedoltomasbo: nah... it wasn't10:08
apuimedowhen I sshed into your machine it was even missing an interface10:08
ltomasboapuimedo, did you add the missing interface to what? api lbaas? demo lbaas? or both?10:09
apuimedoltomasbo: I'm only touching api lbaas10:10
ltomasboso, that was actually working (at least the kuryr-cni was able to reach it10:10
ltomasbomy problem was with the default/demo lbaas10:11
ltomasboapuimedo, and the problem is that it reaches the amphora through the wrong nic10:12
apuimedoltomasbo: I tried to curl the API from the host namespace and it was getting EOF10:14
ltomasbothat I tested before, and it was working10:16
ltomasbolet me restack and start from clean deployment10:17
*** maysamacedos has joined #openstack-kuryr10:23
*** kiennt2609 has quit IRC10:24
dmelladodulek: apuimedo ltomasbo10:32
dmelladoready for upstream gate debugging?10:32
dulekAh!10:34
dulek"kuryr-kubernetes-tempest-daemon-containerized-openshift-lbaasv2 success (non-voting)" \o/10:34
dmelladoheh10:35
dulekhttps://github.com/dulek.keys10:35
dmelladodulek: good padawan xD10:35
dulekhttps://github.com/dulek.keys10:35
dulekOkay, alias works. :P10:35
apuimedodmellado: more or less10:35
apuimedohttps://gist.github.com/celebdor/77f1130eb8763078a2c997a2ebf9149410:36
dulekirenab: Can you look again on https://review.openstack.org/#/c/556777 ? I've answered your comment there.10:36
apuimedodulek: it's a public holiday10:37
dulekAh, okay.10:37
apuimedo70th aniversary of the state of Israel IIRC10:37
dmelladoit'll be all week10:37
dmelladoyeah10:37
dulekPoland's going to have 100th this year. :)10:37
dmelladossh root(at)104.239.135.5810:38
dmelladothen su - stack10:38
dmelladodulek: apuimedo10:38
dmelladoltomasbo: and me will be heading for lunch in 15' or so10:38
dmelladoso maybe you can check you can login10:38
dmelladodeploy a tmux10:38
dulekdmellado: I've logged int.10:38
dmelladoand we can go after we come back10:38
dmelladodulek: awesome10:38
*** caowei has quit IRC10:38
dmelladoI'll install vim and tmux10:38
dmelladoxD10:38
dmelladootherwise we won't be able to work10:39
* dulek starts preparing lunch then.10:39
dmelladodulek: tmux a -t gate10:39
dmelladoI've created a tmux session named like that10:39
dmelladounder 'stack' user10:40
ltomasbodmellado, https://github.com/luis5tb.keys10:40
dmelladoltomasbo: added10:41
ltomasbodmellado, I cannot login...10:43
ltomasboapuimedo, btw, I redeploy the stack, and it is getting access to the 10.0.0.129:44310:46
apuimedolet me take a look10:47
apuimedodmellado: is this before or after running tempest?10:51
dmelladoapuimedo: after10:52
dmelladowe can trigger the test and pause it if needed10:52
apuimedodmellado: not necessary10:52
dmelladoso it tries to spin up an amphora with a non-valid ip which is took y the service?10:52
*** chenyb4 has quit IRC10:54
apuimedowhere the fuck is the 155 port?10:55
apuimedocreation request in the API?10:55
dmelladomaybe we can check neutron?10:55
*** maysamacedos has quit IRC10:56
dmelladowe go for lunch, brb10:58
apuimedoltomasbo: don't you see that in your deployment the api amphora works but is in L3 mode11:02
apuimedoit should be in l2 mode, i.e., have an attachment to the k8s-pod-net11:02
*** maysamacedos has joined #openstack-kuryr11:18
*** gcheresh has quit IRC11:22
*** atoth has joined #openstack-kuryr11:50
*** rh-jelabarre has joined #openstack-kuryr11:58
dulekltomasbo, apuimedo: I'm debating ltomasbo remark on https://review.openstack.org/#/c/562366 .12:03
dulekBasically - which Kuryr version supports which OpenShift?12:03
*** yamamoto_ has quit IRC12:21
dmelladoapuimedo: dulek back,12:22
dmelladoany discovery?12:22
ltomasboapuimedo, shoulid it?12:22
ltomasbofor the kubelet? perhaps that is because we configure the kubelet port ourselves...12:23
openstackgerritMichał Dulko proposed openstack/kuryr-kubernetes master: Raise OpenShift version to 3.9.0  https://review.openstack.org/56236612:23
openstackgerritMichał Dulko proposed openstack/kuryr-kubernetes master: Gate with containerized deployment and OpenShift  https://review.openstack.org/55731312:23
dulekdmellado: Haven't been looking there, was eating and waiting for you.12:26
dulekBut meanwhile, while don't we start mergefest?12:27
*** yamamoto has joined #openstack-kuryr12:27
dmelladolet's go for mergefest while we have the meeting12:27
dulekdmellado: https://review.openstack.org/#/c/556777 - this only needs second +2 and I've answered irenab comment about making `mkdir -p` conditional.12:28
dmelladoin!12:29
dulekWhooo, it's going!12:30
*** chenyb4 has joined #openstack-kuryr12:47
openstackgerritMichał Dulko proposed openstack/kuryr-kubernetes master: Raise OpenShift version to 3.9.0  https://review.openstack.org/56236613:04
openstackgerritMichał Dulko proposed openstack/kuryr-kubernetes master: Gate with containerized deployment and OpenShift  https://review.openstack.org/55731313:04
openstackgerritMichał Dulko proposed openstack/kuryr-kubernetes master: Raise OpenShift version to 3.9.0  https://review.openstack.org/56236613:05
openstackgerritMichał Dulko proposed openstack/kuryr-kubernetes master: Gate with containerized deployment and OpenShift  https://review.openstack.org/55731313:05
ltomasbocelebdor, I think I found out the reason for the api lb not having the subnet port13:08
dmelladodulek: ltomasbo around and seeing my tmux?13:11
*** chenyb4 has quit IRC13:12
dulekdmellado: Not really.13:12
*** jistr is now known as jistr|mtg13:14
openstackgerritMerged openstack/kuryr-kubernetes master: Create CNI bin dir in OpenShift DevStack plugin  https://review.openstack.org/55677713:15
openstackgerritMerged openstack/kuryr-kubernetes master: Add `privileged` SCC to SA in OpenShift DevStack  https://review.openstack.org/55695913:21
openstackgerritMerged openstack/kuryr-kubernetes master: Add HTTPS support to K8s API healthchecks  https://review.openstack.org/55696013:22
*** salv-orl_ has quit IRC13:27
dmelladodulek: could you take over?13:32
dmelladoI've been running the test as otherwise the resources are just destroyed13:32
dmelladoandit seems that we cannot get to the service ip:port13:32
dmelladowe were thinking about it possibly be related to it needing a fip so we added that to the port13:33
dmelladobut still stuck13:33
dulekdmellado: Sorry, I don't get it. I've tried doing `tmux a` and  `tmux a -t gate` but there's no session.13:33
dmelladodulek: can't you get in?13:33
dmelladossh into there13:33
dmelladothen su - stack13:33
dmelladothen tmux a -t gate13:33
dulekI'm on the VM.13:33
dmelladothen I'll show you13:33
dulekdmellado:13:34
dulekroot@ubuntu-xenial-rax-dfw-0003608416:~# tmux a -t gate13:34
dulekno sessions13:34
dmelladodulek: heh13:34
dmelladoread ^^13:34
dmelladosu - stack13:34
dmelladothen tmux a -t gate13:34
dulekAh.13:34
dmelladolet me know when you're in13:35
dulekdmellado: I'm in. So what are you checking there. Don't we just need to do `kubectl expose` and see why Octavia crashes?13:35
dmelladowhat I'm doing is running the test13:35
*** gcheresh has joined #openstack-kuryr13:35
dmelladoand seeing the pods and after that the service13:36
dmelladoi.e.13:36
dulekdmellado: watch kubectl get pods -o wide. :P13:37
dulekdmellado: That's better.13:37
dmelladoI thought it was going to be quicker13:37
dmelladothat's why I thought I'd wait xD13:37
dulekdmellado: Heh, this test creates a pod and waits for it before creating next one?13:38
dulekdmellado: That can be improved. :P13:38
dmelladoyeah, totally13:38
celebdordmellado: found anything?13:38
dmelladoit even uses a for13:38
dmelladosee that now we got that kuryr-service there at 80 port?13:38
dulekOkay, we have a Service. Fine.13:38
dmelladoso let's check tempest logs13:38
dulekdmellado: Can you reach it from the host?13:39
dulekdmellado: I've seen that, nothing to see here. We need to figure out why Octavia explodes.13:39
dulekdmellado: Take a look on Octavia API log.13:39
dulekdmellado: Hey, that's not too bad. Though what's with the restart of the log?13:40
dmelladoIIRC it'll time out eventually as we can't reach it from the host at all13:41
dulekdmellado: Yes, yes, it will.13:41
dmelladohuh, I lost connectivity to the infra vm13:42
dmelladoare you still around in the tmux13:42
dulekdmellado: Why don't we kubectl run a single pod, expose it and start trying to reach the service?13:42
dulekdmellado: I'm in.13:42
dmelladook I can't type there now13:42
dmelladoweird13:42
*** gcheresh has quit IRC13:42
dmelladocould you try that?13:43
dmelladoltomasbo: and we will go into a looong meeting now13:43
dmelladoand I'll sync with you after we're done13:43
dulekdmellado: Yeah, once it gets better connectivity, I have lag as well.13:43
dmelladodulek: see? we just hit that issue13:44
dulekdmellado: Here we see the Octavia failure, right?13:44
dmelladosee the 10.1.0. blah already allocated13:44
dulekReproducing it is easy, hard part is why it's happening.13:44
dulekWHY CAN'T I TYPE?! xD13:45
dmelladoyou can't either? xD xD xD13:45
dulekHm, waaaait…13:45
dmelladodafuq!!!xD13:45
dmelladojust slow?13:45
dulekdmellado: It's tmux fault, we've probably hit something blocking input.13:46
*** hongbin_ has joined #openstack-kuryr13:46
*** jistr|mtg is now known as jistr14:10
dmelladodulek: tmux?14:12
dulekdmellado: I'm not on the tmux, investigating from outside as that works. :P14:13
dmelladoLOOOOL14:13
dmelladoxD14:13
*** janonymous has quit IRC14:13
dulek?14:13
*** kiennt26_ has joined #openstack-kuryr14:16
dmelladodreaded tmux14:16
apuimedodmellado: dulek: I can type just fine in tmux14:17
dulekapuimedo: Maybe it fixed itself.14:17
apuimedomaybe14:17
dulekOkay, what I see from Octavia code and logs is that it sees that LB has and IP.14:17
dulekAnd tries to allocate a port with that IP.14:18
dulekBut that port already exists and for some reason Octavia's unable to notice that.14:18
dulekAccording to code for some reason vip.ip_address gets saved but not vip.port_id.14:19
dulekThough I'm unable to find first POST for that port.14:19
apuimedowhich code?14:19
dmelladogenadi's code14:19
*** kiennt26 has joined #openstack-kuryr14:20
apuimedooh14:20
apuimedoany idea why it only happens at infra14:20
dulekGenadi's?14:21
dulekNo, Octavia code.14:21
dulekOkay, I have first post in Neutron logs… It happens *5 minutes* before the error. Let's see who did that.14:22
dmelladodulek: oh, I thought you meant gena's code on the test14:22
dulekOkay, seems like we did that from DevStack. Let me make sure.14:23
dulekUhm… Interesting, it's not DevStack?14:24
dulekOkay, so it's kuryr-kubernetes who ordered the first conflicting LB…14:27
dmelladodulek: so it was our fault? what did you find out?14:28
dulekNothing yet, still looking.14:28
dulek(I'm reading the logs on the gate, not on the VM, feel free to investigate VM on your own)14:29
dulekapuimedo, dmellado: Okay, here's what happens IMO:14:33
dulek1. Kuryr tries to provision an LB.14:33
dulek2. After a while LB is still not… ACTIVE or whatever, so Kuryr retries.14:33
dulek3. Octavia retries port creation and fails.14:33
dulekThat's why we see those logs.14:33
dulekThey're most likely not related to the fact that Octavia's not passing traffic.14:34
dulekAnyone tried looking what's happening on Amphorae VM?14:34
dulekI'd bet tarball image is malformed or something.14:34
apuimedodulek: shouldn't the retry not happen if the VM and port are created?14:40
dulekapuimedo: Let me show the exact log.14:40
dulekapuimedo: http://logs.openstack.org/64/561364/2/check/kuryr-kubernetes-tempest-octavia/4a08a59/controller/logs/screen-kuryr-kubernetes.txt.gz#_Apr_19_10_17_49_11702014:41
dulekapuimedo: This is the moment when second POST happens on Octavia LB and Octavia tries to create the port for the second time.14:41
dulekNow why is there ResourceNotReady even though LB is created…?14:43
apuimedodulek: my question is whether the LB exists when the second post happends14:43
apuimedo*happens14:43
dulekapuimedo: It definitely does in Octavia API - we can see gets with 200 answers.14:43
dulekMaybe we time out too fast? And Octavia loses track of LB when it gets this duplicated request? Dunno…14:45
dulekYeah, it's still PENDING_CREATE 5 seconds before retry.14:46
dulekBut in the end we can see it created on the env.14:46
dulekJust Kuryr doesn't care about it anymore as test timed out as well and K8s resources are down.14:47
apuimedodulek: mmm14:47
dulekLet me create a Service myself and lets wait a bit.14:47
apuimedook14:47
dulekapuimedo: Hm, I should test ClusterIP, right?14:49
dulekOkay, so LB is PENDING_CREATE14:50
*** salv-orlando has joined #openstack-kuryr14:51
dulekI'll wait until it's up and see if Kuryr created all the other resources.14:51
dulekLike members.14:51
apuimedovery well14:52
dmelladodulek: I've just read all this14:53
dmelladomy laptop crashed14:53
dmellado(as happens with everything as of lately)14:54
dmelladohmmm we can always try with the non-qcow2 version of the amphora14:54
dmelladodunno about the current status of dib, though14:54
dulekdmellado: DIB is still failing on stable/queens even though release with the fix was released.14:56
dulekdmellado: :(14:56
dulekdmellado: So this is another issue.14:56
apuimedodulek: where was it fixed?14:56
dulekdmellado: I've looked on the Amphorae VMs and it doesn't look too bad - qemu reports no panic and they answer pings.14:56
dulekapuimedo: This was supposed to help: https://review.openstack.org/#/c/561479/14:57
dulekapuimedo: But it isn't. Might be good to ping Octavia folks again.14:57
dmelladodulek: maybe they won't reply14:58
dmelladoxD14:58
* dmellado hides14:58
dmelladoseriously, it might be totally worth pinging them again14:58
apuimedodulek: doesn't help even with your libs_from_git cheat?14:58
dmelladoapuimedo: dulek at least today's a happy day14:58
dulekapuimedo: Even. Now as it's released LIBS_FROM_GIT addition is not needed.14:59
dmelladoNicholas Cage has retired14:59
dmelladoxD14:59
dmelladofinally xD14:59
dulekdmellado: I'm probably the only person in the universe that really liked some of his movies.14:59
dmelladodulek: really? which one? If you say Ghost Rider I'll be really sad14:59
dmelladoxD14:59
apuimedodulek: name one good movie where he leads14:59
dulekdmellado: I liked Bad Lieutenant. And Gone In 60 Seconds.15:00
dulekI'm not really sophisticated cinema person.15:00
dmelladodidn't see that first one15:00
dmelladobut I can't but think on this one https://www.imdb.com/title/tt0117420/15:00
dmelladoxD15:00
dulekIt's quite okay. Probably the only movie where Cage acts not like Cage. :P15:01
*** kiennt26_ has quit IRC15:01
dulekOh, Lord of War was good. Not Cage's acting though, but film was nice.15:02
dulekAaaand I have my LB ACTIVE!15:02
dmelladoman it took a while15:02
apuimedodulek: how long was it?15:03
dulekMore than 10 minutes it seems.15:03
dulekBut let me check the connectivity. :P15:03
apuimedogood lord15:04
apuimedodulek: you created it via cli?15:04
dmelladono wonder it fails15:04
dulekapuimedo: I've did `kubectl expose` and waited.15:05
dulekHeh, there's still no connectivity to that LB, so let's dig it further…15:05
apuimedodulek: and kuryr didn't mash it?15:05
apuimedodue to timeout15:05
dulekI'm checking.15:05
dulekYep, Kuryr mashed it. No pool created.15:05
dulekSo… Kuryr-Kubernetes logs!15:06
dmelladodulek: does it complain about us developers? xD15:08
dulekdmellado: No. I wonder though who's not going to complain when he needs to wait 10 minutes until his Service is exposed…15:09
dulekSomeone's still working on mainframes, I guess?15:09
dmelladoI wonder if this is related to nested kvm performance15:09
dmelladoas it's *not* on the upstream infra15:09
dulekdmellado: No nested? Cool.15:10
dmelladoyeah, 'cool'15:10
dmelladothere are plans to enable this but not yet15:10
dulekdmellado: Might be that, VM was ACTIVE fast, but stuff done on the VM… That's other story.15:10
dulekOkay, so how about I increase Kuryr's timeout and retry?15:10
dmelladobut this was working at some point15:10
dmelladodulek: yep15:10
dulekDefault timeout is pathetic 180 seconds?15:13
dmelladodulek: put 600s at least, given what we saw15:13
*** yamamoto has quit IRC15:13
dulekSure thing, I just wonder where do I put it.15:14
dmelladoapuimedo: any hint on that?15:14
*** yamamoto has joined #openstack-kuryr15:14
dulekI see line number, okay.15:14
dmelladoyou saw it? awesome15:14
apuimedodulek: hey... Now that I think of it15:14
apuimedodidn't eunsoo report this issue and sent a patch to have the timeout be configurable?15:15
dmelladohmm could be, kinda rings a bell15:15
apuimedohttps://review.openstack.org/#/c/549945/15:15
dulekCurrently it's hardcoded on, surprise, surprise… 300 seconds.15:15
apuimedodulek: ^^15:15
dulekHeh, exactly. xD15:16
apuimedoso... It's ltomasbo's fault for -115:16
dulekIt's always him!15:16
dulekI'll just change the value in the code.15:17
ltomasbo:/15:17
dmelladodulek: apuimedo I was just blaming him now xD15:17
dmelladohe deserves it15:17
dmelladoxD15:17
ltomasbolol15:17
openstackgerritLuis Tomas Bolivar proposed openstack/kuryr-kubernetes master: Add namespace subnet driver for namespace creation  https://review.openstack.org/56224715:17
dmelladoapuimedo: bring back that picture of him with the new manager15:17
dmelladoxD15:17
apuimedoxD15:18
dulekOkay, 1000 seconds timeout, lets restart and try again.15:18
apuimedoltomasbo: ok15:18
apuimedodulek: ok15:18
*** yamamoto has quit IRC15:19
dulekOkay, it's going.15:20
dmelladolet's see15:20
apuimedodulek: it was 300 by default, right?15:23
dulekBTW - we do have an issue that on timeout we don't clean up the LB.15:23
dulekAnd then get a conflict and fail to process the event.15:23
apuimedodulek: I know15:23
dulekWho's going to file a bug? ;)15:23
dulekapuimedo: It was 300.15:23
apuimedoI wonder how well it works to start deleting something that either didn't finish or errored in provisioning15:23
apuimedodulek: I'd make it related to bug https://review.openstack.org/#/c/549945/615:24
apuimedoit fits quite well15:24
dulekapuimedo: Sure thing. I'll confirm that this is the issue and will handle all the patches and paperwork.15:24
dulekUnless I'll not make it in 2 hours as that's a hard stop for me today. :P15:25
apuimedodulek: very well15:27
apuimedolet me know if I need to take over from you then15:28
dulekapuimedo: Sure.15:28
dulekBTW - anyone on fixing stable/queens? We need to figure out what to do with https://review.openstack.org/#/c/561974/ fast…15:28
dmelladodulek: remaning lbaas could've been there from interrupted tempest runs15:28
dulekdmellado: It is, but still Tempest would let it clean up.15:29
dulekdmellado: The issue is Kuryr doesn't clean up on timeout.15:29
dmelladonope if you interrupt it15:29
dmelladodulek: worst case we'll get to skip the test there too15:29
dulekdmellado: Service and Pods were deleted, Kuryr should handle deletion of its resources.15:29
dulekdmellado: It's not "skip a test". It's skip whole CI.15:30
duleks/CI/gate15:30
dmelladowe can make octavia non voting for there until it gets solved15:30
apuimedodulek: we're stuck on dib, right?15:30
dmelladonot a pretty solution but the best that I can think on15:30
dulekapuimedo: Right15:30
dulekdmnRight.15:30
dmelladoyeah15:30
dulekdmellado: Right.15:30
dulekdmellado: And here's the orphaned resources issue - waiting for LB times out, RetryHandler restarts the LB Handler, Handler sees that LB was not provisioned completely, so it tries to create it.15:31
dmelladohmm I see15:32
dulekdmellado: It fails due to conflict, HTTP conflict is not on a list for RetryHandler, so event gets lost.15:32
dulekThat's it.15:32
dmelladoall awesome and probably due to vm slowness15:32
dulekdmellado: On timeout we need to either cleanup existing LB or make sure we're able to detect it and restart *waiting*, not creating.15:32
ltomasboapuimedo, I found out why the k8s-pod-net was not on the amphora, but adding it does not seems to help...15:32
apuimedoltomasbo: tell me you were not on this long15:33
apuimedocause I already solved that in the morning when you first showed it to me15:34
ltomasboI was doing other staff15:34
apuimedoif you are talking about the api lb15:34
apuimedook15:34
ltomasbowell, it is not really solved15:34
ltomasboit does not work for octavia l215:34
apuimedoltomasbo: I had it working15:34
ltomasbowhen? in another setup?15:34
apuimedoah no15:34
*** yamamoto has joined #openstack-kuryr15:34
apuimedoright15:34
apuimedothere's the centos issue15:34
apuimedowith checksum15:34
ltomasboin your patch (from yesterday) the problem was that we need to add the member in a different way when using l2 and l3 mode15:35
apuimedoyou should report a bug on that to Octavia and assign it to cafarelli15:35
apuimedoltomasbo: I know15:35
apuimedoxD15:35
ltomasboapuimedo, so I modified devstack to include that15:35
ltomasboand apply the right SGs15:35
ltomasbonow that is right, but I see the same problem as before with the demo loadbalancer (when the api was using l3 instead of l2)15:36
ltomasboapuimedo, I believe this is the problem that we hit with the ubuntu amphora long ago15:37
ltomasbothat we are hitting it now with the centos one15:37
openstackgerritDaniel Mellado proposed openstack/kuryr-kubernetes master: Remove LIBS_FROM_GIT as a ver  https://review.openstack.org/56271915:38
dulek14 minutes until it works.15:38
openstackgerritDaniel Mellado proposed openstack/kuryr-kubernetes master: Remove LIBS_FROM_GIT as a var in zuul.yaml  https://review.openstack.org/56271915:38
dulekI've put 16,6 minutes there, so it's still too close. :P15:38
dmelladohmmm that's a LONG time15:39
dmelladoif this is happening like that maybe we'd need to add a new label like 'slow' tag to these tests15:39
dmelladoand add a new tox env15:39
dulekAt least it works.15:39
apuimedoltomasbo: the demo loadbalancer issue is unrelated to that, yes15:39
apuimedoI saw that the requests get to the amphora15:39
apuimedoand from the amphora you can send requests to the member15:39
apuimedobut the haproxy is not taking the requests from what I can see15:39
*** apuimedo has quit IRC15:39
*** celebdor1 has joined #openstack-kuryr15:39
*** celebdor1 is now known as apuimedo15:39
apuimedoltomasbo: which is the last message you saw from me?15:40
dulekOkay, I'll clean up the patches and will ping you once ready.15:40
apuimedodulek: I may have missed some message15:40
dulekBTW - where should I put the timeout option? neutron_defaults sound wrong…15:40
apuimedomy daugther disabled the wifi15:40
dmelladoapuimedo: lol15:40
ltomasboapuimedo, yes, and that is the same issue we hit for ubuntu15:40
dmelladodulek: let me know if you need to take me over lately15:40
dulekapuimedo: Put the router on the ceiling!15:40
dmelladolater15:40
ltomasboapuimedo, https://review.openstack.org/#/c/501915/15:40
dulekdmellado, apuimedo: Where should I put lbaas_activation_timeout option? neutron_defaults section…? Sounds a bit weird to me…15:41
apuimedowell, since we manage it all via neutron (auth and such)15:41
apuimedowe probably should put it there first15:41
apuimedoand then move to a LB section15:41
apuimedo(defined in the handler)15:41
apuimedo(or driver)15:41
dulekHm, okay, I'll leave it in neutron_defaults and we'll think on a cleanup later on.15:42
apuimedomy wife went to some travelperk meetup (I wonder if she'll see devvesa there)15:42
dulekMoving options around isn't too bad with oslo.config.15:42
apuimedoso I have both little monsters in my care15:42
apuimedodulek: true15:42
apuimedoltomasbo: yes, it is probably something similar to https://review.openstack.org/#/c/501915/15:43
dmelladoapuimedo: travelperk15:44
dmelladodeff she'll meet him15:44
dmelladogive him regards xD15:44
apuimedoI won't be there15:46
apuimedodmellado: dulek: alright, what do we do about dib?15:47
apuimedoDo we keep octavia disabled in stable/queens for the merge15:47
apuimedoso we can make a release?15:47
* dulek finishes timeout stuff, but has no ideas so go on.15:47
apuimedos/disable/move to non-voting/15:48
apuimedothat is the plan, as you said15:48
apuimedothen in parallel we have to figure out how long we'll have this octavia breakage15:48
apuimedodmellado: did you talk to the openstack-lbaas guys about this?15:49
ltomasboapuimedo, I'm testing with ubuntu amphora to see if that works with L215:52
openstackgerritMichał Dulko proposed openstack/kuryr-kubernetes master: Make Neutron LBaaS Activation Timeout configurable  https://review.openstack.org/54994515:53
openstackgerritMichał Dulko proposed openstack/kuryr-tempest-plugin master: Revert "Skip service test"  https://review.openstack.org/56136415:53
dulekOkay, let's see how those will do.15:53
dulekapuimedo: I'm okay with moving it to non-voting. That's better than using master's Amphorae tarball.15:54
apuimedoltomasbo: thanks15:58
apuimedodulek: probably15:59
*** pcaruana has quit IRC16:03
*** jchhatbar has quit IRC16:35
dulekHm, jobs are queued for 46 minutes now. I guess I'm checking CI results from the pub today. :P16:40
dmelladoapuimedo: let's go non-voting for now16:47
dmelladoapuimedo yeah16:48
dmelladowill try to whack them tomorrow16:48
dmelladodulek: go for the pub16:48
dmelladolet's fetch some beers today16:48
dmelladoI need those xD16:48
dulekdmellado: Each Thursday we're doing PubQuiz with friends. We're now on triple winning streak, so only one beer today to make sure we'll not break it. :D16:50
dmelladolol16:50
dulekMaybe more after the quiz…? Anyway see you tomorrow!16:51
dmelladoenjoy dulek16:51
*** garyloug has quit IRC17:00
*** jermz has joined #openstack-kuryr17:09
*** jerms has quit IRC17:10
*** yamamoto has quit IRC17:20
*** mestery has quit IRC17:29
*** mestery has joined #openstack-kuryr17:31
*** mestery has quit IRC18:07
*** yamamoto has joined #openstack-kuryr18:20
*** yamamoto has quit IRC18:30
openstackgerritMaysa de Macedo Souza proposed openstack/kuryr-kubernetes master: Fix LB member creation on Nested environment  https://review.openstack.org/56280018:58
*** maysamacedos has quit IRC19:11
*** dulek_ has joined #openstack-kuryr19:22
*** premsankar has joined #openstack-kuryr19:35
*** maysamacedos has joined #openstack-kuryr19:56
*** dulek_ has quit IRC20:08
openstackgerritMaysa de Macedo Souza proposed openstack/kuryr-kubernetes master: Fix LB member creation on Nested environment  https://review.openstack.org/56280020:16
*** maysams has joined #openstack-kuryr20:31
*** maysams has quit IRC20:36
*** atoth has quit IRC20:41
*** maysams has joined #openstack-kuryr20:49
*** maysams has quit IRC20:53
*** maysams has joined #openstack-kuryr21:13
*** maysams has quit IRC21:16
*** maysamacedos has quit IRC21:31
*** yamamoto has joined #openstack-kuryr21:49
*** maysams has joined #openstack-kuryr22:09
*** maysams has quit IRC22:11
*** maysamacedos has joined #openstack-kuryr22:17
*** apuimedo has quit IRC22:24
*** hongbin_ has quit IRC22:57
*** salv-orlando has quit IRC23:13
*** salv-orlando has joined #openstack-kuryr23:13

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!