*** JudeCross has joined #openstack-lbaas | 01:20 | |
*** JudeCross has quit IRC | 01:24 | |
*** kiennt26 has joined #openstack-lbaas | 01:27 | |
*** ducnc has joined #openstack-lbaas | 02:10 | |
*** yamamoto has joined #openstack-lbaas | 02:45 | |
*** JudeCross has joined #openstack-lbaas | 03:21 | |
*** kiennt26 has quit IRC | 03:22 | |
*** JudeCross has quit IRC | 03:26 | |
openstackgerrit | Jacky Hu proposed openstack/octavia-tempest-plugin master: Raise build_timeout from 60 to 300 https://review.openstack.org/606741 | 03:43 |
---|---|---|
*** pcaruana has joined #openstack-lbaas | 04:06 | |
*** JudeCross has joined #openstack-lbaas | 04:06 | |
openstackgerrit | Jacky Hu proposed openstack/octavia master: Make disk image buildable for fedora https://review.openstack.org/606417 | 04:10 |
*** pcaruana has quit IRC | 04:23 | |
openstackgerrit | Jacky Hu proposed openstack/octavia master: Make disk image buildable for fedora https://review.openstack.org/606417 | 04:33 |
*** ramishra has joined #openstack-lbaas | 04:47 | |
*** pcaruana has joined #openstack-lbaas | 05:51 | |
*** sapd1 has quit IRC | 07:26 | |
*** Emine has joined #openstack-lbaas | 07:29 | |
*** velizarx has joined #openstack-lbaas | 07:35 | |
*** velizarx has quit IRC | 07:45 | |
*** zigo has joined #openstack-lbaas | 07:46 | |
*** velizarx has joined #openstack-lbaas | 07:52 | |
*** abaindur has quit IRC | 08:04 | |
*** ducnc has quit IRC | 08:08 | |
*** celebdor has joined #openstack-lbaas | 08:09 | |
*** velizarx has quit IRC | 08:12 | |
*** velizarx has joined #openstack-lbaas | 08:15 | |
*** sapd1 has joined #openstack-lbaas | 08:17 | |
*** JudeCross has quit IRC | 08:20 | |
*** yamamoto has quit IRC | 08:57 | |
*** yamamoto has joined #openstack-lbaas | 08:58 | |
*** yamamoto has quit IRC | 08:58 | |
*** yamamoto has joined #openstack-lbaas | 08:59 | |
*** yamamoto has quit IRC | 09:03 | |
*** sapd1_ has joined #openstack-lbaas | 09:06 | |
*** sapd1 has quit IRC | 09:06 | |
openstackgerrit | Vadim Ponomarev proposed openstack/octavia master: Fix auto setup Barbican's ACL in the legacy driver. https://review.openstack.org/606918 | 09:19 |
openstackgerrit | Vadim Ponomarev proposed openstack/octavia master: Fix auto setup Barbican's ACL in the legacy driver. https://review.openstack.org/606918 | 09:20 |
*** salmankhan has joined #openstack-lbaas | 09:28 | |
*** yamamoto has joined #openstack-lbaas | 09:57 | |
*** yamamoto has quit IRC | 10:28 | |
*** yamamoto has joined #openstack-lbaas | 10:29 | |
*** yamamoto has quit IRC | 10:42 | |
*** salmankhan1 has joined #openstack-lbaas | 10:46 | |
*** salmankhan has quit IRC | 10:48 | |
*** salmankhan1 is now known as salmankhan | 10:48 | |
*** abaindur has joined #openstack-lbaas | 11:06 | |
*** yamamoto has joined #openstack-lbaas | 11:18 | |
*** savvas has joined #openstack-lbaas | 12:19 | |
savvas | GM everyone | 12:19 |
*** salmankhan1 has joined #openstack-lbaas | 12:30 | |
*** salmankhan has quit IRC | 12:34 | |
*** salmankhan1 is now known as salmankhan | 12:34 | |
*** Emine has quit IRC | 12:38 | |
*** ramishra has quit IRC | 12:46 | |
*** velizarx has quit IRC | 12:47 | |
*** Emine has joined #openstack-lbaas | 12:52 | |
*** Emine has quit IRC | 12:59 | |
*** yamamoto has quit IRC | 13:01 | |
*** ccamposr__ has joined #openstack-lbaas | 13:01 | |
*** yamamoto has joined #openstack-lbaas | 13:01 | |
*** celebdor has quit IRC | 13:03 | |
*** yamamoto has quit IRC | 13:17 | |
*** Emine has joined #openstack-lbaas | 13:17 | |
*** velizarx has joined #openstack-lbaas | 13:21 | |
*** celebdor has joined #openstack-lbaas | 13:25 | |
*** yamamoto has joined #openstack-lbaas | 14:00 | |
xgerman_ | o/ | 14:12 |
cgoncalves | xgerman_, https://review.openstack.org/#/c/605264/ | 14:35 |
cgoncalves | once ^ merges, I'd like to propose a rocky maintenance release | 14:36 |
cgoncalves | by my count, that would be the 7th bug fix for rocky | 14:36 |
*** velizarx has quit IRC | 15:05 | |
xgerman_ | k | 15:06 |
*** velizarx has joined #openstack-lbaas | 15:08 | |
*** yamamoto has quit IRC | 15:17 | |
*** yamamoto has joined #openstack-lbaas | 15:18 | |
*** yamamoto has quit IRC | 15:19 | |
*** yamamoto has joined #openstack-lbaas | 15:19 | |
*** yamamoto has quit IRC | 15:19 | |
*** yamamoto has joined #openstack-lbaas | 15:23 | |
*** yamamoto has quit IRC | 15:23 | |
*** pcaruana has quit IRC | 15:30 | |
*** ivve has joined #openstack-lbaas | 15:33 | |
savvas | Hi guys, any thoughts on how I can troubleshoot this? http://paste.openstack.org/show/731180/ | 15:52 |
xgerman_ | this can mean many things — run octavia with debug true in the config… | 15:52 |
johnsom | savvas That says that nova failed to start the service VM | 15:53 |
savvas | Debug is on xgerman_ , this shows up right after SSL keys get installed and between reverting state | 15:53 |
johnsom | We timed out waiting for nova to mark the instance ACTIVE | 15:54 |
xgerman_ | mmh, when I switch on debug I can see the nova calls | 15:54 |
savvas | I should be able to catch what's happening in my Nova logs than | 15:54 |
xgerman_ | yep | 15:55 |
johnsom | Yeah, it looks like an older version of Octavia. I hope you are not using virtual-box.... | 15:55 |
savvas | nop, running OpenStack Ansible on 3-node cluster | 15:55 |
savvas | Queens stable release | 15:55 |
johnsom | Hmm, ok, yeah, not sure why nova isn't starting in a timely way. That exception is pretty clear "Waiting for compute to go active timeout." | 15:57 |
openstackgerrit | Michael Johnson proposed openstack/octavia-tempest-plugin master: Add v2 two-node scenario test https://review.openstack.org/605163 | 15:59 |
savvas | Think I caught it in the nova log now | 15:59 |
savvas | http://paste.openstack.org/show/731183/ | 15:59 |
savvas | checking neutron now | 15:59 |
*** velizarx has quit IRC | 15:59 | |
*** velizarx has joined #openstack-lbaas | 16:00 | |
savvas | I'll circle back in a bit, need to step out for a little, thanks guys | 16:01 |
savvas | http://paste.openstack.org/show/731184/ this is where it stops, sounds to me like I may have made an error setting up the network for Octavia | 16:02 |
johnsom | Yeah, check the boot network setting in the controller_worker section | 16:02 |
*** aojea has joined #openstack-lbaas | 16:03 | |
savvas | ye it does seem to take the right network | 16:04 |
*** aojea has quit IRC | 16:15 | |
*** savvas has quit IRC | 16:17 | |
*** velizarx has quit IRC | 16:31 | |
*** velizarx has joined #openstack-lbaas | 16:35 | |
johnsom | cores: Eyes on this patch would be good as it appears the centos 7 gate is broken without it: https://review.openstack.org/#/c/605894/ | 16:41 |
johnsom | Which is blocking things from merging | 16:41 |
johnsom | Thanks German for already reviewing! | 16:41 |
xgerman_ | :-) | 16:42 |
openstackgerrit | Merged openstack/octavia stable/rocky: Fix health manager performance regression https://review.openstack.org/605264 | 16:45 |
*** evgenyf has joined #openstack-lbaas | 16:46 | |
cgoncalves | stable/rocky 3.0.1: https://review.openstack.org/607004 | 16:48 |
*** ccamposr__ has quit IRC | 16:51 | |
openstackgerrit | Carlos Goncalves proposed openstack/octavia master: Delete zombie amphorae when detected https://review.openstack.org/587505 | 16:53 |
johnsom | I had just started reading that, but got distracted.... | 16:53 |
*** aojea has joined #openstack-lbaas | 16:55 | |
*** pcaruana has joined #openstack-lbaas | 16:56 | |
*** velizarx has quit IRC | 16:59 | |
*** KeithMnemonic has joined #openstack-lbaas | 17:19 | |
*** savvas has joined #openstack-lbaas | 17:19 | |
*** yamamoto has joined #openstack-lbaas | 17:24 | |
savvas | johnsom: can the network be a flat network? | 17:25 |
johnsom | Sure | 17:25 |
savvas | Alright, well I am not sure what to look for at this point, it says port binding failed | 17:25 |
johnsom | Maybe ask in the openstack-neutron channel? | 17:26 |
savvas | Good point, the problem limits itself to the amphora instances though, my other interfaces and instances seem to be fine. I'll ask around , thanks | 17:26 |
*** sapd1 has joined #openstack-lbaas | 17:30 | |
*** JudeCross has joined #openstack-lbaas | 17:31 | |
*** salmankhan has quit IRC | 17:31 | |
*** salmankhan has joined #openstack-lbaas | 17:31 | |
*** salmankhan has quit IRC | 17:36 | |
openstackgerrit | Carlos Goncalves proposed openstack/octavia master: Delete zombie amphorae when detected https://review.openstack.org/587505 | 17:48 |
rm_work | johnsom: i'm not sure why that would affect the centos gate | 17:55 |
rm_work | i thought about it, but | 17:56 |
rm_work | it should only have affected mismatches | 17:56 |
rm_work | *version mismatches | 17:56 |
rm_work | was trying to figure out what the centos issue was but i came to the conclusion that it must be an upstream package/server issue that would hopefully resolve itself | 17:56 |
rm_work | but i wasn't 100% sure | 17:57 |
johnsom | If the amp has 1.5 haproxy in it the cfg verify is going to fail with the http-reuse line | 17:57 |
rm_work | right, but it shouldn't ever in that gate | 17:57 |
rm_work | current centos amps have 1.8 | 17:57 |
*** blake has joined #openstack-lbaas | 17:57 | |
rm_work | if that was actually being tested by that gate, it would have caught it when we first tried to merge the patch that broke it | 17:58 |
johnsom | I wonder if that is the case as the centos gates started dying right after that merged. | 17:58 |
johnsom | I checked, my patch landed before the centos gate was there | 17:58 |
sapd1 | johnsom: Could you review my patch https://review.openstack.org/#/c/601086/? | 18:01 |
rm_work | ah hmmm | 18:02 |
rm_work | doesn't make sense tho | 18:02 |
rm_work | how would it use such an old amp that we have 1.5? | 18:02 |
johnsom | rm_work I don't see haproxy18 anywhere in the devstack log, I don't think it's installing it | 18:02 |
rm_work | errr | 18:02 |
rm_work | it's just part of the DIB build | 18:02 |
rm_work | like | 18:02 |
rm_work | how would it NOT install it? | 18:02 |
rm_work | unless it is accidentally pinned on a VERY old version for the amps? | 18:02 |
johnsom | I see it calling the "cat" to install the repo, I just don't see a 1.8 haproxy install unless I'm totally missing it. | 18:07 |
rm_work | that would be problematic in its own respect | 18:09 |
rm_work | because it absolutely should be | 18:09 |
rm_work | so THAT would also be a bug | 18:09 |
rm_work | O_o | 18:09 |
johnsom | Oh, the run I was looking at failed with a bad mirror | 18:10 |
rm_work | yes | 18:10 |
rm_work | that was my conclusion, some server issue was causing package stuff to fail for centos | 18:11 |
cgoncalves | xgerman_, you made it! you're officially a zombie hunter :) | 18:11 |
xgerman_ | yeah!!! | 18:11 |
rm_work | :P | 18:12 |
xgerman_ | I knew when I let you fix my silly mistakes it will all happen | 18:12 |
rm_work | lol | 18:12 |
johnsom | rm_work Ok, so I see the "Gate" job finished, and has 1.8, looking at why it failed | 18:12 |
johnsom | Yeah, failed inside the amp | 18:13 |
johnsom | http://logs.openstack.org/24/604924/1/gate/octavia-v2-dsvm-scenario-centos-7/4db1414/controller/logs/screen-o-cw.txt.gz?level=ERROR#_Sep_26_15_48_56_948909 | 18:13 |
rm_work | hm | 18:14 |
johnsom | It's bombing out in octavia-create-l7policy-flow | 18:16 |
johnsom | sapd1 Doesn't look like you needed me | 18:17 |
sapd1 | ^^ | 18:17 |
sapd1 | I think my patch for octavia-client need review as well? | 18:20 |
sapd1 | https://review.openstack.org/#/c/605914/1 | 18:20 |
openstackgerrit | sapd proposed openstack/python-octaviaclient master: Support REDIRECT_PREFIX for openstack client https://review.openstack.org/605914 | 18:31 |
*** aojea has quit IRC | 18:32 | |
*** aojea has joined #openstack-lbaas | 18:32 | |
*** sapd1 has quit IRC | 18:38 | |
*** abaindur has quit IRC | 18:40 | |
*** abaindur has joined #openstack-lbaas | 18:40 | |
*** abaindur has quit IRC | 18:41 | |
openstackgerrit | Michael Johnson proposed openstack/octavia-tempest-plugin master: DNM: Testing bionic nodes https://review.openstack.org/600539 | 18:41 |
*** abaindur has joined #openstack-lbaas | 18:41 | |
*** savvas has quit IRC | 18:48 | |
*** blake has quit IRC | 19:04 | |
openstackgerrit | Merged openstack/octavia master: Fix an upgrade issue for CentOS 7 amphora https://review.openstack.org/605894 | 19:08 |
rm_work | johnsom: so i don't THINK it was related still to that ^^ | 19:21 |
rm_work | but that did PASS | 19:21 |
rm_work | so | 19:21 |
rm_work | O_o | 19:21 |
rm_work | let's see if any of the others do? | 19:21 |
rm_work | ugh one failed on a v1 scenario? :/ | 19:22 |
openstackgerrit | Merged openstack/octavia master: Separate the thread pool for health and stats update https://review.openstack.org/581585 | 19:41 |
rm_work | johnsom: so err.... if someone does an update to a listener and it fails, the listener goes to error... and then our workflow is: the user has to delete the listener and recreate it? | 19:43 |
rm_work | you can't *update* an ERROR listener, right? | 19:43 |
openstackgerrit | Carlos Goncalves proposed openstack/octavia stable/rocky: Separate the thread pool for health and stats update https://review.openstack.org/607033 | 19:44 |
openstackgerrit | Carlos Goncalves proposed openstack/octavia stable/queens: Separate the thread pool for health and stats update https://review.openstack.org/607034 | 19:44 |
johnsom | rm_work Right, it is delete/recreate at the moment | 19:45 |
rm_work | :( | 19:45 |
rm_work | that sucks when you've spent a bunch of effort setting up L7 rules and stuff on it | 19:45 |
rm_work | and then you do one update to like, tweak something, and it fails | 19:45 |
rm_work | lol | 19:45 |
rm_work | i think I may just trigger a failover to re-issue configs, and reset them to ACTIVE in the DB <_< | 19:46 |
rm_work | this is kinda why I wanted a "SYNC" API call | 19:46 |
rm_work | for admins | 19:46 |
johnsom | You have the power! | 19:47 |
rm_work | well | 19:48 |
rm_work | i was GOING to | 19:48 |
rm_work | but everyone said they didn't want that | 19:48 |
*** fnaval has joined #openstack-lbaas | 20:14 | |
rm_work | johnsom: so i haven't been able to deploy that fix yet from Friday... still seeing an abnormal number of LBs going to ERROR | 20:37 |
rm_work | in my testing | 20:37 |
rm_work | not sure if just unlucky or related | 20:37 |
johnsom | Hmm, this is the haproxy fix? | 20:38 |
johnsom | version fix? | 20:38 |
rm_work | yeah | 20:41 |
rm_work | ugh | 20:41 |
rm_work | johnsom: did you lie to me | 20:41 |
rm_work | CalledProcessError: Command '['rpm', '-qi', 'haproxy']' returned non-zero exit status 1 | 20:41 |
rm_work | seeing that in the amp agent log | 20:41 |
rm_work | looking into the code | 20:42 |
rm_work | maybe the old amps were actually broken | 20:42 |
rm_work | if you ran a status on them <_< | 20:42 |
rm_work | i may have not updated that command in the same patch that updated the version to haproxy18 <_< | 20:42 |
johnsom | It could be something else is broken | 20:43 |
rm_work | ... | 20:43 |
rm_work | i mean, it is very clear | 20:43 |
rm_work | the status call runs | 20:43 |
rm_work | and it breaks in the amp | 20:43 |
*** pcaruana has quit IRC | 20:43 | |
rm_work | because it's looking up "haproxy" | 20:44 |
rm_work | not "haproxy18" | 20:44 |
johnsom | I was going off this that they have it handled: https://github.com/openstack/octavia/blob/master/octavia/amphorae/backends/agent/api_server/osutils.py#L523 | 20:44 |
rm_work | maybe at the end of rocky | 20:44 |
rm_work | but not when my amps were built | 20:44 |
rm_work | that class doesn't even exist in the version of the amp agent here, lol | 20:46 |
rm_work | yep | 20:46 |
rm_work | carlos fixed it in https://github.com/openstack/octavia/commit/1c4004c156684340406659535534abde7c6ad0e5 | 20:46 |
rm_work | which is fine for most people (because they built amps for a release) | 20:47 |
rm_work | but i build amps constantly | 20:47 |
rm_work | and mine were after the change to haproxy18 but before that | 20:47 |
rm_work | so basically, this is a "me" problem, and I'm pretty f'd | 20:47 |
rm_work | I need to failover everything old | 20:47 |
rm_work | that's just how it has to be | 20:47 |
*** ivve has quit IRC | 21:14 | |
cgoncalves | johnsom, re: https://review.openstack.org/#/c/606142/ I thought about adding a release note, too, so I could do so sure. my question to you is if you don't agree with the warning msg for better visibility | 21:15 |
cgoncalves | http://logs.openstack.org/42/606142/1/check/openstack-tox-docs/d2b1279/html/contributor/guides/dev-quick-start.html | 21:15 |
johnsom | I just think it's out of place in the quick-start guide given it is release specific, but I guess that is only on the queens branch so... | 21:16 |
openstackgerrit | Michael Johnson proposed openstack/octavia-tempest-plugin master: Add v2 two-node scenario test https://review.openstack.org/605163 | 21:28 |
openstackgerrit | Carlos Goncalves proposed openstack/octavia stable/queens: Add note to lower constraints for Jinja and pyOpenSSL https://review.openstack.org/606142 | 21:29 |
*** aojea has quit IRC | 21:41 | |
*** fnaval has quit IRC | 21:47 | |
cgoncalves | johnsom, re: https://review.openstack.org/#/c/605163/ do you want 2 controller nodes or 1x controller+compute and 1x compute? | 21:55 |
cgoncalves | asking because "controller2" seems to be a compute node only | 21:56 |
cgoncalves | why can't you use openstack-two-node from http://git.openstack.org/cgit/openstack-dev/devstack/tree/.zuul.yaml#n61 instead? | 21:56 |
cgoncalves | it is xenial | 21:57 |
openstackgerrit | Adam Harwell proposed openstack/octavia master: DNM: two dumb downstream things to fix, IGNORE ME https://review.openstack.org/593986 | 21:58 |
*** yamamoto has quit IRC | 22:00 | |
*** yamamoto has joined #openstack-lbaas | 22:01 | |
rm_work | johnsom: figured out a solution to my problem -- for now I am setting it so that function always returns 1.5 instead of making the call (as that should not impact anything else?) in my environment, until i can failover everything onto new amps | 22:03 |
*** yamamoto has quit IRC | 22:03 | |
*** savvas_ has joined #openstack-lbaas | 22:05 | |
openstackgerrit | Adam Harwell proposed openstack/octavia master: DNM: 3 dumb downstream things to fix, IGNORE ME https://review.openstack.org/593986 | 22:06 |
johnsom | Yeah, that would work | 22:09 |
openstackgerrit | Adam Harwell proposed openstack/octavia master: Experimental multi-az support https://review.openstack.org/558962 | 22:10 |
openstackgerrit | Adam Harwell proposed openstack/octavia master: WIP: AZ Evacuation resource https://review.openstack.org/559873 | 22:10 |
*** yamamoto has joined #openstack-lbaas | 22:11 | |
openstackgerrit | Adam Harwell proposed openstack/octavia master: WIP: Floating IP Network Driver (spans L3s) https://review.openstack.org/435612 | 22:14 |
savvas_ | johnsom: I managed to get around my networking problem. Changed my playbook a bit and recreated the lbaas interface, that did the trick. My instances boot now, but they terminate right a way | 22:17 |
johnsom | Nice | 22:18 |
savvas_ | http://paste.openstack.org/show/731205/ this is what I grab from the logs. Nova doesn't spit out any errors until it starts terminating the instance. Right before the qemu errors I catch this: | 22:18 |
savvas_ | http://paste.openstack.org/show/731207/ about the flavor,but when I check the Octavia config and the flavor list, it seems to match ids | 22:19 |
savvas_ | Any thoughts? | 22:19 |
openstackgerrit | Adam Harwell proposed openstack/octavia master: DNM: two dumb downstream things to fix, IGNORE ME https://review.openstack.org/593986 | 22:20 |
johnsom | savvas_ I have not seen that before. | 22:23 |
savvas_ | Great, leave it to me to find the good ones huh ;p | 22:24 |
johnsom | Take a look at your qemu and libvirt logs | 22:24 |
johnsom | Yeah, you are definitively winning today. | 22:24 |
savvas_ | http://paste.openstack.org/show/731208/ just this | 22:28 |
johnsom | savvas_ You are looking for a qemu log like this one: http://logs.openstack.org/64/605264/1/check/octavia-v2-dsvm-scenario/7aa9e70/controller/logs/libvirt/qemu/instance-0000000a_log.txt.gz | 22:31 |
savvas_ | Ye browsed through those, lots of debug but no errors | 22:32 |
savvas_ | I may have an idea though, I see the default image that gets pulled is qcow2 | 22:32 |
savvas_ | going to build a custom image now | 22:33 |
*** threestrands has joined #openstack-lbaas | 22:41 | |
*** celebdor has quit IRC | 22:44 | |
*** rcernin has joined #openstack-lbaas | 22:49 | |
savvas_ | Fixed johnsom | 22:56 |
johnsom | Oh good. Bad image somehow? | 22:58 |
savvas_ | it would be good to have a contingency in the playbook for os-octavia that checks whether or not qcow2 is supported in someone's environment | 22:58 |
savvas_ | well in my case I should've just paid better attention, kept going with the test image which is qcow2, but my environment runs on ceph storage | 22:58 |
johnsom | You don't support qcow2? What do you use? | 22:58 |
rm_work | RAW like RAX? :P | 23:00 |
johnsom | public cloud. Private uses qcow2 | 23:01 |
rm_work | heh | 23:01 |
*** yamamoto_ has joined #openstack-lbaas | 23:05 | |
*** yamamoto has quit IRC | 23:05 | |
*** yamamoto_ has quit IRC | 23:08 | |
*** yamamoto has joined #openstack-lbaas | 23:08 | |
johnsom | Oh this is going to be a fun bug to fix: Keepalived[1259]: pid 4324 exited due to segmentation fault (SIGSEGV). | 23:08 |
rm_work | O_o | 23:12 |
rm_work | ugh johnsom my failures from earlier testing were because one of our hypervisors was missing some vlan trunking | 23:26 |
rm_work | so it just had no net, so nothing was coming up <_< | 23:26 |
rm_work | and stuff kept hitting it | 23:27 |
rm_work | the patches all look good now | 23:27 |
johnsom | lol, ok | 23:27 |
rm_work | so throwing the new stuff into prod with my temp-fix, failing everything over today/tomorrow | 23:28 |
rm_work | and then getting rid of that hack | 23:28 |
rm_work | i hope no one else runs into this <_< | 23:28 |
rm_work | only people who used centos images generated mid-cycle | 23:28 |
Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!