*** yamamoto has quit IRC | 00:09 | |
*** Swami has quit IRC | 00:18 | |
*** abaindur has quit IRC | 00:43 | |
*** abaindur has joined #openstack-lbaas | 00:44 | |
*** abaindur_ has joined #openstack-lbaas | 00:47 | |
*** abaindur has quit IRC | 00:48 | |
*** sanfern has joined #openstack-lbaas | 00:51 | |
*** abaindur_ has quit IRC | 01:09 | |
sapd1 | rm_work: I fixed unit test. So It passes all test now. | 01:40 |
---|---|---|
*** yamamoto has joined #openstack-lbaas | 01:51 | |
openstackgerrit | sapd proposed openstack/octavia master: Support REDIRECT_PREFIX action for L7Policy https://review.openstack.org/601086 | 01:52 |
openstackgerrit | Yang JianFeng proposed openstack/octavia master: Add quota support to octavia's l7policy and l7rule https://review.openstack.org/590620 | 02:15 |
*** dmellado has quit IRC | 02:18 | |
openstackgerrit | Yang JianFeng proposed openstack/octavia master: [WIP] Refactor 'check_quota_met' and 'decrement_quota' https://review.openstack.org/596665 | 02:39 |
*** rcernin has quit IRC | 02:47 | |
*** rcernin has joined #openstack-lbaas | 02:48 | |
*** rcernin has quit IRC | 02:48 | |
*** rcernin has joined #openstack-lbaas | 02:49 | |
*** sanfern has quit IRC | 02:53 | |
openstackgerrit | Yang JianFeng proposed openstack/octavia master: [WIP] Refactor 'check_quota_met' and 'decrement_quota' https://review.openstack.org/596665 | 02:55 |
openstackgerrit | Carlos Goncalves proposed openstack/octavia master: Make health checks resilient to DB outages https://review.openstack.org/600876 | 03:01 |
cgoncalves | rm_work, FYI: http://paste.openstack.org/show/729841/ | 03:05 |
cgoncalves | 29 JOINs | 03:05 |
*** phuoc has joined #openstack-lbaas | 03:06 | |
*** phuoc_ has quit IRC | 03:08 | |
*** phuoc_ has joined #openstack-lbaas | 03:08 | |
*** phuoc has quit IRC | 03:11 | |
*** reedipb has quit IRC | 04:06 | |
*** sanfern has joined #openstack-lbaas | 04:15 | |
*** ramishra has joined #openstack-lbaas | 04:22 | |
*** fnaval has quit IRC | 04:34 | |
*** threestrands has joined #openstack-lbaas | 04:34 | |
*** threestrands has quit IRC | 04:34 | |
*** threestrands has joined #openstack-lbaas | 04:34 | |
johnsom | Just a reminder, most of the core team is at the PTG this week and are making the most of the time by collaborating with other project teams. Answers to questions and reviews may be delayed. | 04:38 |
johnsom | abaindur: 1 yes, 2 yes via the housekeeping process, 3 yes - following a normal renewal process that should be true, 4 there is a discussion of this in the maintenance guide in our docs. I think again, if you do a proper renewal I think you can continue to operate. However this is renewing the certs issued by the CAs, the CAs themselves may have other implications. | 04:44 |
sapd1 | johnsom: Could you discuss about redirect prefix which I implemented. | 04:59 |
*** pcaruana has joined #openstack-lbaas | 05:00 | |
*** reedipb has joined #openstack-lbaas | 05:09 | |
*** pcaruana has quit IRC | 05:09 | |
*** yamamoto has quit IRC | 05:31 | |
openstackgerrit | Yang JianFeng proposed openstack/octavia master: Refactor 'check_quota_met' and 'decrement_quota' https://review.openstack.org/596665 | 05:37 |
openstackgerrit | Yang JianFeng proposed openstack/octavia master: Refactor 'check_quota_met' and 'decrement_quota' https://review.openstack.org/596665 | 05:59 |
*** AlexeyAbashkin has joined #openstack-lbaas | 06:06 | |
*** yamamoto has joined #openstack-lbaas | 06:09 | |
*** AlexeyAbashkin has quit IRC | 06:10 | |
*** yamamoto has quit IRC | 06:26 | |
*** annp has joined #openstack-lbaas | 06:31 | |
*** Emine has quit IRC | 06:32 | |
openstackgerrit | Yang JianFeng proposed openstack/octavia master: Add compute_flavor field for amphora api https://review.openstack.org/582914 | 06:38 |
openstackgerrit | Yang JianFeng proposed openstack/python-octaviaclient master: Add l7policy and l7rule to octavia quota https://review.openstack.org/591568 | 06:39 |
*** velizarx has joined #openstack-lbaas | 06:44 | |
openstackgerrit | Yang JianFeng proposed openstack/octavia master: [WIP] Add listener and pool protocol validation. https://review.openstack.org/594040 | 06:47 |
*** tesseract has joined #openstack-lbaas | 06:51 | |
openstackgerrit | Yang JianFeng proposed openstack/octavia master: [WIP] Add listener and pool protocol validation. https://review.openstack.org/594040 | 07:00 |
*** rcernin has quit IRC | 07:08 | |
*** pcaruana has joined #openstack-lbaas | 07:13 | |
*** dmellado has joined #openstack-lbaas | 07:16 | |
*** ccamposr has joined #openstack-lbaas | 07:18 | |
openstackgerrit | Yang JianFeng proposed openstack/octavia master: Add listener and pool protocol validation. https://review.openstack.org/594040 | 07:27 |
*** velizarx has quit IRC | 07:30 | |
*** yamamoto has joined #openstack-lbaas | 07:40 | |
*** velizarx has joined #openstack-lbaas | 07:42 | |
*** hvhaugwitz has quit IRC | 07:45 | |
*** hvhaugwitz has joined #openstack-lbaas | 07:45 | |
*** velizarx has quit IRC | 07:48 | |
*** velizarx has joined #openstack-lbaas | 07:54 | |
*** luksky has joined #openstack-lbaas | 07:55 | |
*** AlexeyAbashkin has joined #openstack-lbaas | 08:01 | |
*** yamamoto has quit IRC | 08:05 | |
*** velizarx has quit IRC | 08:16 | |
openstackgerrit | Yang JianFeng proposed openstack/octavia master: Add listener and pool protocol validation. https://review.openstack.org/594040 | 08:17 |
*** velizarx has joined #openstack-lbaas | 08:20 | |
openstackgerrit | Yang JianFeng proposed openstack/octavia master: Add listener and pool protocol validation. https://review.openstack.org/594040 | 08:24 |
*** threestrands has quit IRC | 08:30 | |
*** Emine has joined #openstack-lbaas | 08:31 | |
openstackgerrit | Yang JianFeng proposed openstack/octavia master: Add listener and pool protocol validation. https://review.openstack.org/594040 | 08:41 |
*** fnaval has joined #openstack-lbaas | 09:07 | |
*** fnaval has quit IRC | 09:11 | |
*** huseyin has joined #openstack-lbaas | 09:16 | |
*** huseyin has left #openstack-lbaas | 09:17 | |
*** Huseyin_ has joined #openstack-lbaas | 09:19 | |
openstackgerrit | Yang JianFeng proposed openstack/octavia master: Add listener and pool protocol validation. https://review.openstack.org/594040 | 09:19 |
*** huseyin has joined #openstack-lbaas | 09:20 | |
*** Huseyin_ has quit IRC | 09:20 | |
*** crazik has joined #openstack-lbaas | 09:37 | |
*** yamamoto has joined #openstack-lbaas | 09:37 | |
crazik | hello | 09:37 |
crazik | I have a question: how to prepare to the planned network maintenance, to avoid amphora recreation? Can I simply disable some octavia services to avoid such events (in case of missed healthchecks)? | 09:39 |
crazik | I had bad experience on DB connectivity issues | 09:39 |
*** tesseract has quit IRC | 09:46 | |
*** tesseract has joined #openstack-lbaas | 09:50 | |
*** tesseract has quit IRC | 09:50 | |
*** tesseract has joined #openstack-lbaas | 09:52 | |
*** fnaval has joined #openstack-lbaas | 10:07 | |
*** tesseract has quit IRC | 10:11 | |
*** fnaval has quit IRC | 10:11 | |
*** yamamoto has quit IRC | 10:13 | |
*** sanfern has quit IRC | 10:19 | |
*** sanfern has joined #openstack-lbaas | 10:20 | |
*** rpittau has joined #openstack-lbaas | 10:20 | |
*** rpittau has quit IRC | 10:21 | |
*** rpittau has joined #openstack-lbaas | 10:21 | |
*** sanfern has quit IRC | 10:28 | |
*** yamamoto has joined #openstack-lbaas | 11:06 | |
*** velizarx has quit IRC | 11:13 | |
*** yamamoto has quit IRC | 11:13 | |
*** sanfern has joined #openstack-lbaas | 11:23 | |
*** annp has quit IRC | 11:23 | |
*** velizarx has joined #openstack-lbaas | 11:26 | |
*** yamamoto has joined #openstack-lbaas | 11:29 | |
*** yamamoto has quit IRC | 12:06 | |
*** velizarx has quit IRC | 12:07 | |
*** fnaval has joined #openstack-lbaas | 12:07 | |
*** velizarx has joined #openstack-lbaas | 12:10 | |
*** fnaval has quit IRC | 12:11 | |
*** reedipb has quit IRC | 12:20 | |
*** yamamoto has joined #openstack-lbaas | 12:33 | |
*** velizarx has quit IRC | 12:53 | |
*** velizarx has joined #openstack-lbaas | 12:55 | |
*** jlaffaye_ has joined #openstack-lbaas | 13:00 | |
jlaffaye_ | hello, what is the difference between the service_auth and keystone_authtoken sections of the octavia conf file ? | 13:01 |
johnsom | crazik: yes, just stop the health manager processes and resume after your maintenance | 13:04 |
crazik | oh, thank you johnsom :) | 13:05 |
crazik | anyone did octavia upgrade from 2.0.1 to 2.0.2? | 13:05 |
crazik | ;) | 13:05 |
johnsom | jlaffaye_: service_auth is the keystone credential octavia uses when accessing other services, such as nova and neutron. Keystone_authtoken is used to verify user tokens. | 13:06 |
*** fnaval has joined #openstack-lbaas | 13:07 | |
johnsom | crazik: Carlos has a patch posted to address some DB issue scenarios. Please review it for him. | 13:08 |
*** fnaval has quit IRC | 13:11 | |
jlaffaye_ | johnsom: thanks, I guess I was confused because the octavia::keystone::authtoken puppet module takes a user/pass | 13:22 |
crazik | johnsom: any url for this patch? | 13:22 |
*** reedipb has joined #openstack-lbaas | 13:23 | |
jlaffaye_ | so I have a lb stuck in PENDING_DELETE because I messed up with oslo messaging config, what can I do as an operator to fix this king of thing ? | 13:39 |
*** fnaval has joined #openstack-lbaas | 13:44 | |
johnsom | crazik https://review.openstack.org/#/c/600876/ | 14:02 |
johnsom | jlaffaye_ PENDING_* states means a controller has ownership of the object. You can wait for that controller to timeout attempting retries, then it will go to ERROR and be deletable, etc. | 14:03 |
jlaffaye_ | well it appears to be idle after an exception in glanceclient and the state is still PENDING | 14:07 |
crazik | hmm | 14:21 |
crazik | yeah, the same thing with amqp probably is needed. | 14:23 |
*** yamamoto has quit IRC | 14:27 | |
*** yamamoto has joined #openstack-lbaas | 14:28 | |
*** yamamoto has quit IRC | 14:28 | |
*** yamamoto has joined #openstack-lbaas | 14:28 | |
*** yamamoto has quit IRC | 14:33 | |
*** sapd1_ has joined #openstack-lbaas | 14:44 | |
*** pcaruana has quit IRC | 14:47 | |
jiteka | hello, I did some testing today on spare pool feature with our environment with octavia queens and I realise that, | 14:48 |
jiteka | new LB are not using READY amphora but always booting new amphora VMs. | 14:48 |
jiteka | Spare amphora seems to be only used in case of faillover | 14:48 |
jiteka | 1. Is it the expected behaviour ? | 14:48 |
jiteka | 2. Is it behaving the same when not using ACTIVE_STANDBY topology ? | 14:48 |
jiteka | 3. Is that behaviour is changing in Rocky release ? | 14:48 |
jiteka | Note : actually in my last test, even failover was creating new amphora instead of using the spare ones | 14:48 |
jiteka | http://paste.openstack.org/show/729880/ | 14:48 |
johnsom | Yes, if you have anti-affinity enabled, spares pool (READY) amps will be bypassed | 14:51 |
jiteka | johnsom: ok it makes sense with what I read in the operator guide then | 14:53 |
johnsom | Yeah, we can't add existing instances to nova server groups. We could probably get creative, but we also don't want to cross the line into becoming a nova scheduler... sigh | 14:54 |
jiteka | johnsom: was thinking about something creative like a reconciliator service that periodically audit the amphora fleet cluster by cluster and detect the one running on the same compute to trigger some live-migration | 14:56 |
jiteka | but... it's far from a small config change | 14:56 |
jiteka | and not sure it would scale really good on a large number of LB | 14:57 |
huseyin | Hello, does anyone have an idea about the error “ERROR octavia.controller.worker.controller_worker TemplateSyntaxError: expected token 'end of statement block', got '.'” in the worker logs? | 14:58 |
huseyin | According to this bug report : https://bugzilla.redhat.com/show_bug.cgi?id=1551821 it is about Jinja2 version | 14:58 |
openstack | bugzilla.redhat.com bug 1551821 in openstack-octavia "Octavia requires jinja2 2.10" [High,Closed: errata] - Assigned to cgoncalves | 14:58 |
huseyin | Bu after upgrading to Jinja2 2.10 the problem still exists | 14:59 |
huseyin | pip show Jinja2 | 15:00 |
huseyin | Name: Jinja2 | 15:00 |
huseyin | Version: 2.10 | 15:00 |
huseyin | Summary: A small but fast and easy to use stand-alone template engine written in pure python. | 15:00 |
huseyin | Home-page: http://jinja.pocoo.org/ | 15:00 |
huseyin | Author: Armin Ronacher | 15:00 |
huseyin | Author-email: armin.ronacher@active-4.com | 15:00 |
huseyin | License: BSD | 15:00 |
huseyin | Location: /usr/local/lib/python2.7/dist-packages | 15:00 |
huseyin | Requires: MarkupSafe | 15:00 |
huseyin | Required-by: octavia, Flask | 15:00 |
huseyin | Any other idea? | 15:03 |
*** luksky has quit IRC | 15:05 | |
sapd1_ | huseyin: I think because your image is built by source from Master branch. So it will be conflict version with your octavia-worker. So I think you should install octavia-* rocky branch. | 15:06 |
johnsom | Did you customize the haproxy templates or are you running the stock templates? | 15:07 |
huseyin | sapd1_: Actually the amphora image is the one that I created while using ocata | 15:08 |
huseyin | johnsom: No I did not | 15:08 |
huseyin | Is it required to create a new image for queens? | 15:08 |
sapd1_ | huseyin: Yep. Please build a new one. | 15:09 |
johnsom | Ok, the Ocata image version likely has the wrong jinja version in it if you are running a newer control plane. We would have to check the release notes | 15:09 |
huseyin | Oh OK. Thanks a lot. I will create a new one immediately | 15:09 |
*** josephrsandoval has joined #openstack-lbaas | 15:10 | |
jiteka | thanks for the help johnsom :) | 15:10 |
jiteka | after chaning "enable_anti_affinity" to false, it use the spare amps | 15:11 |
*** yamamoto has joined #openstack-lbaas | 15:16 | |
*** fnaval has quit IRC | 15:30 | |
*** phuoc has joined #openstack-lbaas | 15:31 | |
*** phuoc_ has quit IRC | 15:33 | |
*** bcafarel has joined #openstack-lbaas | 15:37 | |
*** fnaval has joined #openstack-lbaas | 15:37 | |
*** Emine has quit IRC | 15:44 | |
openstackgerrit | Carlos Goncalves proposed openstack/octavia master: Make health checks resilient to DB outages https://review.openstack.org/600876 | 15:46 |
huseyin | thanks sapd1_ and johnsom | 15:50 |
huseyin | creating a new amphora image solved the problem | 15:50 |
huseyin | new image uses Jinja 2.10 btw | 15:50 |
*** nmanos has joined #openstack-lbaas | 15:54 | |
sapd1_ | huseyin: are you running octavia for production or test only? | 15:59 |
huseyin | sapd1_: I am testing now and planning to use in production after the tests | 16:01 |
huseyin | sapd1_: I started to test while using ocata but some packages like barbicanclient has problems | 16:02 |
huseyin | sapd1_: so I upgraded to queens last week, and running tests now | 16:03 |
*** huseyin has left #openstack-lbaas | 16:03 | |
*** huseyin has joined #openstack-lbaas | 16:03 | |
sapd1_ | huseyin: have you benchmark your LB yet | 16:04 |
huseyin | sapd1_: no I didn’t | 16:04 |
sapd1_ | I concern about performance. | 16:05 |
huseyin | sapd1_: what do you mean by concern? Could you please explain a bit more? | 16:06 |
*** ramishra has quit IRC | 16:07 | |
huseyin | octavia is the default implementation for the openstack as of pike release, isn’t it? | 16:07 |
huseyin | for lbaas service I mean | 16:07 |
johnsom | Yes it is | 16:07 |
sapd1_ | huseyin: Because currently octavia run only one haproxy process. So if you running on multi core. It does not increase performance . :D | 16:08 |
johnsom | sapd1_ I think you will be surprised at the performance. I have seen the underlying cloud networking performance is more of a limiting factor that Octavia. | 16:08 |
huseyin | what kind of performance problems did you find? | 16:08 |
johnsom | I have been able to do 14.5Gbps with a 1vcpu amp | 16:09 |
sapd1_ | huseyin: maximum new http request. :D | 16:09 |
johnsom | But those numbers are super dependent on your cloud platform and application.... So, your milage will vary | 16:09 |
johnsom | I have seen around 33,000 reqs per second when passing actual traffic. | 16:10 |
sapd1_ | johnsom: Yes. I try with SR-IOV so It's higher than OVS | 16:10 |
sapd1_ | what does req/s mean? How did you monitor it? | 16:10 |
johnsom | Yeah, there wasn't SR-IOV or OVS in that deployment | 16:11 |
johnsom | Using benchmarking tools, such as Tsung, weighttp, etc. | 16:11 |
huseyin | That sounds great | 16:11 |
johnsom | Far surpasses what I see most clouds being able to handle. | 16:12 |
*** velizarx has quit IRC | 16:12 | |
sapd1_ | johnsom: how many members in a pool? How many clients did you use in that test? | 16:12 |
johnsom | Three member servers, 10,000 "users" | 16:13 |
johnsom | But again, please note, this is HIGHLY dependent on your cloud and application. | 16:14 |
sapd1_ | yep. I just use ab and apache2 as web server backend. And "hello world" text only for benchmark | 16:14 |
johnsom | This is why we don't quote performance, many clouds cannot achieve that due to the cloud configuration, etc. | 16:14 |
johnsom | Ah, yeah, that will not really stress Octavia. | 16:15 |
johnsom | You will hit limits in ab and apache2 before you stress Octavia | 16:15 |
huseyin | johnsom: is there any plan to backup amp with keepalived for the redundancy? | 16:15 |
sapd1_ | huseyin: It has already implemented. | 16:16 |
johnsom | huseyin Built in since Mitaka | 16:16 |
sapd1_ | johnsom: Let me try tsung and weighttp tomorrow. You only run 1 vcpu amp to get 33k req/s, aren't you? | 16:17 |
johnsom | Correct | 16:17 |
sapd1_ | unbelievable | 16:17 |
johnsom | sapd_1 Grab my performance patch I posted too | 16:17 |
sapd1_ | johnsom: where I can find that patch? :D | 16:18 |
*** huseyin has quit IRC | 16:18 | |
johnsom | sapd1_ https://review.openstack.org/598379 | 16:19 |
sapd1_ | johnsom: Next week I will run Octavia in production. :| | 16:19 |
johnsom | Nice | 16:20 |
johnsom | Welcome to the club! grin | 16:20 |
sapd1_ | My boss want LB get higher performance I mean higher 6k req/s :D | 16:21 |
johnsom | sapd1_ We do that easily today, out of the box | 16:21 |
sapd1_ | oh. 6k req/s I collect from haproxy socket. (rate - measurements) | 16:21 |
johnsom | Just check your cloud networking to make sure it can handle that by bypassing Octavia. That is the biggest issue I see typically, the underlying cloud can't do it | 16:22 |
*** hongbin has joined #openstack-lbaas | 16:22 | |
sapd1_ | run ab process on three servers client. Each of process can get 2k req/s | 16:22 |
johnsom | Yeah, get a better benchmark tool, and faster web server. | 16:23 |
cgoncalves | johnsom, 14.5Gbps? that's really good. what was the frames size, 1500 bytes? :) | 16:23 |
johnsom | Yes, in TCP mode. Octavia test only, so co-located on the same compute host. | 16:24 |
johnsom | sapd1_ Since you are doing "hello world" consider an in-memory web service, such as using this handler: https://github.com/perusio/nginx-hello-world-module | 16:25 |
sapd1_ | 9Gbps in TCP :D | 16:25 |
johnsom | sapd1_ the splice settings will help that | 16:25 |
*** josephrsandoval has quit IRC | 16:26 | |
johnsom | Also check that your haproxy enables splice, I saw some distro packages that disabled it on the command line. | 16:26 |
sapd1_ | johnsom: I saw your patch. It will enable splice in haproxy. | 16:28 |
sapd1_ | johnsom: My backend server is not overload. Because If I stress test direct to backend server (apache2) result is 4k req/s :D | 16:30 |
johnsom | Yeah, that gave me a reasonable BW speed bump. | 16:30 |
johnsom | sapd_1 Ok, so consider a better benchmark tool as well. But you aren't going to get 6k/s with a web server only doing 4k.... grin | 16:31 |
sapd1_ | johnsom: I have 10 servers member for that test . | 16:32 |
johnsom | Ok | 16:32 |
sapd1_ | each of them can get 4k req/s :D | 16:32 |
sapd1_ | In the theory, I can get 40k req/s if haproxy can handle it. :D | 16:33 |
sapd1_ | johnsom: Do you think we should use a/a with DNS Round robin? We can create A record in designate for all AMP. | 16:35 |
johnsom | No, it adds a huge latency when using DNS | 16:35 |
johnsom | And that breaks a bunch of features as well | 16:36 |
sapd1_ | johnsom: I think ELB/ALB from AWS are using DNS | 16:37 |
johnsom | Yes they are for ELB, ALB is similar to Octavia | 16:37 |
johnsom | Doesn't mean it's the best solution.... | 16:37 |
sapd1_ | I have checked proposal for A/A, which use a director component. | 16:38 |
johnsom | Yes, there are around three proposals for A/A | 16:39 |
sapd1_ | So I have to use vRouter, don't I? | 16:39 |
johnsom | No | 16:39 |
johnsom | The reference A/A spec has no external resource requirements. | 16:39 |
johnsom | But, really do you have a use case that requires a/a? We find it hard to find | 16:40 |
sapd1_ | I have some customer, they require high performance for Loadbalancer :( | 16:41 |
sapd1_ | s/customer/customers/ | 16:41 |
johnsom | I would be interested to hear what their current rates are. | 16:42 |
sapd1_ | They said they need 20k requests/s. | 16:43 |
johnsom | I think with Stein work we can reach that without A/a | 16:43 |
johnsom | a/a target is 100k/s really | 16:43 |
sapd1_ | johnsom: It also depends on my network infrasture which you mentioned before | 16:45 |
sapd1_ | johnsom: I confused with current connections and new connections in haproxy. | 16:46 |
johnsom | sapd1_ are you interested in doing development on a/a? | 16:50 |
sapd1_ | I have a proposal for a/a using RNS for my company. | 16:50 |
johnsom | Ok. The reference A/A is stalled with no developers (I had to shift the priority on that down). It's at the stage where it needs some Ryu/Ken development work. The flows are all defined, it's just doing the library coding work. | 16:52 |
sapd1_ | johnsom: Can I join :D | 16:54 |
sapd1_ | johnsom: I don't understand Ryu/Ken mean. | 16:55 |
johnsom | sapd1_ Of course. If you have time we can sync up next week. | 16:55 |
johnsom | sapd1_ Oh, it's an OpenFlow / SDN controller library for python. | 16:55 |
johnsom | sapd1_ https://osrg.github.io/ryu/ | 16:56 |
johnsom | sapd1_ I thought you were going to work on cinder storage stuff. | 16:56 |
johnsom | sapd1_ Ryu is becoming neutron ken | 16:57 |
sapd1_ | johnsom: Last month I have to work to release Load balancer for public cloud. So I can't follow that patch. ( Do you mention boot from volume? - I guess) | 16:58 |
*** ccamposr has quit IRC | 16:59 | |
johnsom | Right | 16:59 |
sapd1_ | johnsom: I will re-read it (https://docs.openstack.org/octavia/latest/contributor/specs/version0.9/active-active-topology.html) | 17:02 |
*** hongbin_ has joined #openstack-lbaas | 17:06 | |
*** luksky has joined #openstack-lbaas | 17:07 | |
*** hongbin has quit IRC | 17:08 | |
*** Swami has joined #openstack-lbaas | 17:08 | |
*** AlexeyAbashkin has quit IRC | 17:09 | |
*** sapd1_ has quit IRC | 17:13 | |
*** huseyin has joined #openstack-lbaas | 17:15 | |
*** nmanos has quit IRC | 17:15 | |
*** hongbin_ has quit IRC | 17:22 | |
*** hongbin has joined #openstack-lbaas | 17:27 | |
*** pcaruana has joined #openstack-lbaas | 17:32 | |
*** hongbin has quit IRC | 17:46 | |
*** irenab has quit IRC | 18:02 | |
*** irenab has joined #openstack-lbaas | 18:03 | |
*** abaindur has joined #openstack-lbaas | 18:04 | |
abaindur | johnsom: hey, you around? | 18:04 |
abaindur | I needed some clarification about cert expiry behavior | 18:05 |
abaindur | rm_work: or maybe you since i see several commits by you related to cert stuf? | 18:06 |
*** huseyin has left #openstack-lbaas | 18:08 | |
abaindur | 1. If the self-generated amphora server certs expire, octavia takes care of this automatically, injecting it into the amphora using the internal API and req. no intervention on our part, correct? | 18:09 |
abaindur | 2. if the octavia controller's client cert expires, no failover or anything else needs to be done also, right? Besides obviously updating the cert in file on host and restarting services. But no failover or anything else | 18:09 |
johnsom | abaindur: I answered your questions already. See eavesdrop.openstack.org if you don’t have the scroll back | 18:10 |
abaindur | ah sorry, i logged off and didnt see em | 18:10 |
johnsom | Just about to be in another meeting though | 18:10 |
johnsom | That site has the channel logs | 18:11 |
abaindur | hmm i cant find them | 18:12 |
abaindur | http://eavesdrop.openstack.org/irclogs/%23openstack-dns/%23openstack-dns.2018-09-10.log.html | 18:12 |
abaindur | whooops | 18:12 |
abaindur | wrong channel | 18:12 |
abaindur | :) | 18:12 |
rm_work | yep i think johnsom got all the answers you needed hopefully | 18:14 |
rm_work | but essentially: yes | 18:14 |
abaindur | ok still a little confused about #4... "4 there is a discussion of this in the maintenance guide in our docs. I think again, if you do a proper renewal I think you can continue to operate. However this is renewing the certs issued by the CAs, the CAs themselves may have other implications." | 18:14 |
abaindur | at least how i'm doing it now, to "renew" the server CA, i'm just re-running the create_certificates.sh script to generate a new server CA | 18:15 |
abaindur | wouldn't that create a new server CA, thus making the amphora unable to validate, since amp certs were signed by old server CA? | 18:16 |
rm_work | err, yes, i believe so | 18:16 |
rm_work | but i think there is a way to renew a CA cert as well | 18:17 |
rm_work | hmm | 18:17 |
rm_work | it's an interesting question, i haven't been deeply involved enough with that CA stuff :P | 18:17 |
abaindur | "If the amphora CA changed in a way which jeopardizes validation of the amphora certificate an operator can manually upload newly issued amphora certificates by switching off validation of the old amphora certificate. This requires a client certificate which can be validated by the client CA file on the amphora. Refer to Octavia HAProxy Amphora API for more details." | 18:17 |
abaindur | https://docs.openstack.org/octavia/queens/admin/guides/operator-maintenance.html#best-practice | 18:17 |
abaindur | under the Rotating Amphora Certificates section | 18:17 |
abaindur | i cant find that API or how to do that in haproxy api: https://docs.openstack.org/octavia/queens/contributor/api/haproxy-amphora-api.html#upload-ssl-server-certificate-pem-file-for-controller-communication | 18:18 |
abaindur | theres somehow a way to disable cert validation for unencrypted communication, via the amphora API? | 18:18 |
openstackgerrit | Carlos Goncalves proposed openstack/octavia master: Make health checks resilient to DB outages https://review.openstack.org/600876 | 18:21 |
abaindur | sorry for all the questions - trying to figure out best way to automate cert issuance and renewal, with least amount of manual intervention (like failing over amphora) needed or downtime | 18:21 |
johnsom | abaindur: did you get what you need. There is also discussion of this in the maintenance guide. | 19:18 |
johnsom | abaindur: did you get what you need. There is also discussion of this in the maintenance guide. | 19:18 |
abaindur | see above ^^ | 19:18 |
abaindur | Yeah, I read the maintenance guide | 19:18 |
abaindur | We are concerned about what needs to be done when the CA's expire. Seems like this requires triggering a manual failover, thus manual intervention, of every amphora which would be disruptive | 19:19 |
abaindur | and require some kind of planning and donwtime by end customers | 19:19 |
abaindur | johnsom: "switching off validation of the old amphora certificate" - which API is that? i dont see it documented in haproxy-amphora-api doc | 19:21 |
*** luksky11 has joined #openstack-lbaas | 19:22 | |
abaindur | it would be nice if Octavia monitors expiry of the server and client CA's themselves, could turn off validation like mentioned above, then generate new CSRs/certs and upload them to the amphora via API | 19:23 |
abaindur | perhaps if the ca configs are empty, it even generates its own self signed CA's and monitors them for expiry | 19:24 |
*** luksky has quit IRC | 19:25 | |
johnsom | xgerman_: wrote that part, maybe he can clarify | 19:30 |
xgerman_ | abaindur: it says: "Octavia will also monitor those certificates and refresh them before they expire." | 19:53 |
xgerman_ | so you don’t have to worry about amphora certs | 19:53 |
xgerman_ | now, if you need to rotate the certs on the control plane that will require manual work or some automation tool. | 19:55 |
*** pcaruana has quit IRC | 19:55 | |
abaindur | xgerman_: so reissuing both the server CA (which issues amp certs), or the client CA requires failing over every amphora? | 19:56 |
abaindur | either of the two CA's certs themselves | 19:57 |
xgerman_ | Yes, but depnding on your security requirements youc an set them to 100 years | 19:58 |
abaindur | xgerman_: any reason why there isn't a way to upload the client_ca into the amph via the API? as we do with amp cert refresh | 19:59 |
abaindur | in this case octavia should still be able to talk to the amp, and upload the new client CA | 19:59 |
abaindur | If that could be done, then we'd need manual intervention/failover of all amphora only when we reissue the server CA | 20:00 |
xgerman_ | how often are you rotating certs? | 20:01 |
abaindur | currently never :) because we havent deployed yet | 20:02 |
xgerman_ | :-) We assume control plane is “safe” and so you can choose a very long expiration date... | 20:02 |
abaindur | true but here the host has the server CA's priv key and cert and passhrase stored in plaintext on filesysstem | 20:03 |
abaindur | so its located on same DCs or even same hosts as VMs and amphora | 20:05 |
xgerman_ | Yes, we never wanted to be a CA. But last time I asked the OpenStack people they said install dogtag | 20:05 |
xgerman_ | no, certs are on control hosts… if they are on the same hosts as VMs you are not building a big cloud. Most clouds we have seen had dedicated control hosts | 20:06 |
abaindur | this is just in test env, but its needed to be in same DC/rack as the hosts where VMs reside, no? | 20:07 |
abaindur | similar to how OVS and l3 agents in neutron are deployed | 20:07 |
abaindur | at least, because the octavia-worker needs to be ablt to talk to the amphora VM directly over the LB mgmt network | 20:07 |
abaindur | so the controller host's networking needs access to the amphora's fixed IP | 20:08 |
abaindur | octavia-api service is completely elsewhere, but -worker, -housekeeping, and health-manager said they need access to LB mgmt network | 20:09 |
johnsom | Don't forget that lb-mgmt-net is isolated and not accessible from the code/interfaces handling tenant traffic. | 20:10 |
*** hongbin has joined #openstack-lbaas | 20:12 | |
johnsom | I don't disagree that there could be more automation here. There is a long history with this code and "OpenStack CA-as-a-Service" that never materialized, etc. So, yes, given today's environment we probably should make some changes to automate some parts and allow shorter expiration on some of the control plane certs. We are an open volunteer community, so patches are welcome! | 20:13 |
johnsom | Adding stories describing your needs is a start as well. | 20:14 |
abaindur | ive been trying to dig thru the code to see if i can hack up something myself, at least for auto-updating and monitoring client_ca expiry, so if i come up with something i'll def send for a review :) | 20:16 |
xgerman_ | Yeah, as I said in practice most people run with really long expiration dates and at most three octavia-control instances — so we are talking about keeping track of four certs... | 20:18 |
xgerman_ | But agree with johnsom if we can automate make it better — I am all for it | 20:19 |
*** Emine has joined #openstack-lbaas | 21:45 | |
*** Emine has quit IRC | 21:49 | |
*** luksky11 has quit IRC | 21:50 | |
*** hongbin has quit IRC | 21:58 | |
*** hongbin has joined #openstack-lbaas | 22:01 | |
*** josephrsandoval has joined #openstack-lbaas | 22:12 | |
colin- | had a pretty easy time with the queens -> rocky upgrade, thanks for keeping that smooth :) | 22:21 |
xgerman_ | happy dance | 22:22 |
colin- | >UDP protocol support requires an update to the amphora image to support UDP protocol statistics reporting and UDP-CONNECT health monitoring | 22:22 |
colin- | was this the update that happened to my existing amphora automatically after the control plane services were online? | 22:22 |
xgerman_ | No, this is something you will need to failover the amphora for | 22:23 |
colin- | understood | 22:23 |
xgerman_ | but udp should apply to new amps IF you deployed the image | 22:24 |
colin- | is that upgrade note expanded elsewhere in the project? just curious about the nature of the amp image changes | 22:37 |
*** rcernin has joined #openstack-lbaas | 22:43 | |
*** fnaval has quit IRC | 23:02 | |
*** josephrsandoval has quit IRC | 23:11 | |
*** josephrsandoval has joined #openstack-lbaas | 23:12 | |
*** josephrsandoval has quit IRC | 23:16 | |
*** amuller has quit IRC | 23:42 | |
*** Swami has quit IRC | 23:44 |
Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!