*** yamamoto has quit IRC | 00:34 | |
*** rcernin has joined #openstack-lbaas | 01:59 | |
*** yamamoto has joined #openstack-lbaas | 02:16 | |
*** ramishra has joined #openstack-lbaas | 03:08 | |
*** ramishra has quit IRC | 03:08 | |
*** ramishra has joined #openstack-lbaas | 03:08 | |
*** psachin has joined #openstack-lbaas | 03:27 | |
*** psachin has quit IRC | 04:13 | |
*** yamamoto has quit IRC | 05:20 | |
*** tkajinam has quit IRC | 05:45 | |
*** tkajinam has joined #openstack-lbaas | 05:46 | |
*** vishalmanchanda has joined #openstack-lbaas | 06:00 | |
*** gcheresh has joined #openstack-lbaas | 07:25 | |
*** ataraday_ has joined #openstack-lbaas | 07:25 | |
*** dayou has quit IRC | 08:30 | |
*** dayou has joined #openstack-lbaas | 08:32 | |
*** zasherif has joined #openstack-lbaas | 08:33 | |
*** dayou has quit IRC | 08:38 | |
*** dayou has joined #openstack-lbaas | 08:39 | |
*** maciejjozefczyk has joined #openstack-lbaas | 08:41 | |
*** dayou has quit IRC | 08:48 | |
*** dayou has joined #openstack-lbaas | 08:50 | |
*** dayou has quit IRC | 08:50 | |
rm_work | cgoncalves: :) | 08:54 |
---|---|---|
openstackgerrit | Adam Harwell proposed openstack/octavia-tempest-plugin master: Remove old api_v1_enabled cruft from job defs https://review.opendev.org/713879 | 08:56 |
*** gcheresh has quit IRC | 08:57 | |
rm_work | sorrison: you think you might have some time to finish up https://review.opendev.org/#/c/695349/ ? make it actually use a different network such that it proves all the networking code works? | 09:00 |
rm_work | cgoncalves: hoping to get in https://review.opendev.org/#/c/589180/ today | 09:01 |
*** JayLiu has quit IRC | 09:24 | |
*** zasherif has quit IRC | 09:26 | |
openstackgerrit | Adam Harwell proposed openstack/octavia master: WIP: Failover stop threshold https://review.opendev.org/656811 | 09:53 |
*** born2bake has joined #openstack-lbaas | 10:01 | |
rm_work | FYI I'm picking up work on https://review.opendev.org/#/c/656811/ again | 10:14 |
rm_work | I think I should be able to get something up and working soon with that base I had started on earlier | 10:14 |
*** tkajinam has quit IRC | 10:31 | |
rm_work | interesting, just noticed we have amps going to ERROR on cert rotation from housekeeping... it's getting a 500 response from the amp agent on the rotate attempt | 10:54 |
rm_work | all i've got from the amp-agent log is this: | 11:01 |
rm_work | [05/Apr/2020:23:25:20 +0000] "PUT /1.0/certificate HTTP/1.1" 500 209 "-" "Octavia HaProxy Rest Client/0.5 (https://wiki.openstack.org/wiki/Octavia)" | 11:01 |
*** gcheresh has joined #openstack-lbaas | 11:02 | |
openstackgerrit | Merged openstack/octavia-tempest-plugin master: Add devstack plugin support https://review.opendev.org/708451 | 11:04 |
rm_work | sorry, new one: [10/Apr/2020:23:25:24 +0000] "PUT /1.0/certificate HTTP/1.1" 500 209 "-" "Octavia HaProxy Rest Client/0.5 (https://wiki.openstack.org/wiki/Octavia)" | 11:06 |
rm_work | file appears changed at the right time: | 11:06 |
rm_work | -rw-rw----. 1 root root 0 Apr 10 23:25 /etc/octavia/certs/server.pem | 11:06 |
rm_work | but it's empty | 11:06 |
rm_work | so, cert maybe didn't get sent over right? O_o | 11:07 |
rm_work | broke during stream read? | 11:07 |
rm_work | unclear | 11:07 |
rm_work | but this is going to happen to all our amps and break them, I think | 11:08 |
rm_work | I might have to turn off housekeeping until i figure this out <_< | 11:08 |
*** ccamposr has joined #openstack-lbaas | 11:15 | |
openstackgerrit | Adam Harwell proposed openstack/octavia master: Use routed network filter if it exists https://review.opendev.org/706153 | 11:23 |
*** sapd1 has joined #openstack-lbaas | 11:34 | |
*** sapd1 has quit IRC | 11:36 | |
rm_work | johnsom: commented on failover rewrite, one issue with a missing requires, i think. also, would be nice to just temporarily shove back in the logic that deletes spares for this cycle -- shouldn't be TOO hard? realized that'd be a kind of annoying functionality gap to have between cycles | 11:40 |
*** sapd1 has joined #openstack-lbaas | 11:41 | |
*** sapd1 has quit IRC | 11:41 | |
*** sapd1 has joined #openstack-lbaas | 11:57 | |
*** sapd1 has quit IRC | 12:04 | |
*** tkajinam has joined #openstack-lbaas | 12:05 | |
rm_work | WOAH | 12:17 |
rm_work | did SQLite fix transactions?! O_O | 12:17 |
rm_work | octavia.tests.functional.db.test_repositories.AllRepositoriesTest.test_sqlite_transactions_broken | 12:17 |
rm_work | started failing | 12:18 |
rm_work | along with another test that uses rollbacks | 12:18 |
rm_work | guessing in SQLAlchemy 1.3.16 | 12:23 |
rm_work | https://docs.sqlalchemy.org/en/13/changelog/changelog_13.html#change-4786673f9a875e26cf92b01ec92c4975 | 12:23 |
rm_work | maybe? | 12:23 |
rm_work | testing | 12:24 |
rm_work | hmm no | 12:26 |
rm_work | i feel like that REALLY should be it though | 12:36 |
rm_work | not sure why i can't replicate locally | 12:37 |
*** gcheresh has quit IRC | 13:21 | |
*** sapd1 has joined #openstack-lbaas | 13:27 | |
*** ataraday_ has quit IRC | 13:28 | |
*** rcernin has quit IRC | 13:41 | |
*** gcheresh has joined #openstack-lbaas | 14:26 | |
*** maciejjozefczyk has quit IRC | 14:35 | |
*** gcheresh has quit IRC | 14:35 | |
*** servagem has joined #openstack-lbaas | 14:36 | |
johnsom | Morning folks. rm_work are you working on the housekeeping cert issue? | 14:55 |
rm_work | haven't gotten a chance yet | 14:56 |
rm_work | working this sqlite issue | 14:56 |
rm_work | it's blocking gates | 14:56 |
rm_work | and my internal build also | 14:56 |
openstackgerrit | Merged openstack/octavia master: Imported Translations from Zanata https://review.opendev.org/719158 | 15:22 |
rm_work | johnsom: i wonder if it's a py3 thing? | 16:00 |
rm_work | johnsom: i guess it's prolly difficult for you to look into that... but if you could fix your failover per my comments that'd be sweet :) | 16:00 |
johnsom | The foreign key thing? | 16:00 |
rm_work | yes | 16:00 |
rm_work | err no | 16:00 |
rm_work | that's what I am working on right now | 16:01 |
johnsom | Seems unlikely to be py3 related | 16:01 |
rm_work | i meant the amp cert refresh issue | 16:01 |
johnsom | Oh | 16:01 |
rm_work | but since you prolly can't easily replicate | 16:01 |
johnsom | Maybe. or single-proc related | 16:01 |
rm_work | could be | 16:01 |
*** gthiemonge has quit IRC | 16:01 | |
rm_work | but i doubt it | 16:01 |
rm_work | the path is super simple for this cert thing | 16:01 |
johnsom | Well, we can drop the interval to short windows for debug. | 16:01 |
rm_work | pretty much the only thing i can see that'd kill it would be flask not passing the stream through right, or else something with opening the file | 16:02 |
johnsom | But, yeah, as soon as I'm done digging out of e-mail my plan was to wrap up failover today | 16:02 |
rm_work | kk | 16:02 |
rm_work | i commented | 16:02 |
johnsom | Yeah, I still have some good comments from Ann to address as well. | 16:02 |
*** gthiemonge has joined #openstack-lbaas | 16:03 | |
rm_work | but yeah until i fix this gate blocker, we're stuck | 16:14 |
*** armax has joined #openstack-lbaas | 16:28 | |
rm_work | johnsom: ummm wat: https://github.com/openstack/octavia/commit/19d80f11a43d95d93e774b21b1f070f1fdd3f0d9 | 16:38 |
rm_work | i'm super confused | 16:38 |
rm_work | cfg.CONF.register_cli_opts(healthmanager_opts, group='health_manager') | 16:38 |
rm_work | why was that ever a cli opt? | 16:38 |
rm_work | it's making my unit test runs fail | 16:39 |
rm_work | that should be a normal register_opts() shouldn't it? | 16:39 |
johnsom | Yeah, probably | 16:39 |
rm_work | T_T | 16:39 |
rm_work | weird | 16:39 |
rm_work | will need to revert that specific bit prolly in my other gate fix | 16:40 |
johnsom | That wasn't the issue for that patch however, it was needed for the core cli opts | 16:40 |
rm_work | yeah we prolly should have commented that the HM opts weren't actually cli lol | 16:40 |
johnsom | They have been that way a long time | 16:41 |
rm_work | yes but i assume it's a bug | 16:41 |
rm_work | bad copy/paste or something | 16:41 |
rm_work | there's nothing CLI-ey about HMs | 16:41 |
rm_work | unless I'm misunderstanding what CLI means in this case | 16:41 |
johnsom | Yeah, we don't have any options there that really need to be set via cli | 16:42 |
johnsom | It was done in 2015 lol | 16:43 |
rm_work | so yeah ok | 16:43 |
rm_work | will put that back | 16:43 |
rm_work | and un-cli it | 16:43 |
*** maciejjozefczyk has joined #openstack-lbaas | 16:56 | |
rm_work | hmm i am not having great luck replicating this sqlite issue | 17:33 |
rm_work | i installed the newest version and rebuilt my python against it | 17:33 |
johnsom | I repros for me | 17:33 |
rm_work | and i can get it to replicate... sometimes | 17:33 |
rm_work | but not reliably | 17:33 |
rm_work | and it only does it if i run the entire test suite T_T | 17:33 |
johnsom | Yeah, I have only done one functional run, so ... | 17:33 |
rm_work | though the FK issue does seem more reliable, but i don't know what it's about yet, i had assumed it was related | 17:34 |
*** gcheresh has joined #openstack-lbaas | 17:52 | |
*** gthiemonge has quit IRC | 17:56 | |
johnsom | Yeah, something is up, unit test output: Failed to fetch load_balancer 0f5dd3ab-f6cd-4904-8260-f3458b9cf3ea from DB. Retrying for up to 60 seconds. | 17:56 |
johnsom | Failed to fetch load_balancer 0f5dd3ab-f6cd-4904-8260-f3458b9cf3ea from DB. Retrying for up to 60 seconds. | 17:56 |
johnsom | Failed to fetch load_balancer 0f5dd3ab-f6cd-4904-8260-f3458b9cf3ea from DB. Retrying for up to 60 seconds. | 17:56 |
*** gthiemonge has joined #openstack-lbaas | 17:57 | |
*** ccamposr__ has joined #openstack-lbaas | 17:58 | |
rm_work | err which issue is that | 18:00 |
johnsom | Just popped up on a single unit test run | 18:00 |
johnsom | octavia.tests.unit.controller.worker.v1.test_controller_worker.TestControllerWorker.test_create_load_balancer_single | 18:00 |
rm_work | hmm | 18:00 |
*** ccamposr has quit IRC | 18:00 | |
johnsom | I think this new sqlalchemy is broken for sqlite | 18:01 |
rm_work | i think it's actually a new sqlite version? | 18:21 |
rm_work | ... unclear | 18:21 |
johnsom | My money is on sqlalchemy | 18:21 |
rm_work | well, also, by "broken" you mean "fixed" | 18:21 |
rm_work | or something? | 18:21 |
rm_work | <_< | 18:21 |
johnsom | Yeah, it passes functional with 1.3.15 | 18:24 |
johnsom | 1.3.16 bombs | 18:24 |
rm_work | hmm | 18:25 |
rm_work | i was poking zzzeek about it earlier | 18:25 |
rm_work | he said the change in 1.3.16 shouldn't affect this | 18:25 |
rm_work | unless we're doing wonky stuff | 18:25 |
johnsom | There are two suspect changes, one with an order by removed, the other is the autocommit with sqlite | 18:25 |
rm_work | [07:46:03] zzzeek:it's important however if you are doing any tinkering with that connection in your application's setup | 18:26 |
rm_work | [07:46:37] zzzeek:as far as the sqlalchemy change it just added some more options to the set_isolation_level() method and if you aren't giving sqlalchemy any execution_options(isolation_level) settings, that change would not impact you | 18:26 |
rm_work | though i don't think we do that | 18:26 |
rm_work | but yes the timing is so close... | 18:26 |
rm_work | it's POSSIBLE the ubuntu image updated to include a newer sqlite | 18:27 |
rm_work | but ... yeah much more likely it seems that sqlalchemy release 3 days ago caused this | 18:27 |
johnsom | I just went into the tox venv and installed the other version of sqlalchemy, problem went away | 18:27 |
johnsom | We do use autocommit and we use non-autocommit transactions | 18:28 |
rm_work | k | 18:28 |
johnsom | Neither are "wonky" | 18:28 |
rm_work | well, what do you think is the best move here | 18:28 |
rm_work | we can pin it and deal with this after U | 18:28 |
rm_work | or ... actuallt we can't can we | 18:29 |
rm_work | we don't control our own upper-constraints | 18:29 |
rm_work | hmm | 18:29 |
johnsom | Yeah, I think we can pin it, just not sure if that is the right answer or not | 18:29 |
rm_work | well, we are running out of days to merge stuff | 18:30 |
rm_work | i wanted to merge the UDP thing today | 18:30 |
rm_work | but that may be a pipe dream | 18:30 |
*** gthiemonge has quit IRC | 18:31 | |
*** gthiemonge has joined #openstack-lbaas | 18:31 | |
johnsom | So the test in question does: | 18:39 |
johnsom | Start a non-autocommit transaction | 18:39 |
johnsom | inside that, a subtransaction to create the LB, then another subtransaction to create the pool. The later can no longer (1.3.16) see the results of the first subtransaction, the LB create | 18:40 |
*** ccamposr__ has quit IRC | 18:41 | |
rm_work | so that test was actually bad? | 18:43 |
rm_work | because that seems like | 18:43 |
johnsom | No, that is the code, not the test really | 18:43 |
rm_work | an issue for a WORKING transaction model? | 18:43 |
rm_work | ermm | 18:43 |
rm_work | hh | 18:43 |
rm_work | *hmm | 18:43 |
johnsom | The LB create one, when the subtransaction closes it should now be visible in the main transaction, but it's not now | 18:44 |
rm_work | may need to poke at zzzeek | 18:45 |
johnsom | Yeah, in that patch I don't know why they are setting the isolation level to "" nor what that means to the stack below sqlalchemy | 18:48 |
*** larsks has joined #openstack-lbaas | 18:53 | |
*** larsks has left #openstack-lbaas | 18:53 | |
rm_work | i asked him to join here | 18:55 |
rm_work | we'll see if he does :D | 18:55 |
rm_work | meanwhile i need to go sleep | 18:55 |
rm_work | first thing i'll be looking at is the cert-refresh housekeeping issue | 18:58 |
johnsom | ok | 18:58 |
rm_work | because i have had to shut down the service in our clouds | 18:58 |
rm_work | it was just going through and breaking all the amps i just fixed | 18:59 |
rm_work | and i because of the way it broke them, i need your failover patch to actually restore them | 18:59 |
rm_work | lol | 18:59 |
rm_work | which means i need to be able to actually build a package, which requires tests passing T_T | 19:00 |
rm_work | fff | 19:00 |
*** sapd1 has quit IRC | 19:05 | |
*** ccamposr has joined #openstack-lbaas | 19:08 | |
*** maciejjozefczyk has quit IRC | 19:13 | |
*** vishalmanchanda has quit IRC | 19:39 | |
*** irclogbot_3 has quit IRC | 19:51 | |
*** irclogbot_1 has joined #openstack-lbaas | 19:52 | |
*** gcheresh has quit IRC | 20:13 | |
*** maciejjozefczyk has joined #openstack-lbaas | 20:34 | |
*** servagem has quit IRC | 20:43 | |
*** maciejjozefczyk has quit IRC | 20:56 | |
*** gthiemonge has quit IRC | 21:01 | |
*** gthiemonge has joined #openstack-lbaas | 21:01 | |
*** maciejjozefczyk has joined #openstack-lbaas | 21:26 |
Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!