Friday, 2017-08-11

*** sshank has quit IRC00:03
*** dayou has quit IRC00:07
*** dayou has joined #openstack-lbaas00:08
*** cpusmith has quit IRC00:11
openstackgerritJude Cross proposed openstack/octavia-tempest-plugin master: Create scenario tests for load balancers  https://review.openstack.org/48677500:17
openstackgerritJude Cross proposed openstack/octavia-tempest-plugin master: Create scenario tests for pools  https://review.openstack.org/49267300:17
openstackgerritJude Cross proposed openstack/octavia-tempest-plugin master: Create scenario tests for listeners  https://review.openstack.org/49231100:17
openstackgerritxuchaochao proposed openstack/neutron-lbaas master: Add a compatible check before creating pool  https://review.openstack.org/49235701:36
*** fnaval has joined #openstack-lbaas01:55
*** fnaval_ has joined #openstack-lbaas02:02
*** fnaval has quit IRC02:04
*** sanfern has joined #openstack-lbaas02:07
*** ianychoi_ has joined #openstack-lbaas02:11
*** sanfern has quit IRC02:17
*** gongysh has joined #openstack-lbaas02:17
*** ianychoi has quit IRC02:18
*** dayou has quit IRC02:18
*** openstackgerrit has quit IRC02:21
*** armax has quit IRC02:21
*** m-greene_ has quit IRC02:21
*** armax has joined #openstack-lbaas02:21
*** xingzhang has joined #openstack-lbaas02:22
*** m-greene_ has joined #openstack-lbaas02:24
*** dayou has joined #openstack-lbaas02:32
rm_workxgerman_: want to https://review.openstack.org/#/c/492233/02:48
rm_workjohnsom: ^^ cut release after that merges?02:51
johnsomOk, yeah, I think we now have it covered, just waiting on merges.02:52
johnsomrm_work Thanks!02:52
*** yamamoto has quit IRC02:52
*** yamamoto has joined #openstack-lbaas02:53
johnsom(working on date night to get this in, so far she hasn't figured it out)02:56
johnsomDownside is watching cheesy shows to keep her distracted02:57
*** openstackgerrit has joined #openstack-lbaas02:59
openstackgerritMerged openstack/neutron-lbaas-dashboard master: Imported Translations from Zanata  https://review.openstack.org/49244802:59
rm_worklol03:00
rm_workheading out myself03:00
*** xingzhang_ has joined #openstack-lbaas03:05
*** xingzhang has quit IRC03:08
*** gongysh has quit IRC03:34
*** sanfern has joined #openstack-lbaas03:39
*** aojea has joined #openstack-lbaas03:41
*** aojea has quit IRC03:46
openstackgerritMerged openstack/octavia master: Fix LB creation with VIP port  https://review.openstack.org/49264903:48
*** gans has joined #openstack-lbaas03:52
openstackgerritMerged openstack/octavia master: Update devstack readme.md  https://review.openstack.org/49223303:58
johnsomWahoo, RC1 release patch is up for review04:02
*** xingzhang_ has quit IRC04:23
*** xingzhang has joined #openstack-lbaas04:24
*** gans819 has joined #openstack-lbaas04:25
*** gans has quit IRC04:28
*** sanfern has quit IRC04:56
*** sanfern has joined #openstack-lbaas05:03
*** yamamoto has quit IRC05:03
*** yamamoto has joined #openstack-lbaas05:04
*** yamamoto has quit IRC05:11
*** xingzhang has quit IRC05:30
*** xingzhang has joined #openstack-lbaas05:30
*** gongysh has joined #openstack-lbaas05:43
*** armax has quit IRC05:51
*** armax has joined #openstack-lbaas05:52
*** armax has quit IRC05:52
*** armax has joined #openstack-lbaas05:52
*** armax has quit IRC05:53
*** armax has joined #openstack-lbaas05:53
*** armax has quit IRC05:54
*** armax has joined #openstack-lbaas05:54
*** armax has quit IRC05:54
*** armax has joined #openstack-lbaas05:55
*** armax has quit IRC05:55
*** armax has joined #openstack-lbaas05:56
*** armax has quit IRC05:56
*** armax has joined #openstack-lbaas05:57
*** armax has quit IRC05:57
*** armax has joined #openstack-lbaas05:57
*** armax has quit IRC05:58
*** sanfern has quit IRC05:59
*** tesseract has joined #openstack-lbaas06:16
*** yamamoto has joined #openstack-lbaas06:19
*** rcernin has joined #openstack-lbaas06:22
*** yamamoto has quit IRC06:24
*** rtjure has joined #openstack-lbaas06:32
*** sanfern has joined #openstack-lbaas06:33
*** rajivk has quit IRC06:33
*** sanfern has quit IRC06:39
*** ajo has joined #openstack-lbaas06:44
*** rajivk has joined #openstack-lbaas06:45
*** sanfern has joined #openstack-lbaas06:46
*** yamamoto has joined #openstack-lbaas06:52
*** amotoki has joined #openstack-lbaas06:57
*** yamamoto has quit IRC07:07
*** openstackgerrit has quit IRC08:02
*** yamamoto has joined #openstack-lbaas08:04
*** dayou has quit IRC08:17
*** yamamoto has quit IRC08:17
*** openstackgerrit has joined #openstack-lbaas08:21
openstackgerritZhaoBo proposed openstack/octavia master: Extend api to accept qos_policy_id  https://review.openstack.org/45830808:21
*** Alex_Staf has joined #openstack-lbaas08:25
openstackgerritOpenStack Release Bot proposed openstack/neutron-lbaas master: Update reno for stable/pike  https://review.openstack.org/49287208:28
openstackgerritOpenStack Release Bot proposed openstack/octavia master: Update reno for stable/pike  https://review.openstack.org/49287508:28
*** yamamoto has joined #openstack-lbaas08:44
*** yamamoto has quit IRC08:56
*** sanfern has quit IRC09:01
*** sanfern has joined #openstack-lbaas09:01
*** sanfern has quit IRC09:02
*** sanfern has joined #openstack-lbaas09:02
*** sanfern has quit IRC09:02
*** sanfern has joined #openstack-lbaas09:03
*** sanfern has quit IRC09:03
*** sanfern has joined #openstack-lbaas09:04
*** sanfern has quit IRC09:04
*** sanfern has joined #openstack-lbaas09:04
*** sanfern has quit IRC09:05
*** yamamoto has joined #openstack-lbaas09:24
*** yamamoto has quit IRC09:35
*** ianychoi_ is now known as ianychoi09:36
*** Alex_Staf has quit IRC10:01
*** Alex_Staf has joined #openstack-lbaas10:15
*** yamamoto has joined #openstack-lbaas10:32
*** xingzhang has quit IRC10:46
*** xingzhang has joined #openstack-lbaas10:46
*** yamamoto has quit IRC10:48
*** xingzhang has quit IRC10:59
*** gans819 has quit IRC11:05
*** gongysh has quit IRC11:17
*** ajo has quit IRC11:22
*** m-greene_ has quit IRC11:27
*** m-greene_ has joined #openstack-lbaas11:27
*** yamamoto has joined #openstack-lbaas11:35
*** yamamoto has quit IRC11:48
*** yamamoto has joined #openstack-lbaas11:48
*** yamamoto has quit IRC11:55
*** yamamoto has joined #openstack-lbaas11:57
*** dasanind has quit IRC12:19
*** zioproto has quit IRC12:19
*** amitry has quit IRC12:19
*** amitry has joined #openstack-lbaas12:19
*** zioproto has joined #openstack-lbaas12:19
*** dasanind has joined #openstack-lbaas12:20
*** gongysh has joined #openstack-lbaas12:20
*** gongysh has quit IRC12:20
*** rtjure has quit IRC12:23
*** rtjure has joined #openstack-lbaas12:26
*** catintheroof has joined #openstack-lbaas12:30
*** gongysh has joined #openstack-lbaas12:35
*** sanfern has joined #openstack-lbaas12:35
*** leitan has joined #openstack-lbaas13:12
*** leyal has quit IRC13:18
*** leyal has joined #openstack-lbaas13:18
*** ajo has joined #openstack-lbaas13:19
*** cpusmith has joined #openstack-lbaas13:34
*** cpusmith_ has joined #openstack-lbaas13:36
*** cpusmith has quit IRC13:40
*** Alex_Staf has quit IRC13:51
*** Alex_Staf has joined #openstack-lbaas13:53
*** mdavidson has quit IRC14:26
*** ajo has quit IRC14:30
*** armax has joined #openstack-lbaas14:31
*** mdavidson has joined #openstack-lbaas14:35
*** fnaval_ has quit IRC14:38
openstackgerritMichael Johnson proposed openstack/octavia master: Update reno for stable/pike  https://review.openstack.org/49287514:47
*** rcernin has quit IRC14:57
*** fnaval has joined #openstack-lbaas15:00
*** xingzhang has joined #openstack-lbaas15:04
*** xingzhang has quit IRC15:09
*** armax has quit IRC16:04
*** gongysh has quit IRC16:19
*** amotoki is now known as amotoki__away16:30
*** ajo has joined #openstack-lbaas16:40
*** tesseract has quit IRC16:41
*** ajo has quit IRC17:04
*** sanfern has quit IRC17:06
*** ajo has joined #openstack-lbaas17:09
*** ajo has quit IRC17:11
openstackgerritMerged openstack/octavia master: Update reno for stable/pike  https://review.openstack.org/49287517:38
*** sshank has joined #openstack-lbaas17:39
*** yamamoto has quit IRC17:43
*** sshank has quit IRC17:44
*** ajo has joined #openstack-lbaas17:46
*** sshank has joined #openstack-lbaas17:55
*** Alex_Staf has quit IRC18:00
*** leitan has quit IRC18:12
johnsomInfra is still having problems.  Our release notes are borked18:14
johnsomhttp://logs.openstack.org/69/690ccfd43fb141c26652e119f1d702b65414a194/post/octavia-releasenotes/238e6ab/console.html#_2017-08-11_17_50_10_43934318:14
xgerman_yeah, my OSA thing is stuck, too18:18
openstackgerritMichael Johnson proposed openstack/octavia master: DO NOT MERGE: Testing log retrieval from amphora  https://review.openstack.org/49199718:34
*** yamamoto has joined #openstack-lbaas18:44
*** yamamoto has quit IRC18:49
*** sshank has quit IRC19:08
*** leitan has joined #openstack-lbaas19:10
*** sshank has joined #openstack-lbaas19:22
*** sshank has quit IRC19:26
*** leitan_ has joined #openstack-lbaas19:31
*** leitan has quit IRC19:31
*** leitan_ has quit IRC19:33
*** gcheresh has joined #openstack-lbaas19:41
*** yamamoto has joined #openstack-lbaas19:46
*** leitan has joined #openstack-lbaas19:50
*** yamamoto has quit IRC19:55
*** leitan_ has joined #openstack-lbaas20:00
*** leitan has quit IRC20:03
openstackgerritMerged openstack/python-octaviaclient master: Lowercase vip_Address in return  https://review.openstack.org/49233020:09
*** catintheroof has quit IRC20:15
*** leitan_ has quit IRC20:49
*** sshank has joined #openstack-lbaas21:01
*** gcheresh has quit IRC21:08
*** atoth has quit IRC21:25
rm_workxgerman_ / johnsom: periodically i'm seeing this, either of you see it?21:56
rm_workhttp://paste.openstack.org/show/618231/21:56
rm_workwhen it happens, it seems to happen on every one of the HM processes running21:56
johnsomEvery one?21:57
rm_worki mean, it happens once21:57
xgerman_nope, having seen that so far21:57
rm_workand it prints that on all 6 of the HM processes I'm running21:57
johnsomI wonder if it is a side effect of having > 1 o-hm21:57
rm_worklike ... all 6 of them tried to do the same lokc21:57
johnsomI wonder if it is a side effect of having > 1 o-hk21:57
rm_workI do have exactly one o-hk actually21:58
rm_workbut 6 o-hm21:58
johnsomOh, sorry, right o-hm21:58
johnsomStill context switching here21:58
rm_workheh21:58
johnsomJust about have cracked the OSC plugin error issue21:58
rm_workah nice21:59
rm_workgodspeed21:59
johnsomMostly fine tuning at this point22:00
johnsomSo, hmmm, that should be a fairly short lived query for update....22:01
johnsomI wonder what your lock timeout is....22:01
rm_workyeah, wonder if the DB being slow for some reason could do it22:01
rm_worki can find out22:01
rm_workit's my own shitty percona-extradb cluster22:02
rm_workrunning on small VMs lol22:02
johnsomWell, that and if you have a large number, we probably don't have proper indexes on that table.22:02
rm_workhmmmmm22:02
*** sshank has quit IRC22:03
*** cpusmith_ has quit IRC22:03
johnsomCan you run a manual SQL query?22:03
rm_workyup yup22:04
johnsomGive me a minute to build a query22:05
johnsomselect * from amphora_health where busy = 0 and last_update < now();22:09
johnsomSee how long that takes22:09
johnsomTechnically it should be select * from amphora_health where busy = 1 and last_update < now() LIMIT 1; but I don't trust sqlalchemy to not pull them all back and THEN give you the first()22:11
johnsomshow variables like 'innodb_lock_wait_timeout';22:13
johnsomWould give you the timeout, mine is 50 which is seconds.22:13
rm_workalso 5022:15
rm_work0.1s22:15
rm_work*shruggie*22:15
rm_workmaybe something with replication taking some time22:16
rm_worknot sure22:16
rm_workit doesn't seem to happen very often22:16
rm_workand i haven't seen anything catastrophic happen because ofit22:16
rm_workso22:16
*** sshank has joined #openstack-lbaas22:17
johnsomNo, it shouldn't cause a problem.  I just means the "deadlocked" transactions rolled back.22:18
johnsomSo, it would just go back and try again after the next sleep interval22:19
johnsomIt doesn't happen often?  Like how often?  I assume failovers are working....22:19
rm_workassuming that didn't immediately cause it to do a failover22:19
johnsomNo, it would at worst *NOT* cause failovers22:20
rm_workah yeah i wonder if that could be trying to do the lock for the busy flag?22:20
rm_workhmmm actually22:20
rm_workwhen i restarted the HMs, it did immediately do one failover <_22:20
rm_worki wonder if that had been pending the whole time22:20
rm_worklet me trigger a failover and see what happens22:21
johnsomIt does a select for update, looking for one amp that hasn't received a heartbeat in the interval, then marks it "busy" for further failover work.22:21
johnsomIt's this https://github.com/openstack/octavia/blob/master/octavia/db/repositories.py#L107222:21
johnsomThis could be more sqlalchemy transaction BS.  We could switch this out of the basic "session" to a lock session with auto commit disabled.22:22
johnsomI bet that is it.  SQLalchemy is being super dumb and NOT encapsulating those two into one transaction like it should.  That would make total sense actually of how the "deadlock" is happening.22:25
johnsomHM A does the select, HM B does select, HM A "autocommits" the select, HM B gets the lock, HM A goes to update the busy flag and sqlalchemy is being dumb and trying to re-lock.22:26
rm_workummmm22:26
johnsomrm_work Want me to push a fix?22:26
rm_worki don't think failovers are happening22:26
johnsomTo try at least?22:26
rm_workah i haven't read what you said22:26
rm_workAH22:27
rm_workone of them printed this too at the end:22:27
rm_work[SQL: u'SELECT amphora_health.amphora_id AS amphora_health_amphora_id, amphora_health.last_update AS amphora_health_last_update, amphora_health.busy AS amphora_health_busy \nFROM amphora_health \nWHERE amphora_health.busy = false AND amphora_health.last_update < %(last_update_1)s \n LIMIT %(param_1)s FOR UPDATE'] [parameters: {u'param_1': 1, u'last_update_1': datetime.datetime(2017, 8, 11, 21, 58, 59, 441736)}]22:27
johnsomWell, don't forget the default timeout before failover triggers is a bit long by default22:27
rm_workjohnsom: i set the time to like 12 hours ago22:27
johnsomCheck interval22:27
rm_workyeah but....22:27
rm_worki've tried it a few tiems22:27
rm_workit keeps getting updated before it can actually trigger22:27
johnsomYeah22:28
johnsomrm_work So test patch or no?22:29
rm_workhmm maybe22:29
rm_worklet me delete this VM22:29
rm_workand see if i can get it to trigger22:29
johnsomThat will work....22:29
*** leitan has joined #openstack-lbaas22:30
rm_workummm yeah22:31
rm_worki think once that deadlock happens once22:31
rm_workit stops doing failover checks22:31
rm_workand it happens almost instantly22:31
rm_workafter starting the services22:32
rm_work:/22:34
johnsomYeah, I could see how if sqlalchemy is dumb that could happen.  Running tox now22:37
rm_workwhat's the change22:39
rm_worki'm applying it by hand anyway22:39
rm_workto test22:39
rm_workjohnsom:22:43
johnsomJust a sec, making sure this is just a test issue and not something else22:43
*** kbyrne has joined #openstack-lbaas22:44
openstackgerritMichael Johnson proposed openstack/octavia master: Fix health monitor DB locking.  https://review.openstack.org/49325222:44
johnsomThere you go, there is a unit test false positive I need to fix (side effect is using None, which doesn't work with this change22:45
*** sshank has quit IRC22:45
rm_worknope22:52
rm_workDBDeadlock: (pymysql.err.InternalError) (1213, u'WSREP detected deadlock/conflict and aborted the transaction. Try restarting the transaction') [SQL: u'SELECT amphora_health.amphora_id AS amphora_health_amphora_id, amphora_health.last_update AS amphora_health_last_update, amphora_health.busy AS amphora_health_busy \nFROM amphora_health \nWHERE amphora_health.busy = false AND amphora_health.last_update < %(last_update_1)s \n LIMIT22:52
rm_work %(param_1)s FOR UPDATE'] [parameters: {u'param_1': 1, u'last_update_1': datetime.datetime(2017, 8, 11, 22, 51, 18, 168861)}]22:52
rm_workstill got that22:52
johnsomYou updated all of the o-hm's?22:53
rm_workyes22:53
rm_workhttp://paste.openstack.org/show/618232/22:54
rm_worktried again just now too22:54
rm_workseems to happen every failover <_<22:54
openstackgerritMichael Johnson proposed openstack/octavia master: Fix health monitor DB locking.  https://review.openstack.org/49325222:55
johnsomHmmm, ok, try this,22:55
rm_workoh wait hold on22:55
johnsomI didn't like that subtransaction BS anyway22:55
rm_worki missed one thing22:55
rm_workoh, no i didn't22:55
rm_workk yeah trying the new thing22:56
rm_workworse22:58
rm_workmuch worse22:58
rm_worklol22:58
*** ssmith has joined #openstack-lbaas22:58
rm_workhttp://paste.openstack.org/show/618233/22:58
johnsomAh, yeah, the oslo_db thing...  Just sec22:59
openstackgerritMichael Johnson proposed openstack/octavia master: Fix health monitor DB locking.  https://review.openstack.org/49325223:02
johnsomForgot that it auto-magically starts the session on your behalf23:02
johnsomrm_work give that a spin23:03
rm_workyeah23:03
rm_workit's spinning currently23:03
johnsom+123:03
rm_workerg23:04
rm_workhttp://paste.openstack.org/show/618234/23:05
rm_workah the line numbers are off by one in health_manager because i didn't use your blank line23:05
johnsomThat is ok23:06
rm_workyeah just letting you know23:06
rm_workin case it was confusing23:06
johnsomWell, maybe it's against the update thread....23:08
johnsomCan you grab SHOW ENGINE INNODB STATUS  from the DB after that happens?23:09
rm_workwhat are you looking for in this23:14
rm_workit's hufe23:14
rm_work*huge23:14
johnsomThere should be a locks and/or deadlocks section.23:14
johnsom(I haven't seen one in a while)23:14
rm_workhmm23:15
*** fnaval has quit IRC23:16
rm_workhttp://paste.openstack.org/show/618235/23:17
rm_workwhelp this is kinda shitty23:27
johnsomDoing another spin23:30
johnsom3 minutes23:30
johnsomThere is another report we can run on mysql, but it dumps to the mysql error log which I assume you don't have access to23:31
rm_worki am (g)root23:31
openstackgerritMichael Johnson proposed openstack/octavia master: Fix health monitor DB locking.  https://review.openstack.org/49325223:31
johnsomOk, so that wraps the only other place that touches that table in a non-autocommit as well23:32
rm_workone advantage of spinning my very own snowflake of a sql cluster23:32
*** sshank has joined #openstack-lbaas23:32
johnsomWell, give that a go, then we will dump some logs23:33
rm_workstill deadlocking23:35
johnsomSoB23:35
johnsomOk, so mysql fun23:36
johnsomSET GLOBAL innodb_print_all_deadlocks = 'ON';23:37
johnsomRe-trigger the deadlock, then in the mysql error log it should have dumped details23:37
rm_workretriggered23:40
rm_workwhere's the error log... *looks*23:40
johnsom /var/log/mysql/octavia/error23:41
johnsom /var/log/mysql/octavia/error.log23:41
rm_workerrr23:41
rm_workthe mysql server?23:41
rm_workdoes it know about octavia? lol23:41
johnsomoctavia is the database name23:42
johnsomoh, maybe it is just /var/log/mysql23:42
rm_workuhh23:42
rm_workhmmm23:42
johnsomit's under /var/lib/mysql/octavia for the data files23:42
rm_workmaybe percona is different23:42
johnsomYeah, check /var/lib/mysql/octavia23:42
johnsomor just /var/lib/mysql23:43
rm_workah some stuff here23:43
rm_workunrelated but this is interesting23:44
rm_work2017-08-11T23:43:49.406278Z 80453 [Warning] InnoDB: Cannot add field `l7rule_3_value_273` in table `tmp`.`#sql_2a5a_0` because after adding it, the row size is 8127 which is greater than maximum allowed size (8126) for a record on index leaf page.23:44
johnsomYeah, I have that too23:44
johnsomhttps://bugs.mysql.com/bug.php?id=7739823:44
johnsomNo good answer there however23:45
rm_workhmmm23:46
rm_worknot seeing it23:46
rm_workmaybe need to set that on every server23:46
rm_workfuntimes23:47
xgerman_if we only stuck to postgressā€¦23:47
johnsomHahahaha23:47

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!