Thursday, 2017-05-25

*** r-daneel has quit IRC00:20
*** zhangguoqing has joined #openstack-telemetry01:02
*** links has joined #openstack-telemetry01:30
*** lhx__ has joined #openstack-telemetry01:39
*** thorst has joined #openstack-telemetry01:46
*** tovin07_ has joined #openstack-telemetry01:51
*** vint_bra has joined #openstack-telemetry01:59
*** thorst has quit IRC02:02
*** zhurong has joined #openstack-telemetry02:05
*** thorst has joined #openstack-telemetry02:14
*** donghao has joined #openstack-telemetry02:34
*** thorst has quit IRC02:37
*** donghao has quit IRC02:38
*** toddnni has quit IRC03:10
*** vint_bra has quit IRC03:21
*** daidv_ has joined #openstack-telemetry03:55
*** joadavis has quit IRC04:02
*** zhurong has quit IRC04:08
*** thorst has joined #openstack-telemetry04:16
*** zhurong has joined #openstack-telemetry04:16
*** thorst has quit IRC04:18
*** joadavis has joined #openstack-telemetry04:22
*** thorst has joined #openstack-telemetry04:26
*** Tamayo has quit IRC04:37
*** thorst has quit IRC04:38
*** rwsu has joined #openstack-telemetry05:27
*** donghao has joined #openstack-telemetry05:37
*** thorst has joined #openstack-telemetry05:38
*** donghao has quit IRC05:41
*** thorst has quit IRC05:42
*** joadavis has quit IRC05:45
*** joadavis has joined #openstack-telemetry05:57
*** yprokule has joined #openstack-telemetry06:05
*** thorst has joined #openstack-telemetry06:09
*** aagate has quit IRC06:17
*** rcernin has joined #openstack-telemetry06:18
*** pcaruana has joined #openstack-telemetry06:20
*** thorst has quit IRC06:27
*** lhx__ has quit IRC06:59
*** lhx__ has joined #openstack-telemetry07:00
*** shardy has joined #openstack-telemetry07:10
*** dixiaoli has joined #openstack-telemetry07:13
*** yprokule_ has joined #openstack-telemetry07:20
*** yprokule has quit IRC07:22
*** yprokule_ is now known as yprokule07:22
*** thorst has joined #openstack-telemetry07:24
*** thorst has quit IRC07:29
*** zhurong has quit IRC07:30
*** donghao has joined #openstack-telemetry07:38
*** donghao has quit IRC07:43
*** zhurong has joined #openstack-telemetry07:45
*** daidv_ has quit IRC08:03
*** lhx__ has quit IRC08:07
*** lhx__ has joined #openstack-telemetry08:08
*** sheel has joined #openstack-telemetry08:13
*** lhx__ has quit IRC08:20
*** lhx__ has joined #openstack-telemetry08:20
*** thorst has joined #openstack-telemetry08:25
*** efoley__ has joined #openstack-telemetry08:26
*** efoley_ has joined #openstack-telemetry08:28
*** thorst has quit IRC08:29
*** efoley__ has quit IRC08:31
*** yprokule_ has joined #openstack-telemetry08:35
*** yprokule has quit IRC08:38
*** yprokule_ is now known as yprokule08:38
*** zhurong has quit IRC08:54
*** zhangguoqing has quit IRC09:04
*** zhurong has joined #openstack-telemetry09:04
*** thorst has joined #openstack-telemetry09:26
*** thorst has quit IRC09:30
*** zhurong has quit IRC09:35
*** zhangguoqing has joined #openstack-telemetry09:36
*** cdent has joined #openstack-telemetry09:46
*** adriant has quit IRC09:58
*** tovin07_ has quit IRC10:00
*** sheel has quit IRC10:22
*** thorst has joined #openstack-telemetry10:27
*** thorst has quit IRC10:31
*** dixiaoli has quit IRC10:39
*** lhx__ has quit IRC11:02
*** lhx__ has joined #openstack-telemetry11:03
*** pradk has joined #openstack-telemetry11:48
*** thorst has joined #openstack-telemetry11:58
*** lhx__ has quit IRC12:21
*** efoley__ has joined #openstack-telemetry12:35
*** efoley_ has quit IRC12:39
*** links has quit IRC12:54
*** fguillot has joined #openstack-telemetry12:55
*** chlong has quit IRC13:06
*** efoley_ has joined #openstack-telemetry13:19
*** dr_gogeta86 has joined #openstack-telemetry13:22
dr_gogeta86hi13:22
*** efoley__ has quit IRC13:23
dr_gogeta86some one here knows why my gnocchi metrics are 10 minutes behind13:23
*** dixiaoli has joined #openstack-telemetry13:26
*** dixiaoli has quit IRC13:28
*** gordc has joined #openstack-telemetry13:35
*** donghao has joined #openstack-telemetry13:46
*** donghao has quit IRC13:51
*** lhx__ has joined #openstack-telemetry14:15
*** r-daneel has joined #openstack-telemetry14:25
*** chlong has joined #openstack-telemetry14:27
*** vint_bra has joined #openstack-telemetry14:27
openstackgerritgordon chung proposed openstack/panko master: support uwsgi  https://review.openstack.org/46779614:40
*** thorst_ has joined #openstack-telemetry14:49
*** thorst has quit IRC14:51
*** dave-mccowan has joined #openstack-telemetry15:00
akrzosdr_gogeta86: have you tried passing --refresh?15:03
akrzosalso whats your polling interval?15:03
dr_gogeta86akrzos, I'm a pretty noob15:04
dr_gogeta86so where put --refres15:04
*** zhangguoqing has quit IRC15:04
dr_gogeta86so where put --refresh15:04
akrzoswhat is grabbing your metrics?15:07
akrzosare you determining this from the cli?15:07
*** shardy has quit IRC15:09
gordcdr_gogeta86: what do you mean 10mins behind? gnocchi aggregates/'rolls-up' data based on what archival policy you choose/create... maybe that's why?15:14
gordcor do you mean something with backlog?15:15
dr_gogeta86gordc, i think there is some issues15:15
dr_gogeta86http://paste.openstack.org/show/610652/15:17
gordcwhat version are you using?15:18
dr_gogeta86newton15:19
gordcgnocchi doesn't follow openstack release cycle... did you install using rpm?15:21
gordcassuming it's gnocchi3, it has some scheduling issues (especially for ceph driver)15:22
gordcif you're using gnocchi3+ceph, you should probably increase: https://github.com/openstack/gnocchi/blob/stable/3.1/gnocchi/cli.py#L15415:22
gordcif you're using gnocchi2, don't15:22
gordcif you're using gnocchi4 (unreleased), you'll need to add more metricd workers.15:23
gordcin all cases, you can add more workers.15:23
dr_gogeta86i think is gnocchi315:28
dr_gogeta86and I have ceph driver15:28
gordcdr_gogeta86: kk, i would start by 'hacking' code and increasing the value (try maybe 128 instead of 16)15:29
gordcyou'll need to restart all your metricd agents after that15:29
pradkHi gordc , quick question , when i create a network(in ocata with collector), i'm not seeing the network resources show up in gnocchi .. https://paste.fedoraproject.org/paste/22j3kwP6ZNKSiyXhyyjljl5M1UNdIGYhyRLivL9gydE=15:32
pradkgordc, anything else i need to do? other resources show up fine15:32
gordcpradk: um... do those meters get generated? i think only bandwidth is a non 'volume=1' metric15:35
gordcand your pipeline is set to ignore bandwidt15:35
pradki see we removed the disable_non_metric_meters in conf, i dint set that15:35
gordcit'll be disabled by default by newton i think (i can't remember when we removed them completely)15:36
pradkhmm lemme check pipeline15:36
gordcseems like it's still there in newton: https://github.com/openstack/ceilometer/blob/stable/newton/ceilometer/network/notifications.py but they'd be disable by default.15:37
gordcand yeah, your pipeline would only allow network.* meters and block everything else.15:37
pradkok so we dont generate network.create etc anymore15:37
pradkthis is ocata15:37
pradki dont see that file15:37
gordcin ocata. no.15:37
pradkin ocata15:37
pradkk that explains it15:38
pradkthx15:38
gordcoh sorry, read newton for some reason15:38
*** pcaruana has quit IRC15:38
gordcso yeah, i think only bandwidth exists... and it's blocked in your pipeline def.15:38
*** shardy has joined #openstack-telemetry15:39
*** pcaruana has joined #openstack-telemetry15:39
*** thorst_ has quit IRC15:39
dr_gogeta86gordc i think 128 is too low15:41
gordcdr_gogeta86: maybe? it's really dependent on your measurements:metric ratio in your backlog (only for ceph driver)15:42
*** jmlowe has quit IRC15:42
dr_gogeta86now is lower ... and go down15:44
dr_gogeta86is possible is impossible to go under 5 minutes ?15:45
gordcthe ceph driver in gnocchi3 has a flaw depending on how bad that ratio gets... for the ceph drier specifically, you should aim to keep that ratio low.15:45
gordcdr_gogeta86: what are you referring to when you say 'go under 5 minutes'?15:45
dr_gogeta86http://paste.openstack.org/show/610655/15:47
dr_gogeta86this15:47
dr_gogeta86now is better15:47
gordcdr_gogeta86: yeah, you need to choose a new archive policy or create your own. remember, gnocchi does not store raw data, but aggregated data so it saves space.15:50
gordcdr_gogeta86: http://gnocchi.xyz/running.html#how-to-define-archive-policies15:50
dr_gogeta86i know15:51
dr_gogeta86is a thing i love15:51
gordchttp://gnocchi.xyz/3.1.5/rest.html#archive-policy here's how to create your own15:52
*** efoley_ has quit IRC15:52
gordcdr_gogeta86: you'll love gnocchi4 better than... it solves the issues you have right now ;)15:53
gordcyou'll can modify the existing archive policy you're using to maintain current metrics... or you can creat new metrics with new archive policy15:54
dr_gogeta86gordc, I hope15:56
*** Tamayo has joined #openstack-telemetry15:59
*** thorst has joined #openstack-telemetry16:00
*** thorst has quit IRC16:02
*** thorst has joined #openstack-telemetry16:04
*** cdent has quit IRC16:22
*** shardy has quit IRC16:28
*** pcaruana has quit IRC16:30
*** pcaruana has joined #openstack-telemetry16:30
*** yprokule has quit IRC16:39
*** rcernin has quit IRC16:40
*** lhx__ has quit IRC16:50
*** jmlowe has joined #openstack-telemetry17:16
*** jmlowe has quit IRC17:16
*** jmlowe has joined #openstack-telemetry17:17
*** aagate has joined #openstack-telemetry17:40
*** donghao has joined #openstack-telemetry18:08
*** donghao has quit IRC18:13
openstackgerritgordon chung proposed openstack/panko master: let requirements handle pbr version  https://review.openstack.org/46812118:16
*** jmlowe has quit IRC18:28
*** jmlowe has joined #openstack-telemetry18:35
*** Tamayo has quit IRC18:39
*** jmlowe has quit IRC18:48
*** jmlowe has joined #openstack-telemetry18:51
*** jobewan has joined #openstack-telemetry18:56
openstackgerritgordon chung proposed openstack/panko master: pass only database options  https://review.openstack.org/46813318:58
openstackgerritgordon chung proposed openstack/panko master: storage: Ensure pymysql is prefered  https://review.openstack.org/46813519:02
*** pcaruana has quit IRC19:09
*** cdent has joined #openstack-telemetry19:32
*** fguillot has quit IRC20:00
*** cdent has quit IRC20:02
*** chlong has quit IRC20:15
*** jmlowe has quit IRC20:18
*** chlong has joined #openstack-telemetry20:33
*** chlong has quit IRC20:37
*** chlong has joined #openstack-telemetry20:38
pradkgordc, seen this error? ERROR gnocchi.storage.incoming._carbonara [-] Unable to detect the number of storage sacks. Ensure gnocchi-upgrade has been executed: int() argument must be a string or a number, not 'NoneType'20:45
pradkgordc, gnocchi-upgrade is run from what i can tell from the logs20:45
gordcpradk: seems you didn't?20:47
gordci can't recall if it happens in any other scenario. although you did remind me of another bug20:47
gordcbut that's related to running gnocchi-sack-changer before gnocchi-upgrade20:48
pradkgordc, hmm so now we need to run gnocchi-sack-changer ?20:49
gordcpradk: no. that's optional20:49
pradki'm seeing this in app.log http://logs.openstack.org/50/467750/2/check-tripleo/gate-tripleo-ci-centos-7-ovb-ha-oooq/b456c2a/logs/overcloud-controller-0/var/log/gnocchi/app.log.txt.gz20:49
gordcyou can run gnocchi and say gnocchi-upgrade --num-storage-sacks <blah> to set it on upgrade or it will just put 128 sacks by default if you don't20:50
gordc...20:51
pradkhttp://logs.openstack.org/50/467750/2/check-tripleo/gate-tripleo-ci-centos-7-ovb-ha-oooq/b456c2a/logs/overcloud-controller-0/var/log/gnocchi/gnocchi-upgrade.log.txt.gz20:51
pradkis the gnocchi-upgrade log20:51
gordcpradk: i was going to ask if it was swift.lol i didn't try swift or s320:51
pradkyea its configured to use swift20:51
gordcweird, seems it didn't create the setting object20:53
gordcdamn. don't have swift enabled in my devstack20:56
gordcpradk: that is pretty strange that you get report stil http://logs.openstack.org/50/467750/2/check-tripleo/gate-tripleo-ci-centos-7-ovb-ha-oooq/b456c2a/logs/overcloud-controller-0/var/log/gnocchi/metricd.log.txt.gz#_2017-05-24_23_01_31_21020:58
pradkyea its just the api is down20:58
gordcbut the report needs to be able to detect number of storage sacks too20:59
gordcso not sure how metricd is detecting it but not api20:59
gordcpradk: also, it's strange your upgrade logs end long after app.py logs have already started21:01
pradkso storage sack is created when gnochci-upgrade initializes storage? swift or ceph/21:01
*** vint_bra has quit IRC21:01
gordcpradk: gnocchi(v4)-upgrade will create a settings object with storage sack value21:02
pradkso even if the api is starting before, it seems to retry? eventually it should have found it?21:02
*** vint_bra has joined #openstack-telemetry21:02
gordcerrr... well your metricd found it21:02
pradksince metricd is obviously21:02
pradkright21:03
gordcbut your metricd is started after gnocchi-upgrade21:03
gordcyout api was started way before metricd according to logs21:03
gordcand gnocchi-upgrade was run multiple times as well it seems.21:03
gordci'm not sure api is smart enough to recover if it doesn't find settings file on startup21:04
gordcprobably should create a but for that.21:04
pradkright so we do this in multiple steps .. gnochci-upgrade is first run with skip-storage as swift might not be up yet .. then api starts in the same step .. we rerun upgrade with stoarge ins tep 5after which metricd and statsd come up21:05
pradkso this whole sack thing is new looks like as this works fine in ocata21:06
gordcwhy would you start api if swift isn't up yet and you skip-storage upgrade?21:07
gordcthat means you're enabling ability to write to something that isn't up and not upgraded21:07
pradksince we start gnocchi api in apache, it just comes up earlier with other apache services21:08
pradkthis is going to be trick.. i need to check if swift is up at this stage, if so, may be we can remove skip-storage now21:08
pradklemme poke around.. but that makes sense to you on why we;'re seeing this error? due to skipping storage during upgrade?21:09
gordcyeah, you shouldn't be enabling api to accept writes to something that is down and/or not upgraded.21:09
gordcpradk: that repeated error is actually it trying to resolve itself which means it's not recovering for some reason21:09
gordchttps://github.com/gnocchixyz/gnocchi/blob/master/gnocchi/storage/incoming/_carbonara.py#L44-L4921:09
pradkright thats puzzling.. if its trying to recover it should have found the sack after step521:10
*** chlong has quit IRC21:11
*** toddnni has joined #openstack-telemetry21:11
gordcyea21:11
*** donghao has joined #openstack-telemetry21:11
gordcit does seem like your swift storage never gets upgraded successfully http://logs.openstack.org/50/467750/2/check-tripleo/gate-tripleo-ci-centos-7-ovb-ha-oooq/b456c2a/logs/overcloud-controller-0/var/log/gnocchi/gnocchi-upgrade.log.txt.gz#_2017-05-24_22_58_30_15821:13
gordcalthough i vaguely recall swiftclient has a default rety built in21:13
*** donghao has quit IRC21:15
pradkgordc, this use to work fine until recently.. so is there some new logic having it check the sacks now?21:20
gordcpradk: well it worked because you had nothing to upgrade21:21
pradkgordc, i shouldnt have anything to upgrade now either :)21:21
gordcthe sacks never existed until recently.21:21
pradkgordc, atleast this is a fresh deploy job21:21
pradkah right ok, so its trying to upgrade sacks now21:22
gordcwell it's trying to create the framework to support sacks21:22
*** thorst has quit IRC21:22
gordcpreviously, it was just 1. start, 2. write to measure to file21:22
gordcnow it's 1.start 2. load configuration (not from conf) 3. write measures to obj21:23
pradkok understood21:25
pradki guess if the retry worked as expected this would still be ok.. else i'll need to rework the ordering to get gnocchi api start later21:25
gordci'm still not sure why the api can't find the setting object over time... it exists according to metricd21:25
gordcbut you're api error says it doesn't.21:26
pradkyea21:26
gordcalthough, now that i think about it more, it also needs that storage_sacks configured to even know how to write. which it is (according to your metricd)21:27
gordcso some api does know that about storage sacks... or your backlog should be empty as well21:27
pradkstorage_sacks configured is done as part of gnocchi-upgrade?21:28
*** ddyer has quit IRC21:29
gordcyes, inital value is set during upgrade21:29
gordcsome gnocchi-api knows about storage sacks... since you have a backlog.21:29
gordcjsut for some reason, you have other gnocchi-api which don't know about it and are spewing those errors.21:30
pradkhmm there should be only one be gnocchi api .. which is down21:31
gordc... then you got to wonder what is pushing http://logs.openstack.org/50/467750/2/check-tripleo/gate-tripleo-ci-centos-7-ovb-ha-oooq/b456c2a/logs/overcloud-controller-0/var/log/gnocchi/metricd.log.txt.gz#_2017-05-24_23_01_31_210 lol21:32
pradkhmm wait lemme check other controllers21:33
*** dave-mccowan has quit IRC21:33
pradkhttp://logs.openstack.org/50/467750/2/check-tripleo/gate-tripleo-ci-centos-7-ovb-ha-oooq/b456c2a/logs/overcloud-controller-1/var/log/gnocchi/app.log.txt.gz21:33
pradkso its up on controller 121:33
gordcah21:34
pradkok so the api is up and running on controller-1 and 221:34
pradkjust 0 has this issue21:34
pradkthat explains the metricd backlog21:34
gordcmaybe controller-0 is pointing to a different swift?21:35
gordci mean in theory, i think if api does nothing, it won't show that original error21:36
gordcso it could be those apis haven't done anything21:37
gordcnm. httpd says it's doing stuff21:37
*** ddyer has joined #openstack-telemetry21:39
pradkgordc, yea i'm a bit confused why other nodes are starting fine but not the first21:45
gordcseems your conf files are all the same so it should be same swift21:45
pradkif storage sack is needed shouldnt all fail?21:45
gordcyep.lol21:45
gordcunless you have 3 different swifts and your upgrade is just upgrading node1/2 and not node0.lol21:46
pradki dont think so, because metricsd on first finds it eventually .. i think its just the ordering on bootstrap node which is the first one. atleast i see the error consistenly on controlelr 022:02
pradki'll investigate this further tomorrow.. i have a wip patch i'm testing22:13
pradkthx gordc22:13
gordcpradk: kk, feel free to ping tomorrow. pretty strange22:15
*** thorst has joined #openstack-telemetry22:20
*** thorst has quit IRC22:23
*** vint_bra has quit IRC22:36
openstackgerritgordon chung proposed openstack/panko master: add noauth api pipeline  https://review.openstack.org/46819122:42
*** thorst has joined #openstack-telemetry22:53
*** jobewan has quit IRC22:55
*** gordc has quit IRC22:57
*** thorst has quit IRC23:08
*** jmlowe has joined #openstack-telemetry23:16

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!