*** r-daneel has quit IRC | 00:20 | |
*** zhangguoqing has joined #openstack-telemetry | 01:02 | |
*** links has joined #openstack-telemetry | 01:30 | |
*** lhx__ has joined #openstack-telemetry | 01:39 | |
*** thorst has joined #openstack-telemetry | 01:46 | |
*** tovin07_ has joined #openstack-telemetry | 01:51 | |
*** vint_bra has joined #openstack-telemetry | 01:59 | |
*** thorst has quit IRC | 02:02 | |
*** zhurong has joined #openstack-telemetry | 02:05 | |
*** thorst has joined #openstack-telemetry | 02:14 | |
*** donghao has joined #openstack-telemetry | 02:34 | |
*** thorst has quit IRC | 02:37 | |
*** donghao has quit IRC | 02:38 | |
*** toddnni has quit IRC | 03:10 | |
*** vint_bra has quit IRC | 03:21 | |
*** daidv_ has joined #openstack-telemetry | 03:55 | |
*** joadavis has quit IRC | 04:02 | |
*** zhurong has quit IRC | 04:08 | |
*** thorst has joined #openstack-telemetry | 04:16 | |
*** zhurong has joined #openstack-telemetry | 04:16 | |
*** thorst has quit IRC | 04:18 | |
*** joadavis has joined #openstack-telemetry | 04:22 | |
*** thorst has joined #openstack-telemetry | 04:26 | |
*** Tamayo has quit IRC | 04:37 | |
*** thorst has quit IRC | 04:38 | |
*** rwsu has joined #openstack-telemetry | 05:27 | |
*** donghao has joined #openstack-telemetry | 05:37 | |
*** thorst has joined #openstack-telemetry | 05:38 | |
*** donghao has quit IRC | 05:41 | |
*** thorst has quit IRC | 05:42 | |
*** joadavis has quit IRC | 05:45 | |
*** joadavis has joined #openstack-telemetry | 05:57 | |
*** yprokule has joined #openstack-telemetry | 06:05 | |
*** thorst has joined #openstack-telemetry | 06:09 | |
*** aagate has quit IRC | 06:17 | |
*** rcernin has joined #openstack-telemetry | 06:18 | |
*** pcaruana has joined #openstack-telemetry | 06:20 | |
*** thorst has quit IRC | 06:27 | |
*** lhx__ has quit IRC | 06:59 | |
*** lhx__ has joined #openstack-telemetry | 07:00 | |
*** shardy has joined #openstack-telemetry | 07:10 | |
*** dixiaoli has joined #openstack-telemetry | 07:13 | |
*** yprokule_ has joined #openstack-telemetry | 07:20 | |
*** yprokule has quit IRC | 07:22 | |
*** yprokule_ is now known as yprokule | 07:22 | |
*** thorst has joined #openstack-telemetry | 07:24 | |
*** thorst has quit IRC | 07:29 | |
*** zhurong has quit IRC | 07:30 | |
*** donghao has joined #openstack-telemetry | 07:38 | |
*** donghao has quit IRC | 07:43 | |
*** zhurong has joined #openstack-telemetry | 07:45 | |
*** daidv_ has quit IRC | 08:03 | |
*** lhx__ has quit IRC | 08:07 | |
*** lhx__ has joined #openstack-telemetry | 08:08 | |
*** sheel has joined #openstack-telemetry | 08:13 | |
*** lhx__ has quit IRC | 08:20 | |
*** lhx__ has joined #openstack-telemetry | 08:20 | |
*** thorst has joined #openstack-telemetry | 08:25 | |
*** efoley__ has joined #openstack-telemetry | 08:26 | |
*** efoley_ has joined #openstack-telemetry | 08:28 | |
*** thorst has quit IRC | 08:29 | |
*** efoley__ has quit IRC | 08:31 | |
*** yprokule_ has joined #openstack-telemetry | 08:35 | |
*** yprokule has quit IRC | 08:38 | |
*** yprokule_ is now known as yprokule | 08:38 | |
*** zhurong has quit IRC | 08:54 | |
*** zhangguoqing has quit IRC | 09:04 | |
*** zhurong has joined #openstack-telemetry | 09:04 | |
*** thorst has joined #openstack-telemetry | 09:26 | |
*** thorst has quit IRC | 09:30 | |
*** zhurong has quit IRC | 09:35 | |
*** zhangguoqing has joined #openstack-telemetry | 09:36 | |
*** cdent has joined #openstack-telemetry | 09:46 | |
*** adriant has quit IRC | 09:58 | |
*** tovin07_ has quit IRC | 10:00 | |
*** sheel has quit IRC | 10:22 | |
*** thorst has joined #openstack-telemetry | 10:27 | |
*** thorst has quit IRC | 10:31 | |
*** dixiaoli has quit IRC | 10:39 | |
*** lhx__ has quit IRC | 11:02 | |
*** lhx__ has joined #openstack-telemetry | 11:03 | |
*** pradk has joined #openstack-telemetry | 11:48 | |
*** thorst has joined #openstack-telemetry | 11:58 | |
*** lhx__ has quit IRC | 12:21 | |
*** efoley__ has joined #openstack-telemetry | 12:35 | |
*** efoley_ has quit IRC | 12:39 | |
*** links has quit IRC | 12:54 | |
*** fguillot has joined #openstack-telemetry | 12:55 | |
*** chlong has quit IRC | 13:06 | |
*** efoley_ has joined #openstack-telemetry | 13:19 | |
*** dr_gogeta86 has joined #openstack-telemetry | 13:22 | |
dr_gogeta86 | hi | 13:22 |
---|---|---|
*** efoley__ has quit IRC | 13:23 | |
dr_gogeta86 | some one here knows why my gnocchi metrics are 10 minutes behind | 13:23 |
*** dixiaoli has joined #openstack-telemetry | 13:26 | |
*** dixiaoli has quit IRC | 13:28 | |
*** gordc has joined #openstack-telemetry | 13:35 | |
*** donghao has joined #openstack-telemetry | 13:46 | |
*** donghao has quit IRC | 13:51 | |
*** lhx__ has joined #openstack-telemetry | 14:15 | |
*** r-daneel has joined #openstack-telemetry | 14:25 | |
*** chlong has joined #openstack-telemetry | 14:27 | |
*** vint_bra has joined #openstack-telemetry | 14:27 | |
openstackgerrit | gordon chung proposed openstack/panko master: support uwsgi https://review.openstack.org/467796 | 14:40 |
*** thorst_ has joined #openstack-telemetry | 14:49 | |
*** thorst has quit IRC | 14:51 | |
*** dave-mccowan has joined #openstack-telemetry | 15:00 | |
akrzos | dr_gogeta86: have you tried passing --refresh? | 15:03 |
akrzos | also whats your polling interval? | 15:03 |
dr_gogeta86 | akrzos, I'm a pretty noob | 15:04 |
dr_gogeta86 | so where put --refres | 15:04 |
*** zhangguoqing has quit IRC | 15:04 | |
dr_gogeta86 | so where put --refresh | 15:04 |
akrzos | what is grabbing your metrics? | 15:07 |
akrzos | are you determining this from the cli? | 15:07 |
*** shardy has quit IRC | 15:09 | |
gordc | dr_gogeta86: what do you mean 10mins behind? gnocchi aggregates/'rolls-up' data based on what archival policy you choose/create... maybe that's why? | 15:14 |
gordc | or do you mean something with backlog? | 15:15 |
dr_gogeta86 | gordc, i think there is some issues | 15:15 |
dr_gogeta86 | http://paste.openstack.org/show/610652/ | 15:17 |
gordc | what version are you using? | 15:18 |
dr_gogeta86 | newton | 15:19 |
gordc | gnocchi doesn't follow openstack release cycle... did you install using rpm? | 15:21 |
gordc | assuming it's gnocchi3, it has some scheduling issues (especially for ceph driver) | 15:22 |
gordc | if you're using gnocchi3+ceph, you should probably increase: https://github.com/openstack/gnocchi/blob/stable/3.1/gnocchi/cli.py#L154 | 15:22 |
gordc | if you're using gnocchi2, don't | 15:22 |
gordc | if you're using gnocchi4 (unreleased), you'll need to add more metricd workers. | 15:23 |
gordc | in all cases, you can add more workers. | 15:23 |
dr_gogeta86 | i think is gnocchi3 | 15:28 |
dr_gogeta86 | and I have ceph driver | 15:28 |
gordc | dr_gogeta86: kk, i would start by 'hacking' code and increasing the value (try maybe 128 instead of 16) | 15:29 |
gordc | you'll need to restart all your metricd agents after that | 15:29 |
pradk | Hi gordc , quick question , when i create a network(in ocata with collector), i'm not seeing the network resources show up in gnocchi .. https://paste.fedoraproject.org/paste/22j3kwP6ZNKSiyXhyyjljl5M1UNdIGYhyRLivL9gydE= | 15:32 |
pradk | gordc, anything else i need to do? other resources show up fine | 15:32 |
gordc | pradk: um... do those meters get generated? i think only bandwidth is a non 'volume=1' metric | 15:35 |
gordc | and your pipeline is set to ignore bandwidt | 15:35 |
pradk | i see we removed the disable_non_metric_meters in conf, i dint set that | 15:35 |
gordc | it'll be disabled by default by newton i think (i can't remember when we removed them completely) | 15:36 |
pradk | hmm lemme check pipeline | 15:36 |
gordc | seems like it's still there in newton: https://github.com/openstack/ceilometer/blob/stable/newton/ceilometer/network/notifications.py but they'd be disable by default. | 15:37 |
gordc | and yeah, your pipeline would only allow network.* meters and block everything else. | 15:37 |
pradk | ok so we dont generate network.create etc anymore | 15:37 |
pradk | this is ocata | 15:37 |
pradk | i dont see that file | 15:37 |
gordc | in ocata. no. | 15:37 |
pradk | in ocata | 15:37 |
pradk | k that explains it | 15:38 |
pradk | thx | 15:38 |
gordc | oh sorry, read newton for some reason | 15:38 |
*** pcaruana has quit IRC | 15:38 | |
gordc | so yeah, i think only bandwidth exists... and it's blocked in your pipeline def. | 15:38 |
*** shardy has joined #openstack-telemetry | 15:39 | |
*** pcaruana has joined #openstack-telemetry | 15:39 | |
*** thorst_ has quit IRC | 15:39 | |
dr_gogeta86 | gordc i think 128 is too low | 15:41 |
gordc | dr_gogeta86: maybe? it's really dependent on your measurements:metric ratio in your backlog (only for ceph driver) | 15:42 |
*** jmlowe has quit IRC | 15:42 | |
dr_gogeta86 | now is lower ... and go down | 15:44 |
dr_gogeta86 | is possible is impossible to go under 5 minutes ? | 15:45 |
gordc | the ceph driver in gnocchi3 has a flaw depending on how bad that ratio gets... for the ceph drier specifically, you should aim to keep that ratio low. | 15:45 |
gordc | dr_gogeta86: what are you referring to when you say 'go under 5 minutes'? | 15:45 |
dr_gogeta86 | http://paste.openstack.org/show/610655/ | 15:47 |
dr_gogeta86 | this | 15:47 |
dr_gogeta86 | now is better | 15:47 |
gordc | dr_gogeta86: yeah, you need to choose a new archive policy or create your own. remember, gnocchi does not store raw data, but aggregated data so it saves space. | 15:50 |
gordc | dr_gogeta86: http://gnocchi.xyz/running.html#how-to-define-archive-policies | 15:50 |
dr_gogeta86 | i know | 15:51 |
dr_gogeta86 | is a thing i love | 15:51 |
gordc | http://gnocchi.xyz/3.1.5/rest.html#archive-policy here's how to create your own | 15:52 |
*** efoley_ has quit IRC | 15:52 | |
gordc | dr_gogeta86: you'll love gnocchi4 better than... it solves the issues you have right now ;) | 15:53 |
gordc | you'll can modify the existing archive policy you're using to maintain current metrics... or you can creat new metrics with new archive policy | 15:54 |
dr_gogeta86 | gordc, I hope | 15:56 |
*** Tamayo has joined #openstack-telemetry | 15:59 | |
*** thorst has joined #openstack-telemetry | 16:00 | |
*** thorst has quit IRC | 16:02 | |
*** thorst has joined #openstack-telemetry | 16:04 | |
*** cdent has quit IRC | 16:22 | |
*** shardy has quit IRC | 16:28 | |
*** pcaruana has quit IRC | 16:30 | |
*** pcaruana has joined #openstack-telemetry | 16:30 | |
*** yprokule has quit IRC | 16:39 | |
*** rcernin has quit IRC | 16:40 | |
*** lhx__ has quit IRC | 16:50 | |
*** jmlowe has joined #openstack-telemetry | 17:16 | |
*** jmlowe has quit IRC | 17:16 | |
*** jmlowe has joined #openstack-telemetry | 17:17 | |
*** aagate has joined #openstack-telemetry | 17:40 | |
*** donghao has joined #openstack-telemetry | 18:08 | |
*** donghao has quit IRC | 18:13 | |
openstackgerrit | gordon chung proposed openstack/panko master: let requirements handle pbr version https://review.openstack.org/468121 | 18:16 |
*** jmlowe has quit IRC | 18:28 | |
*** jmlowe has joined #openstack-telemetry | 18:35 | |
*** Tamayo has quit IRC | 18:39 | |
*** jmlowe has quit IRC | 18:48 | |
*** jmlowe has joined #openstack-telemetry | 18:51 | |
*** jobewan has joined #openstack-telemetry | 18:56 | |
openstackgerrit | gordon chung proposed openstack/panko master: pass only database options https://review.openstack.org/468133 | 18:58 |
openstackgerrit | gordon chung proposed openstack/panko master: storage: Ensure pymysql is prefered https://review.openstack.org/468135 | 19:02 |
*** pcaruana has quit IRC | 19:09 | |
*** cdent has joined #openstack-telemetry | 19:32 | |
*** fguillot has quit IRC | 20:00 | |
*** cdent has quit IRC | 20:02 | |
*** chlong has quit IRC | 20:15 | |
*** jmlowe has quit IRC | 20:18 | |
*** chlong has joined #openstack-telemetry | 20:33 | |
*** chlong has quit IRC | 20:37 | |
*** chlong has joined #openstack-telemetry | 20:38 | |
pradk | gordc, seen this error? ERROR gnocchi.storage.incoming._carbonara [-] Unable to detect the number of storage sacks. Ensure gnocchi-upgrade has been executed: int() argument must be a string or a number, not 'NoneType' | 20:45 |
pradk | gordc, gnocchi-upgrade is run from what i can tell from the logs | 20:45 |
gordc | pradk: seems you didn't? | 20:47 |
gordc | i can't recall if it happens in any other scenario. although you did remind me of another bug | 20:47 |
gordc | but that's related to running gnocchi-sack-changer before gnocchi-upgrade | 20:48 |
pradk | gordc, hmm so now we need to run gnocchi-sack-changer ? | 20:49 |
gordc | pradk: no. that's optional | 20:49 |
pradk | i'm seeing this in app.log http://logs.openstack.org/50/467750/2/check-tripleo/gate-tripleo-ci-centos-7-ovb-ha-oooq/b456c2a/logs/overcloud-controller-0/var/log/gnocchi/app.log.txt.gz | 20:49 |
gordc | you can run gnocchi and say gnocchi-upgrade --num-storage-sacks <blah> to set it on upgrade or it will just put 128 sacks by default if you don't | 20:50 |
gordc | ... | 20:51 |
pradk | http://logs.openstack.org/50/467750/2/check-tripleo/gate-tripleo-ci-centos-7-ovb-ha-oooq/b456c2a/logs/overcloud-controller-0/var/log/gnocchi/gnocchi-upgrade.log.txt.gz | 20:51 |
pradk | is the gnocchi-upgrade log | 20:51 |
gordc | pradk: i was going to ask if it was swift.lol i didn't try swift or s3 | 20:51 |
pradk | yea its configured to use swift | 20:51 |
gordc | weird, seems it didn't create the setting object | 20:53 |
gordc | damn. don't have swift enabled in my devstack | 20:56 |
gordc | pradk: that is pretty strange that you get report stil http://logs.openstack.org/50/467750/2/check-tripleo/gate-tripleo-ci-centos-7-ovb-ha-oooq/b456c2a/logs/overcloud-controller-0/var/log/gnocchi/metricd.log.txt.gz#_2017-05-24_23_01_31_210 | 20:58 |
pradk | yea its just the api is down | 20:58 |
gordc | but the report needs to be able to detect number of storage sacks too | 20:59 |
gordc | so not sure how metricd is detecting it but not api | 20:59 |
gordc | pradk: also, it's strange your upgrade logs end long after app.py logs have already started | 21:01 |
pradk | so storage sack is created when gnochci-upgrade initializes storage? swift or ceph/ | 21:01 |
*** vint_bra has quit IRC | 21:01 | |
gordc | pradk: gnocchi(v4)-upgrade will create a settings object with storage sack value | 21:02 |
pradk | so even if the api is starting before, it seems to retry? eventually it should have found it? | 21:02 |
*** vint_bra has joined #openstack-telemetry | 21:02 | |
gordc | errr... well your metricd found it | 21:02 |
pradk | since metricd is obviously | 21:02 |
pradk | right | 21:03 |
gordc | but your metricd is started after gnocchi-upgrade | 21:03 |
gordc | yout api was started way before metricd according to logs | 21:03 |
gordc | and gnocchi-upgrade was run multiple times as well it seems. | 21:03 |
gordc | i'm not sure api is smart enough to recover if it doesn't find settings file on startup | 21:04 |
gordc | probably should create a but for that. | 21:04 |
pradk | right so we do this in multiple steps .. gnochci-upgrade is first run with skip-storage as swift might not be up yet .. then api starts in the same step .. we rerun upgrade with stoarge ins tep 5after which metricd and statsd come up | 21:05 |
pradk | so this whole sack thing is new looks like as this works fine in ocata | 21:06 |
gordc | why would you start api if swift isn't up yet and you skip-storage upgrade? | 21:07 |
gordc | that means you're enabling ability to write to something that isn't up and not upgraded | 21:07 |
pradk | since we start gnocchi api in apache, it just comes up earlier with other apache services | 21:08 |
pradk | this is going to be trick.. i need to check if swift is up at this stage, if so, may be we can remove skip-storage now | 21:08 |
pradk | lemme poke around.. but that makes sense to you on why we;'re seeing this error? due to skipping storage during upgrade? | 21:09 |
gordc | yeah, you shouldn't be enabling api to accept writes to something that is down and/or not upgraded. | 21:09 |
gordc | pradk: that repeated error is actually it trying to resolve itself which means it's not recovering for some reason | 21:09 |
gordc | https://github.com/gnocchixyz/gnocchi/blob/master/gnocchi/storage/incoming/_carbonara.py#L44-L49 | 21:09 |
pradk | right thats puzzling.. if its trying to recover it should have found the sack after step5 | 21:10 |
*** chlong has quit IRC | 21:11 | |
*** toddnni has joined #openstack-telemetry | 21:11 | |
gordc | yea | 21:11 |
*** donghao has joined #openstack-telemetry | 21:11 | |
gordc | it does seem like your swift storage never gets upgraded successfully http://logs.openstack.org/50/467750/2/check-tripleo/gate-tripleo-ci-centos-7-ovb-ha-oooq/b456c2a/logs/overcloud-controller-0/var/log/gnocchi/gnocchi-upgrade.log.txt.gz#_2017-05-24_22_58_30_158 | 21:13 |
gordc | although i vaguely recall swiftclient has a default rety built in | 21:13 |
*** donghao has quit IRC | 21:15 | |
pradk | gordc, this use to work fine until recently.. so is there some new logic having it check the sacks now? | 21:20 |
gordc | pradk: well it worked because you had nothing to upgrade | 21:21 |
pradk | gordc, i shouldnt have anything to upgrade now either :) | 21:21 |
gordc | the sacks never existed until recently. | 21:21 |
pradk | gordc, atleast this is a fresh deploy job | 21:21 |
pradk | ah right ok, so its trying to upgrade sacks now | 21:22 |
gordc | well it's trying to create the framework to support sacks | 21:22 |
*** thorst has quit IRC | 21:22 | |
gordc | previously, it was just 1. start, 2. write to measure to file | 21:22 |
gordc | now it's 1.start 2. load configuration (not from conf) 3. write measures to obj | 21:23 |
pradk | ok understood | 21:25 |
pradk | i guess if the retry worked as expected this would still be ok.. else i'll need to rework the ordering to get gnocchi api start later | 21:25 |
gordc | i'm still not sure why the api can't find the setting object over time... it exists according to metricd | 21:25 |
gordc | but you're api error says it doesn't. | 21:26 |
pradk | yea | 21:26 |
gordc | although, now that i think about it more, it also needs that storage_sacks configured to even know how to write. which it is (according to your metricd) | 21:27 |
gordc | so some api does know that about storage sacks... or your backlog should be empty as well | 21:27 |
pradk | storage_sacks configured is done as part of gnocchi-upgrade? | 21:28 |
*** ddyer has quit IRC | 21:29 | |
gordc | yes, inital value is set during upgrade | 21:29 |
gordc | some gnocchi-api knows about storage sacks... since you have a backlog. | 21:29 |
gordc | jsut for some reason, you have other gnocchi-api which don't know about it and are spewing those errors. | 21:30 |
pradk | hmm there should be only one be gnocchi api .. which is down | 21:31 |
gordc | ... then you got to wonder what is pushing http://logs.openstack.org/50/467750/2/check-tripleo/gate-tripleo-ci-centos-7-ovb-ha-oooq/b456c2a/logs/overcloud-controller-0/var/log/gnocchi/metricd.log.txt.gz#_2017-05-24_23_01_31_210 lol | 21:32 |
pradk | hmm wait lemme check other controllers | 21:33 |
*** dave-mccowan has quit IRC | 21:33 | |
pradk | http://logs.openstack.org/50/467750/2/check-tripleo/gate-tripleo-ci-centos-7-ovb-ha-oooq/b456c2a/logs/overcloud-controller-1/var/log/gnocchi/app.log.txt.gz | 21:33 |
pradk | so its up on controller 1 | 21:33 |
gordc | ah | 21:34 |
pradk | ok so the api is up and running on controller-1 and 2 | 21:34 |
pradk | just 0 has this issue | 21:34 |
pradk | that explains the metricd backlog | 21:34 |
gordc | maybe controller-0 is pointing to a different swift? | 21:35 |
gordc | i mean in theory, i think if api does nothing, it won't show that original error | 21:36 |
gordc | so it could be those apis haven't done anything | 21:37 |
gordc | nm. httpd says it's doing stuff | 21:37 |
*** ddyer has joined #openstack-telemetry | 21:39 | |
pradk | gordc, yea i'm a bit confused why other nodes are starting fine but not the first | 21:45 |
gordc | seems your conf files are all the same so it should be same swift | 21:45 |
pradk | if storage sack is needed shouldnt all fail? | 21:45 |
gordc | yep.lol | 21:45 |
gordc | unless you have 3 different swifts and your upgrade is just upgrading node1/2 and not node0.lol | 21:46 |
pradk | i dont think so, because metricsd on first finds it eventually .. i think its just the ordering on bootstrap node which is the first one. atleast i see the error consistenly on controlelr 0 | 22:02 |
pradk | i'll investigate this further tomorrow.. i have a wip patch i'm testing | 22:13 |
pradk | thx gordc | 22:13 |
gordc | pradk: kk, feel free to ping tomorrow. pretty strange | 22:15 |
*** thorst has joined #openstack-telemetry | 22:20 | |
*** thorst has quit IRC | 22:23 | |
*** vint_bra has quit IRC | 22:36 | |
openstackgerrit | gordon chung proposed openstack/panko master: add noauth api pipeline https://review.openstack.org/468191 | 22:42 |
*** thorst has joined #openstack-telemetry | 22:53 | |
*** jobewan has quit IRC | 22:55 | |
*** gordc has quit IRC | 22:57 | |
*** thorst has quit IRC | 23:08 | |
*** jmlowe has joined #openstack-telemetry | 23:16 |
Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!