Wednesday, 2016-08-31

*** liuhanxi has joined #openstack-telemetry00:02
*** zhurong has quit IRC00:06
*** chlong has quit IRC00:16
*** thorst has joined #openstack-telemetry00:21
*** thorst_ has joined #openstack-telemetry00:37
*** thorst has quit IRC00:39
*** liuhanxi has quit IRC00:40
openstackgerritxiaozhuangqing proposed openstack/ceilometer: Remove unnecessary confirm prompt  https://review.openstack.org/36282900:58
openstackgerritfengchaoyang proposed openstack/gnocchi: Check whether the resource type attribute min is less than or equal to max  https://review.openstack.org/36160401:00
*** zhurong has joined #openstack-telemetry01:13
*** donghao has joined #openstack-telemetry01:17
*** donghao has quit IRC01:20
*** donghao has joined #openstack-telemetry01:24
*** fguillot has quit IRC01:27
*** oneapm has joined #openstack-telemetry01:28
*** fguillot has joined #openstack-telemetry01:30
*** liuhanxi has joined #openstack-telemetry01:35
*** thorst_ has quit IRC01:38
*** thorst has joined #openstack-telemetry01:38
*** chlong has joined #openstack-telemetry01:40
*** thorst has quit IRC01:46
*** oneapm has quit IRC01:54
*** zhangguoqing has joined #openstack-telemetry01:57
*** vishwanathj has quit IRC01:59
*** zhurong_ has joined #openstack-telemetry02:00
*** zhurong has quit IRC02:02
*** Kevin_Zheng has joined #openstack-telemetry02:13
*** diogogmt has quit IRC02:19
*** fguillot has quit IRC02:30
*** thorst has joined #openstack-telemetry02:45
openstackgerritgecong proposed openstack/panko: Add missing %s in print message  https://review.openstack.org/36336102:47
*** thorst has quit IRC02:52
*** donghao has quit IRC03:12
*** donghao has joined #openstack-telemetry03:14
*** dduvall has quit IRC03:23
*** dduvall has joined #openstack-telemetry03:23
*** adriant has quit IRC03:41
*** diogogmt has joined #openstack-telemetry03:44
*** thorst has joined #openstack-telemetry03:50
*** thorst has quit IRC03:57
*** donghao has quit IRC03:58
*** achatterjee has joined #openstack-telemetry04:30
*** links has joined #openstack-telemetry04:36
*** chlong has quit IRC04:38
*** chlong has joined #openstack-telemetry04:39
*** oneapm has joined #openstack-telemetry04:40
*** chlong has quit IRC04:45
*** thorst has joined #openstack-telemetry04:54
*** chlong has joined #openstack-telemetry04:57
*** donghao has joined #openstack-telemetry04:59
*** thorst has quit IRC05:01
*** donghao has quit IRC05:04
*** yprokule has joined #openstack-telemetry05:08
*** diogogmt has quit IRC05:17
*** _nadya_ has joined #openstack-telemetry05:54
*** donghao has joined #openstack-telemetry05:56
*** _nadya_ has quit IRC05:59
*** thorst has joined #openstack-telemetry06:00
*** donghao has quit IRC06:05
*** thorst has quit IRC06:07
*** pcaruana has joined #openstack-telemetry06:15
*** rcernin has joined #openstack-telemetry06:21
*** sheel has joined #openstack-telemetry06:24
openstackgerritNguyen Phuong An proposed openstack/ceilometer: Config logABug feature for Ceilometer api-ref  https://review.openstack.org/36343006:57
openstackgerritXueFeng Liu proposed openstack/ceilometer: Fix string interpolation in log  https://review.openstack.org/35542206:57
*** thorst has joined #openstack-telemetry07:05
*** tesseract- has joined #openstack-telemetry07:08
*** zhangguoqing has quit IRC07:10
*** thorst has quit IRC07:12
*** shardy has joined #openstack-telemetry07:12
*** zhangguoqing has joined #openstack-telemetry07:17
openstackgerritMerged openstack/ceilometer: remove needless brackets  https://review.openstack.org/36269707:27
*** ljxiash has joined #openstack-telemetry07:30
*** shardy has quit IRC07:39
*** shardy has joined #openstack-telemetry07:40
*** _nadya_ has joined #openstack-telemetry07:43
openstackgerritMerged openstack/ceilometer: The debug that network.incoming/outgoing.packets print is not right  https://review.openstack.org/35633307:45
*** zhurong_ has quit IRC07:45
*** zhurong has joined #openstack-telemetry07:48
openstackgerritWenyan Zhang proposed openstack/ceilometer: correct input params in get_samples  https://review.openstack.org/36346007:53
*** cdent has joined #openstack-telemetry07:56
openstackgerritJulien Danjou proposed openstack/gnocchi: storage: re-store/write read-only splits  https://review.openstack.org/36289207:58
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: optimize uncompressed serialization  https://review.openstack.org/35831407:58
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: compress non padded timeseries  https://review.openstack.org/35831307:58
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: use calcsize rather than hard coded value  https://review.openstack.org/35831207:58
*** r-mibu has quit IRC08:02
*** chlong has quit IRC08:06
*** thorst has joined #openstack-telemetry08:09
*** kelv has quit IRC08:10
*** ljxiash has quit IRC08:12
*** thorst has quit IRC08:17
*** ljxiash has joined #openstack-telemetry08:21
*** kong has quit IRC08:22
*** kong has joined #openstack-telemetry08:23
*** liuhanxi has quit IRC08:24
*** r-mibu has joined #openstack-telemetry08:33
*** ossifrath has quit IRC08:33
*** ossifrath has joined #openstack-telemetry08:36
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: optimize uncompressed serialization  https://review.openstack.org/35831408:52
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: compress non padded timeseries  https://review.openstack.org/35831308:52
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: Timeserie.aggregate  https://review.openstack.org/36349208:52
openstackgerritfengchaoyang proposed openstack/gnocchi: Check whether the resource type attribute min is less than or equal to max  https://review.openstack.org/36160408:52
*** nijaba has quit IRC09:05
*** nijaba has joined #openstack-telemetry09:07
*** nijaba has quit IRC09:07
*** nijaba has joined #openstack-telemetry09:07
*** liuhanxi has joined #openstack-telemetry09:13
*** thorst has joined #openstack-telemetry09:14
*** thorst has quit IRC09:22
*** ljxiash has quit IRC09:44
*** zhurong has quit IRC10:01
*** oneapm has quit IRC10:12
*** thorst has joined #openstack-telemetry10:20
*** thorst has quit IRC10:26
*** shardy is now known as shardy_lunch11:24
*** zhangguoqing has quit IRC11:37
*** thorst has joined #openstack-telemetry11:43
*** vishwanathj has joined #openstack-telemetry11:44
*** larainema has quit IRC11:58
*** Kevin_Zheng has quit IRC11:59
*** larainema has joined #openstack-telemetry12:02
*** Kevin_Zheng has joined #openstack-telemetry12:04
*** shardy_lunch is now known as shardy12:15
*** sheel has quit IRC12:16
*** pradk has joined #openstack-telemetry12:20
*** gordc has joined #openstack-telemetry12:27
openstackgerritxiaozhuangqing proposed openstack/ceilometer: agentbase: miss a blank in info  https://review.openstack.org/36361912:29
*** openstackgerrit has quit IRC12:34
*** openstackgerrit has joined #openstack-telemetry12:35
*** eglynn has joined #openstack-telemetry12:42
*** zhurong has joined #openstack-telemetry12:44
*** jefrite has quit IRC12:51
*** jefrite has joined #openstack-telemetry12:56
*** pradk has quit IRC13:01
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: Timeserie.aggregate  https://review.openstack.org/36349213:05
openstackgerritJulien Danjou proposed openstack/gnocchi: storage: re-store/write read-only splits  https://review.openstack.org/36289213:05
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: optimize uncompressed serialization  https://review.openstack.org/35831413:05
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: compress non padded timeseries  https://review.openstack.org/35831313:05
openstackgerritJulien Danjou proposed openstack/gnocchi: carbonara: use calcsize rather than hard coded value  https://review.openstack.org/35831213:05
*** fguillot has joined #openstack-telemetry13:11
*** liuhanxi has quit IRC13:23
*** paw has quit IRC13:28
*** ljxiash has joined #openstack-telemetry13:29
gordcjd__: i had no idea you could just define a granularity and no timespan/points.13:31
gordci'll start testing out the branch now.13:31
*** eglynn has quit IRC13:31
*** ljxiash has quit IRC13:34
*** liuhanxi has joined #openstack-telemetry13:47
*** liuhanxi has quit IRC13:47
*** liuhanxi has joined #openstack-telemetry13:47
*** paw has joined #openstack-telemetry13:51
*** ljxiash has joined #openstack-telemetry13:53
*** c64cosmin_ has joined #openstack-telemetry13:57
*** links has quit IRC14:00
*** pradk has joined #openstack-telemetry14:02
*** c64cosmin_ has quit IRC14:02
*** zhurong has quit IRC14:03
*** zhurong has joined #openstack-telemetry14:03
*** cdent_ has joined #openstack-telemetry14:05
*** cdent has quit IRC14:06
*** cdent_ has quit IRC14:07
*** cdent has joined #openstack-telemetry14:07
*** liamji has joined #openstack-telemetry14:21
*** rbak has joined #openstack-telemetry14:23
*** _mbelanger has joined #openstack-telemetry14:25
*** xiaozhuangqing has quit IRC14:25
*** xiaozhuangqing has joined #openstack-telemetry14:26
*** donghao has joined #openstack-telemetry14:29
*** nicodemus_ has joined #openstack-telemetry14:32
*** KrishR has joined #openstack-telemetry14:45
openstackgerritMehdi Abaakouk (sileht) proposed openstack/gnocchi: CRUD type attributes  https://review.openstack.org/36373715:03
openstackgerritMehdi Abaakouk (sileht) proposed openstack/gnocchi: CRUD resource type attributes (add)  https://review.openstack.org/36373715:09
*** pshedimb has joined #openstack-telemetry15:10
*** KrishR1 has joined #openstack-telemetry15:10
openstackgerritMehdi Abaakouk (sileht) proposed openstack/gnocchi: CRUD resource type attributes (add)  https://review.openstack.org/36373715:10
*** zhurong has quit IRC15:11
*** KrishR has quit IRC15:11
*** KrishR1 has quit IRC15:12
*** diogogmt has joined #openstack-telemetry15:14
*** KrishR has joined #openstack-telemetry15:16
*** pshedimb has quit IRC15:19
*** pshedimb has joined #openstack-telemetry15:21
*** pcaruana has quit IRC15:22
*** rcernin has quit IRC15:22
*** liuhanxi has quit IRC15:22
openstackgerritMehdi Abaakouk (sileht) proposed openstack/gnocchi: CRUD resource type attributes (add)  https://review.openstack.org/36373715:24
*** ljxiash has quit IRC15:29
*** tesseract- has quit IRC15:36
*** nicodemus_ has quit IRC15:40
*** nicodemus_ has joined #openstack-telemetry15:40
*** _nadya_ has quit IRC15:46
*** rcernin has joined #openstack-telemetry15:54
*** pshedimb has quit IRC15:56
*** sheel has joined #openstack-telemetry15:57
*** pshedimb has joined #openstack-telemetry15:57
gordcjd__: i think the compression stuff isn't getting triggered. i have the exact same size in disk space.15:58
openstackgerritMehdi Abaakouk (sileht) proposed openstack/gnocchi: CRUD resource type attributes (add)  https://review.openstack.org/36373715:58
openstackgerritMehdi Abaakouk (sileht) proposed openstack/gnocchi: CRUD resource type attributes (remove)  https://review.openstack.org/36377015:58
jd__gordc: I have unit test that prove that work :(15:58
jd__gordc: any hint?15:59
jd__gordc: are you time serie big enough, do they have enough splits?15:59
jd__because if they are all of 1 split it's not goint to change anything15:59
gordcno idea, taking a look right now.15:59
EmilienMso "it worked in unit tests" now replaces "it worked in devstack" ?15:59
EmilienM:-) /me runs15:59
jd__EmilienM: yes :D15:59
gordci have one series that is up to 144000opints15:59
jd__it's better it's reproducible16:00
gordcand i'm putting in 7200 points16:00
gordci'm assuming there should be at least one split16:00
jd__gordc: 7200 consecutively aggregated points?16:00
jd__yep16:00
gordcyeah16:00
jd__but not sure 2 splits is going to make a huge diff16:00
gordcprobably not. but it should be something? looknig at ceph logs, it's the exact same footprint over entire set16:01
jd__are your data values completely random?16:02
jd__IIRC worst case is 8 bytes vs 9 bytes without compression so…16:02
jd__if it's random… it might be avery small diff16:02
gordcnot really. i'm actually just posting 720 real points and then offsetting the same 720 points over and over16:03
jd__if you can read one of the old split directly you'll be fixed though :)16:04
*** ddaskal has joined #openstack-telemetry16:05
gordclooking now16:06
*** ildikov has quit IRC16:06
openstackgerritMehdi Abaakouk (sileht) proposed openstack/gnocchi: CRUD resource type attributes (add)  https://review.openstack.org/36373716:10
openstackgerritMehdi Abaakouk (sileht) proposed openstack/gnocchi: CRUD resource type attributes (remove)  https://review.openstack.org/36377016:10
*** links has joined #openstack-telemetry16:11
openstackgerritMehdi Abaakouk (sileht) proposed openstack/gnocchi: CRUD resource type attributes (add)  https://review.openstack.org/36373716:16
openstackgerritMehdi Abaakouk (sileht) proposed openstack/gnocchi: CRUD resource type attributes (remove)  https://review.openstack.org/36377016:16
gordcjd__: it has compressed format... i still don't know how it's the exact same size in ceph though.16:17
gordci'll keep looking.16:17
jd__ah, that's interesting lol16:18
gordcfrom performance pov, at quick glance it looks like it's running at roughly the same speed so doesn't seem like compression is negative impact.16:18
gordci didn't really dig into it though.16:18
openstackgerritMehdi Abaakouk (sileht) proposed openstack/gnocchi: CRUD resource type attributes (add)  https://review.openstack.org/36373716:22
openstackgerritMehdi Abaakouk (sileht) proposed openstack/gnocchi: CRUD resource type attributes (remove)  https://review.openstack.org/36377016:22
openstackgerritMehdi Abaakouk (sileht) proposed openstack/gnocchi: CRUD resource type attributes (add)  https://review.openstack.org/36373716:22
openstackgerritMehdi Abaakouk (sileht) proposed openstack/gnocchi: CRUD resource type attributes (remove)  https://review.openstack.org/36377016:22
*** donghao has quit IRC16:27
*** zenpac has joined #openstack-telemetry16:28
zenpacI'm on Mikata and we've created a dispatcher to send events over to a monitoring system. It was working on older versions of OopenStack but now its not sending data.16:29
openstackgerritMilan Potdar proposed openstack/ceilometer: Nit: Aligning the content.  https://review.openstack.org/36379416:34
gordczenpac: um... i don't recall anything changing.16:36
gordczenpac: we added batch notificaitons16:36
gordczenpac: can't really help other than that since it's not opensource :P16:36
zenpacWe're trying to send messages via AMQP.. What driver in neutron.conf should i list?16:37
*** pshedimb has quit IRC16:37
zenpacceilometer is proprietary?16:37
gordczenpac: there already is a notifier publisher.16:37
*** pshedimb has joined #openstack-telemetry16:38
gordczenpac: no, but i have no idea what this dispatcher you created is16:38
gordcthe notifier publisher just dumps msg to queue for something to pick up16:38
zenpacI got a warning from neutron-server saying that "Option "notification_driver" from group "DEFAULT" is deprecated. Use option "driver" from group "oslo_messaging_notifications"16:39
zenpacBut its not set, not sure what it should be.16:39
gordchttp://docs.openstack.org/developer/ceilometer/install/manual.html#neutron16:40
zenpachttps://github.com/zenoss/ceilometer_zenoss16:42
zenpacgordc: Can you see that repo?16:43
*** _nadya_ has joined #openstack-telemetry16:43
gordcyes16:43
*** fguillot_ has joined #openstack-telemetry16:44
*** _nadya_ has quit IRC16:44
*** _nadya_ has joined #openstack-telemetry16:52
gordcjd__: did you try compress with just lz4 and no double delta encoding?16:59
jd__gordc: you mean the current serialization formaet?17:00
gordcright17:01
jd__yes I did17:01
jd__why?17:01
*** yprokule has quit IRC17:01
gordcthe compress object i have is 32400B and there should only be 2900 points in it.17:01
jd__that sounds weird17:03
jd__is this the actual file size or what Ceph reports?17:03
gordcsize when i get file from ceph to local17:03
gordc32400 is perfectly 3600points (9B each)17:04
jd__so it's not compressed?17:04
jd__it's 2900 points with padding I imagine17:04
jd__the compressed format has 'c' has the first byte of the file if you want to check17:05
jd__s/has/as/17:05
gordcright. i think it should be 2900 because the uncompress obj is 2880 points long17:05
gordcyeah, my compressed object starts with 'c'17:05
jd__and it's 32400 bytes long17:05
jd__?17:05
gordcand i'm sending 5780 points so i assume the remaining 2900 is in the compress obj17:05
gordcyeah, compressed obj is 32400B17:06
gordcuncmpressed is 25920B17:06
jd__I think I'm lost17:06
jd__32400 for a compressed object sounds impossible17:06
gordcagreed17:06
jd__send me the file I'll take a look17:07
gordcsure17:07
silehtIs the gnocchi gate broken by the pymysql thing ?17:09
jd__yes sileht17:09
gordcjd__: sent you the file.17:10
marcin1234guys do you know how to speed up a creation of a "resource" in Gnocchi?17:10
jd__marcin1234: how slow it is?17:11
marcin1234I have interval of 5 minutes on computes, it takes up to 20 minutes17:11
marcin1234this leads to messages that I am collecting measures without predecesor17:12
jd__gordc: lol this is not even a valid data17:14
jd__gordc: I think we miss something here17:14
gordcgoing to lunch. marcin1234 the actual resource creation should take very little time. less than a second. something is wrong.17:14
jd__there might be a subtility and a different between real Ceph and our fake code17:14
gordcyeah, that's what i have. all i see is the 'c' so i assume it's compressed.lol17:15
jd__marcin1234: so it seems the problem is between Nova and Ceilometer, not Gnocchi17:15
jd__gordc: yeah, it looks like it has been rewritten but the rest of the data stayed17:15
jd__gordc: like… maybe write() with offset= does not truncate… sileht?17:16
gordctruncate only writes from offset to however long your data is17:17
gordcs/truncate/write/17:17
gordcso it leavse stuff...17:17
gordcbbl17:17
marcin1234ok17:17
jd__gordc: if that's the case, that' the source of the problem – our fake driver does not do that17:17
jd__I would just like to be sure it's the case as it's a difference with a standard file system17:18
jd__which is ok but surprising somehow - maybe I don't know17:18
*** fguillot_ has quit IRC17:18
jd__it's not a file system17:18
* jd__ looks at sileht 17:18
*** nicodemus_ has quit IRC17:20
*** nicodemus_ has joined #openstack-telemetry17:21
openstackgerritMilan Potdar proposed openstack/ceilometer: Nit: Aligning the content.  https://review.openstack.org/36379417:22
*** _nadya_ has quit IRC17:26
silehtjd__, indead that doesn't truncate17:34
*** sergio_nubeliu has joined #openstack-telemetry17:38
nicodemus_hello17:38
*** pcaruana has joined #openstack-telemetry17:39
nicodemus_using current python-gnocchiclient, is it possible to list all current resources? That is, resources that do not have an ended_at date?17:39
*** pshedimb has quit IRC17:39
marcin1234jd: so I measured exactly by looking at time stamps, from the time my VM was created (nova.instances) to time resource with corresponding original_resource_id was created (gnocchi.resource), it has 15 minutes14 seconds17:44
marcin1234delay17:44
*** rwsu has quit IRC17:44
silehtnicodemus_, you can use the keyword null as value I think, that's works on server side, perhaps the client does the same17:54
*** _nadya_ has joined #openstack-telemetry17:55
nicodemus_sileht, would that be with the --sort option from the client? Or doing a resource search for ended_at not null?17:57
silehtthe search17:57
nicodemus_sileht, got it. Thanks!17:58
silehtnicodemus_, I don't remenber how but yiu can sort with on top or on bottom by passing a special sort direction17:59
sileht*sort the null values*18:00
jd__sileht: haha, ok18:00
jd__sileht: any trick to truncate?18:00
jd__sileht: have to remove the object?18:00
silehtjd__, ioctx.trunc(name, len) or writeop.truncate(len)18:01
*** liamji has quit IRC18:02
*** diogogmt has quit IRC18:02
*** thumpba has joined #openstack-telemetry18:05
*** Kevin_Zheng has quit IRC18:06
*** larainema has quit IRC18:08
silehtnicodemus_, for example: sort=revision_end:asc-nullslast18:10
jd__sileht: awesome, thanks18:11
jd__i'll fix that tomorrow18:11
*** Kevin_Zheng has joined #openstack-telemetry18:11
*** larainema has joined #openstack-telemetry18:11
zenpacI get this type of traceback: http://paste.debian.net/798638/ when we use our dispatcher.18:16
gordcmarcin1234: is the data coming from ceilometer? do you have a massive backlog on metering.sample topic of MQ?18:17
gordczenpac: sorry, we don't actually use eventlet in ceilometer. we probably dropped it in Liberty?18:18
gordci would suggest you try modifying/contributing to notifier publisher, you shouldn't need collector/dispatcher if you are just publishing to rabbit.18:18
marcin1234I do not have backlog on metering.sample18:19
marcin1234what I do have is in ceilometer log "dropping sample with no predecessor"18:19
marcin1234during that 15+ minutes of delay18:20
nicodemus_marcin1234, in my case I have configured 1200 seconds for each source in /etc/ceilometer/pipeline.yaml. Sometimes I launch an instance and, since it's going to take up to 20 minutes for the next ceilometer-agent-compute to post to gnocchi... the resource takes a while to actually appear18:22
nicodemus_could that be the case?18:22
*** shardy is now known as shardy_afk18:24
gordcmarcin1234: that log is unrelated. it will happen on new instances and if you restart your notification agent since our transformations are done in memory18:24
*** shardy_afk has quit IRC18:25
gordcnicodemus_: is there a reason you poll every 20min?18:25
gordctoo much load on Nova api?18:25
nicodemus_gordc, that 20min is for billing purposes and is using a little old gnocchi deploy, a more aggressive polling interval resulted in too many measures for metricd to cope with18:26
nicodemus_using current code I'm polling every 5 mins18:27
*** _nadya_ has quit IRC18:28
gordcnicodemus_: kk, if you have free envirnoment i would actually try https://review.openstack.org/#/c/275741/18:29
gordcin theory, it improves the batching (if you dispatch straight from notification agent rather than use collector18:29
gordcit doesn't improve metricd though. that is only done through v318:30
nicodemus_gordc, that looks promising... I intend to move from collector shortly18:32
nicodemus_I'll give it a try18:32
gordccool cool18:33
marcin1234nicodemus: what does it meant it takes a while? it take 21 minutes to appear in gnocchi table or more like an hour?18:33
nicodemus_marcin1234, maybe it appears one minute after the instance was created, maybe it appears 19 minutes after18:34
marcin1234I need to speed up resource creation on gnocchi, somehow18:34
nicodemus_never took more than 20 mins though18:35
*** _nadya_ has joined #openstack-telemetry18:35
gordcmarcin1234: have you tried creating a dummy resource? like just use gnocchi resource create abcxyz'18:35
gordcthat command takes 15mins?18:35
marcin1234gordc: no that works immediately18:36
*** pshedimb has joined #openstack-telemetry18:36
gordcmarcin1234: oh. then it's probably either your pipeline period is very long.18:38
gordcmarcin1234: or notification agent is backed up like crazy18:38
marcin1234gordc: it is 5 minutes18:38
gordcmarcin1234: if your metering.sample queue is empty and that resource create command is instant than it's something up with ceilometer.18:38
marcin1234I can see those "dropping sample with no predecessor" because there is data coming from ceompute, through collector, so I am collecting measures, for resource that does not exist yet18:39
gordcif you do rabbitmqctl list_queues are there queues that are very large18:39
marcin1234gordc: how can it be with ceilometer? Ceilometer collector is pushing the data to Gnocchi18:39
marcin1234it is just Gnocchi that is not creating resource from some reason18:40
gordcthe "dropping sample with no predecessor" should not be on collector end. it's a notification agent message relating to transfomrations18:40
EmilienMgordc: you know rabbit right? ;)18:40
gordcit has nothing to do with gnocchi.18:40
gordcEmilienM: i eat rabbit?18:40
gordcso do you. you're french :P18:40
EmilienMgordc: I remember your awesome talk about rabbitmq in austin18:41
gordchahahah!18:41
marcin1234queues are all "0"18:41
gordcoh yeah, google it18:41
marcin1234gordc: so based on what Gnocchi decides to create resource in its mysql table?18:41
gordcmarcin1234: gnocchi creates a resource if you tell it to create a resource.18:42
gordcmarcin1234: from ceilometer+gnocchi pov, ceilometer will try to send measures to gnocchi, and if gnocchi returns saying, 'i don't know this resource', ceilometer will tell it to create resource and send measrues again18:43
gordcthis is the message you are getting: https://github.com/openstack/ceilometer/blob/67739c2278dba6197e141715faa3d6eb047170a7/ceilometer/transformer/conversions.py#L9818:43
marcin1234gordc: ok, in case when I build new VM, when Gnocchi is "asked" to create resource?18:43
gordcit has no relation to storage or gnocchi.18:43
*** _nadya_ has quit IRC18:44
nicodemus_marcin1234, perhaps you could try to do a packet capture on the gnocchi API, and see exactly when the POST to create the resource arrives to gnocchi18:44
gordcmarcin1234: it depends. basically when ceilometer gets a datapoint eitehr from notification or polling, it will end up at collector and then it interacts with gnocchi18:44
marcin1234gordc: got it, so I gues my ceilometer does not ask to create resource, or it does but after long ass time18:44
nicodemus_if when the POST arrives the resource appears on gnocchi, then gnocchi is not taking time to create it18:45
gordcwhen you say gnocchi doesn't have resource, do you mean it doesn't have any measures or it doesn't even know about resource at all?18:45
gordcif the latter, than it means ceilometer is getting no data at all18:46
gordcconsidering you have 0 items in your queues18:46
gordcthat basically means ceilometer is idle18:46
marcin1234gordc: it does not have resource in gnocchi.resource with original_resource_id from my nova uuid18:46
*** dduvall has quit IRC18:46
marcin1234gordc: ceilometer is getting data, it is just those queses are very quickly consumed18:47
*** diogogmt has joined #openstack-telemetry18:48
marcin1234I can see on graph in metering.sample18:48
gordcmarcin1234: if it's not in ceilometer it's gotta be in gnocchi. and if it's not in either, than you should have errors everywhere.18:48
marcin1234I can see measure being collected from compute, going to rabbit, being consumed by ceilometer collector18:48
*** thumpba has quit IRC18:49
marcin1234gordc: there are no errors18:49
*** thumpba has joined #openstack-telemetry18:49
gordcmarcin1234: maybe change your dispatcher from gnocchi to something else temporarily and validate the data is actually getting that far?18:50
marcin1234gordc: okay I will test, and tcpdump this18:50
marcin1234I just did another test, and again 22 minutes....18:50
gordci don't know. from data flow pov, your data appears to be straight up gone. i can't explain this 15min delay, with no backlog and no errors18:50
gordcmarcin1234: alternatively, just add logs to gnocchi dispatcher in ceilometer. and see how when it gets data18:52
gordcadd log here: https://github.com/openstack/ceilometer/blob/master/ceilometer/dispatcher/gnocchi.py#L32618:53
gordci guess another check is how long is it between instance create and when you see 'dropping sample...' log in notification agent?18:55
*** sheel has quit IRC18:56
*** diogogmt has quit IRC18:57
*** pshedimb has quit IRC18:59
*** ildikov has joined #openstack-telemetry19:01
nicodemus_I have a question regarding ceilometer-agent-notification: I'm using a rabbitmq vhost for each service (nova, neutron, etc). If I want to use https://review.openstack.org/#/c/217017/ I guess I'd need to use a ceilometer.conf with the vhost & credentials for Nova, since it needs to listen on the notifications.info queue, is that correct?19:05
gordcnicodemus_: how are you doing this currently? you define multiple urls to listen to. https://github.com/openstack/ceilometer/blob/master/ceilometer/notification.py#L64-L7419:08
nicodemus_gordc, cool. I didn't know that configuration option. I'll give it a try, thanks!19:10
*** links has quit IRC19:13
marcin1234gordc: dropping sample shows up 2-3 minutes after VM is created19:22
marcin1234what does not make sense, is once resource is created, it works great with no delay19:23
gordcmarcin1234: 2-3minutes seems pretty long still. i would add long to dispatcher.19:26
gordcit doesn't make any sense there's a 20 min delay between notification agent and collector.19:27
gordconly other thing i can think of is you have batching enabled and the timeout is extremely high.19:27
gordcs/long/log19:32
*** _nadya_ has joined #openstack-telemetry19:34
marcin1234ok19:35
marcin1234thx gordc: I apprecite your insight19:35
gordcnp. just for reference, the data flow is polling agent -> notification agent -> collector -> gnocchi. (assuming you have liberty+19:38
gordci'd add logging in those services to see where the delay is. your empty queues are very curiuos though. they should be exploding if it's a ceilometer issue.19:39
zenpacgordc: Does that mean that our use of eventlet in our dispatcher is the problem?19:44
zenpacor part or it?19:45
gordczenpac: oh, i didn't really dive into your code. sorry. was just pointing out we don't use it so i don't know how they interact.19:50
zenpacok.. Thanks.19:50
openstackgerritMerged openstack/ceilometer: Nit: Aligning the content.  https://review.openstack.org/36379419:53
*** KrishR has quit IRC19:56
*** KrishR has joined #openstack-telemetry19:59
*** _nadya_ has quit IRC20:03
pawgordc: Hi there, sorry for the pestering, for the gerrit review on the IP version ( https://review.openstack.org/363737), I was wondering if you had any additional comments? thanks in advance!20:06
gordcpaw: is that the right patch?20:09
gordcnm. found it.20:09
marcin1234gordc: you are on the money: VM created: 19:49:33, gnocchi.py#L326 executed at 20:05:17, matching with resource created at date20:11
gordcpaw: done. thanks for following up20:11
pawgordc: thanks for the review!20:11
*** kelv has joined #openstack-telemetry20:12
gordcmarcin1234: what's the time diff between dispatcher log and the 'dropping sample...' log in notification agent.20:12
gordcif there's a massive gap, then something is up with MQ20:13
*** pshedimb has joined #openstack-telemetry20:14
*** diogogmt has joined #openstack-telemetry20:14
marcin1234so there is a big gap. 15 minutes20:18
*** pshedimb has quit IRC20:18
marcin1234so here are all timings: VM created: 19:49:33, first "Dropping sample" 19:50:18, gnocchi.py#L326 executed at 20:05:17,20:18
marcin1234gordc: yes I am on mitaka20:20
*** paw has quit IRC20:22
gordci have no idea why you have that delay tbh. literally the only step between dropping sample is to publish sample to queue, and collector picks it up and you get that L326 log20:22
gordcmy only other suggestion is to turn off collector and see how long it takes you to see it on queue.20:22
gordcthere is no reason that there's a delay between notification agent and collector when polling agent -> notification agent uses the same service.20:23
marcin1234gordc: which queue we are talking about metering.sample or notifications.info ?20:23
gordcmetering.sample20:23
marcin1234gordc: what is default "batch_timeout" for collector? I am thinking this has something to do20:25
*** cdent has quit IRC20:27
gordcindefinite20:27
gordcmarcin1234: but batch_size is 1. so it should trigger right away20:28
gordcunless you changed batch_size20:28
marcin1234changed it to 120:29
marcin1234from unset20:29
gordcit should default to 1 if you don't have it set20:32
openstackgerritMerged openstack/ceilometermiddleware: Add release note for background thread notifier  https://review.openstack.org/36316820:44
*** _mbelanger has quit IRC20:56
*** ddaskal_ has joined #openstack-telemetry20:56
*** ddaskal__ has joined #openstack-telemetry20:57
*** pshedimb has joined #openstack-telemetry20:58
*** ddaskal has quit IRC20:59
*** rwsu has joined #openstack-telemetry21:00
*** ddaskal_ has quit IRC21:01
*** sergio_nubeliu has quit IRC21:03
*** paw has joined #openstack-telemetry21:08
marcin1234gordc: what process is consuming notifications.info and publishing them to metering.sample? It looks like it is "agent-notification" correct?21:20
gordcright21:20
marcin1234w21:21
marcin1234sorry, hit enter by an accident21:22
*** pshedimb has quit IRC21:22
*** pshedimb has joined #openstack-telemetry21:24
*** fguillot_ has joined #openstack-telemetry21:27
*** ddaskal has joined #openstack-telemetry21:27
*** ddaskal__ has quit IRC21:30
marcin1234gordc: so is it possible that agent-notification is consuming this message, and it takes 15-20 minutes to publish it to metering.sample ?21:30
gordcnope. especially considering the 'dropping sample...' message is one of the last steps in processing.21:33
gordcthe other step after is to push onto queue.21:33
*** fguillot_ has quit IRC21:33
gordchttps://github.com/openstack/ceilometer/blob/master/ceilometer/publisher/messaging.py#L128 this is actually last step21:36
*** thorst has quit IRC21:38
*** rwsu has quit IRC21:38
*** rwsu has joined #openstack-telemetry21:39
*** nicodemus_ has quit IRC21:40
*** thumpba has quit IRC21:41
marcin1234gordc: this is my understanding how it works: 1. Vm is created 2. ceilometer.compute.pollsters.cpu measures in my example cpu utilization and publishes it to RMQ notifications.info 3. notification-agent consumes it and publishes it to metering.sample 4. ceilometer-collector consumes it and dispatches it to Gnocchi21:47
*** diogogmt has quit IRC21:48
gordccorrect.21:59
gordcnova will also send notifications (if enabled) and will go to step 322:00
*** cdent has joined #openstack-telemetry22:00
gordchttp://docs.openstack.org/developer/ceilometer/architecture.html#high-level-architecture22:01
*** cdent has quit IRC22:05
*** fguillot has quit IRC22:07
*** fguillot has joined #openstack-telemetry22:09
*** adriant has joined #openstack-telemetry22:11
*** gordc has quit IRC22:14
*** KrishR has quit IRC22:42
*** zhurong has joined #openstack-telemetry22:47
*** zhurong has quit IRC22:52
*** rbak has quit IRC22:52
*** pradk has quit IRC23:03
*** ddaskal has quit IRC23:05
*** ddaskal has joined #openstack-telemetry23:06
*** ddaskal has quit IRC23:10
*** pshedimb has quit IRC23:11
*** cdent has joined #openstack-telemetry23:22
*** cdent has quit IRC23:29
*** fguillot has quit IRC23:31
openstackgerritMerged openstack/ceilometer: Config logABug feature for Ceilometer api-ref  https://review.openstack.org/36343023:43
*** diogogmt has joined #openstack-telemetry23:46
*** fguillot has joined #openstack-telemetry23:48
*** donghao has joined #openstack-telemetry23:54
marcin1234gordc: so... I found the solution, it works properly if I have only 1 notification-agent running on one controller, and it will only create a resource, during second time it collects measure from compute23:56
*** rwsu has quit IRC23:59

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!