Tuesday, 2022-10-25

mkarpiarz	Thank you, leifmadsen and mrunge!	12:41
mrunge	mkarpiarz, my pleasure	12:42
mkarpiarz	So with Panko no longer being an option and support for MongoDB long gone, how are people storing/processing events these days?	12:42
mrunge	elasticsearch/loki	12:43
mrunge	err, I don't know how many people store..., but my understanding is, they are using different tools	12:43
mrunge	like elasticsearch or Grafana Loki	12:44
mkarpiarz	Ah, interesting! I see neither of them on the list of supported publishers here: https://docs.openstack.org/ceilometer/zed/admin/telemetry-data-pipelines.html#publishers. Do you mean there is a way to convert to ES/Loki using, for example, the "http" publisher?	12:45
mrunge	yes, you could be using the http publisher	12:47
mkarpiarz	OK, so is the basic idea to treat events like logs, send them to logstash for parsing and backing to ElasticSearch?	12:52
mrunge	in that case, you'd send them directly to ES, if you want to deal with elastic	12:54
mkarpiarz	So just send events as payload of requests to ElasticSearch's APIs?	12:57
mrunge	exactly	12:58
mrunge	however, you don't get an integration in the openstack CLI with this	12:59
mrunge	and I am not sure what you'd get out of events in this case anyways	12:59
mkarpiarz	Ceilometer appears to be sending lists of JSON-formatted objects through http.	13:01
mkarpiarz	Is there a better solution then?	13:01
mrunge	let me ask you the other way around: what do you want to get out of it?	13:02
mrunge	... which events are important that you don't want to miss	13:02
mkarpiarz	Ah, a good question! All I need (at least for now) is events relating to instance creation and deletion.	13:04
mrunge	they would be sent when a user creates or deletes an instance	13:04
mrunge	iirc, they are not sent when an instance died e.g	13:04
mkarpiarz	That's fine. All we need is to be able to tell that this many VMs have been running at any given time.	13:06
mkarpiarz	We also want to know their flavours.	13:06
mrunge	that is something you should be able to get out of nova	13:07
mrunge	without any events	13:07
mkarpiarz	Yeah, just tap into the notification bus, right?	13:07
mrunge	uhm, it's been a little since I looked into that. iirc you can directly ask for the number of running vms	13:08
mkarpiarz	Sure, I can poll APIs for this.	13:09
mkarpiarz	Also, the Prometheus exporter for OpenStack keeps a tally of VMs in different states.	13:10
mrunge	is there an exporter?	13:10
mrunge	the one I've seen was not maintained for a long time	13:11
mkarpiarz	https://github.com/openstack-exporter/openstack-exporter	13:11
mrunge	oh, nice!	13:13
mkarpiarz	The `openstack_nova_server_status` metric is what I've been using.	13:13
mkarpiarz	The problem with a metrics-based approach is that the state of VMs doesn't update in real time.	13:15
mkarpiarz	But I guess for such a simple use case it might not make sense to maintain all the components required for processing/storing events.	13:17
mrunge	hmm, so how much real time do you need?	13:17
mrunge	you'd be using a lot of resources to keep track of the changes otherwise in a different data store	13:18
mkarpiarz	Very true.	13:25
mkarpiarz	However, from my experience a question about measurements in real time or with very fine resolution pops up every time sooner or later.	13:27
mkarpiarz	Also, personally I like the idea of having an alternative for polling APIs all the time just to get information which rarely changes.	13:29
mrunge	the question of getting metrics in a high resolution comes up. and then you'll discover that you rarely have a monitor to display the collected data	13:30
mrunge	we had the request to collect (and display) data in sub-second interval. And suddenly 10 minutes of data will require a larger screen, or you realize, that you could also sample at a lower rate without changing anything on the screen	13:32
mkarpiarz	Haha, that's a good one! In the end all people care is aggregates. xD	13:32
mrunge	yes. I agree, if you want events and reaction on events in real-time, then you don't want metrics	13:32
mrunge	"my network connection died" -> bring up a new one	13:33
mrunge	that's something you want instantly	13:33
mrunge	cpu count on server machines: maybe interesting, but not likely to change in a fast pace	13:33
mkarpiarz	Awesome! Thank you again for your time and all the food for thought!	13:35
mrunge	thank you for the questions and the link to the exporter. I really thought it was dead.	13:36
mrunge	but apparently it's not	13:36
mkarpiarz	You are welcome! :)	13:40

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!