#openstack-monasca log

13:00:26 <witek> #startmeeting monasca
13:00:27 <openstack> Meeting started Tue Jan 21 13:00:26 2020 UTC and is due to finish in 60 minutes.  The chair is witek. Information about MeetBot at http://wiki.debian.org/MeetBot.
13:00:28 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote.
13:00:30 <openstack> The meeting name has been set to 'monasca'
13:00:43 <witek> hello everyone
13:00:47 <chaconpiza> hi
13:00:54 <witek> hi Martin
13:01:28 <witek> the agenda for today consists of two points as of now:
13:01:33 <witek> https://etherpad.openstack.org/p/monasca-team-meeting-agenda
13:01:47 <witek> feel free to add new items
13:02:07 <witek> #topic persister performance optimization
13:02:31 <dougsz> hi all
13:02:50 <witek> hi Doug
13:02:54 <witek> I've stumbled upon a blog article about InfluxDB Python client
13:03:03 <witek> https://www.influxdata.com/blog/writing-data-to-influxdb-with-python/
13:03:48 <witek> one interesting thing they mention, is the recommended batch size for sending datapoints to InfluxDB
13:04:36 <witek> the recommended value is 5000 to 10000 data points per request
13:05:36 <witek> the client offers an argument to split larger requests to batches of a given size
13:06:23 <witek> and so I proposed a patch to allow configuring this value in monasca-persister as well
13:06:30 <witek> https://review.opendev.org/703563
13:08:05 <witek> I have to admit I haven't measured performance impact
13:09:14 <witek> any questions or comments on that?
13:09:59 <witek> if not, we can move on
13:10:07 <chaconpiza> We have in devstack:
13:10:09 <chaconpiza> [kafka_metrics]
13:10:09 <chaconpiza> batch_size = 30
13:10:14 <bandorf> We can execute some system tests, but it will take some time
13:10:26 <chaconpiza> then it would change to 10000?
13:10:46 * witek checking Kafka settings
13:11:16 <chaconpiza> I found it in /etc/monasca-persister/monasca-persister.conf
13:11:55 <bandorf> I'm a bit confused:
13:12:35 <bandorf> According to Martin, this value has been configured already now. witek: You wrote that you did a change to allow config. of this vale
13:13:45 <witek> right, the value which Martin mentions configures the batch size consumed from Kafka
13:14:04 <witek> so, effectively, my change won't have any impact
13:14:13 <dougsz> `# Maximum number of metrics to buffer before writing to database (integer value)
13:14:13 <dougsz> #batch_size = 20000
13:14:13 <dougsz> `
13:14:32 <dougsz> That makes sense - I recall seeing that limit applied in the logs
13:16:18 <witek> I think adding InfluxDB option doesn't hurt though
13:16:44 <witek> I should probably document that also Kafka config affects the actual batch size
13:17:29 <witek> thanks for spotting this chaconpiza
13:19:21 <witek> batch_size = 30 seems somewhat low though, in terms of InfluxDB performance
13:19:53 <chaconpiza> and for other topics the batch_size is = 1
13:20:25 <witek> but only metrics topic gets persisted to InfluxDB
13:21:51 <chaconpiza> I am wondering if [kafka_alarm_history] goes to influxDB as well
13:22:32 <witek> it does, but is more or less negligible in terms of performance
13:22:32 <chaconpiza> with topic = alarm-state-transitions
13:22:39 <chaconpiza> ok
13:23:30 <bandorf> Regarding the batch size for metrics: We can execute some system tests, with different values for the batch size
13:24:03 <witek> that would be nice in any case
13:25:09 <witek> I think the reason for choosing low Kafka batch size was to reduce delays when system is under low load
13:26:04 <dougsz> Doesn't it do a periodic flush as well?
13:26:13 <dougsz> At least it does for metrics
13:26:27 <bandorf> Yes, that's what I have in mind as well.
13:28:29 <witek> you're right again, we have `max_wait_time_seconds`
13:28:36 <dougsz> I'm just thinking about the Kafka batch_size setting for metrics -  I suppose that controls how much data is pulled from typically a spinning disk into RAM from Kafka, and generally you might want that to be a large chunk
13:29:51 <witek> yes, and having an option to control this independently to what is written to InfluxDB offers more flexibility for performance optimization
13:30:14 <dougsz> Yeah - perhaps 20000 is on the small side for that. Benchmarking will be interesting
13:31:18 <bandorf> I assume, currently the batch size "read from kafka" is the batch size used for sending to influxdb?
13:31:32 <dougsz> yeah, I believe so
13:31:36 <witek> bandorf: correct
13:32:10 <witek> playing with these two parameters would be really interesting
13:32:45 <bandorf> Then, we can relatively easy perform a test with different values for that batch size.
13:33:04 <witek> bandorf: that would be great, thanks
13:33:05 <bandorf> I need to check with our system test team when we can execute such tests.
13:33:20 <dougsz> yeah - I've found influxdb to very much not be the bottle neck until I have 10s of cores dedicated to persister processes
13:33:49 <dougsz> thanks bandorf
13:34:15 <witek> nice, thanks for this interesting discussion
13:34:21 <witek> I think we can move on
13:34:45 <witek> #topic monasca-log-api repo maintanance
13:35:35 <witek> I'm wondering how we should proceed with maintaining the monasca-log-api repo after we've merged all the code to monasca-api
13:36:03 <witek> theoretically we could still have cases when we want to provide bugfixes
13:37:03 <witek> should we drop DevStack as currently proposed and propose bugfixes to stable branches if needed?
13:37:17 <dougsz> When do we stop supporting it? This cycle?
13:38:01 <witek> might be to quick, right? I think we can mark it as deprecated
13:38:37 <dougsz> yeah, and remove in Victoria perhaps?
13:39:26 <dougsz> So this cycle we still need to target bug fix patches at master
13:40:01 <dougsz> If we drop devstack support, will that mean we can no longer run tempest against it?
13:41:05 <witek> the new job is configured with monasca-api DevStack plugin, so it doesn't test monasca-log-api repo
13:43:19 <witek> yes, I think it's the right way to keep all the old content and CI setup and remove in Victoria or following cycle
13:43:35 <witek> for now, just mark the repo as deprecated
13:43:56 <witek> chaconpiza: adriancz: what do you think?
13:44:38 <adriancz> yes, i think we should mark this repo as deprecated
13:44:58 <chaconpiza> I agree
13:45:01 <adriancz> and remove this in next cycle
13:45:11 <witek> agreed, thanks
13:45:31 <dougsz> yeah + 1 to remove asap, we don't want the burden
13:45:43 <bandorf> I agree as well
13:45:44 <dougsz> I will switch Kolla this cycle
13:45:57 <witek> #topic AOB
13:46:17 <witek> anything else for today?
13:47:31 <witek> just a notice about self-healing and auto-scaling SIGs joint meeting in 12 minutes
13:47:48 <witek> in #openstack-meeting
13:48:39 <dougsz> thanks - I will lurk
13:48:41 <witek> thanks for joining today and for interesting discussions
13:48:46 <dougsz> +1
13:48:50 <witek> see you next week
13:48:53 <witek> bye bye
13:48:56 <chaconpiza> thanks bye
13:48:56 <bandorf> Thanks and good-bye
13:49:01 <dougsz> bye
13:49:03 <witek> #endmeeting