14:00:22 #startmeeting monasca 14:00:23 Meeting started Wed Aug 16 14:00:22 2017 UTC and is due to finish in 60 minutes. The chair is rhochmuth. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:00:24 0/ 14:00:24 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 14:00:26 The meeting name has been set to 'monasca' 14:00:27 o/ 14:01:31 is anyone here for the weekly monasca meeting? 14:01:47 Yep 14:02:22 hi sgrasley1 14:02:51 well, it looks like a pretty small group this week 14:02:57 just you and i 14:03:15 Hi Roland! 14:03:16 Well a short meeting then. 14:03:30 hi Shristian_ 14:03:34 Christian_ 14:03:41 Hi Roland, Yavor here from OP5, thanks for the invite 14:03:47 so, i t hink some folks are out this week on vacation 14:03:51 i know witek is out 14:04:03 hi yavor_OP5 14:04:16 glad you guys made it 14:05:14 trip down memory lane, this... must be 20 yrs since I last used irc ;) 14:05:39 lol 14:05:52 you'll love it 14:07:04 ok, the agenda doesn't have anything in it 14:07:10 https://etherpad.openstack.org/p/monasca-team-meeting-agenda 14:08:02 i think we can go with an open floor 14:08:13 so do folks have any topics to discss this week 14:08:55 we have a nice backlog on reviews 14:08:58 https://review.openstack.org/#/q/status:open+monasca,n,z 14:10:20 OP5 folks, have you had a chance to review some of the areas of development and come to any direction? 14:10:32 or conclusion 14:10:50 we've started investigating "poor man's clustering" on influxdb on our side... 14:11:24 we had a number of strategies that we had considered that would involve some changes to monasca 14:11:52 one idea is that if an influxdb node/container goes down 14:12:30 when it is brought back on-line it would have to catch-up 14:12:42 yup, we thought about that 14:13:12 therefore, queries from the monasca-api to that influxdb node would not occur until the consumer offset of the persister for that node was close to the producer offset 14:13:43 this assumes that persister storage of some type is being used, such as EBS in an AWS environment 14:13:57 or that the influxdb node is the same node as before with the same disks available 14:14:32 if the node is down for a while, then it could take a while to catach-up via kafka and the persister 14:14:53 so another option is to copy the data from a working node to the new node 14:14:57 that is more invovled 14:15:19 and at some point i think data to both nodes would need to stop 14:16:00 would it be OK to assume that the kafka cluster would provide storage needed for catch-up? 14:16:15 yes 14:16:25 since it is already a part of the reference architecture 14:16:29 you could store data in kafka for several days or a week 14:16:42 otherwise we put constraints on the influx implementation 14:16:51 which may vary from instance to instance 14:17:08 the data is uncompressed in kafka, so the amount of storage is possibly the biggest issue 14:17:18 What are options for persisting to 2 influxdb instances 14:17:22 you would need to size capacity appropriately to handle your injest rate 14:18:10 sgrasley1: you can replicate influxdb using the open-source code 14:18:16 what do you believe is a reasonable expectation of time to catch up with no loss? 2hrs? 2days? We have to have an idea about some sort of duration 14:18:23 or you can use infludb's proprietary clustering solution 14:19:21 i think the catch-up time is related to your amount of downtime, injest rate of metrics, and the throughput to the infludb node 14:19:54 if your injest rate is 50K metrics/sec and you can only write 50K metrics/sec to influxdb, you would never catch-up 14:20:07 sure.. 14:20:27 so, you would have to measure that as it is dependent on systems, number of persisters 14:20:57 but I still think that one needs to have a reasonable expectation of how long an influx outage can last without losing data... 14:21:44 in a kubernetes environment recovery times can be very quick 14:22:09 to exemplify: from our company view if we can sustain an influx outage for a particular ingestion rate of 6 hrs x ingestion rate and then catchup then this is a perfectly acceptable clustering solution 14:22:33 which obviously leads to a question of how you monitor your Monasca instance ;)' 14:23:36 rhochmuth: how would you like us to structure our findings? 14:24:13 good question 14:24:38 i think an overview of the various alternatives, pros/cons of each one 14:24:49 the usual place where this ends-up in openstack is an etherpad 14:25:16 For example, the monasca team meeting is at, https://etherpad.openstack.org/p/monasca-team-meeting-agenda 14:25:32 but you could easily add an etherpad for influxdb 14:25:46 monasca-influxdb 14:25:46 ok, will look into it 14:26:10 and, unless you are done earlier than the midcycle, we could review at one of the weekly meetings 14:26:23 but covering this at the mid-cycle woukld be best as that is just a few weeks away 14:26:32 witek will be organizing that 14:26:52 but, due to his vacation, it will take him another week to cover that 14:28:22 so, should we wrap-up a little early this week? 14:28:37 unless there are other topics/questions to cover 14:28:51 I'm good 14:29:02 thx Christian_ 14:29:09 Yavor was the active one this time :-) 14:29:41 thanks for getting invovled in the project, we are looking forward to the involvement 14:29:49 and sorry about the small group today 14:30:06 thanks for the update! 14:30:07 but it is that time of year when folks are taking vacations 14:30:18 yeah, anything for you? 14:30:31 in terms of vacation that is 14:30:49 i have a backlog, but probably in september 14:31:19 OK everyone, thanks for the meeting today 14:31:23 see you all next week 14:31:29 #endmeeting