15:00:32 <witek> #startmeeting monasca 15:00:32 <openstack> Meeting started Wed Jun 6 15:00:32 2018 UTC and is due to finish in 60 minutes. The chair is witek. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:00:34 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:00:36 <openstack> The meeting name has been set to 'monasca' 15:00:42 <dougsz> hey all 15:00:45 <jgrassler> Hello 15:00:46 <witek> hello 15:00:49 <koji> hi 15:00:53 <joadavis> Good day 15:01:10 <witek> congratulations joadavis ! :) 15:01:38 <joadavis> ah, thanks, you must have heard about my new family addition. :) 15:01:57 <witek> yes 15:02:18 <witek> we have few items in agenda today 15:02:23 <witek> https://etherpad.openstack.org/p/monasca-team-meeting-agenda 15:02:27 <witek> let's start 15:02:35 <witek> #topic reviews 15:02:44 <witek> https://review.openstack.org/#/c/562393/ 15:03:31 <witek> I quickly went through it 15:03:43 <witek> have just given +1 15:04:21 <witek> more reviewers very welcome 15:04:47 <witek> and if not, joadavis feel free to merge if you think it's ready 15:05:08 <joadavis> I've been through it with Ashwin and think it is ready to merge, but it is big enough that any additional eyes are welcome. Thanks for looking at it 15:06:07 <witek> https://review.openstack.org/#/c/563957/ 15:06:39 <witek> that's pretty straight forward 15:07:33 <joadavis> yes, I'd meant to throw https://review.openstack.org/#/c/565162/ and https://review.openstack.org/#/c/565165/ on the list as well. All seemed reasonable to me 15:08:11 <witek> thanks joadavis 15:08:46 <witek> in general we have some backlog of changes in review 15:08:50 <joadavis> I attempted to tox test 565165 but it failed to set up correctly on my devstack, and didn't have time to debug why 15:09:00 <pandy_> Hi All 15:09:12 <mayankkapoor> Hi everyone 15:09:14 <witek> it would be great, if we could invest some more time in reviews and try to get them merged 15:09:30 <witek> pandy_: mayankkapoor: hello 15:09:54 <dougsz> hmm, i also did the joadavis, and it ran all the expected tests 15:10:16 <joadavis> I suspected it was my system, so didn't see it as a blocker 15:10:17 <dougsz> *same 15:10:51 <joadavis> if it had passed, I would have +2 15:10:55 <joadavis> 'd :) 15:11:44 <pandy_> would like to know suggestions for deploying monasca on large scale production setup, mainly looking forward handling kafka 15:12:10 <joadavis> greetings, pandy_. Yeah, that's on the agenda 15:12:11 <witek> pandy_: please hold on, we're on the first agenda item 15:12:42 <witek> to end up with review topic: I've sent a nomination for dougsz to join the core team 15:12:50 <pandy_> witek, sure, waiting for my turn 15:13:02 <joadavis> +1 for adding dougsz :) 15:13:10 <witek> he actively contributes to log-api and kolla 15:13:24 <witek> and reviews actively 15:13:39 <jgrassler> +1 :-) 15:14:36 <witek> I see no objections; congratulations dougsz! I'll add you to the list after the meeting 15:14:56 <witek> #topic monasca-ceilometer publisher 15:15:10 <dougsz> thank you all :) it's privilege to join. I look forward to helping to keep things moving along 15:15:24 <joadavis> Just a quick update on that. The publisher contribution has stalled in ceilometer. 15:15:58 <witek> Yes, I've noticed 15:16:11 <joadavis> I chatted with two of the Telemetry guys, but was told effectively that ceilometer is in maintenance mode and I could take it up with the tc or take over ceilometer myself 15:16:39 <witek> with whom have you talked? 15:16:41 <joadavis> chat history in #openstack-telemetry last week if anyone is curious 15:16:46 <joadavis> julien and gordon 15:16:59 <joadavis> So I wanted to see if you witek had any sway. :) 15:17:20 <witek> does TC know that? 15:17:45 <witek> I mean they're adding new publishers theirselves 15:17:53 <joadavis> I don't know if tc or openstack at large is aware of the lack of developers for telemetry projects 15:18:04 <witek> prometheus being the recent example 15:18:15 <joadavis> (and yes, the fact that a new publisher was added two months ago did invalidate that argument) 15:19:08 <joadavis> I'm not sure if i just asked the wrong question, or if having the name "monasca" on it automatically gets a negative reaction from the telemetry team. :P 15:19:38 <witek> I don't know 15:19:43 <witek> I'll go through the logs 15:19:54 <witek> could you point me to them? 15:19:56 <joadavis> MAy 28 and 29th 15:19:59 <witek> thanks 15:20:17 <witek> for now, I have left you some cosmetic comments in review 15:20:34 <witek> I though about writing to them on openstack-dev mailing list 15:20:42 <witek> but want to read the logs first 15:21:03 <witek> I think we can sync on this offline joadavis 15:21:03 <joadavis> thanks. I have been pondering other long term options, but all would take some work involved that I don't think we have resources to sign up to do 15:21:38 <witek> any other votes on this? 15:22:27 <witek> #topic log-api 15:22:45 <witek> dougsz: your turn 15:22:54 <dougsz> So this is pretty close. I've made a monster patch here: https://review.openstack.org/#/c/570599/ 15:23:19 <dougsz> I can split that into two patches if people would prefer - one to add the ElasticSearch repo, and the other for the API 15:23:33 <witek> #link https://review.openstack.org/#/c/570599/ 15:24:02 <dougsz> My major question is around common code. I started off by trying to factor of common methods from the monasca API into monasca common so that I could use them in the log-API 15:24:27 <dougsz> But then, after adding a voluptuous schema, I had to change these methods a bit 15:24:54 <dougsz> So I wonder if I should confine the log-api change to log-api repo, and address the common code in later commits 15:25:09 <dougsz> To keep things simpler 15:25:19 <jgrassler> I'd vote for that (take care of the monasca-common adjustments later). 15:25:46 <dougsz> Going one step further, I seriously wonder if combining the monasca-api and monasca-log-api might make it easier to maintain 15:26:06 <jgrassler> I'm not a huge fan of "let's add this and let's also include that" - it's got a way of never ending :-) 15:26:29 <dougsz> Yeah - the change is already pretty big. 15:26:41 <witek> I understand dougsz meant it for longer term 15:28:24 <witek> I basically like the idea of joining these two repos 15:28:44 <witek> which would also be valid for events-api 15:28:59 <witek> but would prefer to plan it for the next release 15:29:17 <dougsz> sounds good 15:29:20 <witek> I think it could reduce the maintenance effort 15:29:40 <witek> other thoughts? 15:29:41 <jgrassler> Yes. From a packaging point of view, I'm all for it... 15:30:04 <joadavis> sounds like a good plan 15:30:09 <dougsz> ok, cool 15:30:27 <dougsz> So i will make the log-query-api all in one place, and leave the refactoring for later 15:31:22 <witek> we should take a closer look though what issues could occur 15:31:33 <witek> I mean, when joining 15:31:57 <jgrassler> Yes. This definitely warrants a spec. 15:32:13 <witek> agree 15:32:22 <dougsz> Maybe I can start by throwing some observations into one 15:32:30 <witek> good idea 15:32:58 <dougsz> ok, thanks all, that's it from me 15:33:10 <witek> folks, please review :) 15:33:26 <jgrassler> For one, operators will no longer be able to separate log and metrics APIs (and given the amount of traffic that goes through them that might be desirable :-) 15:35:01 <dougsz> it'd certainly be nice to collect use cases. From our point of view, we always deploy them together. 15:35:31 <jgrassler> I guess that could be taken care of through the service catalog. 15:36:05 <jgrassler> You just run multiple instances of the unified API services and designate one as "this is for logs" and the other as "this is for metrics" 15:36:29 <dougsz> yeah 15:36:45 <dougsz> that makes sense to me 15:37:19 <witek> let's write up a spec for this, so we can go through all this in review 15:37:41 <dougsz> sounds good, i'll start one 15:37:44 <witek> thanks dougsz 15:37:56 <witek> #topic large deployments 15:38:11 <witek> pandy_: sorry, that you had to wait 15:39:14 <witek> pandy_: so you wanted to have some advice on Kafka configuration, right? 15:39:44 <pandy_> witek, thanks for adding our topic in agenda, mayankkapoor_ will explain 15:39:54 <mayankkapoor> Some brief context. pandy_ and I work together in Reliance Jio. Jio is a telecom startup, and we have our own Openstack private cloud. 15:39:59 <mayankkapoor> In Inida 15:40:03 <mayankkapoor> *India 15:40:24 <witek> nice, have we met in Vancouver? 15:40:49 <mayankkapoor> Unfortunately both pandy_ and I couldn't attend. Harshit from our team was there. 15:41:05 <mayankkapoor> He has attended the Monasca update session I think 15:41:32 <mayankkapoor> So long story short, we have about 2000 bare-metals running Openstack, and about 200 running Kubernetes now 15:42:03 <mayankkapoor> Our openstack deployments are 500 nodes each (400 computes, 100 odd storage nodes) 15:42:29 <mayankkapoor> Till now we haven't enabled ceilometer for VM level monitoring due to scaling issues. We currently run Ocata. 15:43:18 <mayankkapoor> We reviewed Monasca architecture and liked it a lot. So the question is: Would you help point us to configuration and deployment settings meant for large scale clouds? 15:44:04 <witek> how do you want to deploy? 15:44:20 <mayankkapoor> Our current plan is to deploy on bare-metals, running docker swarm 15:44:47 <mayankkapoor> Some issues with Kafka running on docker swarm have led us to separate out the Kafka cluster and run in on bare-metals directly. 15:45:00 <mayankkapoor> We have earmarked 3 bare-metal nodes in our control plane for Monasca 15:45:34 <witek> monasca-helm deploys Kafka together with other components in Kubernetes cluster 15:45:50 <witek> HPE and OP5 use it 15:46:24 <dougsz> mayankkapoor: From my side (StackHPC) we haven't scaled up to anything that size yet, but we're working on a 600 node deployment on the Darwin cluster at Cambridge uni, so we should have some experience in a few months. 15:47:00 <witek> dougsz: how do you deploy? 15:47:02 <mayankkapoor> Yeah saw the helm chart. Ok we will defn try the helm install and report back. 15:47:33 <dougsz> At the moment we have a hybrid deployment of Kolla Ansible + some legacy LXC containers 15:47:59 <dougsz> The Kolla-Ansible deployment should be finished in a few weeks at which point we'll switch over to that entirely. 15:48:15 <joadavis> At SUSE we are still deploying to VMs using ansible or crowbar (containers will come soon :) ) 15:48:28 <witek> dougsz: with Monasca included in Kolla? 15:48:31 <jgrassler> Actually Crowbar uses Ansible, too :-) 15:48:43 <dougsz> Yeah that's right 15:50:39 <mayankkapoor> We'll go ahead first with our bare-metal install, and then try the helm install. Thanks for the advice. 15:50:45 <witek> mayankkapoor: it could make sense to dedicate own node(s) for TSDB 15:51:34 <mayankkapoor> @witek: ok we can try that. We'll test first and see if there are any bottlenecks. 15:51:58 <witek> how many measurements do you plan to collect? 15:52:49 <mayankkapoor> Very few: CPU, RAM, Storage are the critical ones. Currently we're using the default ones enabled, but will steadily turn unnece metrics off 15:52:56 <mayankkapoor> *unnecessary metrics off 15:54:24 <mayankkapoor> No further questions for now, thanks for listening 15:55:00 <pandy_> as mayankkapoor will try helm, also looking for advise to handle kafka with better performance 15:55:08 <dougsz> mayankkapoor: It'll be interesting to hear how you get on 15:55:21 <mayankkapoor> @dougz: sure 15:55:50 <witek> I'll ping OP5 folks, if they can give some advice from their deployment 15:56:00 <mayankkapoor> thanks 15:56:31 <witek> I'm also very interested in your progress 15:57:12 <pandy_> witek, thanks, looking forward op5 docs for deployment 15:57:25 <witek> pandy_: mayankkapoor: thanks 15:57:41 <witek> #topic READMEs convertion 15:57:59 <witek> it seems we have to convert our main READMEs to rst 15:58:17 <witek> otherwise we won't be able to create new releases 15:58:50 <witek> I just noticed I have linked wrong story 15:59:03 <witek> the right one: 15:59:06 <witek> https://storyboard.openstack.org/#!/story/2002163 15:59:35 <witek> https://review.openstack.org/572764 15:59:51 <witek> here attempt to create one 16:00:26 <witek> if you have some time please pick up a repo, add a task and convert 16:00:43 <witek> I have to wrap up now 16:00:46 <witek> thanks everyone 16:00:51 <witek> see you next week 16:00:57 <koji> thanks 16:00:57 <joadavis> thanks! 16:01:01 <witek> #endmeeting