#openstack-meeting log

14:00:25 <rhochmuth> #startmeeting monasca
14:00:26 <openstack> Meeting started Wed Jun  7 14:00:25 2017 UTC and is due to finish in 60 minutes.  The chair is rhochmuth. Information about MeetBot at http://wiki.debian.org/MeetBot.
14:00:27 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote.
14:00:29 <openstack> The meeting name has been set to 'monasca'
14:01:10 <rhochmuth> uh oh, no agenda for today's meeting
14:01:19 <rhochmuth> that's a first
14:01:21 <rhochmuth> o/
14:01:33 <rhochmuth> anyone around today?
14:01:36 <koji> o/
14:01:38 <hosanai> o/
14:01:49 <Haruki> o/
14:02:01 <rhochmuth> hi koji and hosanai and haruki
14:02:07 <Fdaisuke> o/
14:02:11 <hosanai> hello
14:02:24 <Haruki> hi
14:02:40 <koji> hello
14:02:43 <rhochmuth> i guess the new time-slot works out a little better for everyone in japan
14:02:51 <rhochmuth> but, it is still a little late
14:03:22 <hosanai> yeah, it works for me :-)
14:03:27 <koji> 11:00pm on our time zone now
14:03:34 <rhochmuth> does anyone have any topics or reviews
14:03:39 <rhochmuth> my wife is in japan
14:03:47 <rhochmuth> she called me at 4:00 AM
14:04:05 <hosanai> hahaha. she tries to kill you :-)
14:04:14 <rhochmuth> she does her best
14:04:34 <hosanai> I would like to share info of analytics status.
14:04:35 <rhochmuth> torture is her specialty though
14:04:50 <rhochmuth> hosanai: you have the floor
14:04:55 <rhochmuth> #topic monasca-analytics
14:05:00 <hosanai> thanks!
14:05:12 <hosanai> brad gave us data for analytics
14:05:12 <Haruki> interesting!
14:05:27 <rhochmuth> brad klein?
14:05:37 <hosanai> yeah
14:05:54 <rhochmuth> very nice
14:06:02 <hosanai> and Fdaisuke checked the data but information is not enough for analytics
14:06:22 <rhochmuth> what data did you get?
14:06:35 <hosanai> We got it.
14:06:50 <rhochmuth> i meant type of data
14:06:55 <rhochmuth> was it metrics, logs?
14:07:20 <hosanai> sorry, i miss understood. Fdaisuke, could you explain it?
14:08:28 <Fdaisuke> ok
14:09:41 <Fdaisuke> there is no metorisc and logs, just only "timestamps" and "value" (100% or not)
14:10:21 <hosanai> my rough understanding is we got timestamp, value and value_meta for each services such as swift, nova and so on
14:10:52 <rhochmuth> do you have the metric name and dimensions?
14:10:55 <Fdaisuke> thanks, hosanai
14:11:30 <koji> isn't the "value" a metric?
14:12:52 <Haruki> memory usage or like that?
14:13:33 <hosanai> in dimensions, there is service and in measurements we have timestamp, value and value_meta. i think.
14:14:13 <hosanai> it looks status of services.
14:14:25 <rhochmuth> and are there any points in which incidents have been labeled?
14:15:10 <rhochmuth> for example, do you have a set of timestamps and label, for an incident and/or type of incident
14:15:22 <hosanai> yeah, he explained us unexpected behavior and the time.
14:16:12 <rhochmuth> so, you have a bunch of metrics, and then a set of incidents and the timestamps for when they occured it sounds like
14:17:04 <hosanai> we don't have metric like memory in the data.
14:17:16 <hosanai> only service status
14:18:08 <hosanai> s/memory/memory usage/
14:18:49 <rhochmuth> oh, so basically if swift goes down, the value of some metric, such as status with dimension service=swift is 1.0
14:19:45 <Haruki> only 1 or 0?
14:20:56 <hosanai> Fdaisuke, could you give us an example? you said the data looks like 100%, 90%...
14:22:31 <Fdaisuke> i saw 100%, 90%, 40%, sometimes 0%
14:23:04 <rhochmuth> ok, so other than the status metric, is there anything else of interest?
14:23:28 <rhochmuth> if that is all you have, then i agree, the dataset is not going to offer a lot of insight
14:23:45 <hosanai> yeah so Fdaisuke sent email to brad to get additional data.
14:23:48 <rhochmuth> or data analysis opportunites
14:24:19 <rhochmuth> i think what you need is a lot of metrics
14:24:36 <rhochmuth> the status metric is already informing you about incidents
14:24:54 <hosanai> i think so. I hope brad keep the data :-)
14:25:37 <rhochmuth> if you had more data, you could potentially identity other metrics that represent the incident
14:25:52 <rhochmuth> or possibly RCA
14:26:15 <hosanai> hopefully :-)
14:26:45 <rhochmuth> it would also be interesting to have the set of alarms that occurred
14:27:19 <rhochmuth> then you could train your data set using the alarms as the response/labels
14:27:49 <rhochmuth> and then try and predict incidents when you don't have the alarms
14:28:39 <hosanai> i see the alarms are useful to create labels.
14:28:43 <rhochmuth> in other works can you develop a model for predicting alarms
14:29:03 <rhochmuth> yeah, i think that would be an interesting problem
14:29:37 <rhochmuth> given a set X of all the metrics and a set y that consists of all the alarms
14:30:10 <hosanai> brad explained us if the status is not 100%, it's a problem from operator point of view.
14:30:35 <rhochmuth> so, status can be used too
14:30:50 <rhochmuth> i guess 0 is no problem
14:31:15 <rhochmuth> 25% could be low status
14:31:23 <rhochmuth> 100% could be there is a problem
14:31:34 <rhochmuth> > 75% could be there is a high probability there is a problem
14:31:46 <rhochmuth> so, that could be useful if you had the other metrics
14:31:55 <rhochmuth> you could use status as the response/label set
14:32:19 <rhochmuth> and then train on that using the metrics as predictors
14:32:30 <hosanai> thanks for the info/advise:-)
14:32:33 <rhochmuth> similar to how alarms would be used
14:32:44 <rhochmuth> your welcom
14:32:50 <hosanai> that's all I would like to share
14:32:51 <rhochmuth> i think you need a lot more data
14:33:06 <hosanai> yeah that's right
14:33:24 <rhochmuth> you could also take the status variables, and treat it as binary
14:33:35 <rhochmuth> for example, anything above 50% is a problem
14:33:54 <rhochmuth> then your problem turns into a binary classification problem
14:34:03 <rhochmuth> but, i'm not sure that is what you want to do
14:34:28 <hosanai> a binary classification means 2 values classification?
14:34:34 <rhochmuth> correct
14:35:06 <rhochmuth> but, you can also treat the predictor as continous, which is what the status is
14:36:14 <hosanai> it's a good way to handle the problem as a binary classification to get good result
14:36:15 <rhochmuth> so, if brad can get you more data, even without the alarms, i think you might have something you can start with
14:37:13 <hosanai> yeah, we can push analytics ahead with brad's help
14:37:23 <rhochmuth> it would also be very interesting to know what metrics correlate well with problems
14:38:03 <hosanai> i think so it's kind of RC
14:38:38 <rhochmuth> that would be good for RCA
14:40:42 <hosanai> yeah, and witec introduced Fujitsu's colleagues who created original algorithm (i forgot the approach) for analytics
14:41:10 <rhochmuth> do you know the names?
14:42:05 <hosanai> i forgot it I got a paper but did't read it yet :(
14:44:02 <hosanai> names for the guys?
14:44:10 <hosanai> you mean
14:44:32 <rhochmuth> well, i was wondering if they were hpe or fujitsu folks
14:46:09 <hosanai> I will send you the names later. I can't access company email now.
14:46:26 <rhochmuth> np
14:46:49 <hosanai> at least 9 hours later :-)
14:47:29 <rhochmuth> i thought you were still at work
14:47:33 <rhochmuth> i'm dissapointed
14:47:38 <rhochmuth> :-)
14:47:50 <hosanai> hahaha
14:49:45 <rhochmuth> so, is that it for today?
14:50:46 <hosanai> i don't have anything now
14:51:02 <rhochmuth> thx hosanai
14:51:13 <rhochmuth> i guess i'll end the meetign a little early
14:51:29 <rhochmuth> #endmeeting