08:00:46 <licanwei> #startmeeting Watcher 08:00:47 <openstack> Meeting started Wed Feb 26 08:00:46 2020 UTC and is due to finish in 60 minutes. The chair is licanwei. Information about MeetBot at http://wiki.debian.org/MeetBot. 08:00:48 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 08:00:51 <openstack> The meeting name has been set to 'watcher' 08:00:52 <Dantalio-> Hello 08:01:05 <chenke> hello 08:01:07 <licanwei> Dantalion: hi~ 08:02:36 <licanwei> Dantalion: are you still on holiday? 08:03:34 <licanwei> #topic Announcements 08:03:35 <Dantalio-> No holiday atm 08:04:19 <Dantalio-> Universities only have holidays during winter and summer in the Netherlands 08:05:07 <licanwei> Dantalion: how about the Watcher helm charts? 08:05:50 <Dantalio-> Progress is a bit slow, mainly due to the openstack docker images (loci) being broken due to the transistion from python 2 to 3 08:06:25 <Dantalio-> I have patches ready after loci I will go to openstack-helm-charts, the charts themselves are ready on a downstream repository 08:08:16 <licanwei> Dantalion: how many Watcher charts? 08:08:56 <Dantalio-> I still need to figure out a good way to test them I am hoping someone at CERN can help me as I know someone who has experience on running OpenStack with helm 08:09:05 <licanwei> api, decsion-engine and applier? 08:09:56 <chenke> If someone can be found, I believe it will be fast. 08:10:17 <Dantalio-> licanwei: yes all services will be setup with the helm chart but it should be configurable through the chart itself 08:11:10 <Dantalio-> Helm charts also allow to easily say things like: I want two decision-engine with a load balancer in front 08:11:37 <licanwei> great~ 08:13:27 <licanwei> if no more infomation, lets move on 08:13:44 <Dantalio-> No more information 08:13:56 <licanwei> #topic Review 08:14:14 <licanwei> https://review.opendev.org/#/c/709914/ 08:14:34 <licanwei> Rocky will be Extended Maintenance 08:15:21 <Dantalio-> So we need to backport all necessities to make Rocky work? 08:15:54 <licanwei> I submit Watcherclient and watcher final release 08:16:12 <licanwei> https://review.opendev.org/#/c/709939/ 08:16:25 <licanwei> https://review.opendev.org/#/c/709938/ 08:17:45 <licanwei> i add you all as reviewer 08:18:43 <Dantalio-> Yes pls, so far looks good but maybe we are missing the patch that fixed fetching some metric for ceilometer or it could be that it wasn't broken yet with Rocky 08:18:49 <Dantalio-> I will check this 08:20:10 <licanwei> we still can patch Rocky after EM 08:20:40 <licanwei> https://review.opendev.org/#/c/691806/ 08:21:12 <licanwei> This bp is for rollback actionplan 08:22:18 <Dantalio-> I will have time to review it this week my main concern is what will happen if data model has changed 08:22:52 <Dantalio-> In my opinion it should fail immediately and not do any rollback otherwise it could happen that it migrates some machines and fails halfway through 08:24:03 <licanwei> it just reverts actionplan that specify its actionplan uuid 08:25:26 <licanwei> i think it should check the actions before send message to applier, if the DM has changed, it should fail 08:25:46 <Dantalio-> Agreed 08:26:59 <chenke> Ye. DM changed, The actionplan is out-of-date 08:26:59 <licanwei> We can check it when coding review 08:27:52 <licanwei> https://review.opendev.org/#/c/707779 08:28:11 <licanwei> This is for webhook api authentication 08:28:38 <Dantalio-> I can review this todo 08:28:42 <licanwei> I added a config option enable_webhooks_auth 08:28:57 <licanwei> Dantalion: thanks 08:28:58 <Dantalio-> today*, lgtm 08:30:12 <licanwei> #topic discussions 08:30:49 <licanwei> about Watcher event based audit authentication 08:31:12 <licanwei> as https://review.opendev.org/#/c/707779 08:31:46 <Dantalio-> I discovered webhooks provide token context if required, as your patch shows we can also use acl's 08:32:12 <Dantalio-> Combined with having to create a special audit type in Watcher the webhooks now have good authentication 08:34:57 <licanwei> Dantalion: can you give a reference about acl? 08:37:56 <licanwei> move on 08:38:02 <Dantalio-> Yes lets 08:38:13 <licanwei> Time series analysis for strategies 08:39:21 <Dantalio-> Currently watcher works by taking a picture of current infrastructure state and performing optimisation based on that. However, many metrics are very time sensitive. 08:40:41 <Dantalio-> I want to work on a small time series framework for Watcher that allows strategies to perform time series analysis and discover patterns such as: Every wednesday between 15:00 and 16:00 this hypervisor has contention due two these 2 vm's fighting for resources 08:41:04 <Dantalio-> It would allow for much more intelligent strategies 08:42:46 <chenke> If there is time, I suggest writing a spec 08:43:03 <Dantalio-> Ofcourse this will come with a spec 08:43:16 <chenke> Maybe this is a good improvement 08:43:19 <chenke> Cool 08:43:23 <licanwei> Dantalion: It's similar AI or ML 08:44:21 <Dantalio-> licanwei: It is similiar but ML is smarter it will learn to recognize these patterns, time series analysis can only discover patterns we program to look for (every week, every month, etc) 08:45:28 <chenke> Looking forward to your spec detailed documentation. 08:46:10 <licanwei> Dantalion: ok, if AI or ML, i don't think it's should be inside Watcher 08:48:26 <Dantalio-> Agree, ML is better to have service outside of Watcher that uses maybe uses parts of Watcher to do ML based optimization strategies (applier can be reused for instance) 08:49:34 <Dantalio-> Time series analysis is just smart statistics 08:49:48 <licanwei> Dantalion: yes. Watcher have a scoring module 08:49:51 <licanwei> https://specs.openstack.org/openstack/watcher-specs/specs/newton/implemented/scoring-module.html 08:50:09 <licanwei> but no strategy use it now 08:51:07 <licanwei> suzhengwei_: hi~ 08:51:10 <suzhengwei_> hi 08:51:45 <licanwei> suzhengwei_: are you still have time for your spec? 08:52:03 <suzhengwei_> I have update it. 08:52:06 <licanwei> suzhengwei_: coding? 08:53:18 <suzhengwei_> I plan to implent it in next release. If you have time, you can take it over. 08:54:16 <licanwei> suzhengwei_: ok, maybe next release:( 08:54:28 <suzhengwei_> yes, V release. 08:55:27 <licanwei> suzhengwei_: We think if the datamodel has changed , the actionplan rollback should fail asap 08:56:36 <licanwei> you can check the meeting log for more information 08:57:08 <suzhengwei_> Ok. 08:57:13 <licanwei> If no more questions, I'll end the meeting 08:57:20 <Dantalio-> No more questions from me 08:57:44 <licanwei> Thank you all for today's meeting 08:57:59 <licanwei> bye~ 08:58:04 <Dantalio-> Yes thank you and see you next time 08:58:06 <suzhengwei_> bye 08:58:07 <chenke> bye 08:58:20 <licanwei> #endmeeting