Wednesday, 2019-06-05

*** sapd1_x has joined #openstack-self-healing00:05
*** sapd1_x has quit IRC00:35
*** mrunge has quit IRC01:07
*** mrunge has joined #openstack-self-healing03:43
*** ricolin has joined #openstack-self-healing03:58
*** witek has joined #openstack-self-healing06:57
*** rakhmerov has joined #openstack-self-healing08:01
aspiersmorning! anyone got topics they want to discuss today?09:04
witekhi aspiers, I could shortly report from the billing initiative meeting09:05
aspierssure09:05
aspiers#startmeeting self-healing09:05
openstackMeeting started Wed Jun  5 09:05:20 2019 UTC and is due to finish in 60 minutes.  The chair is aspiers. Information about MeetBot at http://wiki.debian.org/MeetBot.09:05
openstackUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.09:05
*** openstack changes topic to " (Meeting topic: self-healing)"09:05
openstackThe meeting name has been set to 'self_healing'09:05
aspiers#topic report from billing initiative meeting09:05
*** openstack changes topic to "report from billing initiative meeting (Meeting topic: self-healing)"09:05
aspiersgo for it :)09:05
witek:)09:05
witekPublic Cloud SIG has started the "Billing Initiative"09:06
witekI have attended the last two meetings09:06
witekhttp://eavesdrop.openstack.org/meetings/publiccloud_wg/2019/publiccloud_wg.2019-05-23-14.00.log.html09:06
witekhttp://eavesdrop.openstack.org/meetings/publiccloud_wg/2019/publiccloud_wg.2019-05-28-14.05.log.html09:06
witekthe discussions so far focused on collecting and storing the required measurements09:07
witekI was promoting the idea of instrumenting the code of OpenStack services09:08
aspiersOK09:08
witekI think it's in a long term the best approach both for billing and monitoring in general09:08
witekguys have not been very enthusiastic about it though09:09
aspiersDoes that tie in with what we discussed in Denver with Dirk / Ben / others about exposing internal service metrics via endpoints?09:09
witekyes, that's the same approach09:10
aspiersOK09:10
aspiersI just had a crazy idea for the future :)09:10
aspiersBilling could offer refunds if monitoring sees that users are impacted by outages09:11
aspiersthat kind of ties billing with self-healing09:11
witeknice use case :)09:11
witekanyway, they pointed out that the implementation will take long time and were not optimistic if it can be finished at all09:13
witekMohamed Nasser suggested writing Prometheus exporters instead09:13
aspiersOK09:13
witekI think, instrumenting the code is not much effort, it just requires coordination with other projects09:15
aspiersYeah09:15
witekand will pay off in long term09:15
aspiersMaybe write a spec?09:15
witekbecause instrumentation will live with the code09:15
witekand is able to collect more data than `black box` monitoring with exporter09:15
aspiersYeah09:16
aspiersA spec would be a good way to help people understand your vision09:16
aspiersYou could give some example code to show how it would work09:16
witekyes, I think I should prioritize it on my list09:17
witekthat's all from me I guess09:17
aspiersNot sure where the best place for the spec is09:18
witekif we want to provide oslo library, probably there, but I'm not sure if we need one09:18
witekself-healing could be alternatively09:19
aspiersyup09:20
aspierswell you can draft the spec and submit it somewhere09:20
aspiersif it's the wrong place it's easy to move :)09:20
witekcorrect09:20
aspiersalright, thanks for reporting about that09:23
aspiers#topic AOB09:23
*** openstack changes topic to "AOB (Meeting topic: self-healing)"09:23
aspiersI don't have any updates except I still need to send a report about Denver :-(09:23
aspiersanything else from your side?09:23
witekideas for session proposals for Shanghai?09:24
aspiersoh, good point09:24
aspierswe should try the same one we submitted last time with Ifat09:25
witekyes, that might work09:25
witekI don't have anything else09:27
aspiersnor me09:27
aspierswhen's the deadline for Shanghai?09:27
aspiersJuly?09:27
witekearly July09:27
aspiersOK09:27
aspiersWe have time :)09:27
aspiersMaybe we can talk to the new Vitrage PTL about it09:27
witekright09:27
aspiersCool09:28
witeknext two weeks I'm in vacation09:28
aspiersI won't have time this week either09:28
aspiersDo you still have the text of the submission?09:28
witekyes, will find it09:28
aspiersMaybe best to forward to him sooner, so he has time to think about it while you are away09:28
aspiersCool, thanks09:28
aspiersAlright, thanks again and catch you soon!09:29
witekthanks, bye09:29
aspierso/09:29
aspiers#endmeeting09:29
*** openstack changes topic to "https://wiki.openstack.org/wiki/Self_healing_SIG | https://storyboard.openstack.org/#!/project/openstack/self-healing-sig"09:29
openstackMeeting ended Wed Jun  5 09:29:37 2019 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)09:29
openstackMinutes:        http://eavesdrop.openstack.org/meetings/self_healing/2019/self_healing.2019-06-05-09.05.html09:29
openstackMinutes (text): http://eavesdrop.openstack.org/meetings/self_healing/2019/self_healing.2019-06-05-09.05.txt09:29
openstackLog:            http://eavesdrop.openstack.org/meetings/self_healing/2019/self_healing.2019-06-05-09.05.log.html09:29
*** ricolin has quit IRC10:47
*** tojuvone has quit IRC11:03
*** tojuvone has joined #openstack-self-healing11:04
*** ricolin has joined #openstack-self-healing12:03
*** sapd1_x has joined #openstack-self-healing13:14
joadavisre "instrumenting the code is not much effort, it just requires coordination with other projects" - There have been some similar discussion in the Telemetry side about having each service instrument their code and provide notifications.  So some projects are already on-board with the idea.14:32
aspiersjoadavis: cool14:33
*** akhil_jain has joined #openstack-self-healing14:50
*** sapd1_x has quit IRC14:59
*** witek has quit IRC15:44
aspiersmight be a few mins late for the meeting16:25
aspiersif anybody has anything to discuss, feel free to start without me16:26
*** ekcs has joined #openstack-self-healing16:55
aspiershey ekcs17:03
ekcshello!17:04
ekcshow’s it been?17:04
aspiersblegh :)17:05
aspiersbusy17:05
aspiershow about you?17:05
ekcshaha yea I get it.17:05
aspiersoh, I guess I owe you a pic of my new trackpad17:06
ekcsit’s been pretty good. busy as well. got pulled into a lot more internal work. at least it’s fun work. but does stretch my time that’s for sure.17:06
ekcsyea would love to see how it’s working out!17:06
aspiersnice17:06
aspiersI'm mostly working on nova at the moment17:06
aspiersanyway, do we have anything to discuss today?17:07
* aspiers logs into storyboard17:07
ekcsnot too much from me. I’ve made some progress on documenting the monasca-tacker-congress integration, but slowly.17:08
aspiersoh, just remembered a couple of things17:08
aspiersmight as well minute them17:08
aspiers#startmeeting self-healing17:08
openstackMeeting started Wed Jun  5 17:08:21 2019 UTC and is due to finish in 60 minutes.  The chair is aspiers. Information about MeetBot at http://wiki.debian.org/MeetBot.17:08
openstackUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.17:08
*** openstack changes topic to " (Meeting topic: self-healing)"17:08
openstackThe meeting name has been set to 'self_healing'17:08
aspiersSo this morning witek mentioned some ongoing discussions around billing, and the idea that instrumenting service code in order to provide metrics might work better than black-box monitoring for that17:09
aspierswhich ties in with https://storyboard.openstack.org/#!/story/200563217:09
aspiers#topic exporting metrics from services17:09
*** openstack changes topic to "exporting metrics from services (Meeting topic: self-healing)"17:09
aspiersBTW we seem to have a duplicate story in storyboard for this I think?17:10
aspiershttps://storyboard.openstack.org/#!/story/200564017:10
aspiersseem to remember some weirdness with StoryBoard when we were submitting stories recently17:10
ekcsoh weird. yea I may have created a duplicate because of the weirdness.17:11
ekcsI guess we should delete one?17:11
aspiersyeah, https://storyboard.openstack.org/#!/story/2005632 has one fewer task17:11
ekcsok I’ll delete that one.17:11
aspiersthanks17:11
aspiersnot much more to say on that right now except link to this morning's minutes17:12
aspiers#link http://eavesdrop.openstack.org/meetings/self_healing/2019/self_healing.2019-06-05-09.05.html this morning's minutes17:12
aspiers#topic heat + octavia + aodh17:13
*** openstack changes topic to "heat + octavia + aodh (Meeting topic: self-healing)"17:13
ekcsgreat. yea I read up on the morning meeting. sounds like there isn’t great support just yet, but great thing that witek is working on it.17:13
aspiersso this popped up on the mailing list:17:13
aspiers#link http://lists.openstack.org/pipermail/openstack-discuss/2019-May/006582.html demo of app auto-healing via heat+octavia+aodh17:13
aspiersDidn't get a response though17:13
aspiersWe can either keep chasing or try to document at least a skeleton for it ourselves17:14
aspiers#action aspiers to create a story for documenting that use case17:14
ekcsgot it. yea first step maybe simply to link to that video in a skeletal doc. I can take a stab at that.17:15
aspiersI'll finish that after the meeting17:16
aspiersI mean, finish creating the story17:16
aspiersThat would be awesome if you could kick it off17:16
ekcsyup.17:16
aspiersWe can totally merge a skeleton and flesh it out later17:16
aspiersMain thing is promoting the discoverability / awareness17:16
aspiersIf people are aware and they need more details, they'll probably ask for them17:16
aspiers#topic automated testing17:17
*** openstack changes topic to "automated testing (Meeting topic: self-healing)"17:17
ekcssounds good17:17
aspiersThis old chestnut :)17:17
aspiersSo we *may* have an intern doing a masters thesis on this topic17:17
aspiersin which case we could expect to see some progress17:17
aspiersbut nothing guaranteed yet17:17
aspiersfingers crossed!17:17
ekcsoh very nice! I also see that ricolin started some basic tempest setup.17:18
aspiersYup. IIRC it's still marked WIP so not sure if he needs any help with that17:18
ricolinaspiers, ekcs, yes, it's  working already but I'm more working on how to make the test scenario test more stable17:19
aspiersricolin: cool!17:19
aspiers#link https://storyboard.openstack.org/#!/story/2005830 New story for documenting Heat+Octavia+Aodh17:19
aspiersricolin: Let us know if you need any help17:19
ekcsawesomeness17:20
aspiersI think that was all I had for now17:20
aspiers#topic AOB17:20
ricolinthe self-healing scenario is very unstable in https://review.opendev.org/656070 try to figure out why17:20
*** openstack changes topic to "AOB (Meeting topic: self-healing)"17:20
aspiersah OK17:20
aspiersanything else?17:20
* aspiers takes a look at that review17:20
ekcsricolin: are these similar to tests already being run on heat repos?17:21
aspiersheat_tempest_plugin.common.exceptions.TimeoutException: Request timed out17:21
aspiersDetails: Stack SelfHealingTest-243821469/c9e222f4-e0f0-4cbf-ba58-dea30d2d6a08 failed to reach UPDATE_COMPLETE status within the required time (1200 s).17:21
aspiers#topic heat self-healing tests17:22
*** openstack changes topic to "heat self-healing tests (Meeting topic: self-healing)"17:22
ekcsknowing what’s new exsting heat tests may help us diagnose.17:22
aspierstrue17:22
ricolinthe time out is when the healing process didn't start in any reason17:23
aspiersOK17:23
aspiersthat's beyond my familiarity right now17:24
ricolinHeat should play better role during entire process and help to make sure all component works well17:24
ricolinand reduce the unstable cases17:24
aspiersdo you know why it didn't start?17:25
ricolinI think I got some idea17:25
ricolinbut since next week is part of my wedding ceremony, I won't be that available before 6/1517:26
aspiersAh! No problem, enjoy! :-D17:26
ekcsoh wow congrats!17:26
ricolinand the rest part happen in 11/17 so it's going to be a very long years for me!lol17:27
ricolinekcs, aspiers thx!17:27
aspiershaha17:27
aspiersalright17:27
aspiersanything else anyone want to discuss?17:27
ricolinaspiers, in short, I think that test case fail because Heat didn't make sure the Mistral workflow is up and running stable before we assume next step17:27
aspiersahah, I see17:28
ricolinI will look into that and hope I can bring some good knews17:28
ricolinknews/news17:28
aspiersperfect17:28
ekcsgreat!17:28
ricolinOnce that test is stable, the rest gate job setting will be easy17:28
ricolinsince all required patch is already there17:29
aspiersnice17:29
aspiersI guess we need a short doc explaining it too17:29
ekcsnot a discussion topic per se, but I’ve been wavering in my personal priority between identifying and supporting new use cases vs documenting existing use cases. I think I settled on documenting existing as higher priority at this stage of the sig.17:31
aspierspersonally I think either is fine17:31
aspiersWhatever you are more excited about ;)17:31
ekcs= )17:32
aspiersAny small contributions are a lot better than nothing :)17:32
ekcsyup17:32
aspiersWe're all busy with other stuff, so IMO there's no problem at all with being selective and time-boxing SIG work17:33
aspiersAlright, sounds like we're done for today?17:34
ekcsyup17:34
aspierscool17:34
aspiersthanks, and catch you soon!17:34
ekcsyup later guys! have a great week!17:35
aspierso/17:35
aspiers#endmeeting17:35
*** openstack changes topic to "https://wiki.openstack.org/wiki/Self_healing_SIG | https://storyboard.openstack.org/#!/project/openstack/self-healing-sig"17:35
openstackMeeting ended Wed Jun  5 17:35:30 2019 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)17:35
openstackMinutes:        http://eavesdrop.openstack.org/meetings/self_healing/2019/self_healing.2019-06-05-17.08.html17:35
openstackMinutes (text): http://eavesdrop.openstack.org/meetings/self_healing/2019/self_healing.2019-06-05-17.08.txt17:35
openstackLog:            http://eavesdrop.openstack.org/meetings/self_healing/2019/self_healing.2019-06-05-17.08.log.html17:35
*** akhil_jain has quit IRC17:40
*** ricolin has quit IRC18:41
*** joadavis has quit IRC22:05

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!