Monday, 2020-07-13

*** hongbin has quit IRC00:20
*** hongbin has joined #heat00:22
*** ricolin has joined #heat02:33
*** hongbin has quit IRC03:26
*** ramishra has joined #heat03:44
*** ricolin_ has joined #heat04:56
*** ricolin has quit IRC04:57
*** udesale has joined #heat05:38
*** rcernin has quit IRC06:02
*** rcernin has joined #heat06:11
*** vishalmanchanda has joined #heat06:54
*** rcernin has quit IRC08:06
*** ramishra has quit IRC08:28
*** ramishra has joined #heat08:29
*** k_mouza has joined #heat09:21
*** rcernin has joined #heat09:54
*** ricolin_ has quit IRC10:01
*** rcernin has quit IRC10:03
*** rcernin has joined #heat10:08
*** rcernin has quit IRC10:40
*** ricolin_ has joined #heat11:23
*** tkajinam has quit IRC11:37
*** udesale_ has joined #heat12:09
*** udesale has quit IRC12:12
*** ramishra has quit IRC13:22
*** ramishra has joined #heat13:26
*** udesale_ has quit IRC13:27
*** irclogbot_2 has quit IRC13:27
*** udesale_ has joined #heat13:28
*** irclogbot_2 has joined #heat13:28
*** beekneemech is now known as bnemec14:38
*** udesale_ has quit IRC15:07
*** k_mouza has quit IRC16:33
*** ayoung has joined #heat16:55
*** ayoung has quit IRC17:14
*** ayoung has joined #heat17:17
*** ayoung has quit IRC17:29
*** ayoung has joined #heat17:30
*** ricolin_ has quit IRC17:40
*** ayoung has quit IRC18:03
*** ayoung has joined #heat18:04
zanebmnaser: I think this is the same issue: https://storyboard.openstack.org/#!/story/200784319:04
mnaserzaneb: yep.. we've resorted to running `heat-manage service clean` for now every 1 hour + manually setting `host` to match the system hostname (we don't want to run with `hostNetwork` and this is running inside k8s)19:05
mnaserit pretty much does mean that autoscaling heat-engine inside k8s is not a possibility19:06
zanebwhy not? rabbit gets overwhelmed?19:07
mnaserzaneb: well, what i noticed is on scale-down events (when heat gets SIGTERM), it doesn't clean up its queues behind it, so for a bit, engines are still up19:08
mnaserzaneb: so things queue up in those and then api calls timeout (for example, a stack list)19:09
zanebmnaser: that's weird. when you do a graceful shutdown it should stop pulling new requests off the queue and respond to in-flight API calls before shutting down19:10
mnaserzaneb: but because the engine state is still up (cause i assume it only goes 'down' when it hits $timeout)19:11
mnaserso things still get queue'd to it (i noticed this with engine_worker with a few messages like 13-15)19:11
zanebwe don't unicast API requests to a particular engine afaik, except when we're checking if it's still alive for the purposes of stealing its locks19:12
mnaserzaneb: maybe i'm misinterpreting the real issue, but the artifact is api timeouts and some engine listener queues with 13-15 msgs (or more depending on how busy heat was at the time)19:12
zanebmnaser: looks like the engine-listener ones are purely messages asking the engine if it's still alive. if there's no reply we'll conclude that it's not. so that shouldn't break anything19:31
zanebAPI requests shouldn't time out unless the engine is shutdown ungracefully though19:32
mnaserzaneb: well also engine_worker_xxxx thing19:32
mnaserthose also end up with messages19:32
zanebthat's weird because we literally only ever use cast() with engine_worker queues afaict19:33
mnaserzaneb: but the theory of a cast() from the api going towards engine_worker queues that are no longer being listened to because the worker has shut down?19:34
mnaserthis could also be a by-product of other things, upgrading to ussuri really hurt the performance till enabling cache :x19:34
zanebthere's 3 kinds of topics19:35
zanebengine - uses call() to respond to API calls. no idea how oslo.messaging chooses which engines queue to send to19:36
zanebengine-listener - uses call() to check if a particular engine is alive19:36
zanebengine_worker - uses cast() exclusively19:36
zanebI'd have assumed that for cast() all messages would go in the same queue and any engine could pick them up19:37
mnaserzaneb: so this is what i end up with https://www.irccloud.com/pastebin/CqDCzURw/19:38
zanebthanks. oslo.message docs are non-existent, so we'll have to find an expert to explain how it is supposed to work19:44
zanebbnemec: who is the oslo.messaging expert these days? kgiusti?19:46
bnemecHe's the first person I would point you to, yeah.19:46
zanebasked in #openstack-oslo19:57
*** vishalmanchanda has quit IRC21:08
*** k_mouza has joined #heat22:12
*** tkajinam has joined #heat22:54
*** ayoung has quit IRC23:06
*** ayoung has joined #heat23:08
*** rcernin has joined #heat23:08
*** k_mouza has quit IRC23:17
*** hoonetorg has quit IRC23:24
*** k_mouza has joined #heat23:30
*** k_mouza has quit IRC23:34
*** hoonetorg has joined #heat23:39
*** k_mouza has joined #heat23:40
*** k_mouza has quit IRC23:45
*** k_mouza has joined #heat23:48
*** k_mouza has quit IRC23:53
*** k_mouza has joined #heat23:57

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!