Thursday, 2025-06-12

opendevreviewAlfredo Moralejo proposed openstack/watcher master: Fix instance migrations in zone_migration strategy together with volumes  https://review.opendev.org/c/openstack/watcher/+/95211606:27
opendevreviewDavid proposed openstack/watcher-tempest-plugin master: Add tests for workload_balance with injected data  https://review.opendev.org/c/openstack/watcher-tempest-plugin/+/94972209:11
dviroelsean-k-mooney: hi - pls revote on this spec when you have some time: https://review.opendev.org/c/openstack/watcher-specs/+/94387311:14
sean-k-mooneydone i missed that hey updated it11:17
opendevreviewMerged openstack/watcher-specs master: feat(spec): disable cold/live migration for host maintenance strategy  https://review.opendev.org/c/openstack/watcher-specs/+/94387311:24
opendevreviewMerged openstack/watcher-tempest-plugin master: Update tests for host maintenance strategy.  https://review.opendev.org/c/openstack/watcher-tempest-plugin/+/95085111:54
*** amoralej_ is now known as amoralej11:59
dviroel#startmeeting watcher12:00
opendevmeetMeeting started Thu Jun 12 12:00:43 2025 UTC and is due to finish in 60 minutes.  The chair is dviroel. Information about MeetBot at http://wiki.debian.org/MeetBot.12:00
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.12:00
opendevmeetThe meeting name has been set to 'watcher'12:00
dviroelhi all, who's is around today?12:00
sean-k-mooneyo/12:01
sean-k-mooneyim doing spec review but im here12:01
dviroelcourtesy ping: amoralej jgilaber chandankumar morenod12:01
jgilabero/12:01
mtemboo/12:01
opendevreviewDavid proposed openstack/watcher-tempest-plugin master: Add tests for workload_balance with injected data  https://review.opendev.org/c/openstack/watcher-tempest-plugin/+/94972212:02
amoralejo/12:02
dviroelalright, let's start with today's meeting agenda12:02
dviroel#link https://etherpad.opendev.org/p/openstack-watcher-irc-meeting#L45 (Meeting agenda)12:02
dviroelfeel free to add your own topics to the agenda12:02
dviroelthere are topics to place your changes that requires attention from reviewers12:03
dviroellets start with the first one12:03
dviroel#topic Eventlet Removal12:03
dviroel#link https://etherpad.opendev.org/p/watcher-eventlet-removal12:03
rlandyo/12:03
dviroelnot too much to add i think12:03
dviroelI updated the patch to move eventlet commands to a different directory12:03
dviroeland propose a WIP patch for decision engine12:04
dviroellink for related patches:12:04
dviroel#link https://etherpad.opendev.org/p/watcher-eventlet-removal#L1112:04
dviroelthe WIP patch for decision engine, which is now running on CI, is failing :P12:04
dviroelthe patch allows to replace usage of green thread pools and test against oslo.service new threading backend12:04
dviroelas for now, there is an issue with cluster data model updates which I'm investigating and working on a fix on my env12:05
dviroeli will bring more updates about that soon12:05
dviroelany question or comments on this topic?12:06
dviroelok, moving to the next topic12:07
amoralejthe plan is to make the change configurable and disabled by default?12:07
amoralejor moving to the new threading backend in oslo.service by default?12:07
sean-k-mooneydifferent project are taking diffent approches12:08
sean-k-mooneynova is making it configurable via a ENV var12:08
sean-k-mooneyironic and neutron just ripped out supprot  i think12:08
dviroelyes amoralej, and have a CI job running with eventlet disabled for the services that we add support for it12:08
dviroeli think that, for now we could go with the ENV var to enable/disable eventlet, like nova12:09
amoraleji ask because i just found that the oslo.service supporting the threading backend is blocked in RDO because it requires some other updates in libs12:09
amoralejcotyledon, iirc12:09
amoralejnothing strategic12:10
sean-k-mooneyyes it does12:10
sean-k-mooneyalhtouhg that is not a new requriement in general12:10
sean-k-mooneycotyledon was already used in some projects in openstack12:10
amoralejyes, but it requires > 2.012:10
sean-k-mooneythat shoudl already be packaged in rdo12:10
amoralejand there is 1.7 or something like that12:10
amoralejnot big thing, but needs to be done12:10
sean-k-mooneyack but that would have been requireed for the other service that nativly use it already12:11
amoralejalso new backend is added as an extras12:11
amoralejso, it needs to provide an extra optional rpm12:11
sean-k-mooneygood to know12:11
dviroelright, good point amoralej 12:11
sean-k-mooneyonce we start merging support in watcher we wil want to make it a required dep for the watcher rpm12:11
amoralejit has automatic deps, so if we add it to requiremets.txt, it should be automatically in the rpm12:12
sean-k-mooneybut that can happen later in the cycle once we actully have it woring12:12
amoralejyeap12:12
dviroelthanks for the heads up amoralej 12:13
sean-k-mooneydviroel: this is not the way to test that by the way https://review.opendev.org/c/openstack/watcher/+/952257/1/watcher/eventlet.py12:13
sean-k-mooneyyou can tweak the job to set the env var for the relevent service12:13
sean-k-mooneybasicly you shoudl not need to comemnt and hard code the behvior12:14
sean-k-mooneyalso returnign FALSE here https://review.opendev.org/c/openstack/watcher/+/952257/1/watcher/eventlet.py#2612:14
sean-k-mooneyis semanticly incorrect12:14
dviroelit checks the MONKEY_PATCHED for that 12:15
dviroelbut we can discuss more about there in the change, for sure12:15
dviroelthe idea is not to hardcoded things12:15
dviroelbut work based on OS_WATCHER_DISABLE_EVENTLET_PATCHING12:16
sean-k-mooneyi know but monkey patch shoudl return None12:16
sean-k-mooneyi dont want to use   if _monkey_patch():12:16
sean-k-mooneyif is_patching_enabled(): is fine12:17
sean-k-mooneybut dont conflate the two "_monkey_patch" shoudl patch if not allreay patched it shoudl not  be used to make desions12:17
sean-k-mooneyanyway lets move on 12:18
dviroelack, we can discuss more about this in the patch on later in the channel12:18
dviroelthanks sean-k-mooney 12:18
sean-k-mooneyalso is_patched is the function that you shoudl be useing to make desciosn outside this module12:18
sean-k-mooneyill leave thos comment on the review12:19
dviroellets move on to the next topic, and feel free to comment on the WIP patch too, maybe I'm missing some details there12:19
sean-k-mooneyis_patching_enabled shoudl be private and internal.12:19
dviroelsean-k-mooney: ty12:19
dviroel#topic Call for reviews12:20
dviroelthre is some doc reviews open12:20
dviroel#link https://review.opendev.org/c/openstack/watcher/+/951002 Update Host Maintenance strategy documentation12:20
dviroelalready has some votes on it ^ 12:21
dviroel#link https://review.opendev.org/c/openstack/watcher/+/951025  Update Workload Balance strategy documentation12:22
dviroelrlandy: any comment on this one? 12:22
rlandynot really - just needs some more votes12:23
rlandythe only ones for further discussion are the tables ones12:23
dviroelack, it was recently update, i will review again12:23
rlandythose two should match12:23
dviroel#link https://review.opendev.org/c/openstack/watcher/+/951440  Add table - level of test/usage per strategy12:23
dviroeland12:23
dviroel#link https://review.opendev.org/c/openstack/watcher/+/951699  Add Integrations doc page with support matrix12:23
dviroelack, i see that you moved the status from Production -> Supported12:25
dviroelto match with the Integration table12:25
dviroelthey look that are in sync now12:25
dviroelgoing to review it again and vote12:26
dviroelpls all, take some time to review these doc updates ^12:26
dviroelthere is one more12:26
dviroel#link https://review.opendev.org/c/openstack/watcher/+/951899 Add doc clarifications for Zone Migration12:26
dviroelty all for helping improving the docs (and reviewing them)12:27
dviroelany other doc patches? 12:27
dviroelnow about specs12:28
sean-k-mooneyill try and review those all after the meeting12:28
dviroelthe spec:  feat(spec): disable cold/live migration for host maintenance strategy merged minutes ago12:28
sean-k-mooneyyep12:29
dviroelwe still have 3 open for review12:29
dviroel#link https://review.opendev.org/c/openstack/watcher-specs/+/947282 (Adds spec for extend compute model attributes)12:29
dviroelwhich I recently updated to reduce the scope ^ - we would end now with 2 specs, one for extending the model, and anothe one for using this new info in strategies12:30
sean-k-mooneyya we likely will be able to complete the first this cycle i think12:30
dviroelgoing to propose another one to cover the strategies part12:30
sean-k-mooneythe second might be next cycle as im not sure we can design it in time for sepc freeze12:30
sean-k-mooneywe can see how things go12:31
dviroelcorrect, there is a lot to discuss in the second one12:31
dviroelty for all for comments/reviews 12:31
dviroelnext spec12:31
dviroel#link https://review.opendev.org/c/openstack/watcher-specs/+/949528 (Add spec for skip actions from action plan executions)12:31
amoralejabout that, i still need to update it with your comments12:31
dviroeli need to revisit the last ps too12:32
amoralejbut there is an opened topic from gmaan , which is the status of the actionplan when one ore more actions are skipped12:32
dviroelit is on my spec review list12:32
amoralejmy initial approach was keeping it as succeeded but he challenged if succeeded is the same as succeeded_with_skipped12:33
amoralejwdyt? should we add a new state to the actionplan?12:33
amoralejor keep as succeeded with some note or something visible?12:33
sean-k-mooneythats an interesting question12:34
amoralejit is :)12:34
sean-k-mooneyi dont htink it shoudl be a uniqe state12:34
sean-k-mooneyit could be but then it calles into question the diffent betwen skiped intentionally and skipped by precondtions12:35
sean-k-mooneyit feels incorrect to skip actions before executing the plann and then to say succeeded_with_skipped 12:35
sean-k-mooneyif not addtional actions were skipped during the execution12:36
sean-k-mooneyto feel missleadign to me as all the actions i asked to executed were executed12:36
amoralejthat's a good point, so skipping manually actions would not affect to the succeeded status, but automatically would12:36
sean-k-mooneyso i woudl only add succeeded_with_skiped if you explcitly exluded ones that were skipped before we started the action plan and that feels overly compelxt IMO12:37
sean-k-mooneycan you capture this in the spec. i can try an loop back12:37
sean-k-mooneyand we can discuss it more with gmaan  if they have time12:37
amoralejso, what would be the case if the action skips automatically based on pre-condution?12:37
sean-k-mooneyi think we could do it based on th reason filed12:38
amoralejyes, we can keep discussing in the review12:38
jgilaberamoralej: I would still consider an succesful actionplan that had actions skipped automatically as success12:38
sean-k-mooneybasiclaly if we set it to "skipped by user" or some knwo string when skipping ebfore the execution12:38
sean-k-mooneythen we could perhaps have successs_with_skipped if it only was reported that way when there is skipped_by_precondtion in the reaons 12:39
sean-k-mooneythat might be the way forward12:39
sean-k-mooneyamoralej: in either case we shoudl capture this in the alternitive section regardles of which path we take12:40
sean-k-mooneyto record this design choice12:40
amoralejack12:40
dviroelack, so for those that want to know more, pls check the spec, there are some discussions in there already12:41
amoraleji will do, but instead of using the text reason field, i wonder if that should be a new one12:41
amoralejanyway, yes, let's keep the discussion in the spec12:41
amoralejany feedback is appreciated12:41
dviroeltks amoralej for the updates12:42
dviroelthre is one more spec open12:42
dviroel#link https://review.opendev.org/c/openstack/watcher-specs/+/949804 (Support multitenancy with Prometheus datasource)12:42
dviroelthis one I still need to review :( 12:42
dviroelbut sean-k-mooney already raise some discussion/suggestions there12:43
dviroel++12:43
sean-k-mooneyya im not partically happy with how the design of this is beign done12:43
sean-k-mooneymetta context as well i think they are assumeign we would backprot this supprot downstream12:44
sean-k-mooneywhich they have not discuss with us and i did not plan to do12:44
sean-k-mooneythat a donwstream dicsusion but12:44
sean-k-mooneyform my perspecitve they are not properly doing open desgins and following upsream convestion which im annoyed by12:45
dviroelack, thanks for raising that and for requesting a spec, which allow us to discuss more about the design12:46
dviroeli will try to get some eyes on this one too12:47
dviroelif someone else have some time, pls also take a look12:47
sean-k-mooneyas it stand the current issue i think is a blocker is changing the obeserablity client to require that we alwasy pass a keystone session12:47
sean-k-mooneythat shoudl only be done if Ateos is used12:48
dviroel+112:48
sean-k-mooneyi condiser it to be a security issue12:48
dviroel#link https://opendev.org/openstack/aetos fyi12:48
sean-k-mooneyto pass a admin toke to prometiosu if it is not required12:48
sean-k-mooneybecause if that is not authenicated viah https12:48
sean-k-mooneythen it can be sniffed on the wire12:48
sean-k-mooneyand now you have full admin rights12:49
sean-k-mooneyso it is not accpable to pass it when it is not used and it need to be opt in for upgrade reasons12:49
dviroelupgrade reasons is a important point to raise, for sure12:50
dviroeland also the security concerns12:50
dviroelthanks for the discussions sean-k-mooney - we should all ready the spec and the comments there12:51
dviroelsince it is a important design change for watcher12:51
dviroelgoing to move forward, and we should bring these specs again in the next meetings..12:52
dviroelthat's all for now in specs12:52
dviroelthere are some review request for bug fixes too12:52
dviroel#link https://review.opendev.org/c/openstack/watcher/+/952212 bug in workload stabilization12:52
amoraleji added a those, which are more urgent and i expect are easy to review12:53
dviroelack, going to take a look12:53
dviroeland 12:53
dviroel#link https://review.opendev.org/c/openstack/watcher/+/952364 Aggregate by label when querying instance cpu usage in prometheus12:53
dviroelthis second one I need some time to process the query:) 12:54
opendevreviewDavid proposed openstack/watcher-tempest-plugin master: Add tests for workload_balance with injected data  https://review.opendev.org/c/openstack/watcher-tempest-plugin/+/94972212:54
amoralejsure :)12:55
dviroelbut it is something important to review 12:55
dviroelany question on these patches? 12:56
amoralejit's finally only changing the "avg by (instance)"  to "avg by (resource)"12:56
amoralejbut i took the oportunity to move the query format to dict which i think it's better12:56
amoralejfrom %s to dict, i mean12:56
morenodI confirm that after that change, the workload_balance under prometheus is consistent12:56
dviroelack, it is a small change, but it is also a dangerous one :)12:57
dviroelso it requires more eyes 12:57
dviroeli will take a look12:58
amoralejyep, we can break it all by chaging prometheus queries, that's right12:58
amoralej:)12:58
dviroelwe are almost out of time12:58
dviroelwe can review the bugs listed after the meeting, or someone want to highlight something?12:59
dviroelthanks mtembo for volunteer to chair next week meeting13:01
dviroelI won't join next week's meeting, since it is a public holiday here13:01
mtembothanks too dviroel13:01
dviroelok, i will move the bugs listed to next week meeting, and close for today13:02
dviroelthank you all for joining13:02
dviroel#endmeeting13:02
opendevmeetMeeting ended Thu Jun 12 13:02:56 2025 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)13:02
opendevmeetMinutes:        https://meetings.opendev.org/meetings/watcher/2025/watcher.2025-06-12-12.00.html13:02
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/watcher/2025/watcher.2025-06-12-12.00.txt13:02
opendevmeetLog:            https://meetings.opendev.org/meetings/watcher/2025/watcher.2025-06-12-12.00.log.html13:02
amoralejthanks dviroel for chairing!13:04
gmaansean-k-mooney: amoralej: ++ on only add skip for automatic skipped actions. Main idea is user should know if actions are skipped .when they skip they know so nothing needed but if automatic skipped then they should know either from state or some extra field with reason.16:52
gmaanAlso, it does not need to be a new state as such but adding skipped action info in other field can server the same purpose16:53
gmaanamoralej: I kept spec open to reply but got distracted on other tasks. I replied to your comments in spec, please check16:59
sean-k-mooneygmaan: well you can list the action of an action plan and see the status fo each17:01
sean-k-mooneygmaan: so in the current propaly the action plan woudl be succeeded adn the action that were skipped would have the skipped statsu and a populated reaosn field17:02
sean-k-mooneyto say why17:02
sean-k-mooneyi.e. live mgiration was skipped because the vm was deleted17:03
sean-k-mooneysomethign like that17:03
sean-k-mooney<action> was skipped beacue <precondition> was not met17:03
gmaantrue but having a field say "action_skipped" on action plan level will give a easy way to know and if user need to go in action status detail17:03
gmaanit can be None if no action skipped 17:03
sean-k-mooneyso if we look at action plan show https://docs.openstack.org/api-ref/resource-optimization/#id42 that is the respocen today17:04
sean-k-mooneyit does not refenc the actions at all17:05
sean-k-mooneyyour suggestion we asdd a list or somehting to say that some acttions were skipped?17:05
sean-k-mooneywe coudl have a genera "status_message" field rather then "reason" and provide a human readiable message i guess17:06
gmaanyeah, because if we keep state same 'success' which make sense then another field showing detail about 'success' state can be useful 17:06
gmaansean-k-mooney: ++  "status_message" 17:07
sean-k-mooneyok that i can get behind, a detail fild would also work17:07
sean-k-mooneyit woudl be nice ot use the same name for the field on the action and action plan17:07
sean-k-mooneybut i dont have a stong prefernce17:07
sean-k-mooneyi jsut want it to be consditent where reasonable17:07
gmaanmaking it more generic and not just limiting to 'skipped" is good idea 17:07
sean-k-mooneyit woudl be useful to report info on the audit too17:08
gmaanyeah17:08
sean-k-mooneyi thikn "status_message", "message", or "detail" are all good options17:09
sean-k-mooneyamoralej: ^17:09
sean-k-mooneyorginall i was suggesting "reason" for skipped, canceled and failed17:09
sean-k-mooneybut reaons is weired for success17:09
sean-k-mooneyi know nova uses fault for error reproting but im not sure that the right example to follow17:11
opendevreviewDouglas Viroel proposed openstack/watcher master: WIP - Merge decision engine services into a single one  https://review.opendev.org/c/openstack/watcher/+/95249919:06
opendevreviewDouglas Viroel proposed openstack/watcher master: Extend decision engine to support threading mode  https://review.opendev.org/c/openstack/watcher/+/95225719:17

Generated by irclog2html.py 4.0.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!