Thursday, 2025-08-07

amoralejdviroel, microversion testing worked fine in https://review.opendev.org/c/openstack/watcher-tempest-plugin/+/955775 it ran it in the master job and skipped it in the stable ones even without https://review.opendev.org/c/openstack/watcher/+/95638006:42
dviroelamoralej: nice10:54
dviroelthe api tests don't have a min require microversion, so they all run in stable branches10:55
dviroelmost of the scenario tests have a min required of 1.3, so they should be skipped in stable branches if we don't backport the devstack change10:55
amoralejbut the one i'm adding in https://review.opendev.org/c/openstack/watcher-tempest-plugin/+/955775/4/watcher_tempest_plugin/tests/api/admin/test_action.py it's beeing properly skipped10:55
amoraleji meant10:56
amoralejin stable branches jobs10:56
dviroelyep10:57
dviroel"setUpClass (watcher_tempest_plugin.tests.api.admin.test_action.TestPatchAction) ... SKIPPED: The microversion range[1.5 - latest] of this test is out of the configuration range[None - None]"10:57
dviroelbut after backporting the devstack change, we expect:10:58
dviroelsetUpClass (watcher_tempest_plugin.tests.api.admin.test_action.TestPatchAction) ... SKIPPED: The microversion range[1.5 - latest] of this test is out of the configuration range[1.0 - 1.4].10:58
dviroelif we configure one of the api test to min_microversion 1.3 for instance, it will also be skipped, which is not correct, but in api don't have any11:00
amoralejah, so now works because it has latest as default11:06
amoralejack, thanks11:06
amoralejit works smoothly, anyway, nice work11:06
dviroel++11:13
dviroelhi folks, a reminder that watcher meeting starts in 10 minutes, here in this channel11:50
dviroelplease add your topics to the meeting agenda: https://etherpad.opendev.org/p/openstack-watcher-irc-meeting#L4011:50
dviroel#startmeeting watcher12:00
opendevmeetMeeting started Thu Aug  7 12:00:19 2025 UTC and is due to finish in 60 minutes.  The chair is dviroel. Information about MeetBot at http://wiki.debian.org/MeetBot.12:00
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.12:00
opendevmeetThe meeting name has been set to 'watcher'12:00
dviroelwho is around today?12:00
amoralejo/12:00
morenodo/12:01
dviroelcourtesy ping: jgilaber sean-k-mooney chandankumar rlandy12:02
rlandyo/12:02
dviroelalright, let's start with today's meeting agenda12:02
dviroel#link https://etherpad.opendev.org/p/openstack-watcher-irc-meeting#L40 (Meeting agenda)12:02
dviroelfeel free to add your own topics to the agenda12:02
dviroel#topic Announcements12:03
dviroelFlamingo release schedule12:03
dviroel#link  https://releases.openstack.org/flamingo/schedule.html12:03
dviroeladding it here just to mention that we are 3 weeks from feature freeze12:04
dviroeland reminder that no features should be merged after this milestone12:05
dviroelso please add your changes to the review list in our meeting agenda12:05
dviroeltoday I added a section in the etherpad12:05
sean-k-mooneyo/12:05
dviroelto help us track the main topic that we plan to review12:05
dviroel#link https://etherpad.opendev.org/p/openstack-watcher-irc-meeting#L1812:06
dviroeli added 3 proposed features by their topics12:06
dviroelfeel free to comment, place status update and so on12:07
dviroelsorry jwysogla for missing the Aetos topic :( 12:07
dviroelthanks for adding12:08
sean-k-mooneyi guess we never created a wtacher status etehrpad12:08
dviroelsean-k-mooney: yeah, we may create one, to include everything, bugfixes, features, etc..12:08
sean-k-mooneywell we can use the meting one btu this si what nova does https://etherpad.opendev.org/p/nova-2025.2-status12:08
sean-k-mooneyi tought we did this last cycle but maybe not 12:09
sean-k-mooneyim ok using the irc ether pad for now12:09
sean-k-mooneybut it nice to have a per cycle one as well12:09
sean-k-mooneyless noice and its uesful for cycle highlight and the reslease note prelude12:09
dviroelyes it is, since the amount of patche is also growing12:10
dviroelok, any other announcement?12:10
sean-k-mooneylet create one async and start usign it for the remaidner fo the cyel and put a link at the top of the irc etherpad12:11
dviroelsean-k-mooney: +112:11
dviroeli can start one after the meeting12:11
rlandy+112:11
dviroel#action dviroel to start a watcher status etherpad and link at the meeting etherpad12:12
dviroelok so, next topic in the list12:13
dviroel(dviroel): Eventlet Removal Updates12:13
dviroel#topic Eventlet Removal Updates12:13
dviroel#link https://etherpad.opendev.org/p/watcher-eventlet-removal (watcher evenlet removal etherpad)12:14
dviroelthere is a good news12:14
dviroelthe patch that extend decision engine to support threading mode merged12:14
dviroel#link https://review.opendev.org/c/openstack/watcher/+/952257 (Extend decision engine to support threading mode)12:14
dviroelty all for the reviews12:15
dviroelnote that this patch added a new voting job12:15
dviroelwatcher-prometheus-integration-threading runs with decision-engine in threading mode12:15
dviroelwe decided to keep it voting, since was stable enough12:16
amoralejcool12:16
dviroellet us know if your patch start hitting any issue with that specific job, and we can help with debug, or just make it non-voting if required12:16
dviroelthre is another job proposed in12:16
dviroel#link https://review.opendev.org/c/openstack/watcher/+/955097 (Add a new tox environment to run unit tests in threading mode)12:17
dviroelthat adds a new tox environment to run unit tests in threading mode too12:17
dviroelwhich should be merging soon too, i think12:17
sean-k-mooneyi left that unappoved to allow other to reivew12:17
dviroel++12:17
sean-k-mooneybut ya  llikely early next week if there are no objects12:18
dviroelfrom the decision-engine side, I expect to still go through all threadpool default number of threads12:18
dviroeland propose a update if needed12:18
sean-k-mooneyso im not sure if that will be 12:18
sean-k-mooneylooking at the eventlet code they were prvciously limiting the eventlet pool to 4 eventlets i think12:19
sean-k-mooneyi have not checked all code path but if that was infact correct for all pools12:19
sean-k-mooneythen i dont think there will be an issue using 4 real os threads12:19
sean-k-mooneythe concern with using real os thread was mainly for project where that default limit si 10,00012:19
dviroelright, that's the conclusion that I would expect to reach, but still as a in-progress task on my side12:19
sean-k-mooneywhich i think it was in nova12:20
sean-k-mooneyack12:20
dviroelwe may also want to include threadpool stats in debug logs, to help us on debugging12:20
dviroelI think that nova is also adding something similar12:20
sean-k-mooneyya gibi has added a hook for futurists stat mechanium12:21
dviroelbut that is not only for decision-engine, but for all12:21
sean-k-mooneyso you can likely port that12:21
dviroel+112:21
dviroelnext thing would be to continue the work in the applier12:21
dviroelI will provide more udpates as soon as I have something12:21
sean-k-mooneyhttps://review.opendev.org/c/openstack/nova/+/94834012:21
dviroelor the issue that we found in the way12:21
dviroelsean-k-mooney: yeah, this one12:22
sean-k-mooneythe applier will be a littel trickier12:22
sean-k-mooneyso the applie is implementign cancelation fo the task by killing the greenthread12:22
dviroelyes! 12:22
sean-k-mooneythat is not somethign you can do with an OS pthread12:22
sean-k-mooneyso we may have to consier usign a process pool but that has other problems12:23
sean-k-mooneyro we may have to rethinkn how that works in general12:23
dviroelindeed, the applier logic need to be evaluated 12:23
sean-k-mooneyif we did not have time to do that this cycle you have still made a lot fo progress12:23
sean-k-mooneydviroel: have we ensured that there is no eventlet usage in the api?12:24
sean-k-mooneyi know you marked the depercation fo the console script as done12:25
dviroelthere is a usage only with console script12:25
sean-k-mooneybut when we run under uswig have you confirmed we nevver use eventlet in teh api12:25
sean-k-mooneyya the console script is not a problem because we are just going to delete that12:25
sean-k-mooneyits the wsgi applciation taht is the main part we need to validate12:26
dviroelsean-k-mooney: only digging through the code, and I didn't found anything12:26
sean-k-mooneyack12:26
dviroelbut worth revisting yeah12:27
dviroelthanks for the reminder12:27
dviroelany other question for this topic?12:28
amoralejyou are doing a great progress on this12:28
dviroelmoving forward then, since we have things to cover12:28
* dviroel thanks amoralej 12:28
dviroel#topic (dviroel) Fix for bug #2098984 12:28
dviroeli was recently hitting this bug, with a continuous audit12:29
dviroel#link  https://bugs.launchpad.net/watcher/+bug/2098984 (Zone Migration Strategy failing to build a list of instances for migration)12:29
dviroelthe tl;dr; is12:29
dviroelzone migration strategy uses nova and cinder client to retrive instances and volumes, and then checks if they exist in the compute and storage models12:30
dviroelwhen the instance or volume doesn't exist in the model, it raises an exception, which impatch the audit, setting it to Failure12:30
dviroelso the audir doesn't run anymore12:31
dviroelthere is a patch12:31
dviroel#link https://review.opendev.org/c/openstack/watcher/+/956198 12:31
dviroelwhere I try to fix this issue with a simple try/expect to ignore the instance/volume12:32
dviroelwhich is the expected behavior in the current code12:32
dviroelamoralej: raised a question if we should fix the strategy to  instead, only look at the model, and not use the clients12:32
sean-k-mooneyya so there are multiple parts to this. 1 your patch is correct and valid, 2 amoralej  is also correct that the stragey shoudl be using the data form the model not hitting the api directly12:33
dviroelso the idea here is to get a feedback on tha12:33
amoralejit's fine for me to fix in two steps12:33
sean-k-mooneyi think we will want to fix both issue as seperate bugs12:33
amoralejwfm12:33
amoralej1st merge the try/expect from dviroel and fill a new bug with "zone_migration uses nova client instead of the model to retrive instances"12:34
dviroeli also agree that we should be using these client directly in the strategy12:34
sean-k-mooneywell using the clinet to take actions is obvioulsy fine as part of the applciation fo the action plan.12:35
sean-k-mooneyin limited cases it may be ok in the descion engine12:35
amoralejtheoretically, consuming from model should avoid hitting the other issue, but there may be race conditions with model updates, so good to handle exceptions also12:35
sean-k-mooneyto enfrich the obejcts form teh model with extra data12:35
sean-k-mooneybut we should ideally avoid that12:35
dviroelthe 2nd fix change the behavior a little bit12:36
sean-k-mooneyamoralej: well the race can alwasy happen and we need to be tolerate to that12:36
sean-k-mooneywhich si where the skip feature comes in12:36
amoralejyes12:36
amoralejbut in that case is between running the audit and executing the action12:36
dviroelby getting the list of instances/volumes from the api, we also avoid adding already deleted elements, that are still in the model12:36
amoralejwhile this particular bug is while running the audit, not the action12:36
dviroelbut yes, this can be mitigated by the migration action, in the pre_ methods12:37
sean-k-mooneydviroel: that is correct but we have to handel that the elemen might be deleted in teh applier anyway12:37
dviroel++12:37
amoralejyes12:37
sean-k-mooneybecuase between the action plan being created and it being appoved it could happen12:37
dviroeltrue12:37
sean-k-mooneydviroel: so where i ma with his is i think im pretty comfrotabel backprotign yoru inital patch12:37
sean-k-mooneyim less sure about changing to usign the model12:38
amoralejnote, that when i said to use only (unless we can't) the model is in the strategy execution, not in the action execution12:38
dviroelso we agree with the current patch, file a new bug to replace the current implementation with model only, and we can discuss if should be backport afterwards or not12:38
amoralej+112:39
sean-k-mooneythat a bigger change and we need to make sure it does nto result in other bugs before consdiring backproting it. that the ohter reason i would prefer 2 patches and 2 bugs12:39
sean-k-mooney+112:39
dviroel#action dviroel to file a new bug to explaing the current behavior with zone_migration using nova/cinder clients to get info about resources12:40
dviroel#agreed on continue with the fix https://review.opendev.org/c/openstack/watcher/+/95619812:40
dviroelthanks for the feedbacks on that o/12:41
dviroelmoving to next topic then12:41
dviroel#topic (amoralej) Update about skipped actions feature12:41
dviroelamoralej: want to highlight your latest changes?12:41
amoralej#link https://review.opendev.org/q/topic:%22blueprint-add-skip-actions%2212:41
amoralejso, i've reorganized the changes as discussed a couple of meetings back12:42
amoralejmain change is that all the api related changes are in the last one https://review.opendev.org/c/openstack/watcher/+/955753/12:42
amoralejincluding status_message visibility and new actions patch12:42
amoralejboth covered by the same microversion 1.512:43
amoralejinstead of in separate ones, as in my previous patches12:43
amoralejother than that, i think organization now is much more clear12:43
amoralejand easy to review12:43
dviroelamoralej++ 12:44
dviroelalready started to review them, based on the relation chain12:44
amoralejalso, I sent a new tempest test https://review.opendev.org/c/openstack/watcher-tempest-plugin/+/955775 which is using the microversion support12:44
amoralejthat dviroel added12:44
dviroel++12:44
amoralejand worked great12:44
* dviroel needs to update its own tempest patch to use that12:45
amoralejruning in functional master job, skipped in functiona-<stable> ones12:45
amoralejI think we discussed pretty much the implementation details in a previous mtg, so this was just heads-up for reviews12:45
dviroelthanks amoralej12:46
amoralejthe only missing part is the watcherclient and watcher-dashboard one12:46
amoralejI'll work on that too12:46
dviroelso please folks, review these changes when you have a chance12:46
dviroelamoralej: anything else?12:47
sean-k-mooneyi have approved the parent12:47
sean-k-mooneyhttps://review.opendev.org/c/openstack/watcher-tempest-plugin/+/956306/3 so that should merge soon12:47
amoralejnothing else from my side12:47
amoralejthanks12:47
dviroelnice, thanks sean-k-mooney 12:48
dviroelthanks amoralej 12:48
dviroel#topic Reviews12:48
dviroelagain, no specific reviews to mention here12:48
dviroeljust take a look on the m-3 review list12:48
sean-k-mooneyjwysogla: proably want to bring up theres?12:48
dviroelsee what you can help us to move forward12:49
sean-k-mooneyi did a first pass on the aetos change yesterday https://review.opendev.org/c/openstack/watcher/+/95560812:49
sean-k-mooneyits in merge conflict becase of some other stuff we landed12:49
sean-k-mooneybut over all it looks pretty reasonable12:50
dviroelgreat, going to take a look soon too12:50
sean-k-mooneythe min risks here are on teh timeing of releasign the new version fo the observiablity client and geting that into upper constratis12:50
sean-k-mooneyjwysogla: what are the changes fo gettign jaunLarriba to appove teh release patch adn geing that doen thsi week?12:51
sean-k-mooneyoh they already have https://review.opendev.org/c/openstack/releases/+/95665712:51
sean-k-mooneyok so this is just pending the release team to approve12:52
dviroelok, so we need to follow up on that too12:53
dviroelwe can add it under its change in the status etherpad12:53
dviroeljwysogla: pls provide us a feedback on that, once you are back online12:53
jwysoglaSorry, I was in another meeting12:54
dviroelsean-k-mooney: thanks for raising this concern12:54
sean-k-mooneyjwysogla: no worries12:54
sean-k-mooneyonce the release is doen a bot will propose an update to the requirement repo to bump it in upper constraits12:54
jwysoglaI hope the observabilityclient releases soon. Then I can finish up the patch and make the CI passing. I run all the tests locally and they were working, so I hope there is not much work left.12:54
sean-k-mooneyuntil that is also merged the watcher patch will be blocked 12:55
sean-k-mooneybut that does nto eman we cant review in advance of that12:55
jwysoglayes, thanks12:55
dviroelcorrect12:55
sean-k-mooneyjwysogla: there are some followup that we may want to dicsus after or at the ptg12:55
jwysoglaI should also mention, that I'll be on PTO last 2 weeks of August, which is quite unfortunate timing.12:55
jwysoglaBut until then, that patch will be my top priority.12:56
dviroelok, this is importanto to know12:57
sean-k-mooneyjwysogla: basiclly next cycle if we can i woudl like to see if we can evolve our jobs away form sgcore to use the new promtious scape endpoint. i think that woudl be a good topic to dicuss in the telemetyr ptg session or any time before then after we have cut RC112:57
jwysogla+1 to that12:57
sean-k-mooneywe also need to dicsus the future of gnoici supprot in ceilometer12:57
morenod+112:58
sean-k-mooneyis it correct that there is a plan to retire the way ceilometer feed data to gnoccic today12:58
jwysoglaYes, that's also a good topic for the PTG.12:58
sean-k-mooneywe had a related topic that we want to dicuss at the ptg about the future or the gnocci backend and that woudl obviously be a factor12:59
jwysoglaI don't think we have any official "plan". I my personal opinion, gnocchi should go away once in the future. But afaik most users of openstack telemetry outside of Red Hat currently use gnocchi, so we can't just deprecate it.12:59
sean-k-mooneyjwysogla: is this somethign youc can feed back to the rest of the telemetry team (i.e. our interest in disccsign this)12:59
jwysoglayes13:00
sean-k-mooneygnoicci relise on sg_core today is that correct?13:00
sean-k-mooneyis there an intent to deprecate sg_core13:00
sean-k-mooneyor the ceilometer part. actully we can leave that to a future time13:01
sean-k-mooneywe are at the top of the hour13:01
dviroelyeah13:01
jwysoglasg-core is used purely for Prometheus. I think (I'd need to check to be sure) you can currently use the Prometheus exporter just for compute metrics, so ceilometer-central still needs to go through the notification agent and sg-core13:01
dviroelsean-k-mooney: ok to move your topic to the next meeting?13:01
sean-k-mooneyyep13:02
dviroelrlandy: thanks for volunteering to be the next chair13:02
dviroelok, so let's wrap up for today13:02
dviroelwe will meet again next week13:02
dviroelthank you all for participating13:02
dviroel#endmeeting13:03
opendevmeetMeeting ended Thu Aug  7 13:03:04 2025 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)13:03
opendevmeetMinutes:        https://meetings.opendev.org/meetings/watcher/2025/watcher.2025-08-07-12.00.html13:03
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/watcher/2025/watcher.2025-08-07-12.00.txt13:03
opendevmeetLog:            https://meetings.opendev.org/meetings/watcher/2025/watcher.2025-08-07-12.00.log.html13:03
rlandythanks dviroel 13:03
amoralej_thanks dviroel for chairing13:03
morenodthanks dviroel13:03
opendevreviewMerged openstack/watcher-tempest-plugin master: Add support for microversion testing for api and scenario tests  https://review.opendev.org/c/openstack/watcher-tempest-plugin/+/95630613:16
sean-k-mooneyjwysogla: we are not actully including sg core in our gnoccii job and efoly mentin that it has a diffent way to get the metrics. ceilometer send the metrics to it direclty13:24
sean-k-mooneyso sg-core is only relevnet in the scope of promethous adn aetos jobs13:25
jwysoglacorrect13:25
jwysoglaOriginally ceilometer was designed to push metrics to storage. But Prometheus is designed to pull them on its own from exporters. So sg-core is basically acting as a buffer, it'll receive metrics pushed by ceilometer and expose them for Prometheus to scrape.13:26
sean-k-mooneyjwysogla: i assume there is a plan to evengually extend the scrap endpoitn work to also work for ceilomenter-central?13:32
sean-k-mooneywatcher in theroy shoudl not need to care how the metrics get into aetos/promethus13:33
sean-k-mooneyim more interested if we can have less depencies in our jobs then anything else13:34
jwysoglaI had a short conversation with Juan just now. And sg-core apparently isn't meant to deprecate any time soon and currently the prometheus exporter approach isn't usable for metrics coming from notifications (I don't know if Watcher uses these for anything). But I don't have all the knowledge in this topic, Juan would be better person to ask about the Prometheus exporter, so I think this all is a 13:38
jwysoglapretty good topic for the PTG.13:38
sean-k-mooneyack, lets disucss it then so 13:41
sean-k-mooneyi can reach out to juan directly if it become relevnet before then13:41
opendevreviewJaromír Wysoglad proposed openstack/watcher master: Add Aetos datasource  https://review.opendev.org/c/openstack/watcher/+/95560815:15
opendevreviewMerged openstack/watcher master: Replace dateutils usage with datetime and oslo.utils  https://review.opendev.org/c/openstack/watcher/+/95580920:46
rlandysean-k-mooney: hi wrt https://review.opendev.org/c/openstack/watcher/+/954067 - I checked and didn't find other references to the "Watcher Overload standard deviation algorithm". Where/how do I add the release note to mention the change strategy reference name?21:41

Generated by irclog2html.py 4.0.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!