Wednesday, 2025-09-10

opendevreviewGregory Thiemonge proposed openstack/octavia master: WIP Don't wait for the completion of jobs in the consumers  https://review.opendev.org/c/openstack/octavia/+/96037813:44
skraynev@gthiemonge hi. do you know : why this endpoint do not set PENDING status for balancer? 14:40
skraynevhttps://github.com/openstack/octavia/blob/master/octavia/api/v2/controllers/amphora.py#L180-L19814:40
gthiemongeskraynev: hey I knew it, but i don't remember. I know that it's expected, and we have a similar behavior in the cert rotation flow of the housekeeping service14:47
gthiemongejohnsom knows14:49
skraynevinteresting. thanks. I hope, that johnsom will shed of light on this behaviour 14:50
gthiemongehttps://bugs.launchpad.net/octavia/+bug/211185714:54
gthiemongehmm14:54
johnsom@skraynev It's because this API does not impact the load balancer, it only updates the amphora agent config and for the most part does it transparently by mutating the config. 15:00
gthiemongejohnsom++15:01
johnsomIt is also a single API vs the multiple endpoints that affect change to the load balancer config15:01
johnsom#startmeeting Octavia16:00
opendevmeetMeeting started Wed Sep 10 16:00:36 2025 UTC and is due to finish in 60 minutes.  The chair is johnsom. Information about MeetBot at http://wiki.debian.org/MeetBot.16:00
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.16:00
opendevmeetThe meeting name has been set to 'octavia'16:00
gthiemongeo/16:00
johnsomHi everyone, happy RC1 week!16:01
johnsom#topic Announcements16:01
johnsomIt's RC1 week. lol16:02
gthiemonge:)16:02
johnsomI think there are still bug fixes we should get in, but we can do an RC2 as well.16:03
johnsomThoughts?16:03
gthiemongecan you mark those reviews in the etherpad? I'll prioritize them16:04
johnsomYes, I can work on that today.16:04
johnsomI am thinking the GRAP bug for example, the controller IP list update16:05
johnsomJust to name a few off the top of my head16:05
gthiemongeunless there's a regression, i'm not found of RC216:05
gthiemongeack16:05
johnsomYeah, it's a pain16:05
johnsomWe should keep the list as small as possible16:06
gthiemongeyeah16:06
gthiemongesimple bugfixes are backportable too16:07
johnsomYeah, everything is a backport after RC1 lol16:07
gthiemongeright16:08
johnsom#topic Brief progress reports / bugs needing review16:08
whershbergerHey friends, wondering if I could get another review on https://review.opendev.org/c/openstack/octavia/+/958808 since the failure isn't related to any changes proposed against 2025.1/stable. I'll rebase https://review.opendev.org/c/openstack/octavia/+/956930 as soon as that merges and would appreciate a review of the backports after that (we would really like to get these fixes delivered in Ubuntu)16:08
johnsomI have been working on reviews and trying to finish the controller IP list update patch16:08
johnsomwhershberger Ok, thanks for letting us know that is a priority. I will look after the meeting16:10
gthiemongewhershberger: I +2ed the patch that makes grenade non-voting, note that you don't need to rebase your patch (because the gerrit branch is always merged into the target branch in the CI)16:10
whershbergerOh cool, that's good to hear. Thanks johnsom, really appreciate it16:11
gthiemongei've been working on improving the graceful termination of the worker when jobboard is enabled, not that it doesn't work well, but I think we can handle it in a better way16:11
johnsomYes, this underscore vs dot issue with setuptools is impacting a lot of things. There is an open setuptools bug for it, but I have not checked the status yet16:12
whershbergerI think it made lwn some time ago :)16:12
johnsomOh really? lol16:13
johnsomWell, the fun of working in python....16:13
johnsomAny other progress reports this week?16:13
johnsom#topic Open Discussion16:14
johnsomHow about any other topics this week?16:14
gthiemongenothing from me16:15
johnsomI have been tracking the mariadb discussion the TC has been having about the database collation changes. There is an OpenStack discuss thread about it. I need to go do more research on the changes on the mariadb side to fully get my head around it.16:16
johnsomIn Octavia, we do not specify a collation, we rely on the collation used at DB creation time by the deployment tools.16:17
johnsomSo we may/may not be too impacted or their might need to be some kind of upgrade collation migration. To be determined....16:17
johnsomOk, that is all I have this week. Thanks for joining!16:18
gthiemongeinteresting topic16:18
gthiemongethanks johnsom 16:18
johnsom#endmeeting16:18
opendevmeetMeeting ended Wed Sep 10 16:18:44 2025 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)16:18
opendevmeetMinutes:        https://meetings.opendev.org/meetings/octavia/2025/octavia.2025-09-10-16.00.html16:18
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/octavia/2025/octavia.2025-09-10-16.00.txt16:18
opendevmeetLog:            https://meetings.opendev.org/meetings/octavia/2025/octavia.2025-09-10-16.00.log.html16:18
skraynev@johnsom regarding API for update amphora config. I found an interesting bug, when run locally tempest tests. Test creates LB, later it makes several amphora config updates and exactly after it runs delete LB. in one of X runs it could goes to race condition, when Lb be removed, but flow of update amphora config stuck in jobboard. There is here second part of issue: task tries to connect to deleted VM during 30 m16:37
skraynevinutes, but jobboard re-schedule task to another worker after timeout of 30 seconds.16:37
skraynevsetting LB to PENDING_UPDATE on update amphora config will prevent initial error. however the second part with infinite task in jobboard could be still reproduced.16:39
johnsomYeah, there could be a bug in that flow there. Feel free to open a launchpad for it16:39
johnsomYeah, I don't think locking the LB is the right solution. Probably just need better error logic in the amphora flow16:40
skraynev@johnsom ok, got it, I will create a bug. What about the case with stuck task in jobboard. I thought, that octavia will update TTL of task , but when it goes in AmphoraConfigUpdate looks like TTL update skipped. I did not debug why it happens yet. may be you met similar case?16:48
gthiemongeskraynev: ttl should be updated until the task is complete, or the worker that runs it is stopped16:49
gthiemongeskraynev: is it the redis backend?16:50
skraynevyeah. I expected the same, but observed another situation. maybe it happens, because LB was deleted, as I mentioned above. or may be because task retry connect to already deleted amphora.16:51
skraynevyes, it's the redis 16:51
gthiemongewhich release of octavia/openstack do you use?16:52
johnsomI suspect it is the retry to connect running with a long retry timeout.16:52
skraynev@gthiemonge 2023.1 it's not fresh actually, but I did not find significant changes in base_taskflow or in taskflow releasenotes.16:57
skraynev@johnsom timing is the following: 30 minutes - totally try to connect to VM (120 retries with 10 timeout and 5 sleep duration), 30 seconds - expiration time in taskflow. so exactly after 30 seconds task are marked as without parent key and assigned to another worker16:59
gthiemongethe last major fix regarding the ttl or any related issues was included in zed17:00
skraynev@johnsom https://bugs.launchpad.net/octavia/+bug/2122533 , if some necessary details are missed, please let me know.17:08
johnsomThank you!17:08
skraynev@gthiemonge yeah.  I will try to make some debug around it and come back with more information.17:09
skraynevthank you for the help 17:09
opendevreviewMerged openstack/octavia stable/2025.1: zuul: octavia-grenade voting false  https://review.opendev.org/c/openstack/octavia/+/95880818:33
ttHey, trying to setup redis with the jobboard and it doesn't appear to like my redis cluster. A redis cluster should be supported? I'm getting a MOVED 655 error pointing to the master host. Anyone have any tips?18:57
johnsomAre you using Redis Sentinel?18:57
ttI am not, but maybe I need to?18:58
johnsomI think the Redis driver in Taskflow only supports standalone or Sentinel based clustering19:00
ttAh! Thanks for the infor!!19:04
johnsomIt ultimately uses this library: https://pypi.org/project/redis/19:05
johnsomWe test with standalone, but others are using sentinel19:06
johnsomI'm not sure about other clustering mode/models.19:06
ttGotcha, we're currently testing with ot/redis-cluster19:07
ttthey have a sentinel version, going to give that a try!19:07
opendevreviewMichael Johnson proposed openstack/octavia master: Fix Amphora controller IP list update  https://review.opendev.org/c/openstack/octavia/+/95893320:36
opendevreviewMerged openstack/octavia master: Use native String type  https://review.opendev.org/c/openstack/octavia/+/95709721:22
opendevreviewMerged openstack/octavia master: Fixed missing port_id in additional_vips API  https://review.opendev.org/c/openstack/octavia/+/94686621:26
opendevreviewMerged openstack/octavia master: Move log offload files to /var/log/octavia  https://review.opendev.org/c/openstack/octavia/+/95595921:36
opendevreviewMerged openstack/octavia master: Use overridden defaults cors options when generating a config file  https://review.opendev.org/c/openstack/octavia/+/93541021:36
opendevreviewMerged openstack/octavia master: Fix L7Rules with FILE_TYPE and EQUAL_TO  https://review.opendev.org/c/openstack/octavia/+/95785321:39

Generated by irclog2html.py 4.0.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!