Wednesday, 2024-08-28

*** mmalchuk_ is now known as mmalchuk07:20
jakeyipI'm aaround if anyone wants to talk magnum08:57
andrewbonneyHi. We wanted to bring up https://bugs.launchpad.net/magnum/+bug/2067345 at the meeting if possible08:58
jakeyipok09:02
jakeyip#startmeeting magnum09:03
opendevmeetMeeting started Wed Aug 28 09:03:03 2024 UTC and is due to finish in 60 minutes.  The chair is jakeyip. Information about MeetBot at http://wiki.debian.org/MeetBot.09:03
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.09:03
opendevmeetThe meeting name has been set to 'magnum'09:03
jakeyip#link https://etherpad.opendev.org/p/magnum-weekly-meeting09:03
jakeyipPlease put your topics into to Agenda09:03
jakeyip#topic Roll Call09:03
jakeyipo/09:03
jakeyipmnasiadka / dalees if you are around09:04
mnasiadkao/09:04
mnasiadkaI'm here09:04
daleeso/09:04
daleesI'm around, sort of.09:04
jakeyipcool let's get on with it :)09:06
jakeyip#topic QueuePool limit bug 09:07
jakeyip#link https://bugs.launchpad.net/magnum/+bug/206734509:07
jakeyipandrewbonney: I believe this is from you09:07
andrewbonneyYeah, I just wanted to raise it again as we've seen the same as others since upgrading to C09:07
andrewbonneyIt's pretty major as we have to restart Magnum services frequently to keep things working09:07
jakeyipdid the patches fix things?09:08
andrewbonneyI haven't applied them personally yet as patching oslo.db is a little involved, but given other services are using oslo.db without issue I was a little surprised that might be required09:08
jakeyipyeah I am not sure where the bug is, as I haven't encountered it in prod (we are still at B).09:11
jakeyipI was planning to get to C then I can debug, but unfortunately I had to chase down a few bugs in other places affecting our deployment of Magnum, so C upgrade got delayed09:12
jakeyiphow about mnasiadka or dalees ? 09:12
daleesLikewise, not running C yet; CAPI driver has got most of my attention for now and Magnum version isn't the limitation anymore.09:14
jakeyipif I was to guess, it may have been something introduced by us trying to bring sqlalchemy up to date09:16
andrewbonneyI did have a look at the code around those changes but nothing jumped out unfortunately09:17
jakeyipandrewbonney: are you able to help us test by rolling back those commits? 09:17
jakeyip#link https://review.opendev.org/c/openstack/magnum/+/91072209:17
jakeyip#link https://review.opendev.org/c/openstack/magnum/+/91051209:17
mnasiadkawe are going to work on upgrades to C - so sooner or later this year we'll probably stumble on the same issue09:18
jakeyipandrewbonney: which driver are you using?09:18
andrewbonneyWe're running the vexxhost CAPI integration09:19
andrewbonneyI'm happy to try rolling stuff back, but that will also involve pinning oslo.db back to ensure compatibility with the autocommit changes09:21
jakeyipwill reverting just the autocommit change https://review.opendev.org/c/openstack/magnum/+/910722 fail ? 09:25
andrewbonneyIf we stick with oslo.db 15 from upper-constraints I believe so yes09:25
jakeyipwhat are the magnum / oslo.db versions you are running now?09:26
andrewbonneyMagnum 18.0.1, oslo.db 15.0.009:27
jakeyipsqlalchemy?09:28
andrewbonney1.4.5109:28
daleesandrewbonney: what is the pattern you see with db connections, how quickly do they rise with approx how many clusters? similar to https://bugs.launchpad.net/magnum/+bug/2067345/comments/12 ?09:29
andrewbonneyI can go away and collect some data. Looking at our logs it takes maybe 3 days from service restart to start seeing errors, but this is with 1-3 clusters present at any one time09:31
andrewbonneyWe're running all this in a staging environment at present09:31
daleesI'll try an upgrade in development env soon, and see if I can reproduce the issues.09:34
jrosserwhat andrewbonney is describing is an environment where we do man create/delete of a small number of clusters09:35
jrosserrather than having a large number of clusters that is long lived09:35
jrosser*many create/delete09:35
jakeyipandrewbonney: another thing you can try is try this patch https://review.opendev.org/c/openstack/magnum/+/92662609:35
andrewbonneyWill do, ta09:36
jakeyipthis switches over the code from the legacy facade to the new one introduced in 2024.1, possibly fixing the issue 09:36
jakeyipno sorry, not introduced in 2024.1, introduced many years ago09:37
jakeyipI think rolling forward to https://review.opendev.org/c/openstack/magnum/+/926626 is prob the best choice09:40
andrewbonneyI'll give that a go and feed back in the issue after I've got some data on connections09:44
jakeyipthanks!09:45
jakeyipanything else? 09:49
jakeyipok thanks everyone for coming 09:54
jakeyip#endmeeting09:54
opendevmeetMeeting ended Wed Aug 28 09:54:07 2024 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)09:54
opendevmeetMinutes:        https://meetings.opendev.org/meetings/magnum/2024/magnum.2024-08-28-09.03.html09:54
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/magnum/2024/magnum.2024-08-28-09.03.txt09:54
opendevmeetLog:            https://meetings.opendev.org/meetings/magnum/2024/magnum.2024-08-28-09.03.log.html09:54
jakeyipBTW anyone going to OpenInfra Asia next week? 09:54
mnasiadkaYes, I'll be there10:25
opendevreviewAndrew Bonney proposed openstack/magnum-ui master: Fix master_lb_enabled not following template during cluster create  https://review.opendev.org/c/openstack/magnum-ui/+/92738712:55

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!