Wednesday, 2024-09-11

daleeshi jakeyip , are you around this week?08:44
jakeyipI am here09:00
jakeyipdalees: anything you want to discuss?09:00
daleesyeah, I wanted to discuss https://bugs.launchpad.net/magnum/+bug/2067345 and proposed fix to see if it's on the right track.09:01
daleesanyway, I added to meeting notes :)09:02
jakeyipok09:02
jakeyip#startmeeting magnum09:02
opendevmeetMeeting started Wed Sep 11 09:02:53 2024 UTC and is due to finish in 60 minutes.  The chair is jakeyip. Information about MeetBot at http://wiki.debian.org/MeetBot.09:02
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.09:02
opendevmeetThe meeting name has been set to 'magnum'09:02
jakeyip#link https://etherpad.opendev.org/p/magnum-weekly-meeting09:03
jakeyip#link https://etherpad.opendev.org/p/magnum-weekly-meeting09:03
jakeyip#link https://etherpad.opendev.org/p/magnum-weekly-meeting09:03
jakeyipo/09:03
jakeyip#topic Roll Call09:03
jakeyipo/09:03
daleeso/09:03
jakeyip#topic  Cluster create returns before uuid is valid09:05
jakeyip#link https://bugs.launchpad.net/magnum/+bug/2078390 09:05
jakeyipdalees: I just read the bug, haven't reviewed it yet, apologies09:06
daleesno worries, I understand you've been away!09:06
jakeyipyeah just got back Mon. :)09:07
jakeyipseems valid. I wonder if you referred to other services for the pattern? (synch. call to init db entry)09:08
daleesNo, I didn't. I also considered adding a new state 'INITIALIZED', but figured that was more change than it was worth. It would help identify a lost or slow async RPC create message, if that ever occurred.09:10
jakeyipok, I don't think that's necessary 09:14
jakeyipI find the last two lines of the change log confusing - what can be upgraded first?09:14
jakeyipoh sorry I misread, ignore me09:15
daleesconductor first - it will handle both the old and new rpc create messages09:16
dalees(and by 'old' I just mean that initialize hasn't been called yet)09:18
jakeyipfirst read it looks ok, I will look more after this meeting09:19
daleeswell, I will deploy this change and ensure it sorts out our Tempest problems. Maybe there is more to do, but it feels like a reasonable change even though it adds an RPC round-trip.09:19
daleescheers, all good.09:19
jakeyipjust curious, which tempest tests are you running?09:21
daleeswe run ClusterTest.test_create_list_sign_delete_clusters regularly.09:22
jakeyipyeah same. do you use template id or let the tempest test create a template?09:23
daleestemplate id09:24
jakeyipok same09:26
daleesthe symptoms we end up seeing in this race condition is that tempest will end up deleting a cluster as Magnum does the helm install. So some secrets are wiped from CAPI and the cluster in Magnum is gone, but the helm resources are all created and CAPI tries to keep reconciling the cluster.09:28
daleesso keep a eye out in your CAPI management cluster for extra tempest clusters that Magnum doesn't know about.09:28
daleesI don't think it would be limited to the CAPI helm driver, as the conductor is doing all of this async. But maybe it's easier to reproduce with helm.09:30
jakeyipso there will be a k8s cluster but no corresponding magnum cluster? 09:30
daleesyes, but it usually fails to create as Magnum deletes the OpenStack secrets, or deletes the app cred.09:31
jakeyipyeap ok09:31
jakeyipfor the RPC incompatibility, I am looking if there's some recommendations for this kind of change. it's not typical like adding a new object field. have you looked into that?09:38
daleesNot at other prior art, no. I understand some services have RPC versioning they can use, which prevents upgrading until all agents are on the new version. I'm not sure if Magnum registers agents in the same way as Nova/Neutron though.09:41
jakeyipyeah there's a `openstack coe service list`, but no version from what I can tell09:46
jakeyipgetting late, let's do this offline09:46
jakeyipanything else you want to discuss?09:47
daleesnope, that's all from me09:47
jakeyipok let me take some time to review 09:50
jakeyipseeya next week? (no pressure :P )09:50
daleesYes, see you next week09:54
jakeyip#endmeeting09:56
opendevmeetMeeting ended Wed Sep 11 09:56:05 2024 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)09:56
opendevmeetMinutes:        https://meetings.opendev.org/meetings/magnum/2024/magnum.2024-09-11-09.02.html09:56
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/magnum/2024/magnum.2024-09-11-09.02.txt09:56
opendevmeetLog:            https://meetings.opendev.org/meetings/magnum/2024/magnum.2024-09-11-09.02.log.html09:56
jakeyipdalees: https://github.com/openstack/magnum/blob/master/magnum/common/rpc_service.py#L35 :P 10:09
jakeyipoh well10:09
daleeshaha 201410:23
mnasiadka10 years ago, nice11:17

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!