Thursday, 2025-08-07

opendevreviewJulia Kreger proposed openstack/ironic master: Replace GreenThreadPoolExecutor in conductor  https://review.opendev.org/c/openstack/ironic/+/95293900:15
opendevreviewJulia Kreger proposed openstack/ironic master: Set the backend to threading.  https://review.opendev.org/c/openstack/ironic/+/95368300:15
opendevreviewJulia Kreger proposed openstack/ironic master: Remove direct mapping from API -> DB  https://review.opendev.org/c/openstack/ironic/+/95651200:15
opendevreviewJulia Kreger proposed openstack/ironic master: WIP: Optional indirection API use  https://review.opendev.org/c/openstack/ironic/+/95650400:15
TheJuliawell, this will be interesting!00:15
opendevreviewMerged openstack/bifrost master: Move all CS9 jobs to non-voting  https://review.opendev.org/c/openstack/bifrost/+/95566000:16
*** janders8 is now known as janders01:23
opendevreviewOpenStack Proposal Bot proposed openstack/ironic-ui master: Imported Translations from Zanata  https://review.opendev.org/c/openstack/ironic-ui/+/95662403:25
opendevreviewMorten Stephansen proposed openstack/ironic-python-agent master: Fix for motherboards where efibootmgr returns UTF-8.  https://review.opendev.org/c/openstack/ironic-python-agent/+/95606806:58
rpittaugood morning ironic! o/07:08
rpittaupython 3.9 is officially a thing of the past07:26
rpittauhopefully its ghost won't come back hunting for us :)07:26
dtantsurcardoe: I'm very much in favour of one base option for /var/lib/ironic (and one for /run/ironic if needed). I've been thinking about the same when working on the local RPC change.08:52
dtantsurguilhermesp: I really suspect our code does not account for holder partitions in the format of /dev/md/something. I wonder if it's your case. You can confirm by finding the lsblk output in the IPA logs.08:56
rpittauwhen a core has a moment, can you please review https://review.opendev.org/c/openstack/ironic/+/956701 and https://review.opendev.org/c/openstack/ironic/+/956685 ? thanks!10:41
TheJuliaugh metal3-integration didn't actually log anything aside of the console log12:06
TheJulia:(12:06
*** darmach48 is now known as darmach412:15
guilhermespyeah i was trying to look for it but no ipa logs stores anywhere =( 12:18
guilhermespfor now, no way around other then deploy the os in a single disk and try to convert it to software raid at some point 12:19
dtantsurguilhermesp: if I create a patch, will you be able to test it?12:30
guilhermespi think i can! this deployment has some spare nodes i could test this out, but since ppl are in a rush, i have to move forward with the prod nodes and asked them to just get rid of all vrocs lol 12:35
dtantsurI see, let me cook something testable12:36
opendevreviewDmitry Tantsur proposed openstack/ironic-python-agent master: WIP allow holder devices with slashes  https://review.opendev.org/c/openstack/ironic-python-agent/+/95678212:37
dtantsurguilhermesp: try this ^^12:37
guilhermespcool cool -- i have some meetings booked this morning and need to wait ppl to come in to see if they can get vroc on one of the spare nodes to get this in. will keep you posted dtantsur ! thx 12:40
dtantsurnp. if you can enable collecting ramdisk logs, it will help12:40
opendevreviewJulia Kreger proposed openstack/ironic master: Remove direct mapping from API -> DB  https://review.opendev.org/c/openstack/ironic/+/95651213:05
opendevreviewJulia Kreger proposed openstack/ironic master: WIP: Optional indirection API use  https://review.opendev.org/c/openstack/ironic/+/95650413:05
TheJuliaon a plus side, it looks like the metal3 got close to actually working with the last patch, which makes me wonder. Would we be okay with disabling the metal3 job temporarily to merge the "this will definitely break" patches and then merge in the fixing patches after, or do we feel the need to stack the order to avoid the CI job config change overall. (That is, at the cost of rebasing the order)13:08
dtantsurTheJulia: it's not just the job though, you're going to break the whole metal3 project, no?13:08
TheJuliaFor the time between the patches, if they are consuming master, yes. If we can keep that time narrow and land in rapid succession, they may never notice13:09
TheJuliaadam-metal3: you may care about this conversation, fyi13:09
TheJulia(as an aside, unrelated to metal3, we need to try and fix grenade in the next week or so)13:10
dtantsurWe don't rebuild images on every commit, so there is a chance it goes unnoticed13:11
rpittauI have a release planned in metal3 for ironic-image, I will have to skip the broken jobs I guess13:12
TheJuliaI guess, at least trying to get things in a happy state, I sort of like the "oh, I broke it and i'm fixing it" approach just from the standpoint that right now the switch to threading change doesn't actually log anything except the console log :(13:12
adam-metal3TheJulia, yeah we only build when we merge to ironic-image repo on our end13:13
TheJuliaregardless, I'm under no illusions, this is sort of a both big and small changes at the same time.13:14
adam-metal3there was an upstream breakage yesterday anyhow I heared the solution was discussed on our community meeting, so folks on our end anyhow very aware the main can break any time :D13:15
adam-metal3I will let the community know if you ping me when it starts13:16
adam-metal3I mean the period of potential instability13:16
TheJuliaThat could definitely work. I think the biggest issue we're going to create is the move from greenthreading to real threads is going to very much be a moment where folks may blink13:17
TheJulia(different runtime process model and related memory footprint (and very different VSS since threads)13:18
TheJuliaerr, vsz13:19
TheJuliaI keep thinking RSS and thinking it should be VSS as a result13:19
TheJuliarpittau: so realistically, I'm sort of hoping to make progress in the next couple of weeks. We're in the sort of time where we're going to need to release the next version of ironic for the end of cycle sometime in the next 4-ish weeks it looks like13:22
dtantsurlet's not break metal3 until rpittau does the release :)13:24
TheJuliaI did talk to JayF and his feeling was that it might be that the ironic community might need to merge the changes around removing eventlet and then spend a weke or two kicking the tires13:24
* dtantsur nods13:24
TheJuliarpittau: when will that be?13:24
rpittauTheJulia: the release should happen a soon as the 2 patches I mentioned before merge :D13:25
TheJuliaENOCONTEXT13:25
rpittauhttps://review.opendev.org/c/openstack/ironic/+/956701 and https://review.opendev.org/c/openstack/ironic/+/95668513:25
rpittauthe release is based on the  new bugfix branch13:25
TheJuliaoh, the two you asked about 2.75 hours ago?13:26
rpittauyeah :D13:26
TheJuliaok13:26
TheJuliadone and done13:26
rpittauthanks13:26
opendevreviewMerged openstack/ironic-ui master: Imported Translations from Zanata  https://review.opendev.org/c/openstack/ironic-ui/+/95662413:28
TheJuliadtantsur: btw, if you didn't notice, I got object indirection to happily work over jsonrpc :)13:31
TheJulia(actually, worked out of the box)13:32
dtantsurwow, sweet!13:32
dtantsurno, I haven't noticed, still overwhelmed by conversations since I'm back13:32
TheJuliano worries, I had to excise the direct database calls out of the API, but I kept it lightweight13:32
TheJuliaso no intermediate object stop, just get dicts and that is the way the api was written to use the responses anyhow.13:33
dtantsurinteresting13:35
TheJuliathe indirection stuffs only works for oslo versioned objects or dicts13:35
TheJulianot, collections, or sqlalchemy objects13:35
TheJuliaand we did both while trying to keep that extra object conversion overhead out of the mix13:36
TheJuliaI've not put a "no, you can't talk to the DB flag" in, and in single process mode it would be super hard to do like Nova did for their components which do indirection, but I do detach api.request.dbapi13:37
TheJuliaso it doesn't become an appealing thing to try and use.13:38
dtantsur:D13:38
opendevreviewMerged openstack/ironic bugfix/31.0: Comment out metal3 integration job  https://review.opendev.org/c/openstack/ironic/+/95668513:39
TheJuliawe could just entirely excise the idea, I don't think any usage of api.request.dbapi even exists after my change13:42
TheJulia(but if folks have patches or mods downstream, that is likely super breaky(13:43
opendevreviewMerged openstack/ironic bugfix/31.0: Fix local RPC IPv6 detection to use socket binding instead of file checks  https://review.opendev.org/c/openstack/ironic/+/95670113:51
TheJuliaoooh ahh, it looks like metal3 works with indirection14:24
TheJuliaerr, failing, but more data I guess14:30
opendevreviewJakub Jelinek proposed openstack/ironic-python-agent master: Fix skip block devices for RAID arrays  https://review.opendev.org/c/openstack/ironic-python-agent/+/93734214:33
kubajjIf anybody had a minute, we are working on fixing the skip_block_devices property for RAIDs. We managed to fix the cleaning for the scenario where the arrays span the whole disk, but still need to fix the scenario where the disks are shared between arrays. Should we add it to the same change or create a follow-up?14:38
dtantsurI think I need more context. Is it much harder to fix the 2nd part as well?14:48
rpittauTheJulia: I started the release procedure for ironic-image but I doubt it will be all done before tomorrow14:50
rpittauI guess we can stay broken as the only PR needed is for the release notes and we don't really care about CI runs, wdyt dtantsur adam-metal3 ?14:50
dtantsurrpittau: I'm less worried about the CI and more about the actual ironic-image working14:50
kubajjdtantsur: we are working on it, hopefully won't take almost a year like the first part, lol14:50
rpittaudtantsur: release branch has been cut from bugfix-31.0 so new changes on master won't impact it14:51
dtantsurcool!14:52
TheJuliaLooks like were close on metal3, just need to resolve one more nested object which can't be serialized over rpc14:59
TheJuliaat least, that is what is front and center issue wise right now15:00
opendevreviewJulia Kreger proposed openstack/ironic master: Remove direct mapping from API -> DB  https://review.opendev.org/c/openstack/ironic/+/95651215:01
opendevreviewJulia Kreger proposed openstack/ironic master: WIP: Optional indirection API use  https://review.opendev.org/c/openstack/ironic/+/95650415:01
opendevreviewJulia Kreger proposed openstack/ironic master: ci: grenade: restart neutron-rpc-server  https://review.opendev.org/c/openstack/ironic/+/95680115:36
TheJuliait passed, muahahhaha16:33
TheJuliadoes the metal3-integration job stack patches or only pull the latest patch? Asking because it appears that its trying to run with eventlet which is removed in a prior patch....16:41
TheJuliahttps://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_a1a/openstack/a1a36f27076d4063b50d36c4cc5f61d1/controller/before_pivoting/ironic.log16:42
TheJuliayup, the build process only pulls the ref, not the stack of them :\17:07
TheJuliahttps://github.com/metal3-io/ironic-image/blob/main/ironic-packages-list#L3-L917:08
JayFmakes sense, conceptually in github the ref has the stack. easy thing to overlook :(17:08
TheJuliayup17:08
TheJuliathat makes me super nervious, but also means I think the only way we're going to get a solid result is to go ahead and start moving down the path of merging17:09
JayFI have a mild suggestion, if you think it's worth it17:10
JayFpush a DNM squashed version of the stack to get a CI run17:11
TheJuliawould really be the only way to do it17:11
TheJulialikely also the easiest path to get clarity before we're "ready ready"17:12
JayFor fix ironic-image17:13
* JayF touches his nose17:13
TheJuliait should just be changing the conditional check, I think.  But I'm also not really equipped with the spoons to drag this out when I really need to cycle/loop as close to the issue as possible to get a yay/nay level of confidence17:17
JayF++17:18
TheJuliabut hey, on a plus side, tossing it all over to indirection worked so... \o/17:19
TheJuliaLooking at some other job logs, looks like there *might* be slight unhappienss in everything around allocations, but the tests still pass and the job which uses it still works just fine. Looks like it was a race between threads and one realized 'oh, I've got some stale data'17:39
TheJuliaso, really more threading than anything else17:39
TheJuliaJayF: w/r/t the question your, I think the idea will be to eventually just detach the dbapi out from the api, but that is also more something we need to telegraph: https://review.opendev.org/c/openstack/ironic/+/956504/8/ironic/api/hooks.py - Eventually just remove the dbhook might be the goal.17:46
opendevreviewJulia Kreger proposed openstack/ironic master: DNM - Test change for Metal3 for threading changes  https://review.opendev.org/c/openstack/ironic/+/95682017:54
* TheJulia crosses fingers17:55
alegacyHas anyone had success running bifrost on a CentOS VM with python3.12?  I had been running with python3.9, but NGS now requires >=3.10 so I upgraded my install and now I'm having all sorts of grief with bifrost not finding packages that it needs (one such pkg is 'firewall').18:03
TheJuliaalegacy: I'd assume all ironic related repos have a similar issue, but I've not tried on bifrost with py3.12 specifically since I've been so focused on eventlet stuffs recently18:21
alegacyTheJulia: I'm rebuilding my VM now just in case something went sideways...18:23
TheJuliahttps://review.opendev.org/c/openstack/bifrost/+/955181 merged yesterday, so it might just be timing and the status of your local bifrost branch18:59
JayFTheJulia: [not urgent] I was looking with Adam at CI failures in gophercloud/github actions running Ironic. It's failing on https://github.com/openstack/ironic/blob/master/devstack/lib/ironic#L2583 which was added, I think, to enable vmedia with a "Nexthop has invalid gateway.". If you have any quick insights, it'd be nice to hear em. I suspect it's something environmental and am trying to "hold" a github runner.19:11
JayFMy current hypotheses are that neutron/ovs/whoever didn't do the work properly to setup that subnet, or there's some weirdness (ip overlap? security limitations?) in the environment causing the issue.19:12
JayFalthough I wonder if it's racing with neutron somehow (I guess that'd be adjacent to the first hypothesis)19:13
alegacyTheJulia:  Maybe I'm going about this wrong?  I changed the symlinks for /bin/python3* to point to their python3.12 equivalents.  That seems to move things along, but then firewalld won't start because it can't find the dbus library as there's no dnf package equivalent for 3.12... installing it manually via pip doesn't help since firewalld starts with a "python3 -s" which excludes /usr/local/... from the sys.path19:30
alegacyi think part of the difficulty is that bifrost is hardcoded to build the venv from the system path of python3 19:46
alegacychanging the system interpreter is not trivial and shouldn't necessarily be tied to what you want to use in the venv.19:46
alegacyi'll try changing bifrost to accept an override and see if that works better to decouple it from the system interpreter.  If so I'll propose a patch.19:47
TheJuliais 3.10 available?  I don't have a centos9 machine in front of me19:51
alegacydoesn't appear to be.  3.11 and 3.12 are there.20:00
fricklerfrom what I heard, you can only use system python ie py39 for this kind of things. so centos 9 is out, need to use centos 1020:30
TheJuliarpittau is likely the best person to answer/discuss/engage on this, but he is also likely asleep at the moment20:49
guilhermespdtantsur: btw i still didnt get a chance to test your pr... it was a very busy day, i will try asap and keep you posted o/ 21:08
opendevreviewQueensly Kyerewaa Acheampongmaa proposed openstack/bifrost master: Clarify testenv and install usage order in testenv.rst  https://review.opendev.org/c/openstack/bifrost/+/94611621:19
TheJuliaso good news is that it looks like ironic + metal3 might be fine.... except the bad news, it looks like the job will need more ram. There appears to be zero swap as well so the memory footprint caused it to hit the edge. I remember it being pretty close. Anyhow I'll look at changing the job to use a bigger node tomorrow.22:29
TheJuliahttps://d9d62a197302b635235c-e6ddc7380eb470b6a5615640c6b2397b.ssl.cf5.rackcdn.com/openstack/2289c36ec84e4686882f97aef45672b7/controller/index.html22:29
JayFmaybe a good opportunity to see if we can find a knob to make ram usage profile more similar?22:34
TheJuliaalready went down that path22:35
TheJuliaWe functionally go from 1 -> 3 processes (1 master, 2 workers) and with threads. We already tune the thread sack way down, almost to as low as it can tolerate to still run.22:36
* TheJulia goes to order an espresso martini22:36
JayFack22:38
JayFthis is really going to be an annoying release for operators :( 22:38
JayFespecially container-deployers who limit their resources22:38
TheJuliayup22:39
jandersTheJulia I created the bugs for the servicing issue we discussed: https://bugs.launchpad.net/ironic/+bug/2119989 https://bugs.launchpad.net/ironic/+bug/2119990 - let me know if this is good enough or if I need to refine these further22:40
jandersthank you for looking into this22:41
TheJuliajanders: thanks23:04
TheJuliaJulia was not kidding… https://usercontent.irccloud-cdn.com/file/Ejq4LyHC/1754608383.JPG23:13
jandersTheJulia thank YOU!23:23

Generated by irclog2html.py 4.0.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!