Tuesday, 2025-04-29

arne_wiebalckGood morning, Ironic!06:55
queensly[m]Good morning07:03
AmarachiOrdor[m]Good morning Ironic07:50
sylvrGood morning Ironic08:29
abongaleGood Morning Ironic 08:49
freemanboss[m]Good morning 09:37
sylvrironic-inspector is deprecated in favor of ironic which will handle inspection using power_management_interface to start PXE boot and IPA right ?09:41
sylvrI'm looking through the ironic-inspector, and ironic in band inspection documentation, and using the newer solution (Ironic + IPA), launching some node inspection now require ports to be created ? (which wasn't the case for ironic-inspector)09:58
dtantsursylvr: 1) yes, 2) by default - yes. Read on about unmanaged mode (which was the default for inspector): https://docs.openstack.org/ironic/latest/admin/inspection/managed.html#unmanaged-inspection11:04
fricklerfyi if you have something like redfish, you can run oob inspection to generate the ports automatically11:12
dtantsuryes, and furthermore: the in-band inspection implementation will try to do it automatically before starting11:13
iurygregorygood morning ironic11:21
sylvrdtantsur: thanks for your answers ! just to be sure, without ironic-inspector, no TFTP server is supplied to the providing service, you have to setup your own ?11:50
sylvrfrickler: yes I read about that, and it looks awesome! unfortunately, I'm stuck with old hardware that only support IPMI (and I even had some weird boot order behavior which required me to manually intervene a few times)11:52
dtantsursylvr: ironic-inspector also never supplied a TFTP server, nothing has changed in this regard11:58
sylvrdtantsur: ok I should've known that because it's deployed by kayobe/kolla-ansible. Well, I think I have some issue with the PXE filter and that confused me. Thanks for clarifying Ironic :)12:28
opendevreviewMerged openstack/ironic-python-agent master: Remove eventlet from Ironic Python Agent  https://review.opendev.org/c/openstack/ironic-python-agent/+/94609112:59
opendevreviewcid proposed openstack/ironic master: Add an index on ports.node_id  https://review.opendev.org/c/openstack/ironic/+/94843113:04
opendevreviewcid proposed openstack/networking-baremetal master: Add conductor group sharding support  https://review.opendev.org/c/openstack/networking-baremetal/+/94843213:04
opendevreviewcid proposed openstack/networking-baremetal master: Add conductor group sharding support  https://review.opendev.org/c/openstack/networking-baremetal/+/94843214:00
TheJuliagood morning15:16
shermanmwith the caveat that I know 2023.1, and especially ironic-inspector are not supported anymore, does anyone recall an issue where nodes would get stuck in "inspecting", and never leave it until the conductor is restarted? Just looking for a pointer on where to dig around in the codebase15:47
dtantsurshermanm: do they get stuck immediately or after some time in inspect wait?16:07
dtantsurI assume nothing interesting in the logs?16:08
TheJuliaThis feels deja-vu-ey16:08
TheJuliaso I think we've seen this once or twice before, at least if it is the exact same thing. The client which gets cached for interactions breaks, but you'll obviously need to look at the logs16:12
TheJuliaonce it breaks, you have to restart the conductor but the break itself is the oddity if memory serves. Quickly looking at git doesn't yield anything in that regard, so maybe my internal LLM is having a fever dream or something funky16:13
dtantsuryeah, it's either the client or something in the hooks, the former being more likely16:13
TheJuliaI don't remember what exactly happened to the client16:14
shermanmso, luckily my log archiving actually went back far enough to catch this. It looks like it failed "Unable to start managed inspection for node $uuid: Failed to create neutron ports for node's $uuid ports"16:15
shermanmand that was due to a 504 from neutron, stemming from NGS slowness16:15
TheJuliaBut yeah, entirely situational as well so there may be an opporunity to harden/restart the client. We've had a couple weird such bugs where clients break under weird circumstances, in the case which comes to mind is around timeouts in certian states with redfish bmcs, not inspector but yeah...16:15
TheJulia... that shouldn't break it hard but I guess there is no way to really unwedge it from that point16:16
TheJuliaLikely need to invoke the error handler16:16
dtantsurHmm, the error would explain a failure but doesn't really tell us why the process got stuck16:17
dtantsurthis exception should have been handled properly16:17
shermanmactually, it looks like it failed in the exception handler, after handling the rest of that exception properly, due to `oslo_db.exception.DBDataError: (pymysql.err.DataError) (1406, "Data too long for column 'user' at row 1")`16:18
TheJuliaThat would do it16:18
shermanmwhich IIRC was fixed in newer versions for node_history, I didn't realize it could crop up here too16:18
TheJuliashermanm: How long is user in your environment?16:19
TheJuliayou have the federated config don't you16:22
shermanm64 characters, more details in this launchpad bug from back in xena w.r.t. node_history https://bugs.launchpad.net/ironic/+bug/205459416:22
shermanmyup16:22
TheJuliathat is how we foudn aht issue16:22
TheJuliayeah16:22
TheJuliaokay, that explains it!16:22
shermanmthe rubber-ducking session is appreciated :)16:23
shermanmthe million dollar question for me becomes just how messy is it to try and backport the alembic migration to my 2023.1 downstream16:36
JayFI would upgrade ironic only before I did that16:37
JayFalternatively16:37
JayFI wonder if the migration goes boom if you just make the user bigger16:37
shermanmyeah, that's what I'm afraid of. migrations making future upgrades a huge headache has been an issue before, I'll probably just do an ugly workaround on the error handler until we can get ironic upgraded16:39
JayFshermanm: we don't test it, but I will say some users upgrade Ironic at a faster cadence than the rest of the cloud.16:39
JayFshermanm: I would prefer having Ironic version outta sync than backport a DB-related change16:40
shermanmJayF: definitely a good point16:43
TheJuliaI don't think we've ever tried a backport of a db schema change because it woudl also break the upgrade ordering16:47
TheJuliaThe *closest* thing we've ever done is make some of our upgrades around indexes super smart to check the state prior to applying the change16:47
TheJuliaAnd we ended up backporting docs like: hey, you could run these commands to make the DB happier, ironic knows what to do on upgrade16:48
shermanmluckily there seems to be an easy workaround, just setting "node_history=false" until we get to 2024.1, so no crimes against alembic needed17:10
shermanmpretty sure this would affect deployments running 2023.2, dunno if it would make sense to add a note in the docs about disabling node_history if using keystone federation17:13
alegacy_As discussed at yesterday's weekly meeting I have gone ahead and added a rough outline of some of the high-level topics to try and kickstart the conversation on the standalone network topic.  Here: https://etherpad.opendev.org/p/ironic-standalone-networking17:19
TheJuliaalegacy_: I added some comments. We should likely try and have a call to get multiple folks on the same page, but I also know JayF might not be available easily this week.18:50
JayFa call is maybe one of the easier things I could do18:50
JayFwork that can be done while horizontal >>>>>>>> * 18:50
JayFlol18:50
* TheJulia glares ;)18:54
JayFhonestly I have finally found a comfortable position to sit with my laptop without my back screaming so I'm trying to get as much done as I can18:57
JayFnot just for like, productivity sake but for my own sanity lol18:57
opendevreviewIvan Anfimov proposed openstack/ironic master: Remove tags from README  https://review.opendev.org/c/openstack/ironic/+/94847919:49
opendevreviewIvan Anfimov proposed openstack/ironic master: Remove tags from README  https://review.opendev.org/c/openstack/ironic/+/94847919:50
opendevreviewIvan Anfimov proposed openstack/ironic master: Remove tags from README  https://review.opendev.org/c/openstack/ironic/+/94847922:23
opendevreviewIvan Anfimov proposed openstack/ironic master: Remove tags from README  https://review.opendev.org/c/openstack/ironic/+/94847922:23

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!