Thursday, 2026-01-15

opendevreviewMerged openstack/ironic-prometheus-exporter stable/2025.2: Fix handling of unknown metric keys in ironic parser  https://review.opendev.org/c/openstack/ironic-prometheus-exporter/+/97107200:33
opendevreviewSteve Baker proposed openstack/ironic master: OciImageService detect bootc image  https://review.opendev.org/c/openstack/ironic/+/96676002:16
opendevreviewSteve Baker proposed openstack/ironic master: WIP autodetect deploy interface  https://review.opendev.org/c/openstack/ironic/+/97318702:16
*** jroll07 is now known as jroll007:16
rpittaugood morning ironic! o/07:43
opendevreviewMerged openstack/ironic master: Simplify ovn vtep microversion logic  https://review.opendev.org/c/openstack/ironic/+/97149408:31
opendevreviewPierre Riteau proposed openstack/bifrost master: Fix path to bifrost.crt with non-default tls_root  https://review.opendev.org/c/openstack/bifrost/+/97345210:21
opendevreviewPierre Riteau proposed openstack/bifrost master: Fix path to bifrost.crt with non-default tls_root  https://review.opendev.org/c/openstack/bifrost/+/97345210:25
priteauHello Bifrost team. Is there a way to disable the new OCI artifact registry? In kolla-ansible/kayobe land we are running Bifrost inside a Docker container and that is failing to run podman inside for now, breaking our CI10:33
ContinuityTheJulia: amazing:D10:33
ContinuitySorry a little behind on my IRC10:33
opendevreviewPierre Riteau proposed openstack/bifrost master: CI: Fix SLURP upgrade jobs  https://review.opendev.org/c/openstack/bifrost/+/97345810:58
opendevreviewPierre Riteau proposed openstack/bifrost master: CI: Fix previous release for SLURP upgrade jobs  https://review.opendev.org/c/openstack/bifrost/+/97345810:58
opendevreviewBartosz Bezak proposed openstack/networking-generic-switch master: Add Arista bond trunk support  https://review.opendev.org/c/openstack/networking-generic-switch/+/97346111:31
opendevreviewPierre Riteau proposed openstack/bifrost master: CI: Ensure firewalld is unmasked  https://review.opendev.org/c/openstack/bifrost/+/97346312:04
opendevreviewBartosz Bezak proposed openstack/networking-generic-switch stable/2025.1: [DNM] Add Arista bond trunk support  https://review.opendev.org/c/openstack/networking-generic-switch/+/97347013:11
opendevreviewPierre Riteau proposed openstack/bifrost master: CI: Ensure firewalld is unmasked  https://review.opendev.org/c/openstack/bifrost/+/97346313:33
opendevreviewPierre Riteau proposed openstack/bifrost master: Ensure firewalld is unmasked  https://review.opendev.org/c/openstack/bifrost/+/97346313:36
opendevreviewMichal Nasiadka proposed openstack/networking-generic-switch stable/2025.2: Use upper constraints in pep8 job  https://review.opendev.org/c/openstack/networking-generic-switch/+/97348013:56
opendevreviewMichal Nasiadka proposed openstack/networking-generic-switch stable/2025.2: Use upper constraints in pep8 job  https://review.opendev.org/c/openstack/networking-generic-switch/+/97348013:56
opendevreviewMichal Nasiadka proposed openstack/networking-generic-switch stable/2025.1: Use upper constraints in pep8 job  https://review.opendev.org/c/openstack/networking-generic-switch/+/97348113:56
opendevreviewMichal Nasiadka proposed openstack/networking-generic-switch stable/2025.1: Use upper constraints in pep8 job  https://review.opendev.org/c/openstack/networking-generic-switch/+/97348113:56
opendevreviewMichal Nasiadka proposed openstack/networking-generic-switch stable/2025.1: Use upper constraints in pep8 job  https://review.opendev.org/c/openstack/networking-generic-switch/+/97348114:16
opendevreviewMichal Nasiadka proposed openstack/networking-generic-switch stable/2025.2: Use upper constraints in pep8 job  https://review.opendev.org/c/openstack/networking-generic-switch/+/97348014:16
TheJuliaContinuity: no worries14:26
TheJuliapriteau: looks like the option would need to be added to do such14:30
TheJuliaThat also means some of the outstanding patches would likely need to also make its usage optional. :\14:31
TheJuliapriteau: any chance kayobe might be interested in leveraging OCI URLs instead of bringing your own url/artifact ?14:37
opendevreviewMerged openstack/python-ironicclient stable/2025.2: feat: add 'vendor' and 'category' for port object  https://review.opendev.org/c/openstack/python-ironicclient/+/97343914:37
opendevreviewMichal Nasiadka proposed openstack/networking-generic-switch stable/2025.2: Use upper constraints in pep8 job  https://review.opendev.org/c/openstack/networking-generic-switch/+/97348014:38
priteauTheJulia: What do you mean by OCI URL? Registry URL for an image?14:43
priteauIn Kayobe we already deploy a local Docker registry, so ideally we would reuse it instead of having Bifrost deploy its own14:44
TheJuliapriteau: so the direction bifrost is moving in the use of a image registry to back the need for the webserver for user image artifacts, which allows for more options (and also allows for things like bootc deployed hosts as well, eventually. 14:45
TheJuliaso, conceivably, if we put in knobs on the registry install/use, you could disable the registry from installing in your jobs, and when the other patches merge to use it, you could likely then just offer a patch for an external registry to be used14:48
priteauDo you store qcow2/raw images in this registry?14:48
TheJuliaA patch which has not landed yet does, yes14:49
TheJuliaUsing ORAS14:49
priteauNice stuff14:51
priteauDo you have a timeline for completely removing the options of keeping images on the built-in HTTP server?14:52
TheJuliaonce loaded in, Ironic uses the OCI client we added back in... late 2024... (wow, where has time flown), to retrieve the artifact14:53
TheJuliaA timeline has not yet been set, but I think the consensus has been to just try and move bifrost forward14:53
TheJulianot knowing the kayobe CI details14:53
TheJuliaAnother topic: Who here cares about Dell OS10 (Formerly Force10 OS, version 10) switches and use of VXLAN?14:54
alegacyTheJulia: we care about os10 switches, but not VXLAN (yet)16:08
TheJuliaSo, some analysis points to mostly coupled with vxrail setups which may be on their way out anyway, but those could apparently be ordered with different vendor switches, so no real numbers there. I guess the reason I raise the question is they have a 3 step configuration model for the actual attachment of a VNI to a vlan through an internal range limited value. Just not really sure about demand or relevence at this point 16:12
TheJulia(also, since I've heard they are heading towards being phased out in favor of SONiC (although, I have no idea what might replace that given Dell SONiC requires a license to run sonic at all which expires after some number of years...))16:12
*** gmaan is now known as gmaan_afk16:33
TheJuliaJayF: When you get a chance, please take a look at https://review.opendev.org/c/openstack/ironic/+/973187 I suspect we might need a little more data16:55
JayFhttps://review.opendev.org/c/openstack/ironic/+/973187/3#message-5972ac124953e722a05fc5c145187ed26378040716:59
JayFI suggested to make it detect bootc only for now, and I can do ramdisk as the next step16:59
JayFsince there are questions about how to make ramdisk work I can't answer now and don't have time to tackle likely this week16:59
JayFwho were our outreachy mentors last cycle? rpittau and someone else? Can one of you all DM me?17:07
cardoeJayF: I'd also appreciate https://review.opendev.org/c/openstack/ironic/+/973294 so that I can work on testing clif's stuff.17:16
opendevreviewJulia Kreger proposed openstack/networking-generic-switch master: WIP: l2vni plug case with Cisco NXOS  https://review.opendev.org/c/openstack/networking-generic-switch/+/96837717:37
opendevreviewJulia Kreger proposed openstack/networking-generic-switch master: WIP: Arista EOS and vendor neutral SONiC support for VXLAN attachments  https://review.opendev.org/c/openstack/networking-generic-switch/+/97276317:37
opendevreviewJulia Kreger proposed openstack/networking-generic-switch master: WIP: VXLAN: Add Junos, Cumulus NVUE, and denote Dell OS10 as unsupported  https://review.opendev.org/c/openstack/networking-generic-switch/+/97276417:37
opendevreviewJulia Kreger proposed openstack/networking-generic-switch master: WIP: OVS testing patch for 'vxlan' binding model  https://review.opendev.org/c/openstack/networking-generic-switch/+/97276517:37
opendevreviewMerged openstack/networking-generic-switch stable/2025.2: Use upper constraints in pep8 job  https://review.opendev.org/c/openstack/networking-generic-switch/+/97348017:38
opendevreviewMerged openstack/networking-generic-switch stable/2025.1: Use upper constraints in pep8 job  https://review.opendev.org/c/openstack/networking-generic-switch/+/97348117:38
*** gmaan_afk is now known as gmaan17:46
opendevreviewJulia Kreger proposed openstack/networking-generic-switch master: Migrate setup configuration to pyproject.toml format  https://review.opendev.org/c/openstack/networking-generic-switch/+/97352617:53
cardoeTheJulia: free pony if you add pre-commit in there too18:02
TheJuliaMy claude assistant may be up for that, need to level set the repos to a better state... really18:02
TheJuliaAwwww... lets see if I remember after 11 am. ;)18:06
cardoeSo if a machine in "inspect failed", what's the right operation to get it back to a point that I can inspect it again. "manage"?18:13
TheJuliaEasy win: https://review.opendev.org/c/openstack/ironic/+/97328318:15
TheJuliacardoe, that is one of yours :)18:15
TheJuliacardoe: yes, that should be18:16
TheJuliaAlthough, I thought we kept the auto roll back pattern, maybe we didn't with inspection from inspector?18:16
cardoewhat's wrong with the patch?18:19
cardoeSo I ask because I'm about to file a bug.18:20
cardoeI'm still using agent inspect in this env. And the agent had an error. So it went to "inspect failed". The baremetal port retained its | internal_info         | {'inspection_vif_port_id': '8013b009-2064-423b-9b99-a38953cef7d3'} even after I did "manage"18:21
cardoeThe port remained in neutron18:21
cardoeAnd when I went to inspect again I got a conflict on the port.18:21
TheJuliacardoe: nothing wrong with the patch18:23
opendevreviewMerged openstack/ironic master: fix: port endpoints did not return vendor and category and fix docs  https://review.opendev.org/c/openstack/ironic/+/97329418:23
opendevreviewMerged openstack/ironic master: Add positive port api category/vendor field test  https://review.opendev.org/c/openstack/ironic/+/97339618:23
TheJuliaoh jeeze, yeah, thats a bug18:23
TheJuliaon inspect error being reached, the port should have been ripped out18:24
TheJuliaalong with the vif18:24
cardoeoh I see what happens.18:24
cardoeWe left it on there and when it went to do cleaning it said "gimme the attached vif" and it got the inspection port18:24
opendevreviewDoug Goldstein proposed openstack/ironic stable/2025.2: fix: port endpoints did not return vendor and category and fix docs  https://review.opendev.org/c/openstack/ironic/+/97353218:25
TheJuliaso we failed to capture/handle/remove the vif/port/attachment with the error?18:25
opendevreviewDoug Goldstein proposed openstack/ironic bugfix/33.0: fix: port endpoints did not return vendor and category and fix docs  https://review.opendev.org/c/openstack/ironic/+/97353318:26
cardoeYeah. I don't know if its suppose to be removed with the error or if when it moves away from the failed state?18:28
cardoeblerp18:33
cardoejanders: so I'll +2 that patch if ya add some constants and make the clean up clear like dtantsur said... what makes me think of this...18:33
cardoehttps://github.com/openstack/ironic/blob/09761d9549286b56561d7c2dee16beb83e62e21f/ironic/conductor/utils.py#L148618:34
cardoehttps://github.com/openstack/ironic/blob/09761d9549286b56561d7c2dee16beb83e62e21f/ironic/drivers/modules/network/common.py#L47818:34
TheJuliaIt should happen in the error handler, not on the move from the error state18:41
TheJuliacardoe: have you had a chance to move on the mech driver locally on the neutron repo?18:42
TheJulia(trying to think through the building blocks here, before I start to piece it together18:42
cardoeto the networking-baremetal? not yet.18:43
TheJuliak18:43
cardoeI will.18:48
cardoeSo this box hit a timeout in the inspection. It didn't get attached physically on the network correctly.18:49
TheJuliaok18:49
TheJulia... what went sideways? delayed bind?18:50
cardoeSome change in our mech driver.18:51
cardoeBut the neutron port from inspection get left around with the MAC address of that port.18:51
cardoeAnd inspection_vif_id stuck around18:52
TheJuliayeah, that definitely needs fixing18:52
cardoeThe box went to cleaning and we clean up ports by MAC address18:52
cardoeAnd due to those links I had above the order of the vif_port_id done differently in two different places.18:52
TheJuliaugh, yeah18:53
TheJuliaso the new inspection code doesn't handle errors like the others18:53
cardoeSo we run remove_ports_from_network() which removes a port based on the MAC from the network. But now there was an inspection port and a cleaning port. When cleaning finished it removed the inspection_vif_id port from neutron. But removed the 'cleaning_vif_id' from the baremetal port.18:53
TheJuliaand it is, very definitely, minimal18:53
cardoeSo now the cleaning port with the MAC address was left behind along with a reference on the baremetal port.18:54
TheJuliaThe root issue is the original inspection failure, right?18:54
cardoeAnd somehow when the box was used again the cleaning port was used and bound down to the box and not the tenant port.18:55
cardoeYeah.18:55
cardoeJust walking up the stack of operations from the logs.18:55
TheJuliacould you just create a quick bug, I see the issue and can hammer a fix out kind of quick18:56
cardoeYeah.18:56
TheJuliaThanks18:56
cardoeSo one idea I've had recently is on the neutron ports. We set 'device_owner' to 'baremetal:none'. Really what matters or what we need is that baremetal: prefix. We've been utilizing for our network node baremetal port 'baremetal:network' in our mech driver.18:57
TheJuliaInteresting18:57
cardoeBut I've been thinking that neutron should probably make the port as 'baremetal:cleaning', 'baremetal:inspecting', 'baremetal:provisioning', 'baremetal:tenant', 'baremetal:rescuing'18:57
TheJuliaThat is more state conveyance and tracking, not sure but maybe could be a case if extra guardrails are needed18:58
cardoeYes but the guardrail I was thinking is that when we go to tell neutron THIS PORT RIGHT HERE ATTACH IT NOW... we can ensure we select the right port.18:59
cardoeCause right now we lookup a port with the MAC address for device == node_id18:59
cardoeSo if a cleaning network port or an inspecting network port didn't get cleaned up by accident....19:00
TheJuliahmm19:00
cardoeJust an idea.19:00
TheJuliaits sort of an issue a few steps removed though19:00
TheJuliaits not really wrong to use the mac since the mac is forced to be "unique"19:00
cardoefor baremetal ports but not neutron ports19:01
cardoeThis issue only happened because of the fix for https://bugs.launchpad.net/ironic/+bug/210607319:01
cardoeBecause we now create the tenant port early19:01
cardoebugs.launchpad.net is down19:02
cardoeout of curiosity what tracks the timeout for inspection?19:03
cardoeoh ironic/conductor/manager.py _check_inspect_wait_timeouts19:05
cardoeNothing in there cleans up19:08
opendevreviewJulia Kreger proposed openstack/networking-baremetal master: Swap networking-baremetal to use pyproject.toml  https://review.opendev.org/c/openstack/networking-baremetal/+/97353919:11
TheJulialaunchpaddddddddd19:12
TheJuliaoh, thats wonderful, because even the inspection error handler fails to do basic cleanup19:12
opendevreviewVerification of a change to openstack/ironic master failed: Add LLDP collect for DRAC Redfish inspection  https://review.opendev.org/c/openstack/ironic/+/97063019:31
cardoeSo I went crazy and started switching from random strings to an enum20:13
cardoeI discovered that servicing_vif_port_id is missed entirely in a couple of places.20:14
cardoeTheJulia: do we make a new vif/port for servicing?20:34
cardoeOr are we recycling the tenant port?20:35
TheJuliamake a new one20:36
TheJuliaI've got patch in progress20:36
cardoeooo well my patch is gonna be a lot of fun20:37
cardoehttps://github.com/openstack/ironic/blob/09761d9549286b56561d7c2dee16beb83e62e21f/ironic/drivers/modules/network/common.py#L478-L48220:37
cardoeWe do not get the "servicing_vif_port_id"20:38
cardoeWe're gonna grab the tenant one20:38
TheJuliaIn a meeting atm, so can't look atm20:45
cardoeHere's the thing... I don't really know what it impacts.21:02
opendevreviewDoug Goldstein proposed openstack/ironic master: fix: refactor VIF network types to be an enum and ensure all are checked  https://review.opendev.org/c/openstack/ironic/+/97355721:08
cardoelaunchpad is still dead.21:10
TheJulialikley just an oversight, I am not sure much code actually calls get_current_vif21:42
TheJuliaClaude is acting... very concernably.21:42
cardoeWell I'll review whatever ya come up with.22:00
JayFhttps://review.opendev.org/c/openstack/ironic/+/964502 metal3-integration is failing on this, it looks like nodes are not coming online from cleaning22:09
JayFbut I can't seem to find any of hte ironic-bm-log equivalents, or even find where the ironic-anything logs are22:10
opendevreviewJulia Kreger proposed openstack/ironic master: WIP: Fix introspection failure handling  https://review.opendev.org/c/openstack/ironic/+/97356022:13
TheJuliacardoe: ^^^ passes tests, but claude was acting a bit wonky, a close eye is likely needed22:14
cardoeJayF: yeah I couldn't figure out how to look at it. They're expecting the nodes to be in a different state is all I could tell.22:16
cardoeThe inspection failure change looks reasonable.22:29
TheJuliagood to know, I'll review it myself in the. morning and give it a cleanup spin22:29
TheJuliaI basically had 1-on-1 and then a team meeting so my brain is in word use mode, not code logic mode22:30
TheJuliabut, my devstack is restacking with the stack of ngs code for vxlan fun22:30
opendevreviewDoug Goldstein proposed openstack/ironic master: fix: refactor VIF network types to be an enum and ensure all are checked  https://review.opendev.org/c/openstack/ironic/+/97355722:31
cardoeI reviewed the ironic standalone networking. I'm the same as everyone else as a +1 to show it.22:32
TheJuliacool cool22:36
cardoeIt's got a metric ton of strings all over the place. The first time we've gotta add support for something else it's gonna be like ^ above.22:36
TheJuliayeah22:37
opendevreviewVerification of a change to openstack/ironic master failed: Add LLDP collect for DRAC Redfish inspection  https://review.opendev.org/c/openstack/ironic/+/97063022:49

Generated by irclog2html.py 4.0.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!