Tuesday, 2024-06-18

TheJuliaOut of curiosity, the idea, or via something like SZTP?00:00
cardoeMy wife slapped a treadmill under my standing desk...00:01
TheJuliaShe sounds very wise00:01
cardoeWell need to use SZTP00:01
cardoeFor me it's gonna be a marathon and not a sprint cause there's a LOT of legacy install base.00:02
TheJuliasemi-ouch in that has a very particular model really not geared for "oh, I have servers in a rack"00:02
cardoeokay well lemme walk that back not like a hard requirement00:03
TheJuliaThen again, maybe after the fact once you deal with the qr code and all00:03
TheJuliaoh, okay00:03
TheJuliayes, plan your process :)00:03
TheJuliaone of the former maintainers once told me about a whole workflow he had worked out where they would wheel servers in and they would be consumed/inspected/checked/benched/deployed00:04
cardoeUltimately I need to burn down a LOT of legacy code and a LOT of existing process and such.00:04
TheJuliaoh, those are the best marathons :)00:04
cardoeBut the effort is out in the open from day one, though a lot of convos and such aren't and I'm working to move it all open.00:04
TheJuliaThat is one of the hardest aspects of any transformation project00:05
TheJuliathat and keeping people from going "no! that is my process! (because without it I wouldn't have a job)"00:06
cardoeYou actually know a handful of the folks working with me. I'll let them out themselves.00:06
TheJuliarutro!00:06
TheJulia:)00:06
TheJuliait is all good00:06
cardoeI come from the servers world though so network is newer to me.00:06
TheJuliaThat is a big vendor centric space :(00:06
cardoeSo right now driving down smaller "demos" if you will. Using Nautobot and aiming to do some integrations between it and Neutron via plugins.00:07
cardoeAnyway, happy to contribute and be involved to the conversations and code.00:10
TheJuliavery cool00:10
cardoeSimilar-ish to how the Yaook Cloud folks have the OpenStack in k8s operator and are using Netbox as their IPAM.00:11
cardoeNah. The cool work is from you guys. Just standing on the shoulders of giants.00:13
TheJuliaThe whole thing with networking has kind of uncovered/highlighted some of the challenges before Neutron itself since it wants/really expects to be IPAM00:13
cardoeNeutron thinking it's an IPAM makes me think of the southern saying "bless your heart". As in Neutron saying "I'm a full IPAM!!!".... bless your heart..00:15
TheJuliaoh my00:15
* TheJulia knows this saying very well00:15
* TheJulia feels the need for a mint julep00:16
cardoeMaybe I'm wrong... but every other IPAM out there has quite a bit more in it.00:16
TheJuliaIndeed!00:17
cardoeBut first I'm trying to figure out everything the ironic conductor needs cause the OpenStack Helm project currently starts up the conductor container with the host IPC, network, and PID namespace. Runs as UID 0, mapped to host UID 0. And mounts /dev, /proc, /var, and /sys from the host filesystem. Something tells me that might be a touch overkill.00:39
TheJuliaI could see some of that back in the iscsi deployment interface days00:42
TheJuliabut after the wallaby release, that was... eliminated.00:43
opendevreviewJacob Anders proposed openstack/sushy-tools master: [WIP] Add support for BIOS update emulation  https://review.opendev.org/c/openstack/sushy-tools/+/90950004:56
rpittaugood morning ironic! o/07:06
opendevreviewRiccardo Pittau proposed openstack/virtualpdu master: [DNM] Test timeout  https://review.opendev.org/c/openstack/virtualpdu/+/92215807:14
rpittauJayF, TheJulia, I've created a revert for the time being https://review.opendev.org/c/openstack/requirements/+/92218607:20
rpittauWe'll be more relaxed when working on virtualpdu I guess :)07:20
opendevreviewPierre Riteau proposed openstack/tenks master: Bump minimum Ansible version  https://review.opendev.org/c/openstack/tenks/+/92162007:22
opendevreviewPierre Riteau proposed openstack/tenks master: Bump minimum Ansible version  https://review.opendev.org/c/openstack/tenks/+/92162008:07
mnasiadkaGood morning Ironic09:12
mnasiadkaIs there a way to supply custom dhcp opts to Neutron e.g. ntp server or other things?09:12
opendevreviewPrzemyslaw Szczerbik proposed openstack/ironic master: Fix execution of node servicing steps exposed by IPA's HardwareManager  https://review.opendev.org/c/openstack/ironic/+/92202409:26
sylvrHello ! I'd like to know more about LLDP for autodiscovery, is there other documentation/reference on that apart from this link : https://www.stackhpc.com/ironic-idrac-ztp.html ? Thanks ! :)10:47
sylvroops, this is more about Kayobe than Ironic, should I ask in #openstack-kolla?10:48
Sandzwerg[m]Morning Ironic. I have A question regarding conductors: how can I delete one? We've deleted the compute service, and the running agent but I still see the conductors with "baremetal conductor list" but there is no "conductor delete" So we seem to be missing something to remove the conductor completely.11:02
dtantsurSandzwerg[m]: 'systemctl stop ironic-conductor' :)12:21
Sandzwerg[m]The conductor itself is gone as well (and thus reported as dead)12:22
dtantsurAh, I see. It's probably going to stay there12:23
Sandzwerg[m]So the only possibility to remove it, would be to remove it directly from the DB?12:24
Sandzwerg[m]They don't really hurt but we're now decommissioning more and more old hardware and thus conductors with them so over time it will clutter 12:26
dtantsurI think so, I don't remember this part of the code very well12:29
dtantsur(maybe we need a periodic task to remove really old records)12:30
TheJuliagood morning12:57
TheJuliaWe likely *do* need such a periodic13:48
TheJuliadelete conductors >2 weeks old or something like that13:48
JayFThat would be a pretty low hanging feature too, basically just ape the implementation for node history14:13
samcat116Related to dead agents: it seems like when you delete an ironic node, the ironic-neutron-agent instance for that node isn't deleted and stays around as a dead agent. I see this on it: https://access.redhat.com/solutions/3913411, but I can't access that. Not sure if this is an ironic or a neutron thing.14:20
JayFsamcat116: might be worth documenting that in an ironic+neutron bug14:30
dtantsursamcat116: the solution suggests $ openstack network agent delete UUID14:31
JayFI'm wondering... is there space for a feature for driver migration *inside Ironic*14:40
JayFe.g. if you wanna go from old vendor driver -> generic redfish,having ironic do it for you over time?14:41
rpittauReverbverbverb: I had a first read of the ironic docs analysis, it's brutal, I like it :D14:43
rpittauI left a couple of simple comments to start with14:43
TheJuliasamcat116: I suspect there is room for the agent to have some cleanup/reconcile logic as well14:48
TheJuliaJayF: Maybe, maybe not, dunno. If there is a time to do it, the time to strike the steel is now-ish14:48
JayFthat is basically what I am thinking, too14:49
JayFMy downstream would likely use an ilo->redfish converter thing14:49
JayFI just can't wrap my mind around how to configure it14:49
JayF[ilo] convert_to_redfish = True => runs a periodic task that updates nodes in AVAILABLE / MANAGABLE state to redfish driver?14:49
JayFbut then you'd have little option to configure that move14:50
dtantsurJayF: sounds like an optional (?) online_data_migration14:50
JayFor maybe even, just if you update node.driver with a new enough api microversion, we will automatically migrate creds to the new keys that driver uses14:50
TheJuliamaybe not periodic tasks14:51
JayFI will ponder on this, I am going to point my brain at getting virtualbmc unscrewed (rollback or not it's going to be a problem, we can't ignore it forever unless we wanna just get rid of snmp ci)14:51
JayFI think I might rewrite it to embrace asyncio instead of trying to thread out all the separate snmp server instances14:52
JayFinstead of getting asyncio+threads to play nice14:52
TheJulia*but* there might be room for a general "cleanup/housekeeping periodic" overall, we sort of already have one.14:52
JayFand have configurable things be added to it14:52
JayFlike removing old conductors14:52
JayFconverting from old drivers14:52
JayFetc?14:52
TheJuliaI think some folks will appreciate that a lot since there are folks out there who still use snmp14:52
TheJuliawe always get a couple people who come out of the woodwork when we talk about getting rid of it14:53
JayFYeah, SNMP is a good driver, and I think this is the force-me-to-learn-asyncio issue I've been waiting on14:53
TheJulia++14:53
rpittauJayF: about virtualpdu, I think this https://review.opendev.org/c/openstack/virtualpdu/+/922158 is kind of in the right direction, the unit tests are not failing but it's timing out 14:55
rpittaummm ok there' still one failure14:57
JayFI agree you're close to making it work; but I'm not sure what we'd end up with would be well-structured14:58
JayFversus if we ripped out the threads and went full coroutine given it's already forcing us to use asyncio to use those lower level apis14:58
JayFAFAICT pysnmp only left the sync-style APIs for the high level stuff, I don't think we can operate at the level vbmc does without interacting with it 14:58
JayFbut I'm dealing with MASSIVE amounts of unknown here so it's possible I'm way, way off14:58
opendevreviewDmitry Tantsur proposed openstack/ironic master: WIP migration guide from inspector  https://review.opendev.org/c/openstack/ironic/+/92208915:01
dtantsursince we're on the topic of docs, early feedback welcome ^^15:01
tlegentilHi Everyone !15:02
tlegentilI'm willing to manage baremetal servers with ironic, and I would like to use it within AIO. I've followed the https://docs.openstack.org/ironic/latest/contributor/devstack-guide.html BUT I always end up with the error : "++lib/neutron_plugins/services/l3:create_neutron_initial_network:202  oscwrap --os-cloud devstack --os-region RegionOne network create private -f value -c id15:03
tlegentilError while executing command: HttpException: 503, Unable to create the network. No tenant network is available for allocation.15:03
tlegentil++functions-common:oscwrap:2461             return 115:03
tlegentil+lib/neutron_plugins/services/l3:create_neutron_initial_network:202  NET_ID=15:03
tlegentil"15:03
tlegentilCould you please give me some hint ? Thanks !15:03
JayFWhat configuration from that page are you using? Are you sure you actually ran the bash script instead of copying it into the file (a common mistake)15:17
JayFI know cid has been using configs from that page without issue so we just have to figure out what's up with your environment or process to get ya working15:17
TheJuliafailure to create a tenant network is typically some level of access or there is a base networking misconfiguration somewhere, sort of depends on the actual susituted configuration into any example as well.15:19
tlegentilif I create the local.conf file with the example provided " Ironic with Nova ", then I have the HOST_ID that is not found. So I added the HOST_ID in the local.conf file. 15:21
tlegentilnetwork config is like this : "stack@Openstack-AiO:~/devstack$ ip a15:21
tlegentil1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 100015:21
tlegentil    link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:0015:21
tlegentil    inet 127.0.0.1/8 scope host lo15:21
tlegentil       valid_lft forever preferred_lft forever15:21
tlegentil    inet6 ::1/128 scope host 15:21
tlegentil       valid_lft forever preferred_lft forever15:21
tlegentil2: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc fq_codel state UP group default qlen 100015:21
tlegentil    link/ether bc:24:11:c6:eb:a8 brd ff:ff:ff:ff:ff:ff15:21
tlegentil    altname enp0s1815:21
tlegentil    inet 10.210.44.201/24 brd 10.210.44.255 scope global eth015:22
tlegentil       valid_lft forever preferred_lft forever15:22
tlegentil    inet6 fe80::be24:11ff:fec6:eba8/64 scope link 15:22
tlegentil       valid_lft forever preferred_lft forever15:22
tlegentil3: ens19: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN group default qlen 100015:22
tlegentil    link/ether bc:24:11:91:c2:d3 brd ff:ff:ff:ff:ff:ff15:22
tlegentil    altname enp0s1915:22
tlegentil4: ovs-system: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN group default qlen 100015:22
tlegentil    link/ether ae:2f:3b:a9:d8:5d brd ff:ff:ff:ff:ff:ff15:22
tlegentil5: br-int: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN group default qlen 100015:22
tlegentil    link/ether d2:9b:13:8f:07:39 brd ff:ff:ff:ff:ff:ff15:22
tlegentil6: br-ex: <BROADCAST,MULTICAST> mtu 1400 qdisc noop state DOWN group default qlen 100015:22
tlegentil    link/ether 4e:71:3f:fe:a2:40 brd ff:ff:ff:ff:ff:ff15:22
tlegentil7: virbr0: <NO-CARRIER,BROADCAST,MULTICAST,UP> mtu 1500 qdisc noqueue state DOWN group default qlen 100015:22
tlegentil    link/ether 52:54:00:82:9b:87 brd ff:ff:ff:ff:ff:ff15:22
tlegentil    inet 192.168.122.1/24 brd 192.168.122.255 scope global virbr015:22
tlegentil       valid_lft forever preferred_lft forever15:22
tlegentil8: brbm: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN group default qlen 100015:22
tlegentil    link/ether fa:f1:9e:66:1f:4f brd ff:ff:ff:ff:ff:ff15:22
tlegentil9: br-tun: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN group default qlen 100015:22
tlegentil"15:22
tlegentiloups sorry for the flood15:22
cidtlegentil: Good afternoon.15:27
cidYou could use this for sharing big chunks like that ^^ : https://paste.opendev.org/ :))15:29
cidI have a level of familiarity with setting up devstack at this point. But it's a bit unclear to me what the problem might be in your case.15:32
cidFIY: I'm a bit of a noob too15:32
JayFYeah tlegentil a couple of things: first of all, if you've had a failed install, it can be really tough to get that machine working. I'd suggest starting from a fresh ubuntu install if possible. Then, if you still have issues, paste the devstack log somewhere and we can look at help15:33
cidMaybe, start by ./unstack[ing] and ./stack[ing] once again. And, probably just remove the current local.conf and get a fresh one from the documentation.15:33
cardoeSo a bit of a follow up to the convo about redfish-https sending an ISO. I realize now that's really poking the BMC and telling the boot interfaces to download the ISO. Not mounting the ISO through the BMC like virtual media. So in this case this would work with my network constraints. That ISO is pre-built and the same for all servers or is it being dynamically generated?15:43
cardoeIt wasn't clear to me from the docs.15:43
TheJuliacardoe: dynamically generated by default15:46
tlegentilThanks @cid for the paste site and the unstack / stack that I will try now15:49
cardoeThanks TheJulia 15:49
TheJuliatlegentil: our CI uses a very similar configuration, I guess a question might be if your trying to use a specific branch15:54
tlegentilTheJulia : I'm on the latest, 'git clone etc'15:57
rpittauJayF, TheJulia, the pysnmp-lextudio patch revert has merged, snmp related things should be functional again16:02
rpittaugood night! o/16:06
JayFo/16:09
TheJuliatlegentil: since we don't often run devstack directly, that does seem a little weird. Typically when we have such issues in CI, there is some sort of preceeding error. your error is originating from Neutron failing to create a network, I guess I'd look at the q-* log files to see what neutron is reporting as to why it couldn't create a network16:14
tlegentilok after unstack / stack I have a different one : https://paste.opendev.org/show/b9UgnEVLAFunMWirpn49/16:15
tlegentilAlso my goal is only to manage baremetal provisionning16:15
cidtlegentil: Your best bet may be doing as JayF suggested.17:03
cid- start from a fresh Ubuntu install, with a fresh configuration17:03
cid- if you still have issues, paste the devstack log somewhere and we can look at help17:03
TheJuliatlegentil: bifrost might be an easier path if you don't need extra openstack parts and pieces17:06
JayFWe do *not* reccomend or support the use of devstack in production. You 100% want a solution like bifrost rather than a devstack testenv.17:28
Sandzwerg[m]<samcat116> "Related to dead agents: it seems..." <- We don't have ironic-neutron-agents so I'm not sure if that is related.17:45
TheJuliaSandzwerg[m]: do you have networking-baremetal installed and configured in neutron?17:55
Sandzwerg[m]Nope17:55
TheJuliahmm17:55
Sandzwerg[m]We use arista/cisco as Top of the Rack switch and for Arista we wrote our own driver, not sure about cisco. But I assume our setup in this regard is not very common17:57
opendevreviewcid proposed openstack/ironic master: [WIP but open to reviews] Allow disabling bios deployments  https://review.opendev.org/c/openstack/ironic/+/92224317:57
opendevreviewcid proposed openstack/ironic master: [WIP but open to reviews] Allow disabling bios deployments  https://review.opendev.org/c/openstack/ironic/+/92224317:59
TheJuliaSandzwerg[m]: but do you have dead agents?17:59
Sandzwerg[m]Not that I'm aware of. I see conductor left overs in the ironic conductor list, nowhere else (so far)18:00
TheJuliaokay, yeah, those are distinctly different pieces of data18:01
TheJuliaand processes18:01
Sandzwerg[m]I filled a bug for this: https://bugs.launchpad.net/ironic/+bug/2069771 can add more details later or tomorrow, I have a private appointment now :)18:01
*** awb_ is now known as awb18:12
opendevreviewcid proposed openstack/ironic master: Provision ARM (aarch64) fake-bare-metal-vms  https://review.opendev.org/c/openstack/ironic/+/91544118:57
opendevreviewJulia Kreger proposed openstack/ironic-python-agent-builder master: Remove centos7 specific logic check  https://review.opendev.org/c/openstack/ironic-python-agent-builder/+/92224819:18
cido/19:32
opendevreviewJulia Kreger proposed openstack/ironic stable/2024.1: fix: Fix class typo for portgroup. Portgroup instead of PortGroup  https://review.opendev.org/c/openstack/ironic/+/92225019:37
opendevreviewJulia Kreger proposed openstack/ironic stable/2023.2: fix: Fix class typo for portgroup. Portgroup instead of PortGroup  https://review.opendev.org/c/openstack/ironic/+/92225119:37
opendevreviewJulia Kreger proposed openstack/ironic stable/2023.1: fix: Fix class typo for portgroup. Portgroup instead of PortGroup  https://review.opendev.org/c/openstack/ironic/+/92225219:38
opendevreviewVerification of a change to openstack/ironic master failed: Update version change log with special treatment of .json removal  https://review.opendev.org/c/openstack/ironic/+/92196620:22
* cid Oops... did a few rechecks after seeing a passing snmp job 😬21:05
JayFI think it should be fixed now, if it's not you might want to dig into what's happening. Just make sure that the new constraints file was used -- you should have, in the logs somewhere, a list of the exact packages installed in the venv with ironic and virtualpdu21:47
tlegentilThanks @cid and @TheJulia for the bifrost recommendation, I will have a look.22:26
opendevreviewJulia Kreger proposed openstack/ironic master: Remove ibmc hardware type  https://review.opendev.org/c/openstack/ironic/+/92225923:10
opendevreviewJulia Kreger proposed openstack/ironic master: Remove ibmc hardware type  https://review.opendev.org/c/openstack/ironic/+/92225923:26
opendevreviewJulia Kreger proposed openstack/ironic master: Remove deprecated xclarity hardware type  https://review.opendev.org/c/openstack/ironic/+/92226023:26
opendevreviewJulia Kreger proposed openstack/ironic master: Remove ibmc hardware type  https://review.opendev.org/c/openstack/ironic/+/92225923:28
opendevreviewMerged openstack/ironic master: Update version change log with special treatment of .json removal  https://review.opendev.org/c/openstack/ironic/+/92196623:29
opendevreviewJulia Kreger proposed openstack/ironic master: Remove deprecated xclarity hardware type  https://review.opendev.org/c/openstack/ironic/+/92226023:34
opendevreviewJulia Kreger proposed openstack/ironic master: Remove ibmc hardware type  https://review.opendev.org/c/openstack/ironic/+/92225923:34

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!