opendevreview | Jacob Anders proposed openstack/ironic master: Add Targets to firmware.update on multi system BMCs https://review.opendev.org/c/openstack/ironic/+/922438 | 05:24 |
---|---|---|
rpittau | good morning ironic! o/ | 06:51 |
rpittau | just realized I left the "weekend goodbye" message in the wrong channel on Friday :D | 06:51 |
maxh[m] | We have here an issue, where the IPA throws a LookupNodeError for a new node if another node, known to ironic, has its redfish address unreachable (for whatever reason, e.g. setting redfish_address in driver_info to an invalid value). Any hints? | 07:11 |
opendevreview | Mohammed Boukhalfa proposed openstack/sushy-tools master: Extend Fake System Driver for Interaction with Fake IPA via REST API https://review.opendev.org/c/openstack/sushy-tools/+/923111 | 08:53 |
opendevreview | Jacob Anders proposed openstack/ironic master: Add Targets to firmware.update on multi system BMCs https://review.opendev.org/c/openstack/ironic/+/922438 | 10:59 |
iurygregory | good morning Ironic | 11:10 |
opendevreview | Bartosz Bezak proposed openstack/ironic-python-agent master: Add LVM based image support to MD scenario https://review.opendev.org/c/openstack/ironic-python-agent/+/816685 | 12:28 |
TheJulia | good morning | 13:58 |
TheJulia | maxh[m]: What generates an error first, the conductor or the agent? with redfish based hosts, I believe we try to lookup the known mac addresses via the BMC and populate data, but if the data is wrong of course that will fail, but even then if the address is wrong how can the agent ever really start? | 14:00 |
TheJulia | maxh[m]: I guess, you've got a chicken/egg issue and I'm not sure how your getting into a running IPA | 14:00 |
opendevreview | Julia Kreger proposed openstack/ironic master: CI: Remove grub http boot from standalone CI test https://review.opendev.org/c/openstack/ironic/+/923600 | 14:28 |
opendevreview | cid proposed openstack/ironic master: WIP: Self-Service via Runbooks [Prototype] https://review.opendev.org/c/openstack/ironic/+/922142 | 14:30 |
opendevreview | cid proposed openstack/ironic master: WIP: Self-Service via Runbooks [Prototype] https://review.opendev.org/c/openstack/ironic/+/922142 | 14:32 |
opendevreview | Julia Kreger proposed openstack/ironic master: grub: directly load linked config file https://review.opendev.org/c/openstack/ironic/+/923593 | 14:45 |
opendevreview | Julia Kreger proposed openstack/ironic master: grub: directly load linked config file https://review.opendev.org/c/openstack/ironic/+/923593 | 14:46 |
opendevreview | Julia Kreger proposed openstack/ironic master: WIP: combine tftp maps https://review.opendev.org/c/openstack/ironic/+/923601 | 14:57 |
rpittau | #startmeeting ironic | 15:00 |
opendevmeet | Meeting started Mon Jul 8 15:00:16 2024 UTC and is due to finish in 60 minutes. The chair is rpittau. Information about MeetBot at http://wiki.debian.org/MeetBot. | 15:00 |
opendevmeet | Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. | 15:00 |
opendevmeet | The meeting name has been set to 'ironic' | 15:00 |
TheJulia | o/ | 15:00 |
dtantsur | o/ | 15:00 |
rpittau | Hello everyone! | 15:00 |
rpittau | Welcome to our weekly meeting! | 15:00 |
rpittau | The meeting agenda can be found here: | 15:00 |
rpittau | #link https://wiki.openstack.org/wiki/Meetings/Ironic#Agenda_for_July_08.2C_2024 | 15:00 |
iurygregory | o/ | 15:00 |
JayF | o/ | 15:00 |
masghar | o/ | 15:00 |
cid | o/ | 15:00 |
rpittau | #topic Announcements/Reminders | 15:00 |
rpittau | #info Standing reminder to review patches tagged ironic-week-prio and to hashtag any patches ready for review with ironic-week-prio | 15:01 |
rpittau | #link https://tinyurl.com/ironic-weekly-prio-dash | 15:01 |
rpittau | #info 2024.2 Dalmatian Release Schedule | 15:01 |
rpittau | #link https://releases.openstack.org/dalmatian/schedule.html | 15:01 |
rpittau | #info the next OpenInfra PTG which will take place October 21-25, 2024 virtually! Registration is now open! | 15:02 |
rpittau | #link https://ptg.openinfra.dev/ | 15:02 |
dtantsur | So, we should make the 2nd set of bugfix releases soon(ish)? | 15:02 |
rpittau | dtantsur: beginning of august | 15:03 |
dtantsur | oh? must have miscalculated | 15:03 |
rpittau | usually between -9 and -8 weeks to go | 15:03 |
rpittau | we're close :) | 15:04 |
rpittau | anything else to remind/announce ? | 15:05 |
rpittau | looks like we're good, moving on! | 15:06 |
rpittau | #topic Review Ironic CI status | 15:06 |
rpittau | any updates on CI? it looked stable last week, but I may have missed something | 15:06 |
TheJulia | So I noticed one of the tempest tests, the standalone ones were not doing exactly what was thought | 15:06 |
TheJulia | I seemingly lost a line in a rebace somewhere along the way, so I'm working to fix that since it has uncovered other issues | 15:07 |
TheJulia | related changes are tagged ironic-week-prio | 15:07 |
rpittau | ok, thanks! | 15:07 |
TheJulia | the only other thing to note, anaconda is taking a bit longer than we expect and is timing out | 15:07 |
rpittau | ah | 15:08 |
rpittau | is this a recent thing? | 15:08 |
TheJulia | yeah, looks like it. Kooks like it might need a little more ram/cpu, but ultimately it is just a slow job to begin with | 15:08 |
rpittau | ok, something to keeps an eye on then | 15:08 |
TheJulia | on a plus side, if I get pxe grub2 running under the ironic-redfish-standalone jobs, we can drop a main job setup which will free resources | 15:08 |
rpittau | good | 15:09 |
TheJulia | end of last week, it was fairly frequent, look like it has been happier over the weekend. Worst comes to worst I'm okay with marking anaconda non-voting until we can sort it out | 15:09 |
rpittau | just wondering what could've changed recently | 15:09 |
rpittau | ok, thanks TheJulia | 15:10 |
rpittau | any more updates on CI ? | 15:10 |
TheJulia | oh, it is already non-voting on ironic | 15:11 |
rpittau | right | 15:11 |
TheJulia | its still voting on ironic-tempest-plugin | 15:11 |
TheJulia | and it has been doing it for about two weeks | 15:11 |
rpittau | that's why we probably haven't noticed much | 15:11 |
TheJulia | I can look at tuning it | 15:11 |
TheJulia | That might speed it up a little bit | 15:11 |
rpittau | thanks :) | 15:11 |
rpittau | alright, moving on | 15:12 |
rpittau | #topic Discussion topics | 15:12 |
rpittau | I have on thing | 15:12 |
rpittau | #info pysnmp update status | 15:12 |
rpittau | I haven't received any answer from the maintainer of pysnmp-lextudio so far | 15:13 |
TheJulia | I think the last comment was thursday/friday of last week? | 15:13 |
rpittau | and dont' really got time to move forward with the changes | 15:13 |
TheJulia | yeah... :( | 15:13 |
rpittau | oh so he replied ? | 15:13 |
TheJulia | I think I saw your last comment was last thursday/friday | 15:14 |
rpittau | I left 2 comments | 15:14 |
rpittau | one 3 weeks ago and another one last week | 15:15 |
rpittau | no answers so far | 15:15 |
TheJulia | oh, hmm | 15:15 |
TheJulia | :( | 15:15 |
JayF | Can we catch up on context here? | 15:15 |
JayF | Was this a migration we were doing out of neccessity? | 15:15 |
rpittau | my tests bring to a problem with how asyncio interact with python threading but I don't think I will have the time to look firther into it in the next weeks | 15:15 |
rpittau | JayF: kind of yeah, python version support mainly | 15:16 |
rpittau | this is the testing patch for virtualpdu | 15:16 |
rpittau | #link https://review.opendev.org/c/openstack/virtualpdu/+/922158 | 15:16 |
rpittau | if anyone has time feel free to hijack it, prbably some methods need to be reworked to be able to work with asyncio | 15:16 |
rpittau | I found a similar issue that was fixed here https://github.com/home-assistant/core/pull/112815 | 15:17 |
TheJulia | so the issue your hitting is in virtualpdu specifically at this time? | 15:18 |
rpittau | yes | 15:18 |
TheJulia | okay, that is good context to have | 15:18 |
TheJulia | hmmm | 15:18 |
TheJulia | I don't know how my availability really looks over the next two weeks, but I might be able to take a look. I'm dealing with folks vacations and then traveling the week after | 15:19 |
rpittau | thanks TheJulia, let me know if you need any more info | 15:20 |
TheJulia | k, I'll try to give it some time over the next couple fo days | 15:20 |
TheJulia | of days | 15:20 |
rpittau | ack, thanks | 15:20 |
rpittau | any more comments/questions ? | 15:21 |
hroy | I have created a proposal - https://bugs.launchpad.net/ironic/+bug/2072307 , for this pr : https://review.opendev.org/c/openstack/ironic/+/921657 | 15:21 |
rpittau | hroy: that is in the RFE discussion already I think | 15:21 |
hroy | This is regarding virtual media GET api | 15:21 |
hroy | ah, okay | 15:22 |
rpittau | or maybe not :D | 15:22 |
rpittau | I forgot to add it | 15:22 |
rpittau | I'll add it now and we can discuss it later | 15:22 |
hroy | no issues | 15:22 |
hroy | got it, thanks! | 15:22 |
JayF | That RFE looks reasonable to me fwiw | 15:22 |
JayF | thanks for filing the bug hroy | 15:22 |
dtantsur | I'm mildly worried about a synchronous action in the API.. | 15:23 |
dtantsur | but I also don't know how to avoid it | 15:23 |
TheJulia | There are simply some cases we can't entirely avoid it without massive engineering efforts. I guess the question I have is "why is it needed" | 15:23 |
TheJulia | and what piece of the puzzle owns the lower layer | 15:24 |
dtantsur | TheJulia: that's the only reliable way to learn if the attachment via API actually succeeded | 15:24 |
TheJulia | fair enough | 15:24 |
dtantsur | otherwise, we're relying on the potentially racy last_error | 15:24 |
TheJulia | yup | 15:24 |
TheJulia | makes sense to expose status | 15:24 |
JayF | This isn't the first sync request that goes back to a node is it? | 15:25 |
TheJulia | no, boot mode does it today | 15:25 |
TheJulia | and we've proposed a couple other light touch aspects in the past which sort of have stalled out on the same basic concern | 15:26 |
rpittau | I don't have any particular concern | 15:26 |
JayF | as long as everything fails/times out properly in the unhappy cases, it's fine I think | 15:26 |
TheJulia | Given it is inherently redfish, i have less concerns, since it is definitely not ipmi | 15:26 |
rpittau | if there's no objection I will just mark it as approved | 15:27 |
TheJulia | none, just ask for a little more verbose rfe for that whole why question should it come up again in the future | 15:27 |
rpittau | yep, makes sense | 15:28 |
rpittau | alright, approved | 15:28 |
rpittau | do we ant to have a look at the other RFE since we're here? | 15:28 |
rpittau | #link https://bugs.launchpad.net/ironic/+bug/2071741 | 15:28 |
dtantsur | no objections | 15:29 |
rpittau | we have already a patch proposed for this one and it's ready AFAICS | 15:30 |
TheJulia | none, it is still possible for operators to run their own, I think the conditional got "slightly" mis-used in the depths of the agent driver as well, but removal of the option will clean that up | 15:30 |
rpittau | alright, approved | 15:30 |
JayF | It's not clear to me that using [agent]manage_agent_boot=false is a valid setup at all at this point | 15:30 |
JayF | and I think the original use case could be served by a custom networkmanager | 15:31 |
dtantsur | yeah, nobody knows if it works at all | 15:31 |
TheJulia | yeah | 15:32 |
rpittau | one mroe reason to deprecate it quickly :) | 15:32 |
* dtantsur git push -f | 15:32 | |
rpittau | :D | 15:32 |
JayF | I already have the patch up that makes it deprecated | 15:32 |
TheJulia | heh | 15:32 |
JayF | we'll have to wait two cycles to remove | 15:32 |
dtantsur | will we? | 15:32 |
rpittau | JayF: yep, thanks, mentioned that before | 15:33 |
rpittau | #link https://review.opendev.org/c/openstack/ironic/+/923328 | 15:33 |
rpittau | we can remove the W-1 | 15:33 |
JayF | we have to have it billboarded+working in a SLURP release | 15:33 |
dtantsur | Okay, let's not have this SLURP thing as a religion | 15:33 |
TheJulia | dtantsur: ++ | 15:33 |
JayF | It's not a religion, it's openstack stable policy, and promises our users and operators rely upon. | 15:33 |
dtantsur | This feature is not used and is very probably broken, and we won't fix any bugs with it | 15:33 |
TheJulia | Marking deprecated at least communicates that | 15:34 |
dtantsur | Please don't make me regret giving up the fight against this SLURP thing... | 15:34 |
JayF | That's a reasonable counterpoint, but we also let it sit there rotting for years, why does one more matter so much? lol | 15:34 |
dtantsur | Complexity is not something we lack in the agent code | 15:34 |
TheJulia | lets just make sure there are plenty of inline comments | 15:34 |
masghar | What is a SLURP release | 15:34 |
TheJulia | so we know what is going away in case we need to touch that code in the next year | 15:35 |
TheJulia | ... who am I kidding, when | 15:35 |
dtantsur | masghar: a very stupid name for longer term support releases that allow upgrading jumping over 1 release | 15:35 |
JayF | masghar: https://docs.openstack.org/project-team-guide/release-cadence-adjustment.html | 15:35 |
masghar | thanks! | 15:35 |
JayF | dtantsur: masghar: the dumb name is because the good name was cratered by lawyers lol | 15:35 |
dtantsur | that lawyers! | 15:35 |
TheJulia | Hey, I like those lawyers | 15:35 |
TheJulia | They make my job easier! :) | 15:36 |
rpittau | gosh :) | 15:36 |
dtantsur | Realistically, we always have a certain freedom in interpreting the policies | 15:36 |
dtantsur | And "we keep a broken feature because the paper says so" is not a strong argument in my book | 15:36 |
JayF | I think the question of both: "does it really work?" and "should a feature that turns off a large part of ironic be subject to deprecation policy?" are both reasonable arguments | 15:36 |
rpittau | let's maybe keep that for open discussion :) | 15:36 |
rpittau | I guess we can move on now | 15:36 |
JayF | but lets just approve the rfe, get in the deprecation warning | 15:36 |
JayF | and argue about the removal patch next cycle when it exists | 15:37 |
dtantsur | Well, true | 15:37 |
JayF | I assume we want to at least wait *one* cycle | 15:37 |
* dtantsur nods | 15:37 | |
TheJulia | if we find out it is horribly broken to begin with, just rip it out. | 15:38 |
TheJulia | but next cycle | 15:38 |
rpittau | alright, let's move forward with the meeting | 15:38 |
rpittau | #topic Bug Deputy Updates | 15:38 |
rpittau | I was bug deputy last week | 15:38 |
rpittau | we got 2 new RFEs (that we've discussed already) | 15:39 |
rpittau | 4 new BUGs, with one in particular that I'd like help to triage, or better to approach to | 15:39 |
rpittau | #link https://bugs.launchpad.net/ironic-python-agent/+bug/2071935 | 15:39 |
JayF | I'll also note: I filed a handful (14? 15?) of bugs that came out of the docs-audit. They are tagged "docs-audit-2024" and Reverbverbverb has a very small amount of time left on contract for reviews and clarifying comments if anyone has any questions about those bugs or wants to work on one. | 15:39 |
JayF | It's my expectation they'll be the foundation for a future outreachy/mlh style project | 15:40 |
dtantsur | ++ | 15:40 |
rpittau | JayF: thanks | 15:40 |
JayF | rpittau: that seems like a valid bug to me, probably medium? | 15:40 |
rpittau | and thanks again to Reverbverbverb | 15:40 |
dtantsur | unicode in serial numbers, absolutely and perfectly cursed | 15:41 |
rpittau | JayF: yeah, just wondering how to approach it in a sane way | 15:41 |
rpittau | dtantsur: exactly | 15:41 |
JayF | well, there are two cases right | 15:42 |
JayF | well wait | 15:42 |
TheJulia | wow | 15:42 |
JayF | this is a serial number, baked into the disk | 15:42 |
rpittau | heh.... :) | 15:42 |
rpittau | yep | 15:42 |
JayF | not like a label or something | 15:42 |
dtantsur | the actual problem is in ironic-lib btw | 15:42 |
rpittau | dtantsur: yeah | 15:42 |
rpittau | it's the execute method | 15:43 |
TheJulia | .... hmmmmmmm | 15:43 |
rpittau | that's why I'm puzzled | 15:43 |
TheJulia | so, I guess there are two things | 15:44 |
dtantsur | I wonder if it's also a broken unicode... | 15:44 |
TheJulia | 1) unicode chars should be valid, and we're running the command as utf8, the data structure has utf-16... we likely need to fix the command invocation to handle utf16 in the data | 15:44 |
rpittau | well I would not expect anyone to put unicode or broken utf8 in a serial number honestly | 15:44 |
TheJulia | 2) That character itself is likely not valid | 15:44 |
JayF | that log is ... podman running IPA?! | 15:44 |
dtantsur | https://www.compart.com/en/unicode/U+DCFF is repeated | 15:45 |
dtantsur | JayF: welcome to OpenShift, we have weird stuffs :D | 15:45 |
rpittau | :D | 15:45 |
masghar | JayF I was wondering the same | 15:45 |
dtantsur | https://github.com/openshift/ironic-agent-image | 15:45 |
JayF | dtantsur: full circle, I used to be the IPA-container-weirdo | 15:45 |
dtantsur | hehe | 15:45 |
TheJulia | I think this would be one of those cases we likely need to verify "what is the actual serial number reported via other means" as well. | 15:46 |
TheJulia | specifically, we get "2HC015KJ0000" before the unicode | 15:46 |
TheJulia | 4 zeros is... suspect | 15:46 |
TheJulia | The odds are against that actually being the case | 15:46 |
rpittau | I was wondering if maybe the serial is not correct for some reason | 15:46 |
dtantsur | Yeah, the unicode part is garbage most likely | 15:46 |
JayF | My curiousity is: a newer version of the tool in questino | 15:46 |
JayF | or at least a cursory look at the changelog | 15:46 |
JayF | wouldn't be the first time an lsblk behavior change benefited or harmed our use cases | 15:47 |
TheJulia | definitely garbage characters, the question is how did it get there and why, but realistically we likely need to expect any command execution can return utf-16 at this point | 15:47 |
TheJulia | and to try and force encoding to 8 is going to have some bad side effects | 15:47 |
rpittau | ok, I think I have enough to move forward on 2 fronts | 15:47 |
TheJulia | we saw similar with efibootmgr | 15:47 |
TheJulia | since it's data is technically utf-16 | 15:47 |
dtantsur | I guess the question is what can we do if an important tool returns junk | 15:48 |
TheJulia | I would truncate the invalid characters if we recognize them as formatting/control chars | 15:48 |
rpittau | I haven't seen any other reports beside this one | 15:48 |
TheJulia | but, running in utf-16 is sort of the first step along with identifying what is actually the correct serial | 15:49 |
JayF | It'd also be nice to get written down the hardware that is doing this | 15:49 |
rpittau | ok | 15:49 |
TheJulia | Validate reality versus our perception of reality | 15:49 |
dtantsur | yeah, I guess we need details | 15:49 |
JayF | e.g. if we get it reported again, maybe knowing it's WD Blacks released in May 2023 or something | 15:49 |
rpittau | JayF: LITEON IT ECE-12 | 15:49 |
dtantsur | wut | 15:49 |
JayF | e.g. if we get it reported again, maybe knowing it's WD Blacks released in May 2023 or something/ | 15:49 |
JayF | ugh | 15:49 |
TheJulia | embedded sata drive | 15:49 |
JayF | is that a cd-rom from 2005? | 15:49 |
rpittau | :D | 15:49 |
JayF | I found an image of that model, the serial number is only ASCII characters confirmed :P | 15:50 |
JayF | time to implement label-OCR-inspection technology in IPA /s | 15:50 |
rpittau | I'll get more details anyway | 15:50 |
rpittau | thanks all | 15:51 |
TheJulia | ... rpittau I can't help but wonder if there is a raid controller in the middle which is doing "stuff" | 15:51 |
rpittau | mmmm | 15:51 |
rpittau | more info to get | 15:51 |
TheJulia | I had a case recently where basically the serial/wwn and friends were all faked out by the raid controller | 15:51 |
TheJulia | if you rebuild the reset, it all changed | 15:51 |
TheJulia | s/reset/raid set/ | 15:51 |
dtantsur | so nice of it | 15:51 |
TheJulia | very much so | 15:51 |
rpittau | oh gosh | 15:52 |
TheJulia | The customer kept insisting on doing 2x raid0s and they got the same WWN reported by the raid controller | 15:52 |
TheJulia | and of course they filed a bug saying it is the software's fault :) | 15:52 |
dtantsur | ah, I think I've seen this discussion | 15:52 |
JayF | yeah, the software on that raid controller :D | 15:52 |
dtantsur | ++ | 15:52 |
TheJulia | yup | 15:52 |
rpittau | yeah | 15:52 |
dtantsur | it's all quite a problem for root device hints.. | 15:53 |
TheJulia | And really, it is actually a valid thing for the raid controller to do | 15:53 |
TheJulia | just yeah, it and human expectations of uniqueness is problematic for root device hints | 15:53 |
dtantsur | well, if they put some thoughts into it, they could do ~ hash of both WWNs and the RAID level or something like that | 15:53 |
TheJulia | Since a SAN can say everything is the same WWN and have unique LUNs or path numbers | 15:53 |
dtantsur | to make it predictable for the same combination of drives | 15:53 |
* dtantsur is dreaming of a world where things make sense | 15:54 | |
TheJulia | heh | 15:54 |
TheJulia | I think my guidance was the hctl based because it was at least consistent | 15:54 |
TheJulia | everything else would change between reboots | 15:54 |
TheJulia | or at least, could change | 15:54 |
* TheJulia notes it is too early for whiskey | 15:55 | |
rpittau | not here :D | 15:55 |
rpittau | btw cid you're the bug deputy for this week, good luck! :) | 15:55 |
TheJulia | Oh question! | 15:56 |
rpittau | sure! | 15:56 |
cid | tks | 15:56 |
JayF | cid: feel free to ping me here or in slack if you have any bugs that have you perplexed | 15:56 |
TheJulia | ... should we consider deprecating the ilo stuffs? specifically thinking if we deprecate removal is in a year | 15:56 |
rpittau | mmmm | 15:56 |
dtantsur | ilo4 is still used in the wild, unfortunately | 15:56 |
rpittau | ilo4 is kind of dead | 15:56 |
JayF | We said pretty clearly we'd keep support for those as long as hardware that used it was in place. | 15:56 |
TheJulia | heh | 15:57 |
cid | JayF: I will | 15:57 |
JayF | I know multiple different Ironic shops that use ilo driver with ilo-driver-exclusive hardware. | 15:57 |
TheJulia | Interesting perception differences | 15:57 |
rpittau | I mean technically | 15:57 |
rpittau | but i npractice we still see usage | 15:57 |
dtantsur | Metal3's quick start literally uses ilo4 as an example :D | 15:57 |
rpittau | ilo4 is dead more than 1 year ago | 15:57 |
JayF | "dead" means what? Unsupported by HP? No longer sold? | 15:58 |
JayF | Would someone on a 5-7 year hardware lifecycle still have them? | 15:58 |
rpittau | means unsupported, latest firmware version is from March 2023 | 15:58 |
TheJulia | Okay, I guess nevermind then, just trying to think out into the future far enough and it sort of made me wonder since we dropped the wsman stuffs, but a forward path and all is the unknown | 15:58 |
TheJulia | rpittau: and that last firwmare drop was like a year after EoS right? | 15:58 |
JayF | TheJulia: if we deprecate in a slurp release (e.g. if we deprecated a driver next release), we can remove in the following | 15:58 |
rpittau | TheJulia: correct | 15:58 |
JayF | TheJulia: it's only "wait a whole year" when you try to deprecate in a between-slurps release | 15:58 |
TheJulia | Okay, well, just something we should be mindful of as time moves forward | 15:59 |
dtantsur | This is the very case where I think "a whole year" may be justified | 15:59 |
dtantsur | because people may not be able to buy new hardware in 6 months | 15:59 |
TheJulia | Yeah, we have almost always treated driver removals as long deprecations | 15:59 |
JayF | I could be +1 to *marking it deprecated* | 15:59 |
JayF | with no timeline set on removal | 15:59 |
JayF | as an informational thing | 15:59 |
TheJulia | I think that is fair | 15:59 |
dtantsur | Right. Just as a heads-up for operators: you're dealing with obsolete technology. | 16:00 |
TheJulia | I think the whole "we still see some active use" | 16:00 |
rpittau | I'm absolutely and totally for deprecate it :) | 16:00 |
TheJulia | it makes it a bit harder for us since the migration is not as clear cut perception wise | 16:00 |
JayF | I think the weird bonus piece of this which we all include in the math indirectly | 16:00 |
dtantsur | Thought keep in mind: the ilo5 driver uses ilo4 bits | 16:00 |
dtantsur | so we don't just get rid of proliantutils | 16:00 |
TheJulia | yup, and proliantutils hides the details | 16:00 |
JayF | is that the question isn't "who uses ilo4 hardware today?" it's "Who will be using ilo4 hardware when they upgrade into Dalmation?" | 16:00 |
JayF | which are different questions | 16:00 |
dtantsur | The question is also what we'll realistically deprecate given that ilo5 is a superset of ilo4 | 16:02 |
dtantsur | ref https://opendev.org/openstack/ironic/src/branch/master/ironic/drivers/ilo.py | 16:02 |
JayF | When TheJulia said "deprecate ilo driver", I assumed she meant *all ilo drivers* and migrate ilo5 users to redfish | 16:02 |
JayF | there is minor if any benefit in doing it piecemeal afaict | 16:02 |
dtantsur | That's a loss of functionality for existing hardware | 16:02 |
rpittau | ilo5 is still actively supported though | 16:03 |
dtantsur | existing as in supported and actively used | 16:03 |
dtantsur | e.g. you lose RAID | 16:03 |
TheJulia | I guess we're sort of stuck until post ilo5 then | 16:03 |
masghar | Is it very difficult for ilo5 users to just switch to redfish? | 16:04 |
TheJulia | we should "deprecate" ilo4 in terms of set expectations | 16:04 |
TheJulia | we're not going to magically fix bugs there in proliantutils | 16:04 |
dtantsur | masghar: ilo5 does not support standard Redfish RAID (I think you were the one who discovered it ;) | 16:04 |
JayF | ++ absolutely, and I could even see that being a good move for ilo5 | 16:04 |
TheJulia | ilo5... that might be in sushy for all we know and we should at least "look" | 16:04 |
TheJulia | And the explain what/why/etc | 16:04 |
masghar | dtantsur: Yes I think so | 16:04 |
dtantsur | we *could* extract what we need from proliantutils and build a new driver on top of redfish | 16:04 |
JayF | I also think we should consider pulling some of proliantutils into Ironic/sushy, like we have considered for sushy-oem-drac | 16:04 |
JayF | bingo! | 16:04 |
dtantsur | possibly. proliantutils is not a simple plugin on top of sushy | 16:05 |
TheJulia | definitely not a simple plugin | 16:05 |
TheJulia | it has lots of logic and makes assumptions | 16:05 |
rpittau | and it stil luses python six :) | 16:05 |
TheJulia | i.e. if you point it at an edgeline machine it acts differently even though it has an "ilo" | 16:05 |
JayF | Yeah, and some of those assumptions are invalid in newer ironics | 16:05 |
TheJulia | yup | 16:05 |
JayF | I fixed a bug in one recently | 16:05 |
dtantsur | there is a possibility that the RAID stuff specifically is possible to extract | 16:05 |
dtantsur | https://opendev.org/x/proliantutils/src/branch/master/proliantutils/redfish/resources/system/storage maybe | 16:06 |
JayF | basically if there's no separate entity maintaining proliantutils | 16:06 |
rpittau | we're 5 past, I'm closing the meeting but we can keep the discussion open | 16:06 |
JayF | but we still need the code inside | 16:06 |
rpittau | #endmeeting | 16:06 |
opendevmeet | Meeting ended Mon Jul 8 16:06:16 2024 UTC. Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4) | 16:06 |
opendevmeet | Minutes: https://meetings.opendev.org/meetings/ironic/2024/ironic.2024-07-08-15.00.html | 16:06 |
opendevmeet | Minutes (text): https://meetings.opendev.org/meetings/ironic/2024/ironic.2024-07-08-15.00.txt | 16:06 |
opendevmeet | Log: https://meetings.opendev.org/meetings/ironic/2024/ironic.2024-07-08-15.00.log.html | 16:06 |
JayF | it's easier to absorb the code, potentially | 16:06 |
dtantsur | could be indeed | 16:06 |
TheJulia | so lets ponder it, we can discuss more next week :) | 16:06 |
rpittau | yes | 16:06 |
rpittau | thanks everyone! | 16:06 |
TheJulia | Thanks! | 16:07 |
opendevreview | Verification of a change to openstack/ironic master failed: Localdev: disable caching and related periodic https://review.opendev.org/c/openstack/ironic/+/923453 | 16:13 |
clarkb | not sure if you're aware but I noticed the metal3 integration jobs are failing with ERROR! Failed to download collection tar from 'default' due to the following unforeseen error: HTTP Error 403: Forbidden. HTTP Error 403: Forbidden | 16:15 |
JayF | 403 errors! That's a lot of errors! | 16:17 |
opendevreview | Verification of a change to openstack/ironic master failed: Require hashed passwords for rescue by default https://review.opendev.org/c/openstack/ironic/+/923452 | 16:18 |
opendevreview | Verification of a change to openstack/ironic master failed: Localdev: disable caching and related periodic https://review.opendev.org/c/openstack/ironic/+/923453 | 16:19 |
JayF | 2024-07-08 15:22:14.714191 | controller | + /opt/metal3-dev-env/venv/bin/ansible-galaxy install -r vm-setup/requirements.yml | 16:22 |
JayF | 2024-07-08 15:22:17.315513 | controller | ERROR! Failed to download collection tar from 'default' due to the following unforeseen error: HTTP Error 403: Forbidden. HTTP Error 403: Forbidden | 16:22 |
JayF | For metal3 folks ^ that's the failure log from https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_dc5/923453/2/gate/metal3-integration/dc525ba/job-output.txt | 16:23 |
TheJulia | I guess, in the short term all we can do is non-vote it | 16:27 |
JayF | ++ | 16:28 |
TheJulia | so, I just saw in another channel in slack someone hit the same issue with galaxy and they believe it now to be respolved | 16:29 |
TheJulia | so maybe this is self correcting | 16:29 |
TheJulia | we shall know shortly | 16:31 |
rpittau | TheJulia JayF the issue is on galaxy download not strictly on the metal3 job | 16:34 |
TheJulia | Yeah, looks like ansible galaxy had a hiccup | 16:35 |
TheJulia | I think we may be past it | 16:37 |
opendevreview | Julia Kreger proposed openstack/ironic master: WIP: combine tftp maps https://review.opendev.org/c/openstack/ironic/+/923601 | 16:47 |
opendevreview | Julia Kreger proposed openstack/ironic master: DNM/WIP: Incease resources to the anaconda job https://review.opendev.org/c/openstack/ironic/+/923667 | 16:50 |
cid | ironic, about the runbooks feature, should I allow, like (PROJECT_SERVICE) to have the same privileges as project admins | 16:50 |
TheJulia | likely | 16:51 |
JayF | I was waffling between if it should be "project admin" or "project manager" level by default | 16:51 |
JayF | but I'm not sure we even have a default | 16:51 |
TheJulia | well, service is a bit different | 16:51 |
JayF | if you did what you propose, it's probably a sane default? | 16:52 |
TheJulia | well, a service account in the service project is different | 16:52 |
TheJulia | essentially it denotes another openstack service which gets elevated rights, but it doesn't necessarilly need to map on to it | 16:52 |
JayF | so service project should maybe == system admin? | 16:53 |
JayF | e.g. letting it do things like set projects on runbooks and set public is probably OK and desired | 16:53 |
TheJulia | yeah, functionally somewhere between system admin and system scoped member | 16:54 |
cid | was thinking the same because current policy definitions, except for node, allows that, it would seem. | 16:54 |
rpittau | good night! o/ | 16:57 |
* TheJulia takes a stick and pokes virtualpdu | 17:13 | |
TheJulia | rpittau: virtualpdu's weirdness is indeed weird | 18:58 |
opendevreview | Verification of a change to openstack/ironic master failed: Localdev: disable caching and related periodic https://review.opendev.org/c/openstack/ironic/+/923453 | 19:05 |
TheJulia | rpittau: tomorrow, lets try and chat for a couple minutes regarding virtualpdu, I have some thoughts from looking today | 20:43 |
cid | o/ | 21:22 |
JayF | TheJulia: rpittau : if timing doesn't work out that's fine, but I've also taken a swing at this and was a little lost so I would very much enjoy being in on that conversation | 21:23 |
opendevreview | Jacob Anders proposed openstack/ironic master: Add Targets to firmware.update on multi system BMCs https://review.opendev.org/c/openstack/ironic/+/922438 | 23:02 |
Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!