Thursday, 2025-01-02

iurygregoryhappy new year Ironic o/10:30
dtantsurhappy new year folks!11:30
opendevreviewTakashi Kajinami proposed openstack/ironic master: Replace crypt module  https://review.opendev.org/c/openstack/ironic/+/93717313:07
opendevreviewTakashi Kajinami proposed openstack/ironic-python-agent master: Replace crypt module  https://review.opendev.org/c/openstack/ironic-python-agent/+/93717513:07
opendevreviewAdam Rozman proposed openstack/ironic master: disable ISO cache image format and safety checks  https://review.opendev.org/c/openstack/ironic/+/93836313:51
iurygregorydtantsur, do you have thoughts on https://review.opendev.org/c/openstack/ironic/+/938108 ?13:55
iurygregorylooking for feedback if this would be a valid approach or not13:55
dtantsuriurygregory: generally. But I'd prefer the retry to be more granular.13:59
dtantsurAs in, I'm not sure if the whole prepare_ramdisk is even re-entrant13:59
dtantsurI also don't quite understand why it belongs in prepare_ramdisk, not somewhere in the firmware update code14:00
iurygregorylet me find the link with the logs 1min14:00
iurygregoryhttps://paste.opendev.org/show/bdf8ZjY9DXtJhzYGKTDm/14:01
iurygregoryhere it goes14:01
iurygregorythe error trying to reach the BMC was during prepare_ramdisk, that is why I added there14:01
dtantsurokay, so File "/opt/stack/ironic/ironic/drivers/modules/redfish/firmware.py", line 184 seems to be the right place to add something like "are we ready to proceed already?"14:01
iurygregoryhttps://opendev.org/openstack/ironic/src/branch/master/ironic/drivers/modules/redfish/firmware.py#L184 ?14:02
dtantsurI don't think we should even try to call prepare_ramdisk if the BMC is not working14:02
iurygregoryso probably we would need logic inside reboot_to_finish_step ?14:02
dtantsurdoes your logic apply to normal deployment? cleaning? RAID?14:03
dtantsurif not, why put it in the generic code?14:03
dtantsurthe cause of the failure is firmware update, right?14:03
dtantsurso this has to be handled by (and constrained within) the firmware update code14:04
iurygregoryyeah, it makes sense, since it's only during firmware update14:04
dtantsurright. so probably try to poke the system/managers list and retry.14:05
dtantsurbonus: you will be sure you're not retrying something entirely unrelated14:05
dtantsurnow, I have another concern. Are you sure this condition will get resolved at all until reboot?14:06
iurygregoryok, I have the feeling it would, like the BMC becomes responsive after sometime, if I trigger a manual redfish simple update we don't need to send reboot14:07
iurygregoryso I'm assuming it would be ok14:07
dtantsurgreat14:07
dtantsurthen our plan should be enough14:08
iurygregorywill do manual testing on the HPE and Dell again to see how it goes14:08
iurygregoryThanks for the help!14:08
dtantsursure14:11

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!