Tuesday, 2020-07-21

*** openstack has joined #openstack-ironic07:31
*** ChanServ sets mode: +o openstack07:31
*** rcernin_ has quit IRC07:34
*** bbezak has joined #openstack-ironic07:50
*** dtantsur|afk is now known as dtantsur07:55
dtantsurmorning ironic07:55
*** ociuhandu has joined #openstack-ironic08:08
*** rcernin_ has joined #openstack-ironic08:16
jandershi dtantsur08:21
jandersI've got a couple bifrost questions, if you guys have time08:22
janders(just getting snippets ready)08:23
jandershttp://paste.openstack.org/show/796146/08:24
*** priteau has joined #openstack-ironic08:24
janderswith help from iurygregory I managed to get past the issue I was having yesterday. It seems it's to do with ANSIBLE_COLLECTIONS_PATHS variable not being set by the venv08:24
*** derekh has joined #openstack-ironic08:25
jandersI wonder if  I'm doing something wrong, or am I hitting a bug. Have you guys seen this issue?08:25
*** rcernin_ has quit IRC08:26
* dtantsur is on a German class till the top of the hour08:26
gudrutis0Hi folks08:31
rpittaugood morning ironic! o/08:36
jandershey rpittau08:36
rpittauhey janders :)08:36
jandershow are you going?08:36
*** aedc has joined #openstack-ironic08:37
rpittauall right all considered, lot of things going on at the moment08:41
rpittauhow's it going for you?08:41
jandersgood, thank you. Hitting some bumps getting bifrost set up for VM testing but making steady progress08:42
jandersthank you for your feedback regarding my doco change proposal - I did get a few things wrong there, hopefully all fixed noww08:42
rpittaujanders: no problem, thanks for putting up that :)08:45
jandersno worries, you  were on the money08:46
jandersI seem to have particularly bad luck for venv issues08:46
jandersand sometimes when I hit one it seems to me things are broken while in fact it's just my confusion08:46
*** rcernin_ has joined #openstack-ironic08:46
jandersspeaking of venv issues - I have a fresh one I wanted to ask for assistance with08:47
jandershttp://paste.openstack.org/show/796147/08:47
rpittaueheh shoot :)08:47
jandersI have openstacksdk installed in venv, however if I run a playbook it complains about openstacksdk missing08:47
janderstask is become=no so I'd hope the venv one will work but it doesn't08:48
rpittaummmm08:54
jandersI enabled rdo ussuri repo and installed openstacksdk via dnf as a test, it did make the issue go away08:56
jandersso it is what it claims it is, not some weird cascade error08:56
rpittauI was going to suggest to install that systemeide, ok, so it doesn't really detect the virtualenv08:56
jandersbut I still wonder why the venv openstacksdk doesn't suffice from playbook perspective08:57
jandersI had the same issue with jsonpatch earlier (this is in the main C8 repo so was an easy fix)08:57
jandersansible-playbook does point to /opt/stack/bifrost/bin/ansible-playbook so venv does work other than that08:58
dtantsurjanders, iurygregory, we need ANSIBLE_COLLECTIONS_PATHS handled automatically or documented09:00
dtantsur(ideally the former)09:00
dtantsurjanders: re openstacksdk: did you source the venv? I've seen this problem when venv was not sourced09:01
dtantsur(i.e. it's not enough to call /opt/stack/bifrost/bin/ansible-playbook)09:01
jandersdtantsur yes I did09:01
rpittaudtantsur: it seems it was sourced looking at the paste09:01
rpittaujanders:09:02
rpittaujanders: wait, you install openstacksdk using --user09:02
rpittaunot in the venv09:02
dtantsurRequirement already satisfied: openstacksdk in /opt/stack/bifrost/lib/python3.6/site-packages09:02
dtantsur--user is incorrect, but it should be fine anyway (we install the sdk as part of bifrost)09:03
rpittautrue09:03
janderscorrect, I thought that was the right thing. So if I'm venv, I don't add --user, ACK.09:03
dtantsurjanders: what if you do `-e ansible_python_interpreter=/opt/stack/bifrost/bin/python3`?09:03
janderslet me reproduce it, 1min (gotta remove systemwide rpm)09:04
rpittaumaybe it's ansible not detecting the virtualenv09:04
openstackgerritMerged openstack/ironic master: Add wsme core types, remove WSME  https://review.opendev.org/70449009:05
jandersdtantsur it works if `-e ansible_python_interpreter=/opt/stack/bifrost/bin/python3` is added09:06
jandersthis rings a bell actually09:06
rpittauso yeah, it's ansible not getting the venv09:07
*** rcernin_ has quit IRC09:07
dtantsuransible's handling of venv is a mess :(09:10
jandersI remember having to do this while ansible-ising venv setup for ansible in the past (may sound crazy I know)09:10
janderscan't find the code anymore though, doesn't matter09:11
jandersI wonder if we need to add this to doco (thinking where)09:11
jandersI reckon if I hit this the users might as well09:12
jandershttps://docs.openstack.org/bifrost/latest/user/howto.html#enroll09:12
jandersdoco claims it's a fixed problem, but I am not convinced based on my experience09:13
dtantsurjanders: what if we modify our playbooks/roles to set ansible_python_interpreter if in a venv?09:14
rpittau^ IMHO this is how it should be09:15
* janders is checking if we have ansible config file inside the repo09:16
jandersnope09:17
jandersI think your suggestion is the way to go dtantsur rpittau09:17
janderswould you like me to have a go at fixing this as well as the missing ansible collection paths?09:17
jandershappy to09:17
rpittaujanders: if you got the time, I will gladly review :)09:18
janderscool! I've been looking for stuff to work on last week (and these issues blocked working on the cleaning step ordering RFE) so I don't see why not09:19
openstackgerritDmitry Tantsur proposed openstack/bifrost master: Support HTTP basic auth and switch bifrost-cli to it  https://review.opendev.org/74196409:19
openstackgerritDmitry Tantsur proposed openstack/bifrost master: Support HTTP basic auth and switch bifrost-cli to it  https://review.opendev.org/74196409:20
*** k_mouza has joined #openstack-ironic09:21
openstackgerritRiccardo Pittau proposed openstack/ironic-python-agent master: More refactoring of the image module  https://review.opendev.org/74211009:27
*** rcernin_ has joined #openstack-ironic09:41
openstackgerritVerification of a change to openstack/bifrost failed: Expand advanced install instructions (path info)  https://review.opendev.org/74161109:42
*** k_mouza has quit IRC09:53
*** rcernin_ has quit IRC09:55
*** k_mouza has joined #openstack-ironic10:01
*** ricolin has quit IRC10:18
janderswhat happened with https://review.opendev.org/741611 - does the Zuul fail mean the document changed in the meantime?10:20
patchbotpatch 741611 - bifrost - Expand advanced install instructions (path info) - 7 patch sets10:20
rpittaujanders: yeah, try a rebase on latest master10:20
jandersok10:20
*** aedc_ has joined #openstack-ironic10:22
*** aedc has quit IRC10:22
*** rcernin_ has joined #openstack-ironic10:28
*** Lucas_Gray has joined #openstack-ironic10:33
*** vmud213 has joined #openstack-ironic10:34
openstackgerritJacob Anders proposed openstack/bifrost master: Expand advanced install instructions  https://review.opendev.org/74161110:45
vmud213Hello Ironic !10:45
vmud213Good morning and Good evening ..10:46
openstackgerritJacob Anders proposed openstack/bifrost master: Expand advanced install instructions  https://review.opendev.org/74161110:47
jandersok I think this revision should be okay10:48
jandersnow back to venv & ansible challenges :)10:49
openstackgerritMerged openstack/ironic-python-agent-builder master: Add Tripleo IPA build image job in zuul layout  https://review.opendev.org/74191710:53
jandersthinking how to add setting Ansible collections path to bifrost venv11:01
jandersI see we do it here:11:01
jandershttps://github.com/openstack/bifrost/blob/master/scripts/install-deps.sh#L9611:01
janderswould it be okay if I just append the export to the end of activate script?11:01
janderskind of braindead but should just work...11:02
janderswhat do you think?11:02
iurygregoryhello everyone o/11:14
jandersg'day iurygregory11:14
jandersthanks heaps for your help with ansible collections yesterday, your hints got me out of a rut11:15
jandersfixing a related bug now11:15
iurygregorynp o/11:15
dtantsurI'm not sure how I feel about updating the activate script(s)11:16
janderswhat would be an alternative?11:16
iurygregorydtantsur, it's documented and it was handle automatic O.o11:16
dtantsurMaybe we should set the variable for our roles ourself (inside the roles) and document what should be done11:17
dtantsurjanders: the problem is, not everyone will necessary use 'activate'11:17
jandersok happy to try that11:17
iurygregorythe CI has no problem with the collections and when I tested locally it was working11:17
dtantsurnot everyone will even use a shell, imagine someone calling ansible-playbook from a python/go/whatever program11:17
janderstrue11:17
dtantsurwe need to tell them what to do in the very base case11:17
jandersiurygregory was the variable being automatically set in your setup or was it working using some different mechanism?11:19
iurygregoryjanders, for me it was automatically set11:19
iurygregoryhttps://docs.openstack.org/bifrost/latest/install/index.html#dependencies11:19
jandersright! I wonder if it has to do with ansible.cfg contents11:20
* janders is checking his ansible.cfg11:21
dtantsurstevebaker: did some local testing, the following are wrongly authenticated: / /v1 /v1/continue in ironic-inspector, /v1/heartbeat in ironic: http://paste.openstack.org/show/796150/11:22
openstackgerritDmitry Tantsur proposed openstack/bifrost master: Support HTTP basic auth and switch bifrost-cli to it  https://review.opendev.org/74196411:24
openstackgerritvinay kumar muddu proposed openstack/ironic master: Decouple the ISO creation logic from redfish  https://review.opendev.org/73917411:32
*** vmud213 has quit IRC11:40
*** sshnaidm|afk is now known as sshnaidm11:41
*** Lucas_Gray has quit IRC11:41
dtantsurif anyone has a minute: https://review.opendev.org/#/c/740246/11:43
patchbotpatch 740246 - bifrost - Support extra-vars in bifrost-cli - 6 patch sets11:43
*** Lucas_Gray has joined #openstack-ironic11:44
*** iurygregory has quit IRC11:49
*** tkajinam has quit IRC11:56
*** iurygregory has joined #openstack-ironic11:59
*** ociuhandu_ has joined #openstack-ironic12:04
openstackgerritMerged openstack/bifrost master: Update to cirros 0.5.1  https://review.opendev.org/74192312:05
*** ociuhandu has quit IRC12:07
*** rh-jelabarre has joined #openstack-ironic12:09
rpittaudtantsur: wondering if a release note is needed for the extra-vars ?12:11
*** ricolin has joined #openstack-ironic12:15
*** ociuhandu_ has quit IRC12:15
*** ociuhandu has joined #openstack-ironic12:16
*** Goneri has joined #openstack-ironic12:17
*** k_mouza has quit IRC12:18
openstackgerritVerification of a change to openstack/ironic-inspector failed: Do not try to set local_gb to -1 when the matched root device size is 0  https://review.opendev.org/74193712:26
*** diurnalist has joined #openstack-ironic12:33
jandersregarding Ansible Collection I have one more idea for a potential fix12:34
janderssymlinking the collections install directory to the bifrost playbooks directory can remove the need of setting environment variable12:35
jandersI suppose that would be more cross-platform than tweaking activate script12:35
jandersdtantsur what do you think about this approach?12:35
jandersI'm still trying to get to the bottom of why the var isn't set right now but I wonder if this idea would be a fix material or not12:36
*** diurnalist has quit IRC12:37
dtantsurrpittau: likely, I can follow-up12:39
rpittaudtantsur: okay12:39
dtantsurjanders: thinking..12:39
dtantsurjanders: it may work, just needs to be documented (and added to .gitignore :)12:39
jandersI was thinking creating the symlink from env-setup.sh12:40
janderscause that's where collections are getting installed into the venv (if I understand correctly)12:41
jandersdoes this make sense?12:41
dtantsurjanders: yep, at least at first glance12:41
jandersshould I treat this and the ansible python interpreter as two separate change proposals, or should I combine the two?12:42
jandersmy gut feel = separate12:43
rpittaujanders: separate sounds right, thanks12:43
janderscool, thank you12:43
dtantsurwhatever works for you12:43
openstackgerritDmitry Tantsur proposed openstack/bifrost master: Support extra-vars in bifrost-cli  https://review.opendev.org/74024612:50
dtantsurrpittau: I've realized it was only the 1st +2 and the next patch won't pass ^^12:50
dtantsurso added now12:50
rpittauah yeah12:50
openstackgerritRiccardo Pittau proposed openstack/bifrost master: Make ansible ask for sudo password  https://review.opendev.org/74217512:52
TheJuliabrraaaaiiinnnssss12:53
dtantsurmorning TheJulia12:53
rpittauthanks, I'd like one :)12:53
TheJuliagood morning everyone12:53
dtantsur:)12:53
rpittaugood morning TheJulia :)12:53
TheJuliais everyone ready for another exciting OpenDev discussion!?12:53
jandersg'day TheJulia o/12:53
rpittauI missed the one yesterday, how was it ?12:53
dtantsurTheJulia: I have a visit from the house management company, so will be randomly on and off12:53
*** rcernin_ has quit IRC12:54
* arne_wiebalck prepared coffee and is hence totally ready12:54
dkingTheJulia: Actually, I never got my email for this one with the details, even though I registered.12:54
rpittaummmmm brain + coffee12:54
dkingDo you know where we could get that?12:54
arne_wiebalckthe combo of success12:54
TheJuliadtantsur: ack12:55
TheJuliadking: eek12:55
TheJuliaI can get you the link12:55
TheJuliahttps://zoom.us/j/97532742477?pwd=b2ZydG50eXBFekdFMGNaTVNzVUFYZz09#success12:56
*** ricolin has quit IRC12:56
*** tzumainn has joined #openstack-ironic12:58
rpittauthe music is very relazing12:58
rpittauor relaxing12:58
TheJulia++13:00
*** rloo has joined #openstack-ironic13:02
openstackgerritJacob Anders proposed openstack/bifrost master: Replace ANSIBLE_COLLECTIONS_PATHS var with symlink  https://review.opendev.org/74219013:24
openstackgerritJacob Anders proposed openstack/bifrost master: Replace ANSIBLE_COLLECTIONS_PATHS var with symlink  https://review.opendev.org/74219213:37
*** bnemec has joined #openstack-ironic13:38
janderswrapping up for the night, see you tomorrow o/13:39
*** tenbrae has joined #openstack-ironic13:47
*** bfournie has joined #openstack-ironic13:48
*** bfournie has quit IRC14:03
*** yaawang has quit IRC14:05
*** yaawang has joined #openstack-ironic14:05
openstackgerritRiccardo Pittau proposed openstack/bifrost master: Consolidate env-setup  https://review.opendev.org/74220114:09
openstackgerritDmitry Tantsur proposed openstack/bifrost master: Use random passwords by default  https://review.opendev.org/74220314:15
openstackgerritDmitry Tantsur proposed openstack/bifrost master: Use random passwords by default  https://review.opendev.org/74220314:16
openstackgerritDmitry Tantsur proposed openstack/bifrost master: Support HTTP basic auth and switch bifrost-cli to it  https://review.opendev.org/74196414:17
*** ricolin has joined #openstack-ironic14:21
*** markmcclain has quit IRC14:23
*** markmcclain has joined #openstack-ironic14:24
openstackgerritDmitry Tantsur proposed openstack/bifrost master: Use random passwords by default  https://review.opendev.org/74220314:30
*** diurnalist has joined #openstack-ironic14:32
*** abdysn has quit IRC14:36
openstackgerritDmitry Tantsur proposed openstack/bifrost master: Support HTTP basic auth and switch bifrost-cli to it  https://review.opendev.org/74196414:39
*** k_mouza has joined #openstack-ironic14:45
openstackgerritMerged openstack/ironic master: Remove File type  https://review.opendev.org/74186214:45
openstackgerritMerged openstack/ironic master: Remove Link type  https://review.opendev.org/74186314:45
openstackgerritMerged openstack/ironic master: Stop running test_schedule_to_all_nodes in the multinode job  https://review.opendev.org/74188414:45
openstackgerritDmitry Tantsur proposed openstack/ironic stable/ussuri: Stop running test_schedule_to_all_nodes in the multinode job  https://review.opendev.org/74221414:51
*** rcernin_ has joined #openstack-ironic14:52
openstackgerritDmitry Tantsur proposed openstack/ironic stable/train: Stop running test_schedule_to_all_nodes in the multinode job  https://review.opendev.org/74221514:53
*** rcernin_ has quit IRC14:57
openstackgerritShivanand Tendulker proposed openstack/ironic master: Adds raid validation for in-band AgentRAID deploy step  https://review.opendev.org/74221715:05
*** stendulker has quit IRC15:09
openstackgerritVerification of a change to openstack/ironic failed: Auto extend the timeout for RAX hosts  https://review.opendev.org/74185315:16
openstackgerritShivanand Tendulker proposed openstack/ironic master: Adds raid validation for in-band AgentRAID deploy step  https://review.opendev.org/74221715:35
*** aedc_ has quit IRC15:38
*** gyee has joined #openstack-ironic15:41
*** yolanda has quit IRC15:42
*** yolanda has joined #openstack-ironic15:44
*** stendulker_ has joined #openstack-ironic15:53
stendulker_Hi dtantsur15:54
dtantsuro/15:54
stendulker_This is regarding https://review.opendev.org/#/c/74089315:54
patchbotpatch 740893 - ironic - Adds support SUM based firmware update as deploy step - 1 patch set15:54
stendulker_Apart from software raid this gets used in Ilo RAID as well15:55
dtantsuryou probably mean another patch15:55
stendulker_Do you suggest to remove volume related args in the AgentRAID /15:55
dtantsurstendulker_: okay, this makes sense, but then you need to use the arguments you add15:56
stendulker_yes https://review.opendev.org/#/c/742217/215:56
patchbotpatch 742217 - ironic - Adds raid validation for in-band AgentRAID deploy ... - 2 patch sets15:56
dtantsuryep, you don't pass the arguments anywhere in https://review.opendev.org/#/c/742217/2/ironic/drivers/modules/agent.py15:56
stendulker_Yes, I understand that... It was a quick copy paste :(15:56
patchbotpatch 742217 - ironic - Adds raid validation for in-band AgentRAID deploy ... - 2 patch sets15:56
stendulker_Will add the argsinfo15:57
dtantsuryou need something like raid.filter_target_raid_config I guess?15:57
dtantsurthinking of it, why are these arguments needed?15:57
dtantsurstendulker_: a user provides the raid_config, why cannot they just provide only the bits they need?15:58
dtantsurnot a hard objection, just wondering15:58
stendulker_yes I will update the filter_target_raid_config15:58
stendulker_dtantsur: I agree with you. But in the base method we are supporting it and I think that is how implemented in DRAC as well, I suppose15:59
TheJuliaOkay, Opendev day #2 is a wrap15:59
*** ricolin has quit IRC16:02
stendulker_dtantsur: I think it makes sense to drop create_root_volume and create_nonroot_volumes16:03
stendulker_Will post the patch shortly.16:04
*** k_mouza has quit IRC16:04
*** k_mouza has joined #openstack-ironic16:05
*** strigazi has joined #openstack-ironic16:06
*** diurnalist has quit IRC16:10
*** bfournie has joined #openstack-ironic16:12
*** ociuhandu_ has joined #openstack-ironic16:17
openstackgerritShivanand Tendulker proposed openstack/ironic master: Adds raid validation for in-band AgentRAID deploy step  https://review.opendev.org/74221716:18
*** k_mouza has quit IRC16:20
*** ociuhandu has quit IRC16:20
openstackgerritRiccardo Pittau proposed openstack/sushy master: Update system schema to 1.10  https://review.opendev.org/74223616:21
*** ociuhandu_ has quit IRC16:21
TheJuliaWell this is a new one16:22
TheJuliahttps://www.irccloud.com/pastebin/pvmCchsR/16:22
rpittauoO16:24
dkingI have a couple questions about node cleaning. First, on https://docs.openstack.org/ironic/latest/admin/cleaning.html, it seems to list cleaning steps, but only lists the specific steps for the Management Interfaces. Is there a list of cleaning steps for Deploy, etc.?16:28
*** priteau has quit IRC16:28
*** k_mouza has joined #openstack-ironic16:28
TheJuliaI wonder if we broke API version negotiation :\16:35
TheJuliaI have to take my cat to the vet shortly, but I can dig into it afterwards16:35
TheJuliahttps://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_691/727480/7/gate/ironic-tempest-ipa-wholedisk-direct-tinyipa-multinode/691d51e/testr_results.html16:35
*** diurnalist has joined #openstack-ironic16:38
TheJuliadking: so, there are steps for deploy but we've also not ever been able to gain consensus as to how to programmatically present such data because of the dynamic nature combined with the chicken/egg problem that exists with such. Mainly they are part of the deployment process in newer ironic BUT! by default the clean steps available by default are erase_devices and erase_devices_metadata16:38
*** bfournie has quit IRC16:42
dkingTheJulia: Thanks for the answer. So, right now, other than steps done by Management Interfaces, I should just count on erase_devices and erase_devices_metadata ?16:43
TheJuliadking: yes, now if you have other steps in a hardware manager, I believe they all become "deploy" interface backed16:45
dkingAnother question, is there a way to hook to an external script? For instance, I want to be able to update the BMC and BIOS firmware using redfish, which doesn't seem to be an option listed for the interface. But if I could somehow trigger an external script, it would make it easier for me to manually add that to the cleaning process.16:45
TheJuliathe management ones are bmc/driver specific through the management interface in the hardware type16:45
*** k_mouza has quit IRC16:45
TheJuliaan external script running on the conductor?16:46
TheJuliaat present, no. I think the security risk there would really prevent it.... although we could likely come to agreement on local files only16:46
TheJuliaand it really wouldn't be much code I think16:46
*** JamesBenson has joined #openstack-ironic16:47
dkingI'm really looking for anything that I can do to trigger that sort of work. I'm assuming that it isn't included in the redfish driver because the option isn't universal enough, but that would be best. But other options would be things like having an option to run a shell script, or perhaps even send an API request.16:48
TheJuliaAutomated? every time cleaning is run? Generally we've recommended using a hardware manager in the agent because older BMCs were incapible of flashing all devices16:52
TheJuliaI think disk backplanes are the only thing that are not supported by most vendors these days16:52
TheJuliaAlthough some may have gotten that going16:52
TheJuliaI guess it also depends on the vendor, because there are some differences between vendors even with redfish, aiui16:53
dkingYeah. Even currently, I'm having trouble with the implementation from our vendor. However, I have a ticket in for that.16:54
TheJuliaso I guess kind of goes back to the manual or automatic16:56
dkingThe purpose for the update is the example that Jadon gave in the discussion earlier. If an attacker gains access to a server, they can modify the firmware, which then gets passed back into the pool when they are finished with it. So, to be clean, we need the firmware updated between each deployment.16:56
*** stendulker_ has quit IRC16:56
*** tenbrae has quit IRC16:56
dkingIdeally, I'd prefer if that could all happen anytime Ironic cleaning is triggered (going into available). Otherwise, I have to write a system in between that takes calls to remove a system, and then have that manually do the cleaning, and maybe have to find a way to flag the servers in Ironic so that we know they're being manually cleaned.16:58
*** derekh has quit IRC17:01
*** priteau has joined #openstack-ironic17:01
TheJuliayeah, so automatic17:03
TheJuliahmm17:03
TheJuliaIdeally this is the purpose of hardware managers17:03
TheJuliabut doing it out of band is... different17:03
* TheJulia wonders if driver_info could be populated with specific fields that could be read17:04
* TheJulia needs to take the elder cat to the vet, bbl17:04
*** yolanda has quit IRC17:10
*** yolanda has joined #openstack-ironic17:13
*** dtantsur is now known as dtantsur|afk17:15
dtantsur|afko/17:15
*** priteau has quit IRC17:21
*** klarin has joined #openstack-ironic17:27
*** Lucas_Gray has quit IRC17:40
klarinHi guys! We're on Train/CentOS7 and have an issue on BM instance deployment via nova: info in files under /var/lib/neutron/dhcp is missing for some nodes, and servers cannot PXE boot17:44
klarinFor some nodes though /var/lib/neutron/dhcp becomes populated and nodes successfully boot. Working and non-working nodes may be even assigned to the same conductor17:44
klarinWhat might be the cause?17:45
openstackgerritMerged openstack/ironic-inspector stable/train: Do not try to set local_gb to -1 when the matched root device size is 0  https://review.opendev.org/74193717:50
openstackgerritVerification of a change to openstack/ironic failed: Use native oslo.concurrency execution timeout in ipmitool  https://review.opendev.org/73859117:50
arne_wiebalckBye everyone o/17:51
*** diurnalist has quit IRC18:24
*** diurnalist has joined #openstack-ironic18:35
*** diurnalist has quit IRC18:39
*** rloo has quit IRC18:42
*** rloo has joined #openstack-ironic18:43
*** ociuhandu has joined #openstack-ironic18:48
*** diurnalist has joined #openstack-ironic18:49
*** ociuhandu has quit IRC18:53
*** ociuhandu has joined #openstack-ironic18:56
TheJuliaklarin: anything in the neutron logs? I think there was a neutron bug around then that caused dhcp updates to fail under certian conditions19:02
TheJuliaklarin: Which reminds me, there is also something on centos with dhcp....19:05
TheJuliaMaybe one of my fellow redhat folks remembers19:05
TheJuliaunfortunately I don't remember19:05
klarinTheJulia: I apologize, I was wrong, DHCP must be fine, since manually booted image gets proper hostname and address19:06
TheJuliaklarin: so funny thing about this bug19:07
TheJuliait is like a delay and dnsmasq doesn't pickup the configuration in time19:08
TheJuliaor maybe not at all until another change occurs19:08
klarinif I boot from local disk with centos, node receives the address, but then if I reboot PXE interface does not get it, so must be something with firmware or missing dhcp opts19:12
klarinnot sure why it works on other nodes, as they are all identical19:12
TheJuliaklarin: are the dhcp ops present on the port configuration and/or the actual dnsmasq config files?19:13
TheJuliaklarin: I ask because they are separate operations/interactions19:13
klarintag:port-<...>,150,192.168.<...>tag:port-<...>,210,/tftpboot/tag:port-<...>,66,192.168.<...>tag:port-<...>,67,EFI/bootx64.efitag:port-<...>,option:server-ip-address,192.168.<...>19:14
klarinsorry, no line breaks19:14
TheJuliaso the question is... if you hup dnsmasq or touch that file with another change... will the machine pxe boot with those options?19:15
TheJuliaanother thought, is the network interface in use actually enabled to pxe boot?19:15
klarinnot sure what options to add there, but manually loading EFI/bootx64.efi with tftp command works19:17
klarinand yes, PXE prompt for specific interface shows up19:17
klarinI'll try firmware update or boot using another interface19:18
klarinTheJulia: thank you so much for your response!19:18
TheJuliaklarin: has it bene over an hour?19:20
TheJuliain other words, has ironic purged the files from disk?19:21
klarindhcp data gets removed properly, I was probably looking in the wrong direction, expecting to see entries show up in leases file19:22
openstackgerritJulia Kreger proposed openstack/ironic master: Extend base build timeouts  https://review.opendev.org/74181819:25
TheJuliaAhh yeah, your basically saying when the bmc is asked to network boot, it just doesn't work19:25
TheJuliaand I guess your manually asking from uefi shell?19:26
klarinyes, it's probably firmware that does not work19:29
klarindid not try anything from uefi shell19:30
TheJuliacould the default pxe interface in the firmware config be wrong?19:30
klarinno, it actually tries all interfaces one after another, and also displays its MAC19:31
TheJuliaso identically configured machines?19:33
klarinyes, there are 4 intel interfaces: 2 built-in 10G, 2 add-in 40G, and at least 2 nodes boot OK from 40G19:36
klarinbb thank you again19:51
*** klarin has quit IRC19:51
openstackgerritJulia Kreger proposed openstack/ironic-python-agent-builder master: Strip some excess from the centos dib image  https://review.opendev.org/74181919:58
openstackgerritJulia Kreger proposed openstack/ironic-inspector master: Respond so Apache HTTPd doesn't think the request failed  https://review.opendev.org/74034420:20
openstackgerritJulia Kreger proposed openstack/ironic master: Guard conductor from consuming all of the ram  https://review.opendev.org/72648320:30
*** lbotell has quit IRC20:49
*** JamesBenson has quit IRC21:12
openstackgerritMerged openstack/sushy-cli master: Set min version of tox to 3.2.1  https://review.opendev.org/74118921:32
*** tkaprol has joined #openstack-ironic21:37
openstackgerritMerged openstack/ironic-inspector master: Set min version of tox to 3.2.1  https://review.opendev.org/74118621:42
*** tkaprol has quit IRC21:45
openstackgerritVerification of a change to openstack/ironic failed: Stop running test_schedule_to_all_nodes in the multinode job  https://review.opendev.org/74221421:45
*** Goneri has quit IRC21:50
openstackgerritMerged openstack/bifrost master: Support extra-vars in bifrost-cli  https://review.opendev.org/74024621:56
*** rcernin_ has joined #openstack-ironic22:22
*** rcernin_ has quit IRC22:22
jandersgood morning / evening Ironic! o/22:26
*** rcernin has joined #openstack-ironic22:34
*** bnemec has quit IRC22:47
openstackgerritSteve Baker proposed openstack/ironic-inspector master: Add public api support to basic auth  https://review.opendev.org/74230622:53
*** tkajinam has joined #openstack-ironic22:55
*** Goneri has joined #openstack-ironic22:59
TheJuliagood morning janders23:12
TheJuliaSo question folks. Dmitry wrote support in Train to bounce the node if it had not checked-in after PXE for a while. I'm wondering if we should go through the process of backporting that feature as one of those "generally this will make lives better" sort of thing23:13
jandersTheJulia just to make sure I'm undestanding this correctly:  don't we have something similar in place already (or is it introspection only)?23:18
openstackgerritSteve Baker proposed openstack/bifrost master: Support HTTP basic auth and switch bifrost-cli to it  https://review.opendev.org/74196423:18
TheJuliathere are two levels23:18
jandersI remember my nodes recovering from intermittent portfast issues on their own23:18
TheJuliathere is the rendered template "please retry"23:18
TheJuliaand then there is a "kick the node again" if it doesn't check-in with-in x amount of time23:19
jandersright!23:19
jandersin general this is a useful feature for sure. Is there a config option to match allowing the operator to disable it?23:21
TheJuliayes, and it is disabled by default23:21
janders9/10+ I think it will help. Just thinking of that rare case where the operator is troubleshooting something so doesn't want the node to reboot in the middle - or if the environment has some rare hard to reproduce issue and the operator wants to know when things go wrong, as opposed to having the system auto-recover23:22
jandersI think it is useful, might prevent a fair bit of support tickets from being raised.23:22
jandersas much as pxe booting and building nodes should work 100/100 my experience is it rarely does and often this is due to hardware being non-deterministic23:23
openstackgerritJulia Kreger proposed openstack/ironic stable/stein: Allow retrying PXE boot if it takes too long  https://review.opendev.org/74231323:23
jandersthis is a good way to relieve the pain caused by this and it will likely be appreciated by more conservative folk running older long-life releases23:24
TheJuliahmm. to go beyond rocky is going to require manually doing so23:24
TheJuliamaybe something for tomorrow23:24
TheJuliaYeah, I'm actually looking at an internal ticket thinking "if this was present, it would have likely prevented a lot of this"23:25
jandersthis makes me appreciate more why we are putting effort into vmedia-based builds :)23:28
TheJulia++23:29
*** Goneri has quit IRC23:36
*** iurygregory has quit IRC23:42

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!