Friday, 2025-07-25

iurygregorysnmp-pxe job seems unhappy today... a lot of failures00:57
iurygregoryhttps://zuul.opendev.org/t/openstack/builds?job_name=ironic-tempest-ramdisk-bios-snmp-pxe&project=openstack/ironic 00:57
opendevreviewNahian Pathan proposed openstack/sushy master: Support expanded Chassis and Storage for redfish  https://review.opendev.org/c/openstack/sushy/+/95521102:08
opendevreviewNahian Pathan proposed openstack/sushy master: Support expanded Chassis and Storage for redfish  https://review.opendev.org/c/openstack/sushy/+/95521102:20
opendevreviewKaifeng Wang proposed openstack/sushy master: PCIeDevice and PCIeFuntion  https://review.opendev.org/c/openstack/sushy/+/94704705:24
opendevreviewKaifeng Wang proposed openstack/sushy master: PCIeDevice and PCIeFuntion  https://review.opendev.org/c/openstack/sushy/+/94704705:28
opendevreviewVerification of a change to openstack/ironic master failed: Clear `last_error` on power match/sync  https://review.opendev.org/c/openstack/ironic/+/95543206:50
rpittauTheJulia: thanks for checking! I was lost :/06:56
rpittaugood morning ironic! happy friday! o/06:57
opendevreviewMerged openstack/ironic-python-agent master: Hint at sector sizes when reporting an invalid written image  https://review.opendev.org/c/openstack/ironic-python-agent/+/95449807:10
queensly[m]Good morning o/08:02
abongaleGood morning Ironic o/08:22
masgharThanks for taking a look at the grenade job, TheJulia08:41
mumesan[m]Good morning!10:50
FreemanBoss[m]o/10:50
iurygregoryI'm wondering if neutron broke us "neutron-metadata-agent: no process found" :eyes:11:42
iurygregoryhttps://zuul.opendev.org/t/openstack/build/186edb85a98c4e2bb06c615e8bece800/log/controller/logs/screen-q-ovn-metadata-agent.txt this doesn't look good :D11:43
opendevreviewMithun Krishnan Umesan proposed openstack/networking-generic-switch master: Autogenerate list of NGS compatible devices  https://review.opendev.org/c/openstack/networking-generic-switch/+/95579812:02
iurygregoryI saw some  ovs-vsctl: unix:/var/run/openvswitch/db.sock: database connection failed (Permission denied) in the logs, wondering if is a neutron side issue or if we need to fix something in how we call ovs...12:22
rpittauiurygregory: I think this is a question for neutron directly, either way12:39
rpittauthey can probably at least clarify the metadata agent behavior12:39
TheJuliaregarding the neutron job, the l3 agent is failing and ultimately the upgrade test fails on verification of l3 traffic movement13:01
TheJuliaerr, grenade job13:02
* TheJulia starts the coffee sipping13:02
iurygregoryrpittau, agree, I've asked in their channel already13:07
TheJuliahttps://review.opendev.org/c/openstack/neutron/+/945497 <-- here's our problem :\13:19
iurygregoryso, do we need to bump versions on our side?13:19
iurygregorythis is the grenade problem right?13:19
TheJuliano, re-consulting the code13:20
TheJuliaso, at a glance it looks like neutron's goes sideways when 1.4 is requested13:21
rpittauyay13:23
iurygregory<insert it's fine gif>13:28
TheJuliahttps://review.opendev.org/c/openstack/neutron/+/945497/8/neutron/agent/rpc.py#10513:30
TheJuliaI added a comment13:30
TheJuliaI think we need to basically add logic to downgrade the version13:31
TheJuliaand or just revert13:31
TheJuliabut, this is a super long standing bug13:31
TheJuliaso if memory serves, grenade intentionally puts neutron in a half upgraded state13:33
TheJuliaand we're seeing the sign of that13:33
TheJulialets shift the discussion over to neutron, if there is a open bug someplace then should be relatively easy to write a fix13:34
iurygregoryTheJulia, I'm going to push a patch to move the snmp job to non-voting14:06
TheJuliaiurygregory: I'm already working on one for it and the grenade job14:06
iurygregoryoh ok14:07
TheJuliasince fixing it won't be quick14:07
iurygregoryhappy to review when you finish14:07
TheJuliaokay, shouldn't be much longer, I'm doing the needful paperwork w/r/t neutron and grenade14:12
opendevreviewNahian Pathan proposed openstack/sushy master: Support expanded Chassis and Storage for redfish  https://review.opendev.org/c/openstack/sushy/+/95521114:24
opendevreviewNahian Pathan proposed openstack/sushy master: Support expanded Chassis and Storage for redfish  https://review.opendev.org/c/openstack/sushy/+/95521114:26
opendevreviewNahian Pathan proposed openstack/sushy master: Support expanded Chassis and Storage for redfish  https://review.opendev.org/c/openstack/sushy/+/95521114:29
opendevreviewNahian Pathan proposed openstack/sushy master: Support expanded Chassis and Storage for redfish  https://review.opendev.org/c/openstack/sushy/+/95521114:35
opendevreviewNahian Pathan proposed openstack/sushy master: Support expanded Chassis and Storage for redfish  https://review.opendev.org/c/openstack/sushy/+/95521114:38
opendevreviewJulia Kreger proposed openstack/ironic master: ci: mark failing jobs non-voting pending fixes/removal  https://review.opendev.org/c/openstack/ironic/+/95591615:07
TheJuliaiurygregory: ^^^15:10
TheJuliatook a little while since it doesn't seem like an actual code fix to neutron is needed but our job config is out of date15:10
iurygregoryTheJulia, ack15:14
iurygregorygoing to grab lunch, will review when I'm back15:14
TheJuliaok15:14
opendevreviewNahian Pathan proposed openstack/ironic master: Reduce API calls when collecting sensor data with redfish  https://review.opendev.org/c/openstack/ironic/+/95548415:15
cardoeTheJulia: do we need to patch neutron to fix the RPC issue?15:20
TheJuliano15:20
TheJuliathe logs show the upgrade sequence is being violated15:21
TheJuliai.e. rpc stuffs not getting restarted before end rpc services15:21
cardoeAll of a sudden I see the stream of convo in neutron now15:21
TheJuliaend services are getting restarted which is why it is blowing up15:21
opendevreviewNahian Pathan proposed openstack/sushy master: Support expanded Chassis and Storage for redfish  https://review.opendev.org/c/openstack/sushy/+/95521115:33
cardoeSo need some ideas here with the sushy mocking in ironic16:53
cardoeSo I'm trying to avoid duplicating code and I'd really also like to take out as much mocking as possible cause I swear everytime I fix a bug it's cause someone wrote the mock and passed the tests but the mock doesn't operate like the actual code/interface.16:53
TheJuliaoh yes, that is totally a thing16:54
cardoeSo we mock get_members() for Sushy but we also mock the summary property. The summary property is something we put in sushy that just calls get_members() and does some stuff.16:54
TheJuliait would be easier to look at it, fwiw16:55
cardoeThe issue is that if I try to use real sushy objects we've got this cache_it() thing in Sushy which interferes with the tests. The cached value is returned back in some cases instead of changed data.16:55
cardoehttps://review.opendev.org/c/openstack/ironic/+/95553616:55
cardoeFirst file... test_inspect.py16:55
cardoeI'm gonna duplicate and clean it up so it works.16:57
cardoeBut wondering if we should move away from using summary in Ironic16:57
TheJuliabut because we're mocking parts where there is still underlying sushy code we're calling, we're getting odd results16:58
TheJuliabecause we're actually then testing sushy in ironic16:58
TheJulianot testing ironic with sushy16:58
cardoeOr if we should do something like running a sushy emulator with real data and letting the tests hit a real redfish endpoint16:58
TheJuliaeh, no16:58
TheJuliathats just a lot of complexity16:58
cardoeyeah16:58
TheJuliaand then what are we *really* testing16:58
cardoeWell that's a great question16:59
cardoeWe're trying to test that the "shape" of the data we upload in the inspector is the right shape.16:59
cardoeWhich goes back to my other idea around the shape.17:01
cardoeIf we used json schema and validated it, all these tests don't matter.17:02
TheJuliawe're testing that the inspect method does what we expect with certian data and cases we get from the driver, the issue is likely the shape of our sushy interactions are wrong for testing, or we're testing at a too high level when we need to decompose it17:02
TheJuliaugh17:05
TheJuliainit_system_mock wow17:05
TheJuliahmm17:06
iurygregoryomg CI is busted? POST_FAILURE17:17
cardoeSo yeah TheJulia you see what I mean... we're not really testing anything good. But the question is... what do we want to test?19:18
cardoeI think that nailing down what inspection data should look like would go a long way.19:19
cardoeI mean I just saw a patch from John Garbutt that points out a field we're not correctly populating.19:19
TheJuliaWell, each driver and usage is going to do what is felt to be right. That is going to vary by driver and it should. It’s like the redfish driver should test/represent based upon what it knows. The challenge is it’s being done as a giant monolithic thing and likely need to be decomposed into smaller/distinct chunks20:20
TheJuliaGet data, do needful to data, save/commit data20:20
TheJuliaregarding our work to eradicate eventlet: Please review this blog post https://github.com/OpenStackweb/ironic-website/pull/62/files20:40
opendevreviewMerged openstack/ironic master: Drop remaining reference to openSUSE  https://review.opendev.org/c/openstack/ironic/+/95558120:57

Generated by irclog2html.py 4.0.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!