Thursday, 2023-06-29

opendevreviewJulia Kreger proposed openstack/ironic master: CI: minor fix to irmc driver clean_up_instance testing  https://review.opendev.org/c/openstack/ironic/+/88723900:31
TheJuliawould anyone object to us just reverting the 30 second time limit on tests, or force merging anything to change it?00:39
iurygregoryI don't 00:41
TheJuliamaybe my perception is a red herring, but we see wild performance shifts00:42
TheJuliaand the timeout itself seems blocked on freaky weirdness with irmc tests (one of which, seems weird, the other... I think is just a lacking mock)00:42
iurygregorymaybe reverting the changes we added specific values to see how CI will behave?00:59
TheJuliaperhaps01:03
TheJuliaI'm wondering if we just disable the unit with driver 01:03
iurygregorymakes sense to me01:04
TheJuliaiurygregory: https://review.opendev.org/c/openstack/ironic/+/88698501:08
TheJuliait is v+1 from CI01:08
TheJuliaif that fails, I'll disable driver libs job01:08
TheJuliabut only makes sense for someone else to wf it01:08
iurygregoryon it, let me grab my phone :D01:09
TheJuliaThanks!01:27
iurygregoryyw01:30
opendevreviewVerification of a change to openstack/ironic master failed: CI: Change migrations timeout to be >60 seoncds  https://review.opendev.org/c/openstack/ironic/+/88698501:32
opendevreviewVerification of a change to openstack/ironic master failed: CI: Change migrations timeout to be >60 seoncds  https://review.opendev.org/c/openstack/ironic/+/88698501:45
iurygregoryseems like now I can access gerrit \o/01:49
iurygregorythe change failed .-.01:49
iurygregoryagain the IRMCVirtualMediaBootTestCase test >.<01:51
opendevreviewJulia Kreger proposed openstack/ironic master: CI: minor fix to irmc driver clean_up_instance testing  https://review.opendev.org/c/openstack/ironic/+/88723901:51
TheJuliaIt is just bizarre02:00
opendevreviewJulia Kreger proposed openstack/ironic master: Execute tests by class, not randomly  https://review.opendev.org/c/openstack/ironic/+/88716602:03
opendevreviewMerged openstack/ironic master: CI: minor fix to irmc driver clean_up_instance testing  https://review.opendev.org/c/openstack/ironic/+/88723902:35
TheJuliaSerious, whiskey tango02:53
TheJuliaYa know, they are all py310. I don’t have 3.1003:01
JayFI have py310 on my local box, are you doing anything special to repro?03:22
JayFmaybe toss it in an etherpad and I'll try it in the morn03:22
JayFor literally anywhere03:22
ashinclouds[m]Well, just off the driver lib one fails so readily03:23
JayFhell, I have py311 too03:24
JayFlets break and fix all the things 03:24
JayFbetter than fixing and fixing and fixing and it's still broken03:24
JayFI'll get a driver libs env setup tomorrow and give it a shot03:24
ashinclouds[m]It is just a rod command, fwiw03:25
ashinclouds[m]Rod03:25
TheJuliaTox03:25
* TheJulia glares at autocorrect of the other app03:25
* TheJulia returns to fire while awaiting ice cream03:26
TheJuliahttps://usercontent.irccloud-cdn.com/file/7Vyr1Jgf/1688009194.JPG03:26
JayFyeah I'd JFDI if I wasn't on my windows desktop, about to cut off for the night03:31
opendevreviewHuy Mai proposed openstack/sushy-tools master: Add fake_ipa inspection, lookup and heartbeater to fake system  https://review.opendev.org/c/openstack/sushy-tools/+/87536605:27
rpittaugood morning ironic! o/07:01
opendevreviewVerification of a change to openstack/ironic master failed: Use jammy for base jobs  https://review.opendev.org/c/openstack/ironic/+/86905207:40
opendevreviewVerification of a change to openstack/ironic master failed: Use tox env variables in coverage tests  https://review.opendev.org/c/openstack/ironic/+/88550708:00
opendevreviewVerification of a change to openstack/ironic master failed: Use tox env variables in coverage tests  https://review.opendev.org/c/openstack/ironic/+/88550708:13
rpittauok, at least it's a failure and not a timeout08:19
rpittauoh I see, it's the problem with the foreign keys pymysql.err.OperationalError: (3730, "Cannot drop table 'nodes' referenced by a foreign key constraint 'bios_settings_ibfk_1' on table 'bios_settings'.")08:24
rpittauJayF, TheJulia, not sure where you were with this ^08:41
opendevreviewMerged openstack/ironic-python-agent stable/train: Use utf-16-le if BOM not present  https://review.opendev.org/c/openstack/ironic-python-agent/+/86120009:21
opendevreviewDmitry Tantsur proposed openstack/ironic master: Add the initial skeleton of the agent inspect interface  https://review.opendev.org/c/openstack/ironic/+/87781410:06
opendevreviewDmitry Tantsur proposed openstack/ironic master: [WIP] Very basic in-band inspection with the "agent" interface  https://review.opendev.org/c/openstack/ironic/+/88545010:06
opendevreviewVerification of a change to openstack/ironic master failed: Use jammy for base jobs  https://review.opendev.org/c/openstack/ironic/+/86905211:13
iurygregorygood morning11:20
iurygregoryrpittau, I don't think we saw this (we saw the one for allocation and node)11:20
iurygregoryI'm still waking up, but I can push a patch for this11:23
iurygregoryunless you want to do it =) 11:23
* iurygregory brb more coffee required 11:23
rpittauiurygregory: thanks, go ahead, I will approve it11:36
* iurygregory is awake now11:58
iurygregoryrpittau, oh I think this was a different thing, I was thinking it was a warning, but is an actual error right?11:58
rpittauyep11:59
iurygregorydo you have a link from the failure?11:59
iurygregoryhttps://github.com/openstack/ironic/commit/402c32094b8003db6186f17742a97925405657a611:59
rpittauhere https://review.opendev.org/c/openstack/ironic/+/88550711:59
iurygregoryI thought this approach from the commit would help 11:59
rpittauthat's for alloactions, I think this is different11:59
iurygregoryyeah, it was just a warning, I though we had the same thing12:00
iurygregorybut is an actual error12:00
dtantsurWhy, WHY does our BadRequest exception has HTTP code 500???12:31
iurygregory'-'12:31
dtantsursame Unauthorized. WUT.12:32
dtantsurfortunately, nothing uses the latter12:32
opendevreviewDmitry Tantsur proposed openstack/ironic master: Fix the HTTP code of the BadRequest exception  https://review.opendev.org/c/openstack/ironic/+/88727112:38
dtantsurthis is funny, I wonder what is affected ^^12:38
rpittaudtantsur: why removing Unauthorized?12:44
dtantsurrpittau: it's not used and has a wrong code12:44
dtantsur(should have mentioned in the commit message, hold on)12:44
rpittauyeah, I asked as it was not mentioned :)12:44
opendevreviewDmitry Tantsur proposed openstack/ironic master: Fix the HTTP code of the BadRequest exception  https://review.opendev.org/c/openstack/ironic/+/88727112:46
dtantsurrpittau: should be better now ^^12:46
rpittauthanks!12:49
opendevreviewDmitry Tantsur proposed openstack/ironic master: Correct two mistakes in the /continue_inspection API  https://review.opendev.org/c/openstack/ironic/+/88720212:51
dtantsurHeads-up (cc JayF): metal3 now uses our ramdisk images from tarballs.o.o. Please be gentle :)13:09
iurygregorymetal3-integration is broken? .-.13:27
dtantsurI've seen one failure one pass13:27
iurygregoryit failed on https://review.opendev.org/c/openstack/ironic/+/883062,node-1 introspection completed but the node is not available .-.13:29
iurygregoryplaning to recheck, but will avoid since I still need feedback =)13:30
rpittauiurygregory: I haven't seen failures in other patches13:39
opendevreviewVerification of a change to openstack/ironic master failed: Use jammy for base jobs  https://review.opendev.org/c/openstack/ironic/+/86905213:44
iurygregoryrpittau, https://zuul.opendev.org/t/openstack/builds?job_name=metal3-integration&project=openstack/ironic13:48
opendevreviewVerification of a change to openstack/ironic master failed: Use tox env variables in coverage tests  https://review.opendev.org/c/openstack/ironic/+/88550713:48
iurygregorythe last two failures are identical, haven't look at the other ones13:48
JayFfwiw those tox-with-driver-libs tests are passing pretty regularly on my box15:15
JayFI'm rerunning them over and over but, yeah15:15
JayFthat was on 3.11, doing again on 3.1015:17
rpittauJayF: not passing in CI, we're gtting random failures from migration tests15:20
rpittauexample https://9744b8fb79d09b5cacd8-8985f81df699324d9133cf83275f9aa5.ssl.cf1.rackcdn.com/885507/6/check/ironic-tox-unit-with-driver-libs/b96489b/job-output.txt15:20
JayFyeah15:21
TheJulialets get one of the tox machines held15:21
JayFI'm not sure how much capacity I'll have to help with that today15:21
TheJuliaspecifically from the driver test15:21
TheJuliaI have meetings, but I can help in the background15:21
JayFwoke up sick but trying to make it thru :| 15:21
TheJuliaif I have a machine where I can proroduce maybe15:21
JayFI just need the sudafed to kick in so my head stops throbbing15:21
TheJuliaeek :(15:21
rpittauI've seen failures in py38, tried to reproduce locally to no avail15:21
rpittauso failure seems legit15:23
rpittauand it's not only on driver-libs15:24
JayFI wonder if my change made it worse15:25
JayFlooking at the failures now vs before15:25
TheJuliawe're chasing issues where things are dramatically worse based upon when in the day cycle15:47
TheJuliawhen CI is more heavily loaded, things fail more horrifically and frequently15:47
TheJulialater in the day, not so much15:47
dtantsurnoisy neighbors?15:48
JayFI mean, yeah, very, but that doesn't mean our tests shouldn't run at any speed15:49
JayFI think a fix that goes the direction of "our jobs aren't running fast enough" is likely to get nowhere :/15:49
TheJuliaagree15:49
TheJuliaI think we need to slow them down15:49
dtantsurWe could potentially split migration tests into a new job15:50
TheJuliahttps://review.opendev.org/c/openstack/ironic/+/887166 seems like a reasonable first step to isolate things down by class15:50
TheJuliawe *could* also do that... I think15:50
TheJuliaWe likely should15:50
* TheJulia pulls up the stestr docs15:51
JayFI think that's not a bad idea, but I also am highly suspicious it'd make any difference15:51
dtantsurat the very least, it will allow us to make it non-voting (and avoid merging any db changes) until we figure out15:51
rpittauTheJulia: I don't mind merging that, I'll rebase mine 15:51
dtantsurbut if we somehow end up choking up mysql, it may help as well15:52
opendevreviewRiccardo Pittau proposed openstack/ironic master: Use tox env variables in coverage tests  https://review.opendev.org/c/openstack/ironic/+/88550715:53
TheJuliaso "stestr run --exclude-regex -- WhateverTheMySQLClassNameIs"15:54
TheJuliaand we can do the inverse sort of like the driver one15:54
dtantsurOr literally move the files to another subdirectory15:56
TheJuliaso the test runner can't find it?15:56
dtantsurat least that's how some projects distinguish unit and functional tests15:57
TheJulia... hmmm15:57
TheJuliaI'm not sure how that would look pratically, but I'm trying to sort the regex now15:57
dtantsurlike https://opendev.org/openstack/openstacksdk/src/branch/master/tox.ini#L5615:57
dtantsursee you tomorrow folks o/15:59
TheJuliaokay, I'd kind of like us to work towards fixing this eventually and moving files seem like we're admitting some level of defeat15:59
TheJuliabut... dunno15:59
* TheJulia is running an exclude test now15:59
rpittaugood night! o/16:00
dtantsurI feel like these are actually a different kind of tests16:00
dtantsurwe don't even run them normally locally16:00
JayFI'm going to step away for a bit, I'm fairly miserable and am going to try and get un-miserable so I can be useful later today o/16:00
* dtantsur goes for real16:00
TheJuliaI did at home...16:00
dtantsuroh get better JayF!16:00
* TheJulia waits for tox to return16:00
opendevreviewMerged openstack/ironic master: Execute tests by class, not randomly  https://review.opendev.org/c/openstack/ironic/+/88716616:12
* TheJulia dances16:12
opendevreviewJulia Kreger proposed openstack/ironic master: Unit tests: Isolate mysql test migrations  https://review.opendev.org/c/openstack/ironic/+/88729716:16
TheJuliaso the other issue boils back to what rpittau was encountering, the unwind breaking down16:16
TheJuliawhich is what Jay and I were looking at yesterday16:16
TheJulianone of that explains the irmc virtual media failures16:16
TheJuliawhich are the most bizzare failures known to human kind16:17
opendevreviewJulia Kreger proposed openstack/ironic master: Unit tests: Isolate mysql test migrations  https://review.opendev.org/c/openstack/ironic/+/88729716:23
opendevreviewMohammed Boukhalfa proposed openstack/sushy-tools master: Add fake_ipa inspection, lookup and heartbeater to fake system  https://review.opendev.org/c/openstack/sushy-tools/+/88729916:27
opendevreviewMohammed Boukhalfa proposed openstack/sushy-tools master: Add fake_ipa inspection, lookup and heartbeater to fake system  https://review.opendev.org/c/openstack/sushy-tools/+/87536616:32
opendevreviewVerification of a change to openstack/ironic master failed: Use jammy for base jobs  https://review.opendev.org/c/openstack/ironic/+/86905216:40
opendevreviewMerged openstack/ironic master: Use tox env variables in coverage tests  https://review.opendev.org/c/openstack/ironic/+/88550716:48
iurygregory_woot16:58
*** iurygregory_ is now known as iurygregory16:58
iurygregoryTheJulia, we are still having irmc test failures even after your fix? .-.16:59
TheJuliawell, attempted one of the tests, the other one everything was mocked17:00
TheJuliait makes zero sense that they both still fail sporatically17:00
iurygregory(╯°□°)╯︵ ┻━┻17:00
TheJuliayes. Very yes.17:01
iurygregoryTheJulia, RHOSP 16.1 is train right?17:23
TheJuliaaye17:23
iurygregoryhave you seen any case where the information from inspection is wrong? like the serial and by_path ?17:25
TheJuliaby path is dependent upon initialization order on some machines, but that is basically the case17:25
TheJuliaand that is driver and bus order id17:25
TheJuliait varies 17:26
iurygregorythe fact the person is saying that idrac shows a different serial is making me wonder if they have outdated firmware17:31
iurygregoryR750... .-.17:32
TheJuliadifferent device serial numbers?17:59
iurygregoryyeah, the one we report from all disks https://paste.opendev.org/show/bCYOGcOfCKdkkGxSN8gW/ 18:54
TheJuliaWUT https://8b0ae7fc14ab169ab4ff-19e17e5db2abd0e280dd7dd93f01d60e.ssl.cf2.rackcdn.com/887297/2/check/ironic-tox-unit-with-driver-libs/83243cc/testr_results.html20:01
TheJuliaiurygregory: so, I'd almost ask for someone to power th emachine down, take photos of the disk and lets see who iw right20:03
TheJuliawe mgiht be seeing an adapter20:03
TheJuliadunno!20:03
iurygregoryTheJulia, yeah I told the person to try to access the machine and check21:30
iurygregorynot WUT but WTF re https://8b0ae7fc14ab169ab4ff-19e17e5db2abd0e280dd7dd93f01d60e.ssl.cf2.rackcdn.com/887297/2/check/ironic-tox-unit-with-driver-libs/83243cc/testr_results.html21:31
TheJuliayeah21:39
TheJuliathe only way I could see that happen is if the db base test class was not done initalizing21:40
TheJuliadunno, maybe something like this....21:53
opendevreviewJulia Kreger proposed openstack/ironic master: DNM check and re-trigger db init for testing  https://review.opendev.org/c/openstack/ironic/+/88734321:53
TheJulialaying down for a little bit to let Ci think on that one22:01
*** dtantsur_ is now known as dtantsur23:50

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!