Friday, 2023-07-28

gmannkopecmartin: https://review.opendev.org/c/openstack/tempest/+/88480400:03
gmannkopecmartin: and this unblock the nova-lvm job https://review.opendev.org/c/openstack/tempest/+/88989500:04
opendevreviewGhanshyam Mann proposed openstack/tempest master: Test Nova and Glance RBAC policy old defaults  https://review.opendev.org/c/openstack/tempest/+/88476400:18
fricklergmann: kopecmartin: there are timeouts for tools/generate-tempest-plugins-list.py, I think this was mentioned earlier, but not sure what's the issue there07:57
opendevreviewJakub Skunda proposed openstack/tempest master: [WIP] Catching broken tests in tempest-full-test-account-* jobs  https://review.opendev.org/c/openstack/tempest/+/88968309:32
*** gthiemon1e is now known as gthiemonge09:52
opendevreviewMerged openstack/tempest master: Disable dhcpcd in test_port_security_macspoofing_port  https://review.opendev.org/c/openstack/tempest/+/88971311:13
*** gibi is now known as gibi_pto16:15
gmannkopecmartin: this is ready too https://review.opendev.org/c/openstack/tempest/+/88476417:35
gmannfrickler: yeah, locally it takes less than 3 miin but we  observed that in particulr provider rax and it might be slow node case? this is not just plugin generate script but we see that timeout in doc job also and on rax provider17:37
JayFTheJulia: ^ gmann frickler: I'll note that we're seeing slower than usual nodes causing failures in Ironic this morning as well17:40
gmannJayF: is it always (msot of the time) rax provider ?17:42
* JayF looks at the one glaring example we've been digging17:42
JayFyes17:42
JayFrax17:42
TheJuliait kind of felt like someone paused the vm on that one, but nothing stood out as a solid point in time'17:42
TheJuliatwo sepecific spans where things just went weirdly sideways17:43
gmannhumm, we might need to check this provider as in tempest we see doc/plugin sanity job which used to take <20 min are timeout a lot now a days17:43
TheJuliawe *really* need a generalized failure rate on provider graph17:44
TheJuliait feels like it could be a shame thing, but if we're consistently seeing a higher failure rate on a provider, there is a reason we need to understand17:44
TheJuliawow, a docs job on ovh failed the other day... took like the same task on my desktop 2.5 minutes, there it never finished and had a runtime of ~7 minutes to get half way along17:55
TheJulia3 minutes to do basic bindep install too17:58
dansmithTheJulia: that's pretty much what we're seeing.. yuuuge delays in the middle of a run18:17
dansmithI'm highly suspicious of major IO problems or throttling going on there18:17
dansmiththe docs timeout is a different thing, I think.. it has to survey all the openstack namespace projects and I think it gets stuck doing that sometimes probably for network reasons18:18
dansmithgmann: I thought scenario tests ran single-threaded?18:24
dansmithor did you change that recently?18:24
gmanndansmith: in tempest-slow slow scenario test run in parallel this is what i changed,. but in tempest-full (inherttied jobs) it is all single threaded. 18:25
dansmiththe nova-next job is running scenario with everything else (and thus multi-threaded18:26
dansmith2023-07-28 18:11:22.355120 | controller | {0} setUpClass (tempest.scenario.test_volume_migrate_attached.TestVolumeMigrateRetypeAttached) ... SKIPPED: Cinder multi-backend feature disabled18:26
dansmith2023-07-28 18:12:08.098934 | controller | {1} tempest.scenario.test_volume_boot_pattern.TestVolumeBootPattern.test_volume_boot_pattern [230.835610s] ... ok18:26
gmannyes that is multi threaded https://github.com/openstack/nova/blob/master/.zuul.yaml#L34918:27
gmannbut exclude a few of network one18:27
dansmithokay18:27
gmannwe are trying to run all of them in parallel in tempest-full-parallel but seeing some issue18:28
dansmithyeah18:28
TheJuliadansmith: well, yes if all the docs are being built sure, but as a single project example that seems a bit... troublesome when in the past we've seen that job take no more than 10-15 minutes18:28
dansmithI just didn't expect it in this job but sounds like it's intentional18:28
dansmithTheJulia: I'm just saying I've been seeing it fail a lot recently with a timeout, which it really shouldn't18:29
TheJuliadansmith: agree completely18:29
melwitthah, came here to ask about a openstack-tox-docs job timeout18:48
melwittseemingly no reason for it that I can see in the job-output.txt18:49
dansmithno earthly reason18:51
dansmithmy high school calculus teacher used to prefix every penultimate step in a problem with "and if you've been living your life right...."18:52
melwitthaha, awesome18:59
melwittI feel like our CI is haunted19:10
dansmithit's definitely not healthy lately19:12
dansmithhaunted is probably easier to reconcile19:13
melwittI guess it is named zuul after all. need to call the ghostbusters19:13
gmannghost become more active when we try to fix it:)19:32
dansmithmaybe I should watch the movie again this weekend and take notes to apply to our activities next week19:35
opendevreviewMerged openstack/tempest master: Increase the default concurrency for tempest run  https://review.opendev.org/c/openstack/tempest/+/88722019:58
gmannfinally it is merged20:12
melwittOMG a thing merged20:12
gmann:)20:13
gmannseems ghost is sleeping 20:13
melwittghostbuster talk must have scared them away momentarily20:13
gmannyeah, we should add that comment during recheck20:15
melwitthaha :)20:16
JayFSomebody called Whoopi :)20:42
opendevreviewMerged openstack/tempest master: Test Nova and Glance RBAC policy old defaults  https://review.opendev.org/c/openstack/tempest/+/88476422:06
opendevreviewMerged openstack/tempest master: Default to /tmp for scenario (create|get)_timestamp()  https://review.opendev.org/c/openstack/tempest/+/88480422:38
melwittheyohhh22:40

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!