Thursday, 2021-01-14

*** tosky has quit IRC00:36
openstackgerritArtom Lifshitz proposed openstack/whitebox-tempest-plugin master: [WIP] Different approach to TripleO job  https://review.opendev.org/c/openstack/whitebox-tempest-plugin/+/76286600:45
openstackgerritmelanie witt proposed opendev/elastic-recheck master: Add query for bug 1911574  https://review.opendev.org/c/opendev/elastic-recheck/+/77068800:58
openstackbug 1911574 in OpenStack-Gate "SSH to guest sometimes fails publickey authentication: AuthenticationException: Authentication failed." [Undecided,New] https://launchpad.net/bugs/191157400:59
openstackgerritArtom Lifshitz proposed openstack/whitebox-tempest-plugin master: [WIP] Different approach to TripleO job  https://review.opendev.org/c/openstack/whitebox-tempest-plugin/+/76286602:31
openstackgerritmelanie witt proposed openstack/devstack-plugin-ceph master: DNM debug logging  https://review.opendev.org/c/openstack/devstack-plugin-ceph/+/77069702:48
*** rcernin has quit IRC03:04
*** rcernin has joined #openstack-qa03:25
*** rcernin has quit IRC03:27
*** brinzhang0 has joined #openstack-qa03:27
*** brinzhang0 has quit IRC03:29
*** brinzhang0 has joined #openstack-qa03:29
*** brinzhang_ has quit IRC03:30
*** brinzhang0 has quit IRC03:30
*** brinzhang0 has joined #openstack-qa03:31
*** rcernin has joined #openstack-qa03:31
*** rcernin has quit IRC03:36
*** rcernin has joined #openstack-qa03:37
*** rcernin has quit IRC03:38
*** rcernin has joined #openstack-qa03:38
*** rcernin has quit IRC03:39
*** rcernin has joined #openstack-qa03:39
*** ricolin_ has joined #openstack-qa04:13
*** whoami-rajat__ has joined #openstack-qa04:22
*** ricolin_ has quit IRC04:24
*** artom has quit IRC04:24
*** ramishra has quit IRC04:25
*** ramishra has joined #openstack-qa04:26
*** amotoki has quit IRC04:44
*** amotoki has joined #openstack-qa04:44
openstackgerritmelanie witt proposed openstack/devstack-plugin-ceph master: DNM debug logging  https://review.opendev.org/c/openstack/devstack-plugin-ceph/+/77069705:12
*** dave-mccowan has quit IRC05:43
*** ricolin has quit IRC06:17
*** brinzhang_ has joined #openstack-qa06:25
*** gcheresh has joined #openstack-qa06:25
*** brinzhang0 has quit IRC06:28
*** abdysn has joined #openstack-qa06:28
openstackgerritmelanie witt proposed openstack/devstack-plugin-ceph master: DNM debug logging  https://review.opendev.org/c/openstack/devstack-plugin-ceph/+/77069706:55
*** ricolin has joined #openstack-qa07:39
*** ccamposr has joined #openstack-qa07:39
*** rcernin has quit IRC07:40
*** ccamposr__ has quit IRC07:41
*** ramishra has quit IRC07:44
*** akahat|rover is now known as akahat|lunch07:45
*** ramishra has joined #openstack-qa07:45
*** rpittau|afk is now known as rpittau07:47
*** ralonsoh has joined #openstack-qa07:51
*** jpena|off is now known as jpena07:52
*** ralonsoh_ has joined #openstack-qa07:56
*** rcernin has joined #openstack-qa07:56
*** ralonsoh has quit IRC07:59
*** rcernin has quit IRC08:01
*** slaweq has joined #openstack-qa08:03
*** gfidente|afk is now known as gfidente08:11
*** rcernin has joined #openstack-qa08:26
*** ralonsoh has joined #openstack-qa08:28
*** ralonsoh_ has quit IRC08:28
*** brinzhang0 has joined #openstack-qa08:45
*** brinzhang_ has quit IRC08:48
*** tosky has joined #openstack-qa08:49
*** slaweq has quit IRC08:55
*** slaweq has joined #openstack-qa09:00
*** lucasagomes has joined #openstack-qa09:08
*** yamamoto has quit IRC09:21
*** yamamoto has joined #openstack-qa09:53
*** abdysn has quit IRC09:57
*** abdysn has joined #openstack-qa09:58
*** yamamoto has quit IRC10:05
*** rcernin has quit IRC10:14
*** rcernin has joined #openstack-qa10:23
*** rcernin has quit IRC10:40
*** dtantsur|afk is now known as dtantsur10:56
*** brinzhang has joined #openstack-qa10:57
*** brinzhang has quit IRC10:58
*** brinzhang has joined #openstack-qa10:58
*** brinzhang0 has quit IRC10:59
*** brinzhang has quit IRC11:25
*** brinzhang has joined #openstack-qa11:25
*** brinzhang_ has joined #openstack-qa11:26
*** brinzhang_ has quit IRC11:28
*** brinzhang_ has joined #openstack-qa11:29
*** brinzhang has quit IRC11:30
*** rcernin has joined #openstack-qa11:55
*** rcernin has quit IRC11:58
*** ramishra has quit IRC12:02
*** brinzhang0 has joined #openstack-qa12:03
*** ramishra has joined #openstack-qa12:05
*** ramishra has quit IRC12:05
*** ramishra has joined #openstack-qa12:05
*** brinzhang_ has quit IRC12:05
*** jpena is now known as jpena|lunch12:33
*** akahat|lunch is now known as akahat|rover12:36
*** rfolco has left #openstack-qa12:36
soniya29gmann, thanks for the links, I will go through them and then get in touch with you12:55
*** rh-jelabarre has joined #openstack-qa13:02
*** brinzhang0 has quit IRC13:13
*** brinzhang0 has joined #openstack-qa13:14
*** sboyron has joined #openstack-qa13:26
openstackgerritLee Yarwood proposed openstack/devstack-plugin-ceph master: WIP zuul: Introduce a multinode ceph job  https://review.opendev.org/c/openstack/devstack-plugin-ceph/+/71162513:27
*** jpena|lunch is now known as jpena13:29
*** artom has joined #openstack-qa13:30
*** nweinber has joined #openstack-qa13:54
*** yamamoto has joined #openstack-qa14:03
*** yamamoto has quit IRC14:08
*** dave-mccowan has joined #openstack-qa14:20
*** abdysn has quit IRC14:55
*** ramishra has quit IRC14:57
*** andrebeltrami_ has joined #openstack-qa15:00
*** ramishra has joined #openstack-qa15:01
*** sboyron has quit IRC15:31
gmannkopecmartin: can you backport this to stable/stein too https://review.opendev.org/q/I71102095f3603915f0bc7d21f2e18c4eac4e95ec16:23
gmannsoniya29: hanks, anytime16:23
*** dosaboy has quit IRC16:24
*** vhari has quit IRC16:24
*** dosaboy has joined #openstack-qa16:25
*** gcheresh has quit IRC16:26
*** vhari has joined #openstack-qa16:33
openstackgerritMartin Kopec proposed openstack/devstack stable/stein: Remove tempest deprecated img_dir option  https://review.opendev.org/c/openstack/devstack/+/77083916:45
kopecmartingmann: sure ^^16:45
openstackgerritLee Yarwood proposed openstack/devstack-plugin-ceph master: nova: Make configure_ceph_nova multinode compatible  https://review.opendev.org/c/openstack/devstack-plugin-ceph/+/75632316:48
openstackgerritLee Yarwood proposed openstack/devstack-plugin-ceph master: zuul: Introduce a multinode ceph job  https://review.opendev.org/c/openstack/devstack-plugin-ceph/+/71162516:48
gmannkopecmartin: thanks17:01
*** lucasagomes has quit IRC17:02
*** jpena is now known as jpena|off17:07
*** raildo has quit IRC17:42
*** rpittau is now known as rpittau|afk17:42
*** gfidente is now known as gfidente|afk17:53
dansmithgmann: so one of the glance multistore tempest tests is failing periodically17:56
dansmithit's reported by subunit as "inprogress" and doesn't actually show as a fail in the test run output,17:57
dansmithbut does in the testr-results, but with no useful error17:57
dansmith.MultiStoresImportImagesTest.test_glance_direct_import_image_to_all_stores [] ... inprogress17:57
dansmithI've been running tempest in a loop locally trying to repro, but I can't17:57
dansmithdoes inprogress mean test timeout maybe?17:57
dansmithtotal runtime is 1887s which would be just a little over 1800s == 30m18:04
*** andrebeltrami_ has quit IRC18:05
*** dtantsur is now known as dtantsur|afk18:10
gmanndansmith: humm, it should return timeout in that case18:17
gmannis it happening frequently?18:17
dansmithyeah, ish18:18
dansmithenough that I've seen it a few times, and people have asked me to fix it :)18:18
gmannohk18:18
dansmithgmann: http://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%22tempest.api.image.v2.test_images.MultiStoresImportImagesTest.test_glance_direct_import_image_to_all_stores.*inprogress%5C%2218:21
dansmithso, maybe just once a day, so not so much, but... still noticeable18:21
*** dosaboy has quit IRC18:21
gmanndansmith: checking..18:22
gmannit passing scenario, it took only 31 sec - https://zuul.opendev.org/t/openstack/build/ad4328066cdc4526aa1c9526022cbada/log/job-output.txt#2763418:24
dansmithyeah18:24
dansmithI noticed that in the one I'm looking at, there's an OOM in dmesg18:24
dansmithI think it was just haproxy, but could have stalled something maybe18:25
dansmithoh, nm,18:25
dansmithhaproxy triggered, but it killed a python18:25
dansmiththat must be it.. if one of the workers just gets kill -9'd it reports as "inprogress...never finished" maybe?18:25
dansmithbecause the test report also says workerN never returned a value18:26
*** raildo has joined #openstack-qa18:26
dansmiththat python doesn't show up as using a lot of memory in the dstat18:27
dansmithbut I wonder if we're pushing the limits with ceph18:28
gmannbut its failing in non-ceph job too https://zuul.opendev.org/t/openstack/build/28c47bb373084072ae3e8d1ca7f01359/logs18:29
dansmithah, okay I haven't seen it other than on ceph multistore jobs, but... okay18:30
dansmithKilled process 103758 (python) total-vm:1532428kB, anon-rss:1295772kB, file-rss:5272kB, shmem-rss:0kB, UID:1002 pgtables:2660kB oom_score_adj:018:36
dansmithI'm guessing that's a test runner that got to 1.5g?18:36
*** slaweq has quit IRC18:38
*** ralonsoh has quit IRC18:41
dansmitheven with just running v2.images tests, my subunit runner master process swells to 1.5G over the course of a 36s run18:48
dansmithI wonder if we're generating a ton of debug logs somewhere or something18:49
gmannhumm, may be18:50
gmannshowing running with 0 duration  https://9bc86082a26342dc4413-58cf4a3a4fc59e9f2b77716e9b3e3ff8.ssl.cf1.rackcdn.com/743695/25/check/glance-multistore-cinder-import/28c47bb/controller/logs/stackviz/index.html#/stdin/test-details/tempest.api.image.v2.test_images.MultiStoresImportImagesTest.test_glance_direct_import_image_to_all_stores18:59
dansmithgmann: https://pastebin.com/6waTd7Ab19:00
dansmithmultistore tests make it use 1.5g, 65m otherwise19:00
gmannhumm19:01
dansmithgmann: help me with math.. this is 1m not 1g, right? https://github.com/openstack/tempest/blob/79650295718af6eeb4d4438a40fed531e19a253d/tempest/api/image/v2/test_images.py#L16119:02
gmann10 m19:03
dansmithI don't think so19:03
dansmithokay, maybe it is, but it takes a looong time to run in a REPL shell19:04
dansmithmaybe just because of random(), I dunno19:05
gmannits 1e7.19:05
gmannis it adding more19:05
dansmiththought maybe that was going haywire but it returns 10m of stuff, just takes a long time19:05
gmannin other non multi store test, we just do randombyte - https://github.com/openstack/tempest/blob/2262cced388fea86afc6e645232e406d6ba36bae/tempest/api/image/v2/admin/test_images.py#L10119:07
gmannwhich is default 102419:08
dansmithyeah19:08
*** gcheresh has joined #openstack-qa19:08
gmannwith multistore it is 10M  for loop19:09
dansmithrandint() for 10M bytes, still generates 10M19:10
gmannwe can add log in random_bytes which i think will be helpful for many other cases too19:10
dansmithtakes a while, but doesn't seem like that would inflate it19:11
dansmithI ran it locally myself and it generated 10M of stuff19:11
*** dosaboy has joined #openstack-qa19:11
dansmithwe do duplicate that in stage_image, so that's 20m19:12
dansmithlemme knock that down and see if it runs smaller19:12
gmannmay be with 102419:13
dansmithwhoa, yeah19:14
dansmiththat made a difference :)19:14
dansmithwith 148576019:14
dansmithMaximum resident set size (kbytes): 26292419:14
dansmith256m instead of 1.5g19:14
gmannwow19:14
dansmithso I dunno why that's happening, but I also don't think we need to use 10m there, do you/19:15
gmannyeah, we can go with default only19:15
gmannchecking review of that change if any rational behind that19:16
dansmithsetting to default makes it run at 68m, basically the same as when I exclude that test19:16
dansmithso... yeah19:16
gmannnothing specific comment of discussion on this, https://review.opendev.org/c/openstack/tempest/+/745712/119:18
gmannyeah we can go with the default.19:18
dansmithpatching19:18
gmanndansmith: what log you think we can addfor future help may be before return in data_utils.random_bytes ?19:19
*** whoami-rajat__ has quit IRC19:20
dansmithgmann: I was thinking maybe random_bytes should raise if you try to generate more than 1M or so?19:20
dansmithwe don't know what the actual problem is, so I dunno if it's with random_bytes, or our use of it later19:20
dansmithbut I'm not sure when we'd need that much data anyway19:20
gmannyeah we should limit it with some value may be 10M if 1M is less19:21
gmannbut yeah i am not sure who need that much data19:21
openstackgerritDan Smith proposed openstack/tempest master: Fix memory explosion in multi-store image tests  https://review.opendev.org/c/openstack/tempest/+/77085019:23
gmannanyways let's do data_utils.random_bytes limit separately as it is lib function and need reno and all19:24
dansmithack19:24
gmanngate is taking too long as you know I will check result tomorrow if available :)19:25
* dansmith nods19:25
dansmithI'm hoping for results by at least next week :)19:26
gmannyour devstack constraint one still in queue- data_utils.random_bytes19:26
gmannhttps://review.opendev.org/c/openstack/devstack/+/77066219:26
gmannheh19:26
dansmithyeah19:26
*** raildo has quit IRC19:37
*** raildo_ has joined #openstack-qa19:37
dansmithgmann: what reno section would this go under? deprecations == more than 1M is deprecated? :)19:37
gmanndansmith:  i think we can add in upgrade as we are going to reject more than that now onwards19:38
gmannor we can just log warning if it is more than 1M ?19:39
dansmithack, didn't know if a test suite had "upgrade" problems.. :P19:39
dansmithwe'll never see it I think19:39
gmannok19:39
dansmithand when we're OOM killed, our subunit buffer dies19:39
gmannyeah19:39
dansmithI can still try to figure out why it's exploding, but I don't think tests really need to use large data blobs in almost all cases, despite the human need to think it makes it more realistic ;)19:42
gmanntrue.19:43
openstackgerritDan Smith proposed openstack/tempest master: Make random_bytes() enforce sane size limits  https://review.opendev.org/c/openstack/tempest/+/77085219:47
*** hyang has joined #openstack-qa19:47
*** ccamposr__ has joined #openstack-qa19:54
*** ccamposr has quit IRC19:56
openstackgerritCyril Roelandt proposed openstack/tempest master: Make _create_and_stage_image always return a list of stores  https://review.opendev.org/c/openstack/tempest/+/75568619:58
*** slaweq has joined #openstack-qa20:19
*** gcheresh has quit IRC20:22
*** slaweq has quit IRC20:27
*** dosaboy has quit IRC21:00
*** dosaboy has joined #openstack-qa21:02
*** raildo_ has quit IRC21:15
openstackgerritGhanshyam proposed openstack/tempest master: Create default network for scenario tests  https://review.opendev.org/c/openstack/tempest/+/77016921:15
*** hamalq has joined #openstack-qa21:22
openstackgerritLingxian Kong proposed openstack/patrole master: Remove the tests for unsupported Nova APIs  https://review.opendev.org/c/openstack/patrole/+/77086721:22
*** nweinber has quit IRC21:42
*** vhari has quit IRC21:50
*** rcernin has joined #openstack-qa22:05
*** yamamoto has joined #openstack-qa22:18
*** hyang has quit IRC22:31
openstackgerritMerged openstack/devstack stable/ussuri: Remove tempest deprecated img_dir option  https://review.opendev.org/c/openstack/devstack/+/77055222:42
*** yamamoto has quit IRC22:52
*** yamamoto has joined #openstack-qa22:52
openstackgerritMerged openstack/devstack stable/train: Remove tempest deprecated img_dir option  https://review.opendev.org/c/openstack/devstack/+/77055523:07
*** nweinber has joined #openstack-qa23:08
*** nweinber has quit IRC23:11
openstackgerritMerged openstack/tempest master: Delete wrong argument from creating HTTP connection  https://review.opendev.org/c/openstack/tempest/+/77065923:51

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!