Thursday, 2023-04-27

opendevreviewGoutham Pacha Ravi proposed openstack/devstack-plugin-ceph master: Run the native CephFS job with cephadm  https://review.opendev.org/c/openstack/devstack-plugin-ceph/+/88151902:35
opendevreviewLukas Piwowarski proposed openstack/tempest master: [WIP] Catching broken tests in tempest-full-test-account-* jobs  https://review.opendev.org/c/openstack/tempest/+/88063007:09
opendevreviewFrancesco Pantano proposed openstack/devstack-plugin-ceph master: [DNM][CI] Add CEPHADM_DEPLOY flag to py3 tests  https://review.opendev.org/c/openstack/devstack-plugin-ceph/+/83422312:16
opendevreviewFrancesco Pantano proposed openstack/devstack-plugin-ceph master: [DNM][CI] Add CEPHADM_DEPLOY flag to py3 tests  https://review.opendev.org/c/openstack/devstack-plugin-ceph/+/83422314:57
opendevreviewFrancesco Pantano proposed openstack/devstack-plugin-ceph master: [DNM][CI] Add CEPHADM_DEPLOY flag to py3 tests  https://review.opendev.org/c/openstack/devstack-plugin-ceph/+/83422315:04
opendevreviewLuigi Dino Tamagnone proposed openstack/tempest master: Avoiding the clean of not tempest created resources  https://review.opendev.org/c/openstack/tempest/+/87992315:13
*** haleyb_ is now known as haleyb15:14
opendevreviewFrancesco Pantano proposed openstack/devstack-plugin-ceph master: Revert "Temporary pin the ceph jobs nodeset to Focal"  https://review.opendev.org/c/openstack/devstack-plugin-ceph/+/86531515:45
opendevreviewFrancesco Pantano proposed openstack/devstack-plugin-ceph master: [DNM][CI] Add CEPHADM_DEPLOY flag to py3 tests  https://review.opendev.org/c/openstack/devstack-plugin-ceph/+/83422315:45
opendevreviewFrancesco Pantano proposed openstack/devstack-plugin-ceph master: [DNM][CI] Add CEPHADM_DEPLOY flag to py3 tests  https://review.opendev.org/c/openstack/devstack-plugin-ceph/+/83422316:01
opendevreviewFrancesco Pantano proposed openstack/devstack-plugin-ceph master: [DNM][CI] Add CEPHADM_DEPLOY flag to py3 tests  https://review.opendev.org/c/openstack/devstack-plugin-ceph/+/83422316:05
opendevreviewGoutham Pacha Ravi proposed openstack/devstack-plugin-ceph master: Revert "Temporary pin the ceph jobs nodeset to Focal"  https://review.opendev.org/c/openstack/devstack-plugin-ceph/+/86531516:11
opendevreviewDan Smith proposed openstack/tempest master: Always validate in scenario create_server()  https://review.opendev.org/c/openstack/tempest/+/88167516:12
opendevreviewribaudr proposed openstack/tempest master: [WIP] Test tempest for Nova and Manila  https://review.opendev.org/c/openstack/tempest/+/83874316:13
dansmithgmann: I probably need to pass validation_resources to create_server() as well right?17:14
gmanndansmith: we do not need that as method itself creating and wait_until=SSHABLE means we need to create validation _resources either in common place or test pass17:17
gmanndansmith: 1 comment i missed earlier https://review.opendev.org/c/openstack/tempest/+/881675/2/tempest/scenario/manager.py#32317:17
dansmithgmann: okay sorry so get_test_validation_resources() will set up and cache those and somehow create_test_server() will see them and I don't need to pass them?17:18
gmanndansmith: you are passing those in kwargs to create_test_server() right?  https://review.opendev.org/c/openstack/tempest/+/881675/1..2/tempest/scenario/manager.py#32517:19
dansmithoh hehe, right17:20
dansmithI did that yesterday and this morning I'm like "wtf, I don't even pass them"17:20
gmann:)17:20
dansmithgmann: even with what I have up there, I would think it should work right?17:35
dansmithhttps://zuul.opendev.org/t/openstack/build/b4cb70dc8b5e4e239233072462fead31/log/controller/logs/grenade.sh_log.txt#198617:35
dansmithI assume that's a failure to setup validation_resources?17:35
dansmiththe other failure looks different, but perhaps due to lack of network setup? https://zuul.opendev.org/t/openstack/build/b4cb70dc8b5e4e239233072462fead31/log/controller/logs/grenade.sh_log.txt#48917:36
dansmithperhaps I should just take the approach of trying to make sure all the volume tests wait for ssh'able, but..it just feels like whack-a-mole17:37
gmanndansmith: yeah, if we setup network we should have proper network for tenant 17:37
gmannssh failure, need to check, this is with class level validation resource right17:39
dansmithah, that test class has set_network_resources()17:39
dansmithgmann: oh, is that why?17:39
gmanndansmith: ohk, yes. so we need to pass wait_until=ACTIVE from thre17:39
gmannI think we have few class doing that ipv6 and sec grp test ?17:39
dansmithacj17:40
dansmithokay so I might as well push up another PS because I assume the regular non-grenade jobs will fail this way too17:41
gmannyeah, it fail on tempest-full also17:41
dansmithokay.. I see tempest-slow passed one scenario test with volumes, so hopefully that's good news17:41
opendevreviewDan Smith proposed openstack/tempest master: Always validate in scenario create_server()  https://review.opendev.org/c/openstack/tempest/+/88167517:42
dansmithgmann: ^17:42
gmannchild class set_network_resources take preference 17:42
dansmithI hope this is going to simultaneously improve stability for regular tests and enable the jammy ceph job to work17:42
dansmithfeeling a little hopeless at the moment :/17:43
gmanndansmith: there are few more test not setting network resource and does own validation , we need to pass wait_until=SSHABLE from there too 17:50
gmann1. https://github.com/openstack/tempest/blob/master/tempest/scenario/test_network_v6.py#L63 17:50
gmann2. https://github.com/openstack/tempest/blob/master/tempest/scenario/test_network_advanced_server_ops.py#L6117:50
gmann3. https://github.com/openstack/tempest/blob/master/tempest/scenario/test_security_groups_basic_ops.py#L16017:50
gmannsorry, *pass wait_until=ACTIVE17:51
dansmithgmann: you mean wait_until=ACTIVE?17:51
gmannyeah17:52
dansmithI'm trying to get my devstack locally back to working so I can probe for more17:52
gmannk17:52
dansmithgmann: okay that one where we need to pass wait_for=active worked, but still getting this:18:27
dansmithhttps://zuul.opendev.org/t/openstack/build/7238bf8b049c4e71b0a2a8c9a085ae8f/log/controller/logs/grenade.sh_log.txt#150018:27
dansmiththat looks like not having a keypair configured properly, I assume in validation_resources18:28
dansmiththat one already does verify ssh itself, so maybe I should wait_for=ACTIVE that one as well?18:28
gmannyeah, in that case we do not need to double SSH18:29
dansmithah, it creates its own keypair18:29
dansmithso I assume that's why18:29
dansmithyeah18:29
dansmith okay18:29
dansmithI missed understanding that earlier18:29
gmannohk, that is why.18:29
gmannvalidation resource does create keypair but there was mismatch in what server got and test trying18:30
dansmithyeah18:30
opendevreviewDan Smith proposed openstack/tempest master: Default to SSHABLE in scenario create_server()  https://review.opendev.org/c/openstack/tempest/+/88167518:31
opendevreviewDan Smith proposed openstack/tempest master: Default to SSHABLE in scenario create_server()  https://review.opendev.org/c/openstack/tempest/+/88167519:12
dansmithclarkb: has zuul started automatically restarting jobs when a depends-on is updated while the jobs are running?19:17
clarkbdansmith: I don't think so I would expect it to continue to kick them out19:49
dansmithclarkb: yeah, that's the normal behavior but it surely seems like it's restarting them20:03
opendevreviewDan Smith proposed openstack/tempest master: Default to SSHABLE in scenario create_server()  https://review.opendev.org/c/openstack/tempest/+/88167520:08
dansmithclarkb: so that ^ just restarted the jobs on the patch that depends on it, change 88158520:09
dansmithno recheck required on my end20:09
clarkbthat must be new then. I don't recall reviewing a change to make that happen20:16
dansmithyeah, twice now I've pushed up a patch and then gone to the dependent and refreshed in a loop until I see the zuul -1 abort, only to never see it and realize the jobs got restarted21:13
dansmithgmann: https://4950815d31f217aabeea-33bb14cb0ac75615a3427d1ffce62a66.ssl.cf1.rackcdn.com/881675/6/check/tempest-multinode-full-py3/9d0a185/testr_results.html21:26
dansmithgmann: the shelve one sets its own keypair as well, so I assume that's why we can't verify it, but it also doesn't do its own sshable check.. 21:27
dansmithand the first one seems to be failing on secgroup, but that seems like it's unrelated, unless it's part of set_network stuff changing things?21:27
gmanndansmith: checking21:30
gmanndansmith: it does ssh to server in create_timestamp method so we can skip SSH in create_server https://github.com/openstack/tempest/blob/e4ec47a063ab03ae3d5d621556377717186668c2/tempest/scenario/test_shelve_instance.py#L8821:31
gmanncreate_timestamp  is ssh to server and write file21:31
gmannsec group seems not related to me too let's check again. 21:32
dansmithgmann: ahh, makes sense.. I figured it must be because it creates the keypair, I just wasn't thinking timestamp21:33
dansmithokay yeah I'm hoping secgroup is just a fluke.. i'll see if the nova job hits that too21:33
gmannk21:34
dansmithgmann: there's another one I just found, in cinder encrypted volumes test, which definitely needs wait-for-ssh21:38
dansmithwe create with keypair, but I'm not sure I see us actually use it, which is weird21:38
dansmithhttps://github.com/openstack/tempest/blob/e4ec47a063ab03ae3d5d621556377717186668c2/tempest/scenario/test_encrypted_cinder_volumes.py#L4821:38
dansmithis there some easy way I can just create a validation_resources with that keypair in it or something?21:39
dansmithmight be able to just remove this one, but not sure21:39
gmannin this case test one is used  for server - https://github.com/openstack/tempest/blob/e64f4c2b07e048ea77d08a29b7afbd17448abc32/tempest/common/compute.py#L224   but validation one for ssh - https://github.com/openstack/tempest/blob/e64f4c2b07e048ea77d08a29b7afbd17448abc32/tempest/common/compute.py#L14021:41
gmannwe should fix that or just override the created kaypair in validation_resource['keypair']21:41
dansmithoh we pass the name to create_server so it gets created that way in nova, but why?21:42
gmanndansmith: or yeah, remove one from test and let create_server() handle that will be easy21:42
dansmithwe don't use it again right?21:42
gmannyeah21:42
dansmithmaybe just copy/pasta?21:43
gmannmay be21:43
gmannmaking create_server() handle all will be consistent 21:44
dansmithyeah21:45
dansmiththe nova job has failed three things, that shelve test from the tempest run and two cinder tests for this encrypted case21:45
dansmithhoping we're getting close21:46
gmannyeah21:46
dansmithalthough it's definitely taking a while21:46
dansmithoh actually that timer is wrong21:47
dansmithclarkb: when it restarts the jobs it doesn't reset the time-in-queue (not surprisingly) so this looks like it's been going for 3.5 hours :)21:47
dansmithugh, cinder tempest plugin has some tests I need to fix21:49
gmanndansmith: oh, thinking of plugins. there are might be cases in many other plugins like neutron tempest plugin etc21:54
dansmithgmann: UGH21:54
gmanndansmith: I think we can keep arg default as ACTIVE and explicitly change tempest required scenario tests to pass SSHABLE ?21:54
dansmithwell, the thing is,21:56
dansmithwith this ceph thing it seems like it affects even non-volume-related tests because our instance storage is all based on ceph21:57
dansmithalthough I admit I've lost track of the range of failures through the steps I've made here21:57
dansmithman that really ruined my afternoon :)21:58
dansmithokay so.. I guess I need to revert this to default to ACTIVE and then let it run again and see what fails and decorate those as SSHABLE and see21:59
gmanndansmith: or another way is to check in create_server() if network resources are created and then go for self.get_tenant_network() and sshable things ? and plugins will not fail and can pass ACTIVE explicitly?22:01
dansmithgmann: yeah I thought about that, but we also have these keypair problems and it seems like maybe it's getting too magical there22:02
dansmithlike also hard to tell what the validations are that it's doing for you that way22:02
dansmithlet me just switch to opt-in and we'll see22:02
gmannyeah22:03
gmannok22:03
dansmithactually the problem scenario was in the cinder test plugin anyway22:07
dansmithso I should probably start there after I roll back this default22:07
dansmithgmann: okay what do we do about this: https://github.com/openstack/tempest/blob/e64f4c2b07e048ea77d08a29b7afbd17448abc32/tempest/scenario/manager.py#L120122:12
dansmithtests can optionally pass a keypair in, so we can't just wait_until sshable there22:13
dansmithshould I get_test_validation_resources() and then just .update(keypair=keypair) ?22:13
gmanndansmith: I think that will work instead of fixing many test not pass the keypair which we can do later22:14
dansmithI mean I will do the update if the test passes keypair= to create_server()22:15
dansmithoh wait22:16
dansmithvalidation_resources[keypair] is a boolean not a keypair?22:16
dansmithhmm, no it must be22:19
gmannyeah it should be storing actual keypair in validation_resources['keypair']22:19
gmannthe validation fixture take keypair as boolean to create keypair22:19
dansmithoh I see the fixture okay22:23
gmannideally we should be able to pass validation resources bool in get_test_validation_resources() so that test can override those over configuration one. and validation fixture can create only requested resources 22:24
gmannbut anyways that is something we should optimize later. 22:24
gmanncurrently, it create all the configured one for all tests. we can make it test based request also22:25
gmannbut for this change, let's override it in validation_resources dict itself22:26
opendevreviewDan Smith proposed openstack/tempest master: Default to SSHABLE in scenario create_server()  https://review.opendev.org/c/openstack/tempest/+/88167522:27
dansmithah crap, need to update the commit message (again)22:27
gmanndansmith: wait22:29
* dansmith just ctrl-c'd git review22:29
dansmithwaiting :)22:29
gmannyou need to copy whole keypair dict including private_key not just name https://review.opendev.org/c/openstack/tempest/+/881675/7/tempest/scenario/manager.py#32822:29
dansmithwe don't get the whole dict, just the name, but compute.create_test_server doesn't seem to need more than the name?22:30
gmannbecause that is what used in ssh logic on common.compute.py22:30
gmannyeah, that is issue22:30
dansmithgmann: I was going based on this: https://github.com/openstack/tempest/blob/e64f4c2b07e048ea77d08a29b7afbd17448abc32/tempest/common/compute.py#L22622:31
gmanndansmith: this only pass it to nova to inject in server but later when tempest do ssh it use private_key https://github.com/openstack/tempest/blob/e64f4c2b07e048ea77d08a29b7afbd17448abc32/tempest/common/compute.py#L14022:31
dansmithah, validation_resources needs it for the ssh part you mean22:32
gmannyeah22:32
dansmithokay, that's going to be harder22:32
dansmithI guess I need to make it take keypair or key_name and then fix the tests to pass all of keypair22:32
gmannthat need some plugins also to change which is main issue here. I am thinking in common.compute.py we can get kaypair and override in validation_resources in else part https://github.com/openstack/tempest/blob/e64f4c2b07e048ea77d08a29b7afbd17448abc32/tempest/common/compute.py#L22622:33
gmannget keypair from nova using name22:33
dansmithwell, I was thinking of making it either or, and I can fix these tests and the tests in the cinder test plugin (at least) to pass the whole keypair22:33
dansmithand then I can strip out the name for the actual create22:34
gmanndansmith: ohk that will work as those are only one you are passing SSHABle now22:34
gmanngot it. +122:34
gmannthis ssh thing is meshed up in tempest, i should clean it up sometime22:35
dansmithin all your free time? :)22:37
gmannyeah :)22:37
opendevreviewDan Smith proposed openstack/tempest master: Allow wait_until=SSHABLE in scenario manager  https://review.opendev.org/c/openstack/tempest/+/88167522:40
dansmithgmann: is tempest pinned for xena?23:32
dansmithah, no I'm failing on zed too23:33
opendevreviewDan Smith proposed openstack/tempest master: Allow wait_until=SSHABLE in scenario manager  https://review.opendev.org/c/openstack/tempest/+/88167523:42

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!