*** dviroel|ruck|afk is now known as dviroel|ruck | 00:48 | |
*** rlandy|ruck|bbl is now known as rlandy|ruck | 00:57 | |
*** dviroel|ruck is now known as dviroel|ruck|out | 00:57 | |
*** dviroel|ruck|out is now known as dviroel|out | 00:57 | |
*** rlandy|ruck is now known as rlandy|out | 04:06 | |
*** amoralej|off is now known as amoralej | 08:11 | |
amoralej | rlandy|out, ack, sorry for that i'll prepare a review to reproduce it before moving to it again | 08:14 |
---|---|---|
*** jpena|off is now known as jpena | 08:31 | |
*** sshnaidm|afk is now known as sshnaidm | 08:54 | |
*** ysandeep|out is now known as ysandeep | 09:01 | |
marios | need reviews please https://review.opendev.org/c/openstack/tripleo-ci/+/828110 thank you | 09:19 |
ysandeep | marios: looking | 09:25 |
marios | thanks ysandeep | 09:30 |
ysandeep | pojadhav, pm | 09:44 |
*** rlandy|out is now known as rlandy|ruck | 11:06 | |
rlandy|ruck | amoralej: thanks | 11:07 |
rlandy|ruck | yeah dviroel|out and I looked at that enable opstools section | 11:07 |
rlandy|ruck | I am not sure why it is not enabled | 11:08 |
*** dviroel|out is now known as dviroel|ruck | 11:10 | |
marios | rlandy|ruck: this is the one i was referring to https://review.opendev.org/c/openstack/tripleo-quickstart/+/828126 | 11:19 |
soniya29 | arxcruz, kopecmartin, raukadah, ysandeep, rlandy|ruck, please add/edit the today's agenda for tempest meeting:- https://hackmd.io/fIOKlEBHQfeTZjZmrUaEYQ | 11:27 |
rlandy|ruck | https://review.opendev.org/c/openstack/octavia-tempest-plugin/+/816369 | 11:33 |
marios | rlandy|ruck: https://bugs.launchpad.net/tripleo/+bug/1959014/comments/11 | 11:33 |
rlandy|ruck | dviroel|ruck: akahat|rover: let's sync | 11:35 |
rlandy|ruck | meet.google.com/vzm-nrah-qqf | 11:35 |
rlandy|ruck | amoralej: chatted with marios | 11:35 |
rlandy|ruck | we're going to update our tripleo-quickstart release files | 11:36 |
rlandy|ruck | so we also enable opstools | 11:36 |
amoralej | yes, currently it's duplicated | 11:36 |
amoralej | between delorean-deps.repo and quickstart-centos-opstools | 11:36 |
amoralej | actually, the quicstart one is disabled in providers | 11:37 |
amoralej | so, the plan is to enable quickstart one and disable the one in delorean-deps.repo? | 11:37 |
marios | amoralej: yeah we think the issue surfaced when we switched c8 container&image builds touse the release files (previously using some other code in tripleo-repos) | 11:38 |
amoralej | ahh, got it | 11:39 |
marios | amoralej: so we need something similar /along the same lines like https://opendev.org/openstack/tripleo-quickstart/src/commit/2ce46f8205098327b0dbf2cec08d16bd52638296/config/release/tripleo-ci/CentOS-9/master.yml#L58-L66 | 11:39 |
marios | amoralej: so it doesn't get disabled | 11:39 |
amoralej | marios, why not simply keeping it enabled by default? | 11:41 |
amoralej | getting it enabled shouldn't break anything | 11:42 |
marios | amoralej: not sure ... i guess nothing needs it during the deployment | 11:42 |
marios | amoralej: and we only needed it for containers | 11:42 |
amoralej | that's right, but i'd help to keep things easier | 11:42 |
marios | amoralej: i don't know why it was decided to disable it on the normal run other than 'it isn't needed' ? maybe that is reason enough | 11:42 |
marios | :D | 11:42 |
amoralej | maybe :) | 11:42 |
amoralej | so that fixes the enablement of that repo | 11:43 |
amoralej | but not the duplicated | 11:43 |
amoralej | actually, efectively, we've being having the repo enabled in all jobs via delorean-deps.repo | 11:43 |
marios | amoralej: well the duplicated one should be disabled or set priority ... ah right ;) ^^ | 11:43 |
rlandy|ruck | arxcruz: can you take a look at: https://bugs.launchpad.net/tripleo/+bug/1960355 | 11:44 |
amoralej | marios, so, there is no point in adding logic to enable/disable in quickstart but keeping it always enabled in delorean-deps | 11:47 |
marios | amoralej: was it alwyas in delorean-deps or some recent addition? yeah i thnk your suggestion is OK if they are the same content frmo deps vs the 8-stream one | 11:48 |
amoralej | i was checking exactly that :) | 11:49 |
amoralej | it has been in delorean-deps.repo since the beginning of centos8 at least | 11:49 |
amoralej | https://review.rdoproject.org/r/c/rdo-infra/puppet-dlrn/+/26508 | 11:49 |
marios | amoralej: ack | 11:49 |
akahat|rover | rlandy|ruck, https://review.rdoproject.org/r/c/testproject/+/36756 | 11:50 |
marios | tripleo ci o/ need reviews please https://review.opendev.org/c/openstack/tripleo-ci/+/828110 thank you | 11:52 |
rlandy|ruck | amoralej: just discussing with ruck/rovers | 11:52 |
rlandy|ruck | we are going o do what is in the centos-9 release files | 11:52 |
marios | rlandy|ruck: see 13:47 < amoralej> marios, so, there is no point in adding logic to enable/disable in quickstart but keeping it always enabled in delorean-deps | 11:52 |
akahat|rover | https://logserver.rdoproject.org/56/36756/28/check/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp_1supp-featureset039-master/d848579/logs/supplemental/home/centos/ipa_prep.sh.log.txt.gz | 11:52 |
rlandy|ruck | remove it from our release files? | 11:52 |
marios | rlandy|ruck: i think it makes sense.. so could do that instead and just make sure it isn't disabled | 11:53 |
rlandy|ruck | I think there is a diff whether there is a change to build or not | 11:53 |
marios | rlandy|ruck: yeah 'remove from our release files' will remove the duplicate | 11:53 |
akahat|rover | failed: mirror issues. | 11:53 |
marios | rlandy|ruck: so maybe do the easiest thing first to unblock the gate | 11:53 |
marios | rlandy|ruck: and add the conditional | 11:53 |
marios | rlandy|ruck: hmm... not sure though we will have a duplicate... though it should be the same content | 11:53 |
marios | which should not be a problem | 11:53 |
amoralej | marios, but the gates are not longer blocked, right? | 11:54 |
marios | amoralej: ah right yes rlandy|ruck merged a revert correct | 11:54 |
amoralej | you scared me :) | 11:54 |
marios | but i wonder... do we have delorean-deps in container build? in that case why did i need to explicitly enable it with https://opendev.org/openstack/tripleo-ci/src/commit/f052b61dbbd229e1776d3f34c2252ad636b7f78a/roles/tripleo-build-jobs-repos/tasks/load-vars.yaml#L16 | 11:55 |
marios | i.e. in that case it wasn't using the delorean-deps one but instead i had to explicitly enable the opstools repo | 11:56 |
amoralej | i'd say, if we don't have delorean-deps in containers builds, provider jobs whould have been failing as there were not opstools repo enabled | 11:57 |
marios | checking sthing sec | 11:57 |
amoralej | also, i wouldn't break those builds jobs with my change in delorean-deps ... | 11:57 |
arxcruz | rlandy|ruck hmmm, never saw this error, i might need more time to debug | 12:00 |
marios | rlandy|ruck: amoralej: in 9 we don't have a duplicate opstools repo only for 8 | 12:00 |
marios | amoralej: it seems we are relying on the delorean-deps for 9? | 12:00 |
marios | amoralej: rlandy|ruck: https://logserver.rdoproject.org/openstack-periodic-integration-main/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-build-containers-centos-9-push-master/3795657/etc/yum.repos.d/ | 12:00 |
marios | amoralej: rlandy|ruck: no 'quickstart-centos-opstools' repo defined inhttps://opendev.org/openstack/tripleo-quickstart/src/commit/2ce46f8205098327b0dbf2cec08d16bd52638296/config/release/tripleo-ci/CentOS-9/master.yml#L198 | 12:01 |
amoralej | mmm | 12:01 |
amoralej | that reminds me i need to move from buildlogs to mirror.centos.org | 12:02 |
marios | amoralej: so it is what you suggested i.e. only using the delorean-deps | 12:02 |
marios | rlandy|ruck: but then i wonder what that is meant to do https://opendev.org/openstack/tripleo-quickstart/src/commit/2ce46f8205098327b0dbf2cec08d16bd52638296/config/release/tripleo-ci/CentOS-9/master.yml#L198 dviroel|ruck we don't have quickstart-centos-opstools' repo defined | 12:02 |
marios | inhttps://opendev.org/openstack/tripleo-quickstart/src/commit/2ce46f8205098327b0dbf2cec08d16bd52638296/config/release/tripleo-ci/CentOS-9/master.yml#L198 | 12:02 |
soniya29 | arxcruz, kopecmartin, facing some network issues | 12:02 |
* marios stops spamming channel. | 12:02 | |
marios | rlandy|ruck: dviroel|ruck: let me know if you want me to poke more at this otherwise i'll let you propose something and we can discuss on gerrit | 12:03 |
amoralej | marios, but note it's using the variable to enable/disable the repo in delorean-deps.repo | 12:04 |
amoralej | https://github.com/openstack/tripleo-quickstart/blob/master/config/release/tripleo-ci/CentOS-9/master.yml#L198 | 12:04 |
amoralej | centos9-opstools that's the one in delorean-deps.repo | 12:04 |
marios | amoralej: ah i see it | 12:04 |
rlandy|ruck | reading back | 12:04 |
marios | amoralej: right makes more sense... but confirms what you suggest anyway we don't actually even need the extra definition in there just use deps | 12:05 |
amoralej | so if you check in some other job which is not building containers | 12:05 |
amoralej | that repo should be disabled | 12:05 |
amoralej | marios, yep, i'd keep it simple by using delorean-deps.repo | 12:05 |
rlandy|ruck | marios: can you join review time again? | 12:05 |
rlandy|ruck | on with dviroel|ruck | 12:05 |
marios | rlandy|ruck: sure on my way | 12:05 |
amoralej | then, if we want to be able to enable/disable, let's do similar to centos9 | 12:05 |
soniya29 | rlandy|ruck, ysandeep, raukadah, i am not able to join the meeting, facing network issues | 12:06 |
rlandy|ruck | amoralej: ack - we just discussed ... | 12:11 |
rlandy|ruck | step 1: make centos-8 release files match c9 | 12:11 |
rlandy|ruck | step 2: investigate dropping the disable/enable line | 12:11 |
rlandy|ruck | so there will no longer be a dup | 12:11 |
marios | to be clear amoralej we're doing what you suggested... 'make 8 like 9' means remove the extra opstools repo defined in the 8 files and just use the deps one | 12:12 |
amoralej | ok, nice | 12:12 |
amoralej | actually, mmm let me see if there is any other duplication | 12:12 |
amoralej | we may have more | 12:13 |
amoralej | centos9 lgtm | 12:14 |
amoralej | yeah, i think opstools is the only duplication | 12:15 |
rlandy|ruck | akahat|rover: https://sf.hosted.upshift.rdu2.redhat.com/zuul/t/tripleo-ci-internal/stream/ec13046365624fc29a4a65a17cc97a63?logfile=console.log - the last run on c9 kvm job failed | 12:21 |
rlandy|ruck | so the fix was not yet in master | 12:21 |
rlandy|ruck | we will have to track those hashes | 12:21 |
rlandy|ruck | maybe when octavia component promotes | 12:21 |
*** ysandeep is now known as ysandeep|break | 12:41 | |
rlandy|ruck | dviroel|ruck: akahat|rover: https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/39240 Temp remove fs002 from c8 master criteria | 12:44 |
dviroel|ruck | rlandy|ruck: marios https://review.opendev.org/c/openstack/tripleo-quickstart/+/828522 - I trying to make all c8 files work with the same logic; deleted the other opstools repo; let me know if makes sense | 12:47 |
rlandy|ruck | dviroel|ruck: https://review.opendev.org/c/openstack/tripleo-quickstart/+/828522 - left comment | 12:47 |
rlandy|ruck | re: promoting-testing-hash files | 12:47 |
rlandy|ruck | the c9 ones do not have that clause | 12:47 |
rlandy|ruck | idk if they should or should not | 12:47 |
rlandy|ruck | marios can weigh in there | 12:48 |
rlandy|ruck | akahat|rover: thanks for putting in the fs039 rerun | 12:48 |
rlandy|ruck | akahat|rover: anything else we missing ruck/rover -wise in your morning? | 12:48 |
akahat|rover | rlandy|ruck, no. | 12:49 |
marios | rlandy|ruck: k will check the review dviroel|ruck | 12:49 |
dviroel|ruck | rlandy|ruck: yes, i'm thinking to make things follow the same logic in all files - not sure why we have in some files and other don't | 12:50 |
rlandy|ruck | mystery of the week | 12:50 |
rlandy|ruck | to opstools or not to opstools | 12:50 |
rlandy|ruck | akahat|rover: ah - octavia component should run next | 12:51 |
rlandy|ruck | we will see those new fixes clear then - I hope | 12:51 |
dviroel|ruck | rlandy|ruck: yeah, maybe doesn't have because is harmless to have it always enabled | 12:51 |
rcastillo | o/ | 13:05 |
rlandy|ruck | rcastillo: hey | 13:07 |
rlandy|ruck | rcastillo: master c9 promoted | 13:07 |
rlandy|ruck | and the ipa job should be passing now | 13:07 |
*** amoralej is now known as amoralej|lunch | 13:07 | |
rlandy|ruck | can you add the patch to get that voting/gating again? | 13:07 |
rlandy|ruck | rcastillo: ^^ | 13:07 |
rcastillo | rlandy|ruck: ack | 13:10 |
*** ysandeep|break is now known as ysandeep | 13:13 | |
*** dasm|off is now known as dasm | 13:17 | |
rcastillo | rlandy|ruck: https://review.opendev.org/c/openstack/tripleo-ci/+/828524 | 13:17 |
dasm | o/ | 13:17 |
rlandy|ruck | rcastillo: thank you | 13:27 |
rlandy|ruck | akahat|rover: hey ... can you help out with promoter question ... | 13:27 |
rlandy|ruck | https://jenkins-cloudsig-ci.apps.ocp.ci.centos.org/view/phase-1-pipelines/job/rdo_trunk-promote-wallaby-centos8-current-tripleo/88/ | 13:28 |
rlandy|ruck | passed | 13:28 |
rlandy|ruck | all | 13:28 |
akahat|rover | rlandy|ruck, yes .. | 13:28 |
rlandy|ruck | so we should have promoted wallaby c8 current-tripleo-rdo | 13:28 |
rlandy|ruck | Feb 6 | 13:28 |
rlandy|ruck | http://promoter.rdoproject.org/config/CentOS-8/ | 13:29 |
rlandy|ruck | akahat|rover: ah ... we need to remove the packstack jobs from criteria | 13:30 |
rlandy|ruck | hmm .. no | 13:30 |
rlandy|ruck | those ran | 13:30 |
akahat|rover | rlandy|ruck, weirdo-*-packstack jobs? | 13:30 |
rlandy|ruck | akahat|rover: no - I was wrong | 13:31 |
rlandy|ruck | the jobs look right | 13:31 |
rlandy|ruck | but we didn't promote | 13:31 |
rlandy|ruck | current-tripleo-rdo/2022-01-27 22:42 | 13:31 |
rlandy|ruck | why? | 13:31 |
rlandy|ruck | should have promoted feb 6th | 13:31 |
rlandy|ruck | akahat|rover: see what I am looking at | 13:32 |
rlandy|ruck | can you trace that promoter log? | 13:32 |
akahat|rover | rlandy|ruck, yes. looking in it. | 13:33 |
marios | tripleo ci o/ need reviews please https://review.opendev.org/c/openstack/tripleo-ci/+/828110 thank you | 13:34 |
rlandy|ruck | akahat|rover: ussuri as well | 13:40 |
rlandy|ruck | https://jenkins-cloudsig-ci.apps.ocp.ci.centos.org/view/phase-1-pipelines/job/rdo_trunk-promote-ussuri-centos8-current-tripleo/ | 13:40 |
rlandy|ruck | but we're not promoting | 13:40 |
rlandy|ruck | amoralej|lunch: hey ... can you help me understand the dlrn hash reporting here ... | 13:51 |
rlandy|ruck | https://jenkins-cloudsig-ci.apps.ocp.ci.centos.org/job/rdo-promote-get-hash-ussuri-centos8-current-tripleo/65/console | 13:51 |
rlandy|ruck | + export NEW_HASH=7d23a38590b5cf6750e96a41ca2f22ca | 13:52 |
rlandy|ruck | + NEW_HASH=7d23a38590b5cf6750e96a41ca2f22ca | 13:52 |
rlandy|ruck | https://trunk.rdoproject.org/centos8-ussuri/current-tripleo/delorean.repo.md5 | 13:52 |
rlandy|ruck | https://jenkins-cloudsig-ci.apps.ocp.ci.centos.org/job/tripleo-quickstart-promote-ussuri-current-tripleo-delorean-minimal/93/console | 13:53 |
rlandy|ruck | reports dlrn ... | 13:53 |
rlandy|ruck | dlrnapi --url https://trunk.rdoproject.org/api-centos8-ussuri --username <> report-result --job-id tripleo-quickstart-promote-ussuri-current-tripleo-delorean-minimal --agg-hash 665b36551e955e31e5ed4c82108b17bd | 13:54 |
rlandy|ruck | two diff hashes | 13:54 |
ysandeep | pojadhav, bhagyashris rlandy|ruck takashi is proposing patches in puppet related repos to move to c9. | 13:54 |
rlandy|ruck | nice | 13:54 |
ysandeep | pojadhav, rlandy|ruck bhagyashris marios fyi.. https://review.opendev.org/q/topic:tripleo-c9 for puppet related repos | 13:55 |
marios | thanks ysandeep | 13:56 |
*** amoralej|lunch is now known as amoralej | 13:59 | |
akahat|rover | rlandy|ruck, only promotion we got current-tripleo-rdo: http://promoter.rdoproject.org/promoter_logs/centos8_wallaby_2022-01-27T22:42.log-20220128 | 14:06 |
akahat|rover | for wallaby ^^ | 14:06 |
pojadhav | ysandeep, ack | 14:06 |
*** pojadhav is now known as pojadhav|brb | 14:06 | |
akahat|rover | and ussuri current-tripleo-rdo: http://promoter.rdoproject.org/promoter_logs/centos8_ussuri_2022-01-27T05:35.log-20220128 | 14:06 |
* akahat|rover leaving for the day. | 14:11 | |
*** akahat|rover is now known as akahat|PTO | 14:11 | |
arxcruz | rlandy|ruck regarding https://bugs.launchpad.net/tripleo/+bug/1960355 i believe it was a race condition, i check the logs and I see no error, also, the latest fs035 run fails in a complete different issue | 14:16 |
arxcruz | dviroel|ruck ^ | 14:16 |
arxcruz | https://logserver.rdoproject.org/openstack-periodic-integration-main/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-9-ovb-3ctlr_1comp-featureset035-master/8d6c815/logs/undercloud/var/log/tempest/stestr_results.html.gz | 14:16 |
arxcruz | oh, nevermind, the tests were removed | 14:17 |
dviroel|ruck | arxcruz: yep, skipping | 14:17 |
dviroel|ruck | skip or never promote again | 14:18 |
dviroel|ruck | arxcruz: yesterday i was comparing the latest success one with the first failure | 14:19 |
arxcruz | dviroel|ruck there's a server internal error | 14:19 |
dviroel|ruck | arxcruz: last success was still using ctrlplane ip | 14:19 |
arxcruz | probably a bug in keystone/keystonemiddleware | 14:19 |
arxcruz | dviroel|ruck but it's being able to do a post and create the token | 14:19 |
arxcruz | but it's being invalidated | 14:20 |
arxcruz | it seems to be an error in keystone service | 14:20 |
dviroel|ruck | arxcruz: yeah, i think that i saw it invalidating the token | 14:20 |
arxcruz | dviroel|ruck yup on the delete | 14:21 |
rlandy|ruck | frenzy_friday: https://github.com/openstack/tripleo-quickstart/tree/master/config/release | 14:21 |
rlandy|ruck | https://codesearch.opendev.org/?q=trunk.registry.rdoproject.org&i=nope&literal=nope&files=&excludeFiles=&repos= | 14:22 |
rlandy|ruck | https://review.rdoproject.org/codesearch/?q=trunk.registry.rdoproject.org&i=nope&files=&repos= | 14:22 |
dviroel|ruck | arxcruz: the request after 500 error | 14:22 |
dviroel|ruck | arxcruz: "500 POST https://[2001:db8:fd00:1000::5]:13000/v3/OS-OAUTH1/request_token" | 14:22 |
arxcruz | dviroel|ruck 2022-02-08 19:00:28.790 149 WARNING keystone.server.flask.application [req-8e2bc385-ce46-4eaf-a36c-ce8d9fc414a1 6f72adab6a0f4209935f9f2c53c64bef 1bd34489e9814280b3e27f11a32acc22 - default default] Invalid mapping id: 9dc88feb7d1e41cf8d4b060912c6785d: keystone.exception.ValidationError: Invalid mapping id: 9dc88feb7d1e41cf8d4b060912c6785d | 14:23 |
dviroel|ruck | arxcruz: this error comes after request_token, right? | 14:23 |
rlandy|ruck | https://opendev.org/openstack/tripleo-ci/src/branch/master/toci-quickstart/config/testenv/multinode-vexxhost.yml#L52 | 14:24 |
*** pojadhav|brb is now known as pojadhav | 14:24 | |
rlandy|ruck | command: "podman pull docker://trunk.registry.rdoproject.org/ceph/{{ item.split('/')[-1] }}" | 14:25 |
arxcruz | dviroel|ruck that's the thing, i can't tell you, there's not much info there | 14:28 |
marios | rlandy|ruck: ok to merge that ? (reviewed but wasnt sure what the story was on that) https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/39181/1#message-5cbc3efafcf8266b2a8923c51304a9d5a5812cd5 | 14:28 |
rlandy|ruck | meeting ... reading back | 14:29 |
rlandy|ruck | marios: yeah - idk what it will mess up tbh | 14:29 |
rlandy|ruck | I could hose some stuff here | 14:30 |
marios | rlandy|ruck: k wfing | 14:30 |
arxcruz | dviroel|ruck what I was able to find, in the same timestamp was this: | 14:31 |
arxcruz | dviroel|ruck 2022-02-08 19:00:30.293 117 DEBUG keystone.auth.core [req-8b5a104c-9bb6-463a-9e65-b8a2e1e7ba45 - - - - -] MFA Rules not processed for user `6f72adab6a0f4209935f9f2c53c64bef`. Rule list: `[]` (Enabled: `True`). check_auth_methods_against_rules /usr/lib/python3.9/site-packages/keystone/auth/core.py:438 | 14:31 |
arxcruz | and checking the code, it throws an exception if not found | 14:31 |
rlandy|ruck | marios: basically, we only have the option to have one file per release tested in ci.centos | 14:31 |
arxcruz | but i couldn't find the error 500 | 14:31 |
rlandy|ruck | there was no way for master on c8 and c9 | 14:31 |
rlandy|ruck | so that's basically an attempt at creating the same dir structure as tripleo-ci | 14:31 |
rlandy|ruck | so we have the option | 14:31 |
rlandy|ruck | https://review.opendev.org/c/openstack/tripleo-quickstart/+/828171 | 14:32 |
rlandy|ruck | marios: ^^ would require this to take effect | 14:33 |
marios | rlandy|ruck: ack adding to my reviews but it is still wip | 14:33 |
rlandy|ruck | let me unwip that | 14:33 |
rlandy|ruck | marios; the one you merge is just an extra var atm | 14:33 |
rlandy|ruck | so it should be ok | 14:34 |
dviroel|ruck | arxcruz: ack, please comment in LP - thanks | 14:35 |
marios | rlandy|ruck: ack | 14:36 |
rlandy|ruck | marios: https://review.opendev.org/c/openstack/tripleo-quickstart/+/828171 - updated commit message there | 14:36 |
marios | rlandy|ruck: thanks | 14:38 |
*** pojadhav is now known as pojadhav|dinner | 15:00 | |
rlandy|ruck | dviroel|ruck: hi - need anything? ... otherwise will continue looking into current-tripleo-rdo promotions | 15:01 |
rlandy|ruck | ysandeep: ^^ need anything for downstream? | 15:01 |
rlandy|ruck | dasm: let's touch base re: uefi boot ... | 15:02 |
rlandy|ruck | it's working on 17? | 15:02 |
rlandy|ruck | and we confirmed we are actually using that boot mode? | 15:02 |
rlandy|ruck | when we use the uefi ipxe boot image? | 15:02 |
dasm | rlandy|ruck: logs show it's using right boot type: uefi. | 15:03 |
rlandy|ruck | dasm++ nice | 15:03 |
dasm | but i haven't had a successful test run yet | 15:04 |
rlandy|ruck | ysandeep; ^^ thought about moving to that boot mode for 17? | 15:04 |
rlandy|ruck | dasm: next step is to try the same with fs001 wallaby job | 15:04 |
rlandy|ruck | c8 and c9 | 15:04 |
ysandeep | rlandy|ruck, need anything for downstream? -> please see pcci chat channel on internal | 15:04 |
rlandy|ruck | or master | 15:04 |
rlandy|ruck | looking | 15:05 |
dasm | ok. i'll add that and i'll see if it's running | 15:05 |
rlandy|ruck | dasm: awesome | 15:05 |
rlandy|ruck | if that works, | 15:06 |
rlandy|ruck | we will need to make some similar trigger arrangement like scenario010-kvm | 15:06 |
dasm | ok | 15:06 |
rlandy|ruck | where we trigger the job in downstream to run with upstream specified hash | 15:06 |
rlandy|ruck | Tengu: ^^ fyi | 15:07 |
rlandy|ruck | uefi boot testing in progress by dasm | 15:07 |
dasm | rlandy|ruck: are you talking about this one? "periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-baremetal-wallaby" ? | 15:10 |
rlandy|ruck | dasm: w/o baremetal - that's a component job | 15:11 |
dasm | k | 15:11 |
rlandy|ruck | just periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-wallaby | 15:11 |
rlandy|ruck | or c9 | 15:11 |
dasm | ack | 15:11 |
Tengu | rlandy|ruck: uefi boot failing here with downstream. | 15:16 |
dviroel|ruck | rlandy|ruck: no, nothing for now | 15:23 |
* dviroel|ruck was on lunch | 15:23 | |
ysandeep | rlandy|ruck, +1 to moving to uefi, But I am not in sync with progress dasm made.. If you want to meet and discuss - I am available/ share in tomorrow's scrum | 15:26 |
ysandeep | Tengu, did uefi boot finally worked for you on wallaby? | 15:27 |
Tengu | ysandeep: I didn't test wallaby, it was plain master. | 15:28 |
Tengu | ysandeep: but Kevin is working with wallaby on cs9 nowadays. | 15:28 |
Tengu | You want to ping him | 15:28 |
ysandeep | Tengu, ack.. thanks! | 15:29 |
Tengu | np | 15:29 |
dasm | ysandeep: sure. I'm still checking few things, but it seems to be working with small caveats. | 15:31 |
amoralej | rlandy|ruck, what's the question? | 15:36 |
rlandy|ruck | amoralej: it looks like the hash being reported in rdo-promote-get-hash-ussuri-centos8-current-tripleo is not the same hash reported on | 15:37 |
rlandy|ruck | in https://jenkins-cloudsig-ci.apps.ocp.ci.centos.org/job/tripleo-quickstart-promote-ussuri-current-tripleo-delorean-minimal/93/ | 15:37 |
amoralej | mmm lemme check those jobs | 15:37 |
rlandy|ruck | unless I am misreading this | 15:37 |
rlandy|ruck | + export NEW_HASH=7d23a38590b5cf6750e96a41ca2f22ca | 15:37 |
rlandy|ruck | vs | 15:37 |
ysandeep | dasm, great.. lets discuss more in tomorrow's scrum | 15:37 |
rlandy|ruck | what is reported | 15:38 |
rlandy|ruck | in dlrn | 15:38 |
amoralej | i need to page out stuff from my swap :) | 15:38 |
amoralej | +that comes from old days... | 15:38 |
rlandy|ruck | so we are not promoted | 15:38 |
dasm | ysandeep, sure! | 15:41 |
rlandy|ruck | dviroel|ruck: we can ping with card on #sec-interop | 15:43 |
*** ysandeep is now known as ysandeep|out | 15:43 | |
rlandy|ruck | the caesar there | 15:44 |
dviroel|ruck | rlandy|ruck: ack | 15:46 |
rlandy|ruck | rerunning failed octavia tests | 15:47 |
rlandy|ruck | amoralej: does that make sense? | 16:00 |
rlandy|ruck | trying to match if we are reporting all jobs on the same hash | 16:00 |
rlandy|ruck | echo export AGG_HASH=7d23a38590b5cf6750e96a41ca2f22ca | 16:01 |
rlandy|ruck | https://jenkins-cloudsig-ci.apps.ocp.ci.centos.org/job/weirdo-ussuri-centos8-promote-packstack-scenario001/65/consoleFull | 16:01 |
amoralej | rlandy|ruck, nop | 16:01 |
amoralej | i'm trying to understand | 16:01 |
rlandy|ruck | so the weirdo job reports on the right hash | 16:02 |
amoralej | so, we didn't get any promotion after that? | 16:02 |
rlandy|ruck | no ... | 16:02 |
rlandy|ruck | so I think all the weirdo jobs do the right thing - and report on the right hash | 16:02 |
amoralej | mmm | 16:02 |
rlandy|ruck | I think the quickstart job does not | 16:03 |
amoralej | so the get-hash jobs and the oooq one take the hashes independently | 16:03 |
amoralej | gimme some minutes please | 16:03 |
rlandy|ruck | sure | 16:05 |
rlandy|ruck | ok - so I think this could happen if we promote current-tripleo | 16:08 |
rlandy|ruck | while the job is running | 16:08 |
rlandy|ruck | 665b36551e955e31e5ed4c82108b17bd/2022-02-08 19:16 | 16:09 |
rlandy|ruck | 7d23a38590b5cf6750e96a41ca2f22ca/2022-02-06 04:23 | 16:09 |
rlandy|ruck | quickstart job pulls DELOREAN_URL | 16:09 |
rlandy|ruck | which can change | 16:10 |
amoralej | yes | 16:10 |
amoralej | i was about to say that | 16:10 |
amoralej | that's what happened | 16:10 |
amoralej | rlandy|ruck, now i have the doubt which repo was actually testing the job | 16:11 |
amoralej | in weirdo jobs we are passing the hash as a parameter to the job | 16:12 |
rlandy|ruck | ack | 16:12 |
rlandy|ruck | looking at how that gets done | 16:12 |
amoralej | so it tests and reposts in the same one even if there is a promotion in the middle | 16:12 |
amoralej | i'd saly... | 16:13 |
amoralej | say | 16:13 |
amoralej | in the oooq one i see it gets the agg hash at reporting time | 16:14 |
amoralej | from delorean.repo.md5, but i'm not sure how it picks the delorean-repo at build time | 16:14 |
*** pojadhav|dinner is now known as pojadhav | 16:14 | |
rlandy|ruck | yeah - trying to figure that out now | 16:16 |
rlandy|ruck | looking at those jobs | 16:16 |
* rlandy|ruck think the release file just resolves it | 16:17 | |
rlandy|ruck | checking repos | 16:17 |
amoralej | i'd say it used the good one | 16:17 |
amoralej | 7d23 | 16:17 |
amoralej | look at ok: [172.19.3.90] => {"attempts": 1, "changed": false, "commit_hash": null, "distro_hash": null, "dlrn_url": "https://trunk.rdoproject.org/centos8-ussuri/current-tripleo/delorean.repo.md5", "error": "", "extended_hash": null, "full_hash": "7d23a38590b5cf6750e96a41ca2f22ca", "success": true} | 16:17 |
amoralej | repo-setup : tripleo-get-hash current-tripleo | 16:18 |
amoralej | TASK [repo-setup : Set fact dlrn_hash->7d23a38590b5cf6750e96a41ca2f22ca] ******* | 16:18 |
rlandy|ruck | ok: [172.19.3.90] => { | 16:21 |
rlandy|ruck | "msg": [ | 16:21 |
rlandy|ruck | "the tq release is: ussuri", | 16:21 |
rlandy|ruck | "the distro is: centos8", | 16:21 |
rlandy|ruck | "dlrn_hash is: 7d23a38590b5cf6750e96a41ca2f22ca" | 16:21 |
rlandy|ruck | ] | 16:21 |
rlandy|ruck | } | 16:21 |
rlandy|ruck | but it reports on the newer hash | 16:21 |
amoralej | because to report | 16:21 |
amoralej | it pulls the .repo.md5 from current-tripleo | 16:21 |
rlandy|ruck | correct | 16:22 |
rlandy|ruck | it should take whatever was passed | 16:22 |
rlandy|ruck | ugh - so complicated | 16:22 |
amoralej | too much | 16:23 |
amoralej | complexity everywhere :) | 16:24 |
rlandy|ruck | https://github.com/rdo-infra/ci-config/blob/master/jenkins-v2/jobs/scripts/dlrn-api-report.sh | 16:24 |
rlandy|ruck | https://github.com/rdo-infra/ci-config/blob/master/jenkins-v2/jobs/scripts/dlrn-api-report.sh#L7 | 16:24 |
rlandy|ruck | AGG_HASH=`curl -L ${DELOREAN_URL}.md5` | 16:24 |
rlandy|ruck | ^^ actually that | 16:24 |
rlandy|ruck | looking for the var that is passed | 16:25 |
rlandy|ruck | HASH_FILE="$WORKSPACE/delorean_master_centos9_current_tripleo_hash" | 16:27 |
rlandy|ruck | Copied 1 artifact from "rdo-promote-get-hash-ussuri-centos8-current-tripleo" build number 65 | 16:31 |
rlandy|ruck | [EnvInject] - Injecting environment variables from a build step. | 16:31 |
rlandy|ruck | where does that go? | 16:31 |
rlandy|ruck | echo export INFO_URL="https://jenkins-cloudsig-ci.apps.ocp.ci.centos.org/job/${JOB_NAME}/${BUILD_NUMBER}/artifact/logs.html" >> ${WORKSPACE}/${BUILD_ID}/dlrnapi-varfile | 16:32 |
rlandy|ruck | ^^ weirdo jobs | 16:32 |
rlandy|ruck | Skipping script : . ${WORKSPACE}/${BUILD_ID}/dlrnapi-varfile | 16:33 |
rlandy|ruck | rm -rf ${WORKSPACE}/${BUILD_ID} | 16:33 |
rlandy|ruck | export JOB_SUCCESS="false" | 16:33 |
rlandy|ruck | export DLRNAPI_USER="ciuser" | 16:33 |
rlandy|ruck | # Report | 16:33 |
rlandy|ruck | # DLRNAPI_PASSWD is provided by a credential binding | 16:33 |
rlandy|ruck | cd weirdo | 16:33 |
rlandy|ruck | tox -e ansible-playbook -- -i ${WORKSPACE}/hosts playbooks/dlrn-api-report.yml | 16:33 |
rlandy|ruck | ^ that's how weirdo gets it right | 16:33 |
rlandy|ruck | ok - will have to edit our jobs somehow to do the same | 16:33 |
rlandy|ruck | <brain hurts> | 16:34 |
ykarel | rlandy|ruck, amoralej i recall i attempted to fix ^ long back with https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/10434 | 16:37 |
ykarel | it got abandoned as obsoleted, so likely similar can be used to fix it | 16:37 |
amoralej | wow 2017 :) | 16:39 |
* dviroel|ruck brb | 16:39 | |
amoralej | promote-newton :) | 16:39 |
ykarel | :D | 16:40 |
rlandy|ruck | ykarel: thanks - looking at that | 16:40 |
rlandy|ruck | and in all these years nobody bothered to revive it | 16:40 |
rlandy|ruck | might still be relevant wrt https://bugs.launchpad.net/tripleo/+bug/1859213, will check jobs. | 16:41 |
rlandy|ruck | lol | 16:41 |
rlandy|ruck | dviroel|ruck: woohoo https://review.rdoproject.org/zuul/stream/a30904dea33f4281bfd2c366ee90b6c6?logfile=console.log periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp_1supp-featureset039-master | 16:42 |
rlandy|ruck | success: true | 16:42 |
rlandy|ruck | master c8 promotion - here we come | 16:42 |
ykarel | yeah that's my comment, but don't recall what all happened that time | 16:44 |
marios | rlandy|ruck: when you have time please see https://review.opendev.org/c/openstack/tripleo-ci/+/828110 - I'll workflow it in the morning if it is still around then ;) or if there are comments | 16:45 |
ykarel | but seems this should work, but would need some adjustment due to introduction of agg_hash | 16:45 |
marios | rlandy|ruck: so we can start considering the child patch for c8 teardown by friday would be good | 16:45 |
rlandy|ruck | marios: ack - thanks | 16:46 |
marios | bhagyashris: thanks for updating the topic on takashi patches ;) | 16:47 |
*** ykarel is now known as ykarel|away | 16:48 | |
rlandy|ruck | https://review.rdoproject.org/zuul/builds?pipeline=openstack-periodic-integration-stable4-centos7 hmmm | 16:52 |
rlandy|ruck | rekicking | 16:52 |
*** marios is now known as marios|out | 16:53 | |
rlandy|ruck | lunch ... brb | 16:54 |
*** jpena is now known as jpena|off | 17:27 | |
dviroel|ruck | master c8 promoting :) | 17:31 |
rlandy|ruck | will revert the fs002 patch | 17:32 |
dviroel|ruck | rlandy|ruck: https://review.opendev.org/c/openstack/tripleo-ansible/+/828417 - merged | 17:33 |
dviroel|ruck | fs002 should be good now | 17:33 |
rlandy|ruck | ack - I'll revert after promote goes through | 17:33 |
rlandy|ruck | https://review.opendev.org/c/openstack/tripleo-ci/+/828524 | 17:59 |
rlandy|ruck | ^^ vote there pls | 17:59 |
rlandy|ruck | dviroel|ruck: ^^ | 17:59 |
rlandy|ruck | https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/38808 | 18:00 |
dviroel|ruck | rlandy|ruck: promoter failed? | 18:00 |
rlandy|ruck | another vote pls | 18:00 |
rlandy|ruck | didn't promote | 18:01 |
rlandy|ruck | looking | 18:01 |
rlandy|ruck | 2022-02-09 17:44:25,878 2013128 ERROR promoter Candidate hash 'aggregate: 6fdbde2576bd48e3e9624ac5a1dd61f8, commit: a9058bb68beff87f08a3462163d25dd45b019cc2, distro: df726a5c620cd1d0b5dff471247268382bc0db5b, extended: None, component: validation, timestamp: 1644383082': client registries_client FAILED promotion attempt to current-tripleo | 18:02 |
rlandy|ruck | 2022-02-09 17:44:25,878 2013128 ERROR promoter Failed to promote overcloud images | 18:02 |
rlandy|ruck | Traceback (most recent call last): | 18:02 |
rlandy|ruck | -02-09 17:44:25,874 2013128 ERROR promoter Command '['env', 'ANSIBLE_LOG_PATH=/home/promoter/web/promoter_logs/container-push/20220209-165207.log', 'ANSIBLE_DEBUG=False', 'ansible-playbook', '-v', '-e', '@/tmp/tmpv840s1_1.yaml', '/home/promoter/ci-config/ci-scripts/container-push/container-push.yml']' returned non-zero exit status 2. | 18:02 |
rlandy|ruck | Traceback (most recent call last): | 18:02 |
dviroel|ruck | [localhost]: FAILED! => {"censored": "the output has been hidden due to the fact that 'no_log: true' was specified for this result", "changed": true} | 18:03 |
dviroel|ruck | love that one ^ | 18:03 |
rlandy|ruck | lol | 18:05 |
rlandy|ruck | it keeps the mystery alive | 18:05 |
rlandy|ruck | train8 is promoting now | 18:05 |
rlandy|ruck | should try again | 18:05 |
rlandy|ruck | then we will see if it's a real error | 18:05 |
rlandy|ruck | leaving fs002 in skip until promotion is done | 18:06 |
dviroel|ruck | ack | 18:06 |
rlandy|ruck | woohoo 17 promoted | 18:11 |
*** amoralej is now known as amoralej|off | 18:17 | |
dasm | brb | 18:44 |
rlandy|ruck | https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/828115 | 18:52 |
rlandy|ruck | dviroel|ruck: ^^ one more - pls w+ if you agree | 18:52 |
rlandy|ruck | ugh master promotion failed agin | 18:54 |
rlandy|ruck | again | 18:54 |
rlandy|ruck | dviroel|ruck: ^^ real issue now | 18:54 |
rlandy|ruck | looking | 18:54 |
rlandy|ruck | http://promoter.rdoproject.org/promoter_logs/container-push/20220209-182943.log | 18:54 |
rlandy|ruck | now to find the one image that failed to promote | 18:56 |
dviroel|ruck | rlandy|ruck: ack - reviewing code - will help on debug in a min | 18:59 |
rlandy|ruck | dviroel|ruck: no worries - I got a little time before next meeting | 18:59 |
rlandy|ruck | so looking | 18:59 |
rlandy|ruck | will pass back to you then | 19:00 |
* dasm is back | 19:20 | |
rlandy|ruck | 6fdbde2576bd48e3e9624ac5a1dd61f8 - no tag on that in openstack-ceilometer-central | 19:20 |
rlandy|ruck | https://hub.docker.com/r/tripleomaster/openstack-ceilometer-central/tags | 19:45 |
rlandy|ruck | it is there now | 19:45 |
dviroel|ruck | rlandy|ruck: great :) | 20:00 |
rlandy|ruck | on promoter | 20:00 |
rlandy|ruck | trying to find unsanitized logs | 20:01 |
rlandy|ruck | meeting time | 20:01 |
rlandy|ruck | looks like it's pushing again | 20:02 |
dasm | rlandy|ruck: unfortunately i called shots too quickly. all tests running with uefi are failing due to issues with introspection of provisioned bare metal machines. It probably means they're just not starting correctly. Right now I'm going through logs and searching for some reliable information. | 20:11 |
rlandy|ruck | k | 20:12 |
rlandy|ruck | dviroel|ruck: hey - out of meeting | 20:46 |
dviroel|ruck | rlandy|ruck: o/ | 20:47 |
rlandy|ruck | dviroel|ruck: any magic insights | 20:47 |
rlandy|ruck | still staring at nothing useful atm | 20:48 |
dviroel|ruck | rlandy|ruck: https://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-centos-7-train-containers-build-push&project=openstack/tripleo-ci | 20:48 |
dviroel|ruck | real issue | 20:48 |
* dviroel|ruck was fixing some molecule jobs in tripleo-ansible too | 20:49 | |
rlandy|ruck | | ImportError: No module named collections_abc | 20:51 |
dviroel|ruck | virtualenv issue in py2.7 | 20:51 |
rlandy|ruck | ah ok | 20:51 |
rlandy|ruck | was staring at master promotion logs | 20:51 |
dviroel|ruck | rlandy|ruck: my guess - trying to see which version is | 20:52 |
rlandy|ruck | but can't see the issue yet | 20:52 |
rlandy|ruck | ok - let's bug the c7 one | 20:52 |
rlandy|ruck | you can pass it on to arx tomorrow | 20:52 |
dviroel|ruck | yeah, does not reproduce in my vm | 20:58 |
dviroel|ruck | http://mirror.regionone.vexxhost-nodepool-tripleo.rdoproject.org/centos/7/updates/x86_64/Packages/python-virtualenv-15.1.0-5.el7_9.noarch.rpm | 21:06 |
dviroel|ruck | same version that i installed here | 21:06 |
rlandy|ruck | ugh - still no idea why container push is failing | 21:13 |
rlandy|ruck | no obvious error | 21:14 |
rlandy|ruck | 2022-02-09 20:29:21,395 p=2147676 u=promoter n=ansible | TASK [containers-promote : Tag and push images to registry] ******************** | 21:14 |
dviroel|ruck | did you enabled the logs? | 21:15 |
rlandy|ruck | was looking if there was a fuller copy elsewhere but I don't see it | 21:24 |
rlandy|ruck | if I enable, it will spew all the info they want to hide to public | 21:25 |
dviroel|ruck | make sense, we would need to remove this log from public access and enable the log then | 21:28 |
rlandy|ruck | https://github.com/rdo-infra/ci-config/blob/master/ci-scripts/container-push/container-push.yml#L68 | 21:30 |
rlandy|ruck | # Setting no_log as this can potentially expose passwords | 21:30 |
rlandy|ruck | dviroel|ruck: going to leave a note for bhagyashris to remove no_log on other promoter server | 21:30 |
rlandy|ruck | promote master | 21:31 |
rlandy|ruck | and store logs somewhere we can see them | 21:31 |
rlandy|ruck | https://github.com/rdo-infra/ci-config/blob/4ff0a90c9de3002c64707d051b4ed9d77dd41ff8/ci-scripts/container-push/roles/containers-promote/tasks/registry-push.yml#L56 | 21:32 |
rlandy|ruck | ^^ failing piece | 21:32 |
rlandy|ruck | but idk why | 21:32 |
* dasm is leaving for now. Be back later. | 21:33 | |
* dasm dasm|bbl | 21:33 | |
dviroel|ruck | rlandy|ruck: ack, seems to be a good idea - hard to say what is happening - master c8 is refusing to promote in all ways it can | 21:35 |
rlandy|ruck | it's a revolution | 21:35 |
rlandy|ruck | https://hub.docker.com/r/tripleomaster/nop/tags | 21:39 |
rlandy|ruck | dviroel|ruck: interesting | 21:39 |
rlandy|ruck | don;t know where that came from | 21:39 |
rlandy|ruck | https://github.com/rdo-infra/ci-config/search?q=nop | 21:41 |
dviroel|ruck | oh, just to validate login | 21:44 |
rlandy|ruck | honestly idk | 21:46 |
rlandy|ruck | need to stare at something else now | 21:46 |
dviroel|ruck | rlandy|ruck: going out for today - tty tomorrow | 22:32 |
*** dviroel|ruck is now known as dviroel|out | 22:33 | |
rlandy|ruck | dviroel|out: sure good night | 22:36 |
*** rlandy|ruck is now known as rlandy|out | 23:23 |
Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!