*** marios is now known as marios|ruck | 05:38 | |
*** pojadhav- is now known as pojadhav | 06:24 | |
*** jpena|off is now known as jpena | 07:00 | |
marios|ruck | chandankumar: quick question https://review.opendev.org/c/openstack/tripleo-quickstart/+/803134/2#message-b9756e12bf60266583a5067a2298ffd5e876e1f1 | 07:01 |
---|---|---|
marios|ruck | chandankumar: but otherwise see previous comment are we waiting for testing or good to go? otherwise why cant we put that into the release files per ^^ | 07:02 |
zbr | sshnaidm_: if you could release a new version of podman collection it would be great. | 07:21 |
zbr | with bit of luck, that is all we need to finish switching from docker to podman on molecule scenarios | 07:22 |
chandankumar | ykarel: hello, is it possible to test this change https://review.opendev.org/c/openstack/tripleo-quickstart/+/803134 via testproject? | 07:32 |
ykarel | chandankumar, no not possible with testproject as that's jenkins jobs | 07:35 |
ykarel | can do some tests with jenkins hack, but the change should be good to go | 07:35 |
chandankumar | marios|ruck: ^^ Done, thanks :-) | 07:37 |
marios|ruck | chandankumar: ack | 07:42 |
ykarel | chandankumar, i see https://ci.centos.org/job/tripleo-quickstart-gate-master-delorean-full-featureset052/ too failing | 07:47 |
chandankumar | hmm | 07:51 |
chandankumar | this one failed at Build and push container images to the local registry | 07:52 |
chandankumar | if I see here https://artifacts.ci.centos.org/rdo/jenkins-tripleo-quickstart-gate-master-delorean-full-featureset052-1099/172.19.3.93/home/stack/builder-undercloud.log | 07:52 |
chandankumar | tripleo-operator-ansible got installed | 07:52 |
ykarel | chandankumar, actually it's failing while looking for that role on virthost | 07:53 |
ykarel | so likely need more fixes your patch | 07:54 |
ykarel | as similar error is in promotion pipeline, | 07:54 |
ykarel | can hold your patch | 07:54 |
zbr | marios|ruck: chandankumar: can we please do https://review.opendev.org/c/openstack/ansible-role-collect-logs/+/792652 ? | 07:57 |
zbr | https://review.opendev.org/c/openstack/tripleo-repos/+/800462 is also waiting for your reviews. | 07:58 |
chandankumar | ykarel: Do you have logs from promotion pipeline? | 07:59 |
ykarel | https://artifacts.ci.centos.org/rdo/jenkins-tripleo-quickstart-promote-wallaby-current-tripleo-delorean-minimal-24/console.log | 07:59 |
chandankumar | marios|ruck: please remove +w from above patch | 08:07 |
chandankumar | more fix is needed for standalone | 08:07 |
*** sshnaidm_ is now known as sshnaidm | 08:20 | |
sshnaidm | zbr, yeah, will do today | 08:25 |
marios|ruck | chandankumar: which one | 08:26 |
chandankumar | marios|ruck: sorry just updated the patch | 08:26 |
marios|ruck | ack | 08:26 |
*** ykarel is now known as ykarel|lunch | 08:34 | |
sshnaidm | zbr, before release, need to fix: https://github.com/ansible-community/molecule-podman/pull/46 | 09:09 |
zbr | sshnaidm: done, merged. | 09:20 |
zbr | sshnaidm: https://github.com/ansible-community/molecule-podman/issues/45 could also prove a major issue but is not a blocker. use of syncronize is very popular. | 09:22 |
sshnaidm | zbr, yeah, working on this.. | 09:24 |
sshnaidm | zbr, after this pr is merged, I'll release: https://github.com/containers/ansible-podman-collections/pull/279 | 09:25 |
sshnaidm | need to wait for tripleo jobs :D | 09:25 |
sshnaidm | zbr, don't you have tripleo-ansible molecule jobs in molecule-podman repo? | 09:25 |
zbr | nope, there is nothing specific to openstack in molecule & drivers. | 09:28 |
zbr | the only exception is for linter where we have an "eco"-system pipeline that tests outcomes while running on a set of 3rd party repositories, some of them are openstack/tripleon ones https://github.com/ansible-community/ansible-lint/blob/master/playbooks/eco.yml#L8-L37 | 09:29 |
zbr | i would prefer to keep it simple, especially as we use GHA, not zuul. Only two projects have a 3rd party zuul due to limitations on GHA (molecule-libvirt and molecule-vagrant) | 09:30 |
zbr | it should not matter, but if we find a bug that affects us, we can easily adapt the tests and improve them. | 09:31 |
zbr | marios|ruck: do we have known blockers or should I be confident to use retry when getting what looks like random failure? | 09:32 |
marios|ruck | zbr: link? | 09:32 |
marios|ruck | zbr: not aware of gate issues today | 09:32 |
zbr | https://review.opendev.org/c/openstack/tripleo-ansible/+/787767 got one failure, looks random to me. | 09:33 |
marios|ruck | zbr: no looks like new issue | 09:34 |
marios|ruck | zbr: i will file a bug but need to dig a bit | 09:34 |
marios|ruck | zbr: i see two so far similar fail https://zuul.opendev.org/t/openstack/builds?job_name=tripleo-ci-centos-8-content-provider | 09:34 |
marios|ruck | zbr: 2021-08-02 08:25:05 | fatal: unable to access 'https://github.com/rdo-packages/tripleo-ansible-distgit.git/': Failed to connect to github.com port 443: Connection timed out | 09:35 |
marios|ruck | zbr: go ahead recheck lets see | 09:39 |
zbr | sshnaidm: stuff like ^ is why I would oppose adding tripleo jobs to molecule ;) | 09:42 |
sshnaidm | zbr, you don't like challenges | 09:57 |
zbr | in fact I like those where there is slight chance for me to win | 09:58 |
marios|ruck | soniya29|rover: o/ i am going afk for a few in a couple mins | 10:06 |
sshnaidm | zbr, in GHA: fatal: unable to access 'https://github.com/ansible/ansible.git/': Failed to connect to github.com port 443: Connection timed out | 10:09 |
sshnaidm | so tripleo is not related :) | 10:09 |
marios|ruck | soniya29|rover: fyi https://bugs.launchpad.net/tripleo/+bug/1938684 not sure if it is transient yet | 10:09 |
sshnaidm | marios|ruck, ^^ seems like problem with github | 10:09 |
*** ykarel|lunch is now known as ykarel | 10:10 | |
marios|ruck | sshnaidm: yeah just dont know how widespread yet if transient etc | 10:11 |
* marios|ruck back in ~40 mins | 10:13 | |
marios|ruck | thanks soniya|rover | 11:03 |
soniya|rover | marios|ruck, ack | 11:11 |
marios|ruck | anyone know what's up with SKIPPED @ https://review.rdoproject.org/zuul/builds?pipeline=openstack-periodic-integration-main ? | 11:14 |
*** jpena is now known as jpena|lunch | 11:30 | |
*** rlandy is now known as rlandy|ruck | 11:32 | |
rlandy|ruck | marios|ruck: soniya|rover: hi - how are things today? any emergencies? | 11:34 |
marios|ruck | rlandy|ruck: something wrong with promoter master | 11:41 |
rlandy|ruck | marios|ruck: last week it was failing on manifest push | 11:41 |
marios|ruck | rlandy|ruck: http://10.0.148.74/promoter_logs/centos8_master.log failed promotion 2021-08-02 11:28:20,193 480062 ERROR promoter TASK [containers-promote : Fail if there are missing containers] *************** | 11:41 |
rlandy|ruck | still the issue? | 11:41 |
marios|ruck | rlandy|ruck: tried to promote but fails pushing containers | 11:41 |
* rlandy|ruck looks | 11:41 | |
marios|ruck | rlandy|ruck: "no such manifest: | 11:41 |
marios|ruck | rlandy|ruck: that what you meant? T^^^ | 11:41 |
rlandy|ruck | idk if it's the same - looking at logs | 11:42 |
marios|ruck | rlandy|ruck: commented on gchat and ccd you... that and also master line skipped all the things some infra issue? | 11:42 |
rlandy|ruck | akahat: ^^ did you turn off manifest push> | 11:42 |
rlandy|ruck | k | 11:43 |
marios|ruck | rlandy|ruck: also gate blocker? transient? filed https://bugs.launchpad.net/tripleo/+bug/1938684 | 11:43 |
marios|ruck | rlandy|ruck: but otherwise, yeah living the dream \o/ :D | 11:43 |
* rlandy|ruck looks at promoter and then the line | 11:43 | |
marios|ruck | rlandy|ruck: hoping that is not a gate bloker and that it is transient ^^^ | 11:43 |
rlandy|ruck | woohoo - another week of fun | 11:43 |
akahat | rlandy|ruck, yes. i've turned it off. | 11:43 |
rlandy|ruck | marios|ruck: ^^ ok - then we have another issue | 11:44 |
rlandy|ruck | looking | 11:44 |
rlandy|ruck | http://10.0.148.74/promoter_logs/container-push/20210802-112540.log | 11:46 |
rlandy|ruck | marios|ruck: ^^ pushing containers now | 11:46 |
rlandy|ruck | transient? | 11:47 |
akahat | is this hash exists in registry: b18a7a08a93f7cf4643d1fd721d96583 | 11:47 |
rlandy|ruck | docker.io/tripleomaster/openstack-designate-producer:6392edc0eaefb612986de9809a0195ba | 11:47 |
akahat | it shows ""no such manifest: trunk.registry.rdoproject.org/tripleomaster/openstack-octavia-health-manager:b18a7a08a93f7cf4643d1fd721d96583"" | 11:47 |
akahat | rlandy|ruck, is is right url: http://trunk.registry.rdoproject.org:8443/oapi/v1/namespaces/tripleomaster/imagestreamtag/ | 11:48 |
marios|ruck | rlandy|ruck: looking | 11:48 |
marios|ruck | rlandy|ruck: early on in that file there is still errror so didn't scroll to bottom i mean there http://10.0.148.74/promoter_logs/container-push/20210802-112540.log | 11:48 |
marios|ruck | rlandy|ruck: lets see if it manages to promote then thanks for checking b18a7a08a93f7cf4643d1fd721d96583 | 11:49 |
rlandy|ruck | 2021-08-02 11:28:20,067 p=480121 u=promoter n=ansible | TASK [containers-promote : Fail if there are missing containers] *************** | 11:50 |
rlandy|ruck | 2021-08-02 11:28:20,085 p=480121 u=promoter n=ansible | fatal: [localhost]: FAILED! => {"changed": false, "msg": "There are missing containers"} | 11:50 |
rlandy|ruck | I see that | 11:50 |
rlandy|ruck | "docker.io/tripleomaster/openstack-zaqar-wsgi:current-tripleo-rdo"} | 11:51 |
rlandy|ruck | pushing current-tripleo-rdo | 11:51 |
rlandy|ruck | trunk.registry.rdoproject.org/tripleomaster/openstack-rsyslog:b18a7a08a93f7cf4643d1fd721d96583", "no such manifest: trunk.registry.rdoproject.org/tripleomaster/openstack-unbound:b18a7a08a93f7cf4643d1fd721d96583"], | 11:52 |
rlandy|ruck | claims it's missing all the manifests from b18a7a08a93f7cf4643d1fd721d96583 | 11:56 |
rlandy|ruck | checking previous logs | 11:57 |
rlandy|ruck | sshnaidm: thank you - cockpit looks sane again | 11:57 |
rlandy|ruck | marios|ruck: testproject in the failed fs001 in master line | 11:59 |
rlandy|ruck | hopefully this new hash will then promote | 11:59 |
rlandy|ruck | 2021-08-02 10:54:41 | 2021-08-02 10:54:41.378 216382 ERROR openstack [-] Message queue for ephemeral heat not created in time.: tripleoclient.exceptions.HeatPodMessageQueueException: Message queue for ephemeral heat not created in time.[00m | 12:00 |
rlandy|ruck | ^^ watch that | 12:00 |
marios|ruck | rlandy|ruck: the latest master periodic line is all skipped https://review.rdoproject.org/zuul/builds?pipeline=openstack-periodic-integration-main | 12:01 |
marios|ruck | rlandy|ruck: i noted in the gchat, maybe some issue | 12:01 |
rlandy|ruck | openstack-periodic-integration-main | 12:01 |
rlandy|ruck | it's running right now | 12:01 |
rlandy|ruck | marios|ruck: ^^ am I missing something??? | 12:02 |
marios|ruck | ratailor__: Invalid dateSKIPPED @ https://review.rdoproject.org/zuul/builds?pipeline=openstack-periodic-integration-main | 12:04 |
marios|ruck | ratailor__: sorry for noise was meant for someone else | 12:04 |
ratailor__ | marios|ruck, np :) | 12:04 |
marios|ruck | rlandy|ruck: ^^ i see it running now https://review.rdoproject.org/zuul/status if that is what you meant yea | 12:04 |
rlandy|ruck | that was for me :) | 12:04 |
rlandy|ruck | yes | 12:04 |
rlandy|ruck | just reran fs001 | 12:04 |
marios|ruck | rlandy|ruck: but donno why previous run is skipped | 12:04 |
rlandy|ruck | probably infra | 12:05 |
rlandy|ruck | oh | 12:05 |
rlandy|ruck | gerrit was out | 12:05 |
rlandy|ruck | maybe some result | 12:05 |
rlandy|ruck | 2021-08-02 11:46:52,437 p=481816 u=promoter n=ansible | PLAY RECAP ********************************************************************* | 12:08 |
rlandy|ruck | 2021-08-02 11:46:52,437 p=481816 u=promoter n=ansible | localhost : ok=30 changed=18 unreachable=0 failed=0 skipped=8 rescued=0 ignored=0 | 12:08 |
rlandy|ruck | it supposed to have finished pushing | 12:08 |
rlandy|ruck | akahat: ^^ | 12:09 |
rlandy|ruck | if manifest push is off | 12:09 |
rlandy|ruck | and the previous completed | 12:09 |
rlandy|ruck | what's it doing now | 12:09 |
rlandy|ruck | oh nvm | 12:10 |
rlandy|ruck | tag | 12:10 |
rlandy|ruck | I see | 12:10 |
akahat | :) | 12:11 |
rlandy|ruck | weshay|ruck: hey | 12:16 |
rlandy|ruck | weshay|ruck: so you want to run the CIX meeting or should I? | 12:16 |
weshay|ruck | I can do it | 12:16 |
rlandy|ruck | ok | 12:16 |
marios|ruck | brb for cix call o/ weshay|ruck welcome back ;) | 12:17 |
chandankumar | weshay|ruck: welcome back sir, Hope you had a great vacation :-) | 12:18 |
weshay|ruck | hola hola everyone :) | 12:21 |
Tengu | he's back! | 12:22 |
*** jpena|lunch is now known as jpena | 12:28 | |
rlandy|ruck | 2021-08-02 12:19:19,098 514147 INFO promoter Qcow promote 'aggregate: 6392edc0eaefb612986de9809a0195ba, commit: b38561c845156f014892d3910972faf38d560cec, distro: ae83045246b1725cbc365b60068f180c76cda314, extended: None, component: validation, timestamp: 1627581069' to current-tripleo-rdo: Successful promotion | 12:30 |
rlandy|ruck | getting there | 12:30 |
rlandy|ruck | akahat: 2021-08-02 12:32:59,550 p=548357 u=promoter n=ansible | fatal: [localhost]: FAILED! => {"changed": false, "msg": "There are missing containers"} | 12:35 |
rlandy|ruck | ^^ still failing master | 12:35 |
akahat | rlandy|ruck, looking | 12:35 |
akahat | rlandy|ruck, i'm not able to find this hash: b18a7a08a93f7cf4643d1fd721d96583 in the rdo registry | 12:38 |
akahat | could you please recheck it. | 12:38 |
rlandy|ruck | maybe it got purged | 12:38 |
* rlandy|ruck checks | 12:38 | |
marios|ruck | ykarel: can you check https://trello.com/c/OmLNBwGT/2041-cixlp1938283tripleociproa-master-ci-jobs-failing-randomly-as-pcs-resource-operations-actions-times-out please can we close this out ? | 12:45 |
ykarel | marios|ruck, ack looking | 12:47 |
marios|ruck | ykarel: thanks just discussed in cix call can you update with some comment and we can close | 12:48 |
ykarel | sure updating | 12:49 |
rlandy|ruck | akahat: what registry link are you checking? | 12:52 |
akahat | rlandy|ruck, https://trunk.registry.rdoproject.org:8443/oapi/v1/namespaces/tripleomaster/imagestreamtags | 12:52 |
rlandy|ruck | :6392edc0eaefb612986de9809a0195ba | 12:54 |
rlandy|ruck | that one - previous | 12:54 |
rlandy|ruck | c7581691e115f2708651475b1d91a8e1 | 12:55 |
rlandy|ruck | but yeah - no b18a7a08a93f7cf4643d1fd721d96583 | 12:55 |
rlandy|ruck | idk what happened to that hash | 12:55 |
rlandy|ruck | if the current run works | 12:56 |
rlandy|ruck | we can get that promoted | 12:56 |
* rlandy|ruck checks for current hash | 12:56 | |
rlandy|ruck | f50dfd1b5edf85ec026b5ae4a32b4219 hash is there | 12:57 |
rlandy|ruck | marios|ruck: akahat: ^^ one running now | 12:57 |
rlandy|ruck | so if we get current tests to pass that are in rerun, that hash should promote | 12:58 |
akahat | this hash is uploaded at 29 | 12:58 |
akahat | https://trunk.rdoproject.org/centos8-master/tripleo-ci-testing/b1/8a/b18a7a08a93f7cf4643d1fd721d96583/ | 12:58 |
akahat | ah.. great | 12:58 |
rlandy|ruck | f50dfd1b5edf85ec026b5ae4a32b4219 hoping for | 13:00 |
rlandy|ruck | zbr: scrum time' | 13:01 |
chandankumar | https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/790926 | 13:07 |
dviroel | reviews on these please: | 13:14 |
dviroel | first: https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/34612 | 13:14 |
dviroel | then: https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/34767 | 13:14 |
dviroel | the second patch adds a molecule job that test the roles added | 13:15 |
marios|ruck | https://bugs.launchpad.net/tripleo/+bug/1938684 | 13:16 |
marios|ruck | rlandy|ruck: ^^^^ | 13:16 |
rlandy|ruck | zbr: https://27d56bbd07be34be0cc1-9b6cbe136e0e918c88bba727d81cded7.ssl.cf1.rackcdn.com/787502/9/check/tox-ansible-test-sanity/db8ab1e/job-output.txt | 13:26 |
marios|ruck | b18a7a08a93f7cf4643d1fd721d96583 fails container push missing containers ? http://10.0.148.74/promoter_logs/container-push/20210802-112540.log | 13:34 |
marios|ruck | rlandy|ruck: ^^ | 13:34 |
rlandy|ruck | soniya29|rover: f50dfd1b5edf85ec026b5ae4a32b4219 | 13:38 |
marios|ruck | soniya29|rover: rlandy|ruck: https://docs.openstack.org/tripleo-docs/latest/ci/chasing_promotions.html#specifying-a-particular-hash | 13:39 |
zbr | please review and +W https://review.opendev.org/c/openstack/tripleo-repos/+/800462 | 13:48 |
soniya|rover | rlandy|ruck, the hash- f50dfd1b5edf85ec026b5ae4a32b4219 which you specified above, is it the same one which we want to specify by dlrn_hash_tag parameter for fs35 master with the test-project? | 14:13 |
rlandy|ruck | soniya|rover: yes | 14:14 |
soniya|rover | rlandy|ruck, okay, thanks | 14:15 |
*** ykarel is now known as ykarel|away | 14:16 | |
rlandy|ruck | soniya|rover: marios|ruck: testproject added for retry_limit job periodic-tripleo-ci-centos-8-scenario000-multinode-oooq-container-updates-ussuri | 14:20 |
rlandy|ruck | will try to get ussuri to promote today | 14:21 |
soniya|rover | rlandy|ruck, ack | 14:21 |
marios|ruck | rlandy|ruck: thanks | 14:29 |
sshnaidm | zbr, released | 14:49 |
soniya|rover | rlandy|ruck, marios|ruck, testproject added for job periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-master | 14:50 |
soniya|rover | rlandy|ruck, marios|ruck, https://review.rdoproject.org/r/c/test-project/+/34799 | 14:50 |
rlandy|ruck | soniya|rover: thanks - looking | 14:51 |
rlandy|ruck | soniya|rover: so ... https://review.rdoproject.org/r/c/test-project/+/34799 - | 14:53 |
rlandy|ruck | https://review.rdoproject.org/r/c/test-project/+/34799/1/.zuul.yaml is correct but your review is not showing on https://review.rdoproject.org/zuul/status | 14:53 |
rlandy|ruck | your repo is wrong | 14:53 |
rlandy|ruck | testproject - no hash | 14:54 |
rlandy|ruck | Repo | Branch test-project | master | 14:54 |
rlandy|ruck | I mean no dash :) | 14:54 |
rlandy|ruck | repo testproject | 14:54 |
soniya|rover | rlandy|ruck, okay, thanks :) | 14:56 |
marios|ruck | rlandy|ruck: wana talk promotion criteria sorry just finished call | 15:04 |
marios|ruck | rlandy|ruck: fs35 isn't even in criteria? | 15:06 |
marios|ruck | rlandy|ruck: at least on the promoter | 15:06 |
marios|ruck | #- periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-master | 15:06 |
marios|ruck | cd ci-config/ci-scripts/dlrnapi_promoter/config_environments/rdo/CentOS-8 | 15:07 |
rlandy|ruck | marios|ruck: either way - let's get a run on it | 15:07 |
marios|ruck | rlandy|ruck: k | 15:07 |
marios|ruck | rlandy|ruck: am gonna copy paste master there https://github.com/rdo-infra/ci-config/blob/master/ci-scripts/dlrnapi_promoter/config_environments/rdo/CentOS-8/master.yaml#L14 now is that OK ? | 15:07 |
marios|ruck | rlandy|ruck: it will include 36 | 15:07 |
marios|ruck | 35 | 15:07 |
rlandy|ruck | sec | 15:07 |
marios|ruck | rlandy|ruck: https://github.com/rdo-infra/ci-config/blob/4a68c3b7b062f9768410f5cafa171ff256dc0972/ci-scripts/dlrnapi_promoter/config_environments/rdo/CentOS-8/master.yaml#L18 | 15:08 |
rlandy|ruck | marios|ruck: hold on | 15:08 |
rlandy|ruck | that will include fs035 | 15:08 |
rlandy|ruck | promoter is still trying to push commit: b38561c845156f014892d3910972faf38d560cec | 15:09 |
marios|ruck | rlandy|ruck: ok then | 15:10 |
marios|ruck | rlandy|ruck: bump tomorrow | 15:10 |
marios|ruck | np | 15:10 |
rlandy|ruck | on tmux | 15:10 |
rlandy|ruck | marios|ruck: so I think we should restart promoter with current config | 15:10 |
marios|ruck | rlandy|ruck: i am on tmux | 15:11 |
marios|ruck | rlandy|ruck: switching pvt sec | 15:11 |
rlandy|ruck | fs035 is in here | 15:11 |
soniya29|rover | rlandy|ruck, marios|ruck, https://review.rdoproject.org/r/c/testproject/+/34800 | 15:18 |
sshnaidm | zbr, does molecule take collection from galaxy? or from rpm? | 15:19 |
rlandy|ruck | soniya29|rover; +1 - thanks | 15:19 |
*** jpena is now known as jpena|off | 15:25 | |
soniya29|rover | rlandy|ruck, :) | 15:31 |
zbr | sshnaidm: galaxy only but only when missing or outdated | 15:34 |
soniya29|rover | rlandy|ruck, marios|ruck, anything else where I can help, please let me know | 15:34 |
sshnaidm | zbr, ok, because rpm is not ready yet | 15:34 |
zbr | the condition for required collection is done in such way that i should not attempt to use galaxy if is already present | 15:35 |
rlandy|ruck | soniya29|rover: think we are ok for now ... but if you have time, we want to check which components have not promoted and may need help/bugs | 15:35 |
rlandy|ruck | so all components train -> master | 15:35 |
soniya29|rover | rlandy|ruck, okay | 15:37 |
marios|ruck | k thanks soniya29|rover | 15:59 |
marios|ruck | rlandy|ruck: am getting out in a bit | 15:59 |
rlandy|ruck | marios|ruck: sure - have a good night | 16:00 |
rlandy|ruck | chandankumar: hmm, ... looks like we are hitting the same problem on baremetal jobs: https://sf.hosted.upshift.rdu2.redhat.com/logs/openstack-periodic-integration-rhos-16.2/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-rhel-8-bm_envD-3ctlr_1comp-featureset035-rhos-16.2/7e94eea/job-output.txt | 16:12 |
rlandy|ruck | OVB downstream looks good | 16:12 |
soniya|rover | rlandy|ruck, weshay|ruck, leaving for the day | 16:21 |
weshay|ruck | soniya|rover, k.. thank you! | 16:21 |
*** sshnaidm is now known as sshnaidm|afk | 16:25 | |
*** marios|ruck is now known as marios|out | 16:26 | |
rlandy|ruck | arxcruz: 1-on-1? | 16:35 |
zbr | dviroel: please double check https://review.opendev.org/c/openstack/tripleo-repos/+/800462 | 16:36 |
dviroel | zbr: hi, i was checking this few minutes ago | 16:38 |
dviroel | zbr: where are the sanity test outputs? | 16:38 |
zbr | look at zuul output (tox) | 16:39 |
zbr | you will still see some failures, but these will be fixed later. | 16:39 |
zbr | (later = in follow-up) | 16:40 |
dviroel | i still don't see any errors on those tox outputs | 16:43 |
dviroel | but np, if you will continue to work on these test in a follow up | 16:44 |
dviroel | the code lgtm | 16:44 |
rlandy|ruck | lunch - brb | 16:45 |
rlandy|ruck | name: tripleo_overcloud_image_upload - also hitting baremetal | 17:30 |
weshay|ruck | rlandy|ruck, http://dashboard-ci.tripleo.org/d/mOvYIiOMk/component-pipeline-train?orgId=1 is fixed | 17:42 |
weshay|ruck | rlandy|ruck, anything else busted on the dashboard you noticed? | 17:42 |
rlandy|ruck | thank you | 17:50 |
rlandy|ruck | yep ussuri and victroa lines were mixed up | 17:50 |
* rlandy|ruck gets | 17:50 | |
weshay|ruck | rlandy|ruck, looks right to me.. http://dashboard-ci.tripleo.org/d/SjLc_1cGk/component-pipeline-victoria?orgId=1 | 17:50 |
rlandy|ruck | idk - maybe we fixed it | 17:51 |
weshay|ruck | ussuri also looks ok | 17:51 |
rlandy|ruck | oh ... | 17:52 |
rlandy|ruck | no - looks right now | 17:53 |
rlandy|ruck | will check again | 17:53 |
rlandy|ruck | shoot gate failures begin | 18:28 |
weshay|ruck | rlandy|ruck, https://review.opendev.org/c/openstack/tripleo-ci-health-queries/+/803246 | 18:51 |
weshay|ruck | rlandy|ruck, re: gate | 18:51 |
weshay|ruck | I did something wrong w/ the query though | 18:52 |
weshay|ruck | 2021-08-02 18:44:38.521122 | ubuntu-focal | ERROR: Git reported dirty status. Git should never report dirty status at the end of testing, regardless if status is passed, failed or aborted. | 18:52 |
rlandy|ruck | what's the question? | 18:53 |
rlandy|ruck | why review has zuul -1 | 18:54 |
rlandy|ruck | you're just giving a heads up | 18:54 |
rlandy|ruck | https://bugs.launchpad.net/tripleo/+bug/1938684 | 18:55 |
rlandy|ruck | marios loged ^^ as possible gate blocker | 18:56 |
weshay|ruck | rlandy|ruck, that's the gate failure | 18:56 |
rlandy|ruck | k - so then we have a real issue | 18:56 |
weshay|ruck | the current one didn't fail w/ github... failed on pulling ubi8 | 18:56 |
rlandy|ruck | different then ok | 18:57 |
weshay|ruck | rlandy|ruck, although we should create a query for | 18:57 |
weshay|ruck | 2021-08-02 08:25:05 | fatal: unable to access 'https://github.com/rdo-packages/tripleo-ansible-distgit.git/': Failed to connect to github.com port 443: Connection timed out | 18:57 |
weshay|ruck | or atleast " Failed to connect to github.com port 443" | 18:58 |
weshay|ruck | rlandy|ruck, can you help me please w/ https://review.opendev.org/c/openstack/tripleo-ci-health-queries/+/803246 any idea why that file is dirty? | 19:00 |
rlandy|ruck | looking | 19:00 |
rlandy|ruck | ha - used to see that in my local runs | 19:02 |
rlandy|ruck | Git reported dirty status. Git should never report dirty status at the end of testing, regardless if status is passed | 19:03 |
rlandy|ruck | 2021-08-02 18:44:38.526359 | ubuntu-focal | Untracked files: | 19:03 |
rlandy|ruck | 2021-08-02 18:44:38.526374 | ubuntu-focal | (use "git add <file>..." to include in what will be committed) | 19:03 |
rlandy|ruck | 2021-08-02 18:44:38.526395 | ubuntu-focal | output/elastic-recheck/1938718.yaml | 19:03 |
weshay|ruck | am I suppose to create that file? | 19:04 |
* rlandy|ruck looks if that was added | 19:04 | |
rlandy|ruck | hold on | 19:04 |
rlandy|ruck | weshay|ruck: can I edit | 19:11 |
weshay|ruck | rlandy|ruck, always | 19:12 |
weshay|ruck | ssamal, you want/need an irc bouncer? | 19:12 |
ssamal | weshay|ruck, I was gonna ask you but since I am not really working/communicating, thought would ask later. | 19:13 |
weshay|ruck | ok.. ya.. I'll create an account on mine for you and email details.. you can take setup the client side later though. thought I would ask | 19:14 |
rlandy|ruck | ERROR: Git reported dirty status. Git should never report dirty status at the end of testing, regardless if status is passed, failed or aborted. | 19:15 |
weshay|ruck | ya.. not sure what to do w/ that | 19:15 |
rlandy|ruck | I get that always in my personal runs | 19:16 |
rlandy|ruck | let's see if it still bothers it | 19:16 |
ssamal | weshay|ruck, that would be awesome. Thanks :) | 19:18 |
weshay|ruck | bah.. quotes | 19:19 |
weshay|ruck | thanks | 19:19 |
ssamal | weshay|ruck, i wanted to go back and forth, I am not sure about my boundaries here. lol | 19:20 |
weshay|ruck | ssamal, back and forth w/ regards to irc? | 19:21 |
ssamal | yes | 19:21 |
rlandy|ruck | 2021-08-02 19:28:44.429614 | ubuntu-focal | py38 finish: run-test after 142.60 seconds | 19:30 |
rlandy|ruck | 2021-08-02 19:28:44.456943 | ubuntu-focal | ERROR: Git reported dirty status. Git should never report dirty status at the end of testing, regardless if status is passed, failed or aborted. | 19:30 |
rlandy|ruck | 2021-08-02 19:28:44.460517 | ubuntu-focal | On branch master | 19:30 |
rlandy|ruck | weshay|ruck: ^^ also got that error | 19:30 |
rlandy|ruck | output/elastic-recheck/1938718.yaml | 19:30 |
weshay|ruck | ah | 19:30 |
rlandy|ruck | ^^ problem with that file | 19:30 |
rlandy|ruck | issue with the repo | 19:32 |
rlandy|ruck | frenzy_friday: ^^ | 19:32 |
rlandy|ruck | health-query has a repo issue | 19:32 |
rlandy|ruck | .git/COMMIT_EDITMSG:Related-Bug: #1938718 | 19:33 |
rlandy|ruck | src/data/queries.yml: url: https://bugs.launchpad.net/tripleo/+bug/1938718 | 19:33 |
rlandy|ruck | recloned | 19:40 |
rlandy|ruck | when I recloned, I didn;t get that error | 19:43 |
rlandy|ruck | Powered by Gerrit Code Review (3.2.11-13-g94b0943907-dirty) | 19:43 |
rlandy|ruck | promoter Containers promote 'aggregate: f50dfd1b5edf85ec026b5ae4a32b4219, commit: 07e484a8616a4f51e998f0e8ae865e0781d0b20b, distro: 926d273316c9dc7cf18b78c6ff88dae77078ea01, extended: None, component: validation, timestamp: 1627921043' to current-tripleo: Attempting promotion | 19:54 |
rlandy|ruck | woohoo | 19:54 |
rlandy|ruck | master is on the run | 19:54 |
rlandy|ruck | o .. health | 19:56 |
rlandy|ruck | ussuri also promoting | 20:07 |
rlandy|ruck | weshay|ruck: looks to be some stash error ... I can reproduce it if I leave tox dir over from fist run | 20:19 |
rlandy|ruck | first | 20:19 |
rlandy|ruck | frenzy_friday: ^^ when you get in - pls | 20:19 |
rlandy|ruck | I even added the file with git | 20:19 |
rlandy|ruck | no luck | 20:19 |
*** dviroel is now known as dviroel|out | 20:53 | |
weshay|ruck | rlandy|ruck, anything you want me to look at? | 21:11 |
weshay|ruck | rlandy|ruck, I've completed some updates to the dashboards, cockpit and component lines.. if you have other suggestions holla | 21:11 |
rlandy|ruck | that's all for the moment | 21:23 |
rlandy|ruck | recheck the gate fail | 21:24 |
rlandy|ruck | weshay|ruck: ^^ | 21:24 |
rlandy|ruck | master and ussuri are promoting | 21:24 |
weshay|ruck | promotions are flowing.. we'll have to be careful it may be too many | 21:25 |
rlandy|ruck | need to decide what to do about downstream | 21:25 |
rlandy|ruck | git some debug to do there | 21:25 |
rlandy|ruck | multinode | 21:25 |
weshay|ruck | I haven't looked.. re: 16.2 promotion or something else/ | 21:26 |
rlandy|ruck | baremetal is failing due to chandan's change | 21:26 |
weshay|ruck | ? | 21:26 |
rlandy|ruck | no need | 21:26 |
rlandy|ruck | 17 and 16.2 | 21:26 |
weshay|ruck | rlandy|ruck, operators right? | 21:27 |
rlandy|ruck | yep | 21:27 |
rlandy|ruck | been a barrel of laughs | 21:28 |
weshay|ruck | hrm.. if you want to fill me in to see if I can help.. I'm avail | 21:28 |
rlandy|ruck | there is an issue with the current patch | 21:29 |
rlandy|ruck | left a note | 21:29 |
rlandy|ruck | we can get involved if no fix by tomorrow | 21:29 |
weshay|ruck | k | 21:29 |
rlandy|ruck | I am sure they will sort it out by then | 21:29 |
rlandy|ruck | weshay|ruck: we got a response re: rhos-1 | 21:30 |
rlandy|ruck | should be fine | 21:30 |
rlandy|ruck | weshay|ruck: hey | 22:16 |
rlandy|ruck | weshay|ruck: promoter issue | 22:16 |
rlandy|ruck | can you view master promoter? | 22:17 |
rlandy|ruck | it pushed the containers and images | 22:18 |
rlandy|ruck | never promoted the hash | 22:19 |
weshay|ruck | rlandy|ruck, hey | 22:19 |
weshay|ruck | ugh | 22:19 |
rlandy|ruck | http://10.0.148.74/promoter_logs/centos8_master.log | 22:19 |
rlandy|ruck | on the tmux | 22:19 |
rlandy|ruck | allows_clients is commented out | 22:19 |
weshay|ruck | k.. getting on tmux | 22:20 |
rlandy|ruck | https://images.rdoproject.org/centos8/master/rdo_trunk/current-tripleo/ | 22:21 |
rlandy|ruck | there | 22:21 |
rlandy|ruck | weshay|ruck: ^^ | 22:21 |
weshay|ruck | ya | 22:21 |
weshay|ruck | so.. I wonder if there isn't a default | 22:21 |
weshay|ruck | for allowed_clients | 22:21 |
rlandy|ruck | https://hub.docker.com/r/tripleomaster/openstack-base/tags?page=1&ordering=last_updated | 22:22 |
rlandy|ruck | has the f50... hash | 22:22 |
rlandy|ruck | f50dfd1b5edf85ec026b5ae4a32b4219 | 22:22 |
rlandy|ruck | Last pushed24 minutes ago | 22:22 |
weshay|ruck | ya.. I don't see in the previous logs where it attempted the promotion yet | 22:23 |
weshay|ruck | http://10.0.148.74/promoter_logs/centos8_master_2021-07-29T22:37.log | 22:24 |
weshay|ruck | may have to wait until this finishes w/ containers | 22:24 |
rlandy|ruck | it just rotates | 22:24 |
rlandy|ruck | ussuri promoted ok | 22:24 |
rlandy|ruck | oh and now the gate is having a fit | 22:25 |
rlandy|ruck | undercloud-deploy : Install the undercloud | 22:27 |
rlandy|ruck | maybe since we just updated ussuri | 22:28 |
rlandy|ruck | weshay|ruck: and log rotate is messed up here | 22:30 |
rlandy|ruck | looks like it's writing over the last log | 22:30 |
weshay|ruck | ya.. I've mentioned the log rotate issue to amol | 22:30 |
rlandy|ruck | there should be multiple for 08/02 | 22:30 |
rlandy|ruck | it got to pushing images | 22:31 |
rlandy|ruck | and the just restarted | 22:31 |
rlandy|ruck | looks like it skipped the dlrn_client | 22:31 |
rlandy|ruck | can we confirm that? | 22:31 |
rlandy|ruck | #allowed_clients: dlrn_client,qcow_client | 22:31 |
weshay|ruck | checking ussuri mirrors | 22:31 |
rlandy|ruck | explicitly define allowed_clients to include dlrn_client | 22:32 |
weshay|ruck | several mirrors not synced | 22:32 |
rlandy|ruck | gate - yeah - | 22:32 |
rlandy|ruck | just updated ussuri | 22:32 |
rlandy|ruck | threw the gate off | 22:32 |
rlandy|ruck | will recheck that | 22:32 |
weshay|ruck | rlandy|ruck, master just promoted | 22:34 |
weshay|ruck | http://images.rdoproject.org/centos8/master/rdo_trunk/?C=M;O=D | 22:34 |
weshay|ruck | oops | 22:34 |
weshay|ruck | hold on | 22:34 |
rlandy|ruck | that's what it did before | 22:35 |
rlandy|ruck | push conatiners, images and die | 22:36 |
rlandy|ruck | no dlrn promotion | 22:36 |
weshay|ruck | ya... and we shouldn't need the allowed_clients | 22:36 |
weshay|ruck | http://promoter.rdoproject.org/config/CentOS-8/wallaby.yaml | 22:36 |
rlandy|ruck | the code base is diff on the two promoters | 22:38 |
rlandy|ruck | I think | 22:38 |
rlandy|ruck | correct for rdoproject one | 22:38 |
rlandy|ruck | idk about master one | 22:38 |
weshay|ruck | the other gate failure.. I think failed on pulling ubi8 again | 22:39 |
weshay|ruck | I don't see the base failure | 22:39 |
weshay|ruck | the base build log rather | 22:39 |
rlandy|ruck | weshay|ruck: wrt master | 22:40 |
rlandy|ruck | can we redefine allowed_clients and retry? | 22:40 |
weshay|ruck | rlandy|ruck, we should probably turn off container promotion and retry | 22:41 |
rlandy|ruck | funny - we promoted fine last week | 22:41 |
rlandy|ruck | and image | 22:41 |
rlandy|ruck | weshay|ruck: ^^ just dlrn | 22:41 |
rlandy|ruck | extended: None, component: validation, timestamp: 1627921043': SUCCESSFUL promotion to current-tripleo | 22:43 |
rlandy|ruck | checking dlrn | 22:43 |
rlandy|ruck | https://trunk.rdoproject.org/centos8-master/current-tripleo/delorean.repo.md5 | 22:44 |
rlandy|ruck | weshay|ruck: ^^ yep | 22:44 |
rlandy|ruck | got it now | 22:44 |
weshay|ruck | ya.. bug some where | 22:44 |
rlandy|ruck | we need to put back all three | 22:44 |
rlandy|ruck | weshay|ruck: ^^ | 22:44 |
rlandy|ruck | and restart | 22:44 |
weshay|ruck | rlandy|ruck, no.. let's leave it off | 22:44 |
weshay|ruck | and afaict... we should have allowed_clients remarked out | 22:45 |
weshay|ruck | totally right? | 22:45 |
rlandy|ruck | yes - if the code on this promoter is updated | 22:45 |
rlandy|ruck | totally correct on rdoproject promoter | 22:45 |
weshay|ruck | ya.. they need to update the code | 22:45 |
* rlandy|ruck will check with akahat first thing tomorrow | 22:45 | |
weshay|ruck | let's just leave master off for now | 22:45 |
rlandy|ruck | leaving ruck/rover note | 22:45 |
rlandy|ruck | may throw the gate | 22:46 |
weshay|ruck | ya | 22:46 |
rlandy|ruck | more promotions | 22:46 |
weshay|ruck | rlandy|ruck, now we get into the grey area w/ lots of opinions w/ regards to how often to promote... | 22:47 |
weshay|ruck | the mirrors suck | 22:47 |
weshay|ruck | rlandy|ruck, I can recheck the failed gate jobs.. you should get out of here | 22:48 |
rlandy|ruck | weshay|ruck: thanks ... yeah - we need to get this mirror thing sorted | 22:48 |
weshay|ruck | rlandy|ruck, we may not have the powa | 22:49 |
rlandy|ruck | weshay|ruck: thanks for gate recheck ... going to step away for a bit mincha and then running time | 22:49 |
weshay|ruck | rlandy|ruck, although.. part of that script chandan is writing.. should check and pound the mirrors that are not synced | 22:49 |
rlandy|ruck | will check back in a bit | 22:49 |
rlandy|ruck | yep - should help | 22:49 |
rlandy|ruck | there is no good time | 22:49 |
rlandy|ruck | unless we only promoted on the weekends | 22:50 |
rlandy|ruck | which is not a possibility | 22:50 |
rlandy|ruck | weshay|ruck: ^^ if we only had master, wallaby and train, we could though | 22:50 |
weshay|ruck | heh | 22:50 |
weshay|ruck | rlandy|ruck, come to the #tripleo mtg tomorrow | 22:50 |
rlandy|ruck | a girl can dream, right? | 22:50 |
weshay|ruck | next step.. change release gov | 22:50 |
rlandy|ruck | k - yep | 22:51 |
rlandy|ruck | k - back later | 22:51 |
*** rlandy|ruck is now known as rlandy|ruck|bbl | 22:51 |
Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!