*** rlandy|ruck2 has quit IRC | 00:01 | |
*** dsneddon has quit IRC | 00:27 | |
*** rfolco has quit IRC | 00:28 | |
*** dsneddon has joined #oooq | 00:29 | |
*** jmasud has quit IRC | 00:42 | |
*** jmasud has joined #oooq | 00:55 | |
*** Goneri has quit IRC | 01:07 | |
*** jmasud has quit IRC | 01:12 | |
*** jmasud has joined #oooq | 01:20 | |
*** jmasud has quit IRC | 02:14 | |
*** jmasud has joined #oooq | 03:02 | |
*** jmasud has quit IRC | 03:23 | |
*** jmasud has joined #oooq | 03:25 | |
*** ysandeep|away is now known as ysandeep | 04:05 | |
*** jmasud has quit IRC | 04:12 | |
*** skramaja has joined #oooq | 04:35 | |
*** apetrich has quit IRC | 04:42 | |
*** ykarel|away is now known as ykarel | 04:51 | |
*** jmasud has joined #oooq | 04:59 | |
*** soniya29 is now known as soniya29|ruck | 05:08 | |
*** udesale has joined #oooq | 05:34 | |
*** pojadhav|away is now known as pojadhav | 05:38 | |
*** soniya29|ruck is now known as soniya29|rover | 05:40 | |
*** marios has joined #oooq | 06:00 | |
*** jfrancoa has joined #oooq | 06:02 | |
*** ratailor has joined #oooq | 06:08 | |
*** ratailor has quit IRC | 06:22 | |
*** tosky has joined #oooq | 06:30 | |
*** ratailor has joined #oooq | 06:30 | |
*** jbadiapa has quit IRC | 06:34 | |
*** jmasud has quit IRC | 07:01 | |
*** dtantsur|afk is now known as dtantsur | 07:17 | |
*** amoralej|off is now known as amoralej | 07:19 | |
*** bhagyashris|away is now known as bhagyashris | 07:39 | |
*** jpena|off is now known as jpena | 07:57 | |
*** jmasud has joined #oooq | 08:27 | |
*** jbadiapa has joined #oooq | 08:32 | |
jbadiapa | Hi giulio, | 08:32 |
---|---|---|
jbadiapa | I talked to randy regarding https://review.opendev.org/#/c/711423/ | 08:33 |
jbadiapa | He told me that if we are sure that only ceph-nfs is the client for the mds to go down is ok with the changes...that's why I came back to you | 08:36 |
jbadiapa | is there any other client supported? | 08:36 |
*** soniya29|rover is now known as soniya29_lunch|r | 08:43 | |
*** soniya29_lunch|r is now known as soniya29|rover | 08:44 | |
*** ykarel is now known as ykarel|lunch | 08:45 | |
zbr | panda: morning | 08:48 |
zbr | look at https://logserver.rdoproject.org/38/28038/20/check/molecule-tripleo-common-delegated-centos-7/e753f3c/job-output.txt | 08:50 |
zbr | search for FAILED!, basically it creates the directory but next task fails to run | 08:51 |
zbr | my suspicion is that someone messed file permissions for zuul user, likely by running some commands as root | 08:51 |
zbr | if test-python environment (or a subfolder) is created by root, for example. | 08:52 |
zbr | neverming, i found the culprit, someone added a become: true to creation of venv.... | 08:54 |
*** apetrich has joined #oooq | 09:13 | |
*** ysandeep is now known as ysandeep|brb | 09:29 | |
*** chem has quit IRC | 09:30 | |
*** chem has joined #oooq | 09:32 | |
*** ykarel|lunch is now known as ykarel | 09:48 | |
*** dsneddon has quit IRC | 10:24 | |
soniya29|rover | chandankumar, ykarel, periodic-tripleo-ci-centos-7-ovb-1ctlr_2comp-featureset021-stein job is failing , as per logs undercloud is failing | 10:33 |
soniya29|rover | chandankumar, ykarel, logs:- https://logserver.rdoproject.org/openstack-periodic-24hr/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-7-ovb-1ctlr_2comp-featureset021-stein/338314c/job-output.txt | 10:35 |
*** jmasud has quit IRC | 10:38 | |
*** pojadhav is now known as pojadhav|afj | 10:47 | |
*** pojadhav|afj is now known as pojadhav|afk | 10:47 | |
ykarel | soniya29|rover, as per logs, tempest is failing not undercloud https://logserver.rdoproject.org/openstack-periodic-24hr/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-7-ovb-1ctlr_2comp-featureset021-stein/338314c/logs/tempest.html.gz | 10:48 |
ykarel | u seeing where ? | 10:48 |
soniya29|rover | ykarel, i looked here:-https://logserver.rdoproject.org/openstack-periodic-24hr/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-7-ovb-1ctlr_2comp-featureset021-stein/338314c/job-output.txt | 10:52 |
ykarel | soniya29|rover, in ^^ "cmd": "set -o pipefail &&subunit2junitxml /home/zuul/tempest/testrepository.subunit --output-to /home/zuul/tempest/tempest.xml 2>&1 >> tempest.log\n", failed | 10:53 |
ykarel | may be u got confused with undercloud : ok=18 changed=9 unreachable=0 failed=1 skipped=36 rescued=0 ignored=1 | 10:53 |
soniya29|rover | ykarel, yes. I took it as undercloud failure | 10:54 |
ykarel | soniya29|rover, ack, no it's not undercloud failure | 10:55 |
ykarel | see https://logserver.rdoproject.org/openstack-periodic-24hr/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-7-ovb-1ctlr_2comp-featureset021-stein/338314c/logs/ | 10:56 |
ykarel | _Tempest_test_failed.log | 10:56 |
*** marios has quit IRC | 11:08 | |
chandankumar | soniya29|rover, ykarel fs21 job is meant to be ignored | 11:08 |
*** ratailor has quit IRC | 11:09 | |
ykarel | only if it's tempest failure | 11:09 |
soniya29|rover | chandankumar, ykarel, i didn't get? | 11:09 |
chandankumar | ykarel, does fs020 job is working fine | 11:10 |
ykarel | chandankumar, i didn't checked | 11:10 |
chandankumar | soniya29|rover, fs021 and fs020 are same job only difference is fs21 runs skip tests as a tempest tests | 11:10 |
chandankumar | soniya29|rover, please check fs020 job also | 11:10 |
ykarel | also in addition fs021 exercise cr repo | 11:10 |
chandankumar | ykarel, cr repo one needs to be fixed also for ussuri and master | 11:11 |
soniya29|rover | chandankumar, it is also failing in fs20 | 11:11 |
ykarel | chandankumar, yes | 11:11 |
chandankumar | soniya29|rover, job link | 11:11 |
soniya29|rover | chandankumar, https://review.rdoproject.org/zuul/builds?pipeline=openstack-periodic-24hr&job_name=periodic-tripleo-ci-centos-7-ovb-1ctlr_2comp-featureset020-stein | 11:12 |
soniya29|rover | chandankumar, ykarel, overcloud deployment failed | 11:17 |
chandankumar | arxcruz, kopecmartin agenda for today's tempest meeting https://hackmd.io/fIOKlEBHQfeTZjZmrUaEYQ?view#2020-06-19 feel free to add or update | 11:18 |
*** derekh has joined #oooq | 11:36 | |
*** pojadhav|afk is now known as pojadhav | 11:37 | |
*** jpena is now known as jpena|lunch | 11:41 | |
*** rlandy has joined #oooq | 11:56 | |
*** rlandy is now known as rlandy|ruck | 11:57 | |
rlandy|ruck | chandankumar: hi | 11:58 |
chandankumar | rlandy|ruck, Hello | 11:58 |
rlandy|ruck | ch: looks like we got dlrn fails in downstream. do we need to make a change there now? https://sf.hosted.upshift.rdu2.redhat.com/logs/84/200084/38/check/tripleo-ci-rhel-8-standalone-rhos-17/fbb97f7/logs/undercloud/home/zuul/dlrn.log | 11:59 |
rlandy|ruck | chandankumar: ^^ | 11:59 |
chandankumar | rlandy|ruck, I am checking with jpena|lunch on that | 11:59 |
rlandy|ruck | chandankumar: thanks - I left in your suggested setting in the release file | 12:00 |
*** marios has joined #oooq | 12:01 | |
rlandy|ruck | soniya29|rover: hello ... | 12:04 |
rlandy|ruck | weshay|ruck: hey | 12:04 |
rlandy|ruck | soniya29|rover: how are things today? | 12:04 |
rlandy|ruck | https://hackmd.io/XcuH2OIVTMiuxyrqSF6ocw | 12:05 |
rlandy|ruck | following ^^ | 12:05 |
*** ysandeep|brb is now known as ysandeep | 12:11 | |
*** rfolco has joined #oooq | 12:12 | |
rlandy|ruck | soniya29|rover: have you had a meeting with weshay|ruck and former ruck/rovers? | 12:20 |
rlandy|ruck | pojadhav: rfolco: ^^ did you all transfer info? | 12:21 |
rfolco | talked to sagi yesterday | 12:22 |
pojadhav | rlandy|ruck, yup | 12:28 |
rlandy|ruck | marios: hey | 12:31 |
rlandy|ruck | https://review.opendev.org/#/c/736816/ | 12:31 |
rlandy|ruck | need your w+ on that | 12:31 |
rlandy|ruck | paying the usual rate | 12:31 |
weshay|ruck | rlandy|ruck, w/ just soniya29|rover | 12:31 |
rlandy|ruck | weshay|ruck: np - as long as you guys touched base | 12:32 |
marios | rlandy|ruck: o/ looking | 12:32 |
chandankumar | weshay|ruck, tempest meeting time | 12:32 |
marios | rlandy|ruck: is it causing explosions for you? blocker for d/stream maybe or ovb or something? | 12:32 |
*** derekh has quit IRC | 12:33 | |
rlandy|ruck | marios: upstream rather | 12:33 |
rlandy|ruck | marios: <EmilienM> rlandy|ruck: yes please approve it - I can't though | 12:34 |
marios | rlandy|ruck: ack done | 12:34 |
rlandy|ruck | marios: weshay|ruck: thanks both | 12:34 |
*** amoralej is now known as amoralej|lunch | 12:39 | |
rfolco | chandankumar, would you review this one pls ? https://review.opendev.org/#/c/733676/ | 12:39 |
*** jpena|lunch is now known as jpena | 12:43 | |
*** udesale_ has joined #oooq | 12:44 | |
*** udesale has quit IRC | 12:46 | |
*** derekh has joined #oooq | 12:51 | |
zbr | marios: rlandy|ruck weshay|ruck : https://review.rdoproject.org/r/#/c/28038/ -- please - switch to py3 on promoter | 13:00 |
zbr | big in number of files, but not so much in the nature of changes | 13:00 |
zbr | rfolco: ^ | 13:01 |
rlandy|ruck | zbr: yep - looks good - +2 | 13:02 |
chandankumar | arxcruz++ | 13:03 |
arxcruz | soniya29|rover++ :) \ | 13:03 |
soniya29|rover | rlandy|ruck, hello | 13:07 |
soniya29|rover | rlandy|ruck, had meeting with pojadhav and weshay|ruck today | 13:08 |
rlandy|ruck | soniya29|rover: ok - as long as you are all set | 13:08 |
*** dtantsur is now known as dtantsur|brb | 13:11 | |
chandankumar | rlandy|ruck, weshay|ruck https://review.opendev.org/#/c/736999/ | 13:12 |
marios | zbr: ack adding to queue will check if its still around on next run but looks like it is on final approach | 13:19 |
rlandy|ruck | weshay|ruck: soniya29|rover: I renamed the new bmc-template on rdocloud | 13:21 |
rlandy|ruck | we will have to see if that helps | 13:21 |
*** amoralej|lunch is now known as amoralej | 13:21 | |
rlandy|ruck | so stacks created from a couple mins ago should get the new bmc template | 13:21 |
zbr | who can give me access to tripleo-infra project on rdo? | 13:23 |
zbr | panda: ok, do it but I still need access to it, just in case hell breaks | 13:24 |
ykarel | rlandy|ruck, new bmc template got tested in https://review.rdoproject.org/r/#/c/28004/ and looks good | 13:25 |
ykarel | can u also update in vexx host | 13:25 |
rlandy|ruck | ykarel: sshnaidm|off updated vexx | 13:26 |
rlandy|ruck | ykarel: rdocloud was set as public | 13:26 |
ykarel | rlandy|ruck, okk good, may be he did earlier so it's already used in job | 13:26 |
rlandy|ruck | so we needed admin help there | 13:26 |
ykarel | okk got it | 13:26 |
ykarel | i seen the chat | 13:26 |
ykarel | earlier | 13:26 |
*** ykarel is now known as ykarel|afk | 13:28 | |
rlandy|ruck | my gosh we are running a lot of jobs | 13:28 |
rlandy|ruck | no OVB started on rdocloud in a while | 13:28 |
*** pojadhav is now known as pojadhav|afk | 13:30 | |
panda | ... I thought it would be quicker, but we are upgrading the promtoer server to CentOS7.8 | 13:33 |
panda | so it may not be accessible | 13:33 |
panda | for a while | 13:33 |
panda | it's taking time to make a snapshot ... | 13:33 |
*** Goneri has joined #oooq | 13:36 | |
chandankumar | marios, you are looking at train c8 user stories? | 13:36 |
marios | chandankumar: not yet go ahead just update if you pick up something put your name on it? | 13:38 |
marios | chandankumar: we can sync next week i will pickup something different | 13:38 |
chandankumar | marios, I will start picking it from monday | 13:39 |
marios | chandankumar: sounds right | 13:39 |
*** TrevorV has joined #oooq | 13:42 | |
weshay|ruck | zbr, 1-1 | 13:51 |
ysandeep | chandankumar, I am interested in train c8 user stories too .. Please keep in in loop too o/ I will probably bug you for context and planning | 13:53 |
chandankumar | ysandeep, sure | 13:58 |
ysandeep | chandankumar, thank you :) | 13:58 |
rlandy|ruck | ysandeep: marios: weshay|ruck: looks like we have a downstream registry problem ... component pipeline turns red from glance | 14:02 |
rlandy|ruck | onwards | 14:02 |
rlandy|ruck | looking into it | 14:02 |
rlandy|ruck | no outage reported | 14:02 |
rlandy|ruck | https://sf.hosted.upshift.rdu2.redhat.com/zuul/t/tripleo-ci-internal/builds?pipeline=openstack-periodic-rhos-17 - that is pretty though | 14:03 |
ysandeep | rlandy|ruck, :( not again | 14:03 |
* weshay|ruck looks | 14:04 | |
rlandy|ruck | panda: ^^ we had a clean pass on integration line | 14:04 |
marios | rlandy|ruck: ack thanks for heads up | 14:04 |
rlandy|ruck | why didn't we promote to current-tripleo | 14:04 |
rlandy|ruck | panda: ^^ still not wired up? | 14:05 |
rlandy|ruck | can we promote, pls | 14:05 |
weshay|ruck | rlandy|ruck, so just the glance component is failing .. this is good | 14:05 |
weshay|ruck | also not seen in master | 14:05 |
weshay|ruck | so will be interesting to see the root cause of the failure | 14:05 |
panda | rlandy|ruck: upstream or downstream ? | 14:05 |
rlandy|ruck | panda: downstream | 14:05 |
weshay|ruck | panda, downstream | 14:05 |
rlandy|ruck | network failed as well | 14:06 |
rlandy|ruck | maybe a hitch | 14:06 |
panda | rlandy|ruck: ok, promoting now. | 14:08 |
*** tosky has quit IRC | 14:08 | |
rlandy|ruck | panda: ok - so we're not automated yet | 14:08 |
panda | rlandy|ruck: no, promoting 8f5b47d2198ea3eefdae76a9d7789f66 | 14:08 |
panda | rlandy|ruck: then I'll automate. | 14:08 |
weshay|ruck | requests.exceptions.HTTPError: 401 Client Error: Unauthorized for url: http://mirror.regionone.vexxhost-nodepool-tripleo.rdoproject.org:8082/v2/tripleoussuri/centos-binary-cinder-api/manifests/62340a910277dfc38f77529a3e80a1d6 | 14:09 |
weshay|ruck | anyone see container pulls failing in periodic vexx? | 14:09 |
rlandy|ruck | panda: perfect - thanks | 14:10 |
rlandy|ruck | weshay|ruck: looking | 14:10 |
rlandy|ruck | vexx was doing well | 14:10 |
rlandy|ruck | at least ovb was passing | 14:10 |
rlandy|ruck | ipa cleared up | 14:11 |
rlandy|ruck | which is also nice | 14:11 |
weshay|ruck | rlandy|ruck, soniya29|rover sshnaidm|off FYI.. this may be an easier way to visualize periodic promtion jobs http://dashboard-ci.tripleo.org/d/QrkFP-zGz/upstream-and-rdo-promotions?orgId=1 | 14:12 |
rlandy|ruck | what's up with cloudops | 14:12 |
rlandy|ruck | never promotes | 14:12 |
rlandy|ruck | upstream or downstream | 14:12 |
weshay|ruck | rlandy|ruck, check consistent against | 14:12 |
weshay|ruck | the promoted hash | 14:12 |
weshay|ruck | same | 14:12 |
rlandy|ruck | yeah - know | 14:12 |
rlandy|ruck | so inactive | 14:12 |
weshay|ruck | so that's why | 14:12 |
rlandy|ruck | just musing why we have such an inactive component | 14:13 |
*** ysandeep is now known as ysandeep|away | 14:14 | |
weshay|ruck | rlandy|ruck, aye.. may be pinned.. but low on my hit list atm | 14:14 |
rlandy|ruck | we have other things to worry about | 14:14 |
weshay|ruck | aye | 14:15 |
chandankumar | weshay|ruck, rlandy|ruck for enabling cr repos https://review.opendev.org/#/c/736999/ and https://review.rdoproject.org/r/#/c/28168/ | 14:15 |
chandankumar | will take care of that | 14:15 |
*** pojadhav|afk is now known as pojadhav | 14:15 | |
weshay|ruck | rlandy|ruck, any updates on ovb generally? I see 2 passes in vexx.. you/sagi fixed bmc I assume | 14:16 |
weshay|ruck | I still see lots of stacks in rdo | 14:16 |
rlandy|ruck | weshay|ruck: I updated the bmc-template on rdocloud per sagi's email | 14:16 |
rlandy|ruck | he updated the template on vexx | 14:16 |
rlandy|ruck | that only deals with introspection errors | 14:16 |
weshay|ruck | k.. | 14:17 |
* weshay|ruck pokes around | 14:17 | |
rlandy|ruck | weshay|ruck: so https://review.rdoproject.org/zuul/stream/b30a11358e984c5dbd513ae95fbe55cf?logfile=console.log | 14:17 |
rlandy|ruck | for example - started late enough to catch the change | 14:17 |
rlandy|ruck | it is deploying now | 14:18 |
rlandy|ruck | weshay|ruck: so the update went in 8:56 my time | 14:19 |
rlandy|ruck | it 10:19 now | 14:19 |
rlandy|ruck | so that should give you an idea of when the change would get picked up | 14:19 |
weshay|ruck | ah k | 14:20 |
* rlandy|ruck checks vexx | 14:20 | |
rlandy|ruck | weshay|ruck: also - we're still battling stuck stacks ... | 14:20 |
rlandy|ruck | <rlandy|ruck> I think our current scripts would tackle a stuck server | 14:21 |
rlandy|ruck | <kforde> rlandy|ruck ack. Might need some DB massaging | 14:21 |
rlandy|ruck | weshay|ruck: ^^ there we are atm | 14:21 |
chandankumar | weshay|ruck, rlandy|ruck https://review.opendev.org/#/q/topic:crrepo+(status:open+OR+status:merged) | 14:23 |
rlandy|ruck | vexx has other issues ... 2020-06-19 14:09:35.689181 | TASK [ovb-manage : Attach instance to provision OVB network] | 14:27 |
rlandy|ruck | weshay|ruck: vexx looks like diff failures in diff places - afaict | 14:28 |
*** ykarel|afk is now known as ykarel | 14:28 | |
*** dtantsur|brb is now known as dtantsur | 14:28 | |
rlandy|ruck | 2020-06-19 05:56:42 | 14:36 |
rlandy|ruck | periodic-tripleo-ci-rhel-8-bm_envA-3ctlr_1comp-featureset001-baremetal-rhos-17 | 14:36 |
rlandy|ruck | 4 hours, 53 minutes | 14:36 |
rlandy|ruck | SUCCESS | 14:36 |
rlandy|ruck | YES | 14:36 |
rlandy|ruck | ha | 14:36 |
rlandy|ruck | ysandeep|away: weshay|ruck: ^^ it's official baremetal is running in component pipeline | 14:38 |
weshay|ruck | saw :) | 14:39 |
weshay|ruck | rlandy++ ysandeep++ | 14:39 |
rlandy|ruck | weshay|ruck: you're out next week, right? | 14:41 |
rlandy|ruck | ovb on the other hand - no so much | 14:42 |
chandankumar | weshay|ruck, rlandy|ruck I am leaving early today, see ya on monday | 14:45 |
chandankumar | happy weekend | 14:45 |
rlandy|ruck | chandankumar: have a good weekend | 14:45 |
*** chandankumar is now known as raukadah | 14:45 | |
rlandy|ruck | see you monday | 14:46 |
*** ccamacho has quit IRC | 14:48 | |
rlandy|ruck | 2020-06-19 10:32:12.566739 | primary | /home/zuul/workspace/dlrnapi_venv/bin/activate: line 31: $1: unbound variable | 14:50 |
rlandy|ruck | no what? | 14:50 |
*** skramaja has quit IRC | 14:59 | |
*** jtomasek has quit IRC | 15:08 | |
weshay|ruck | rlandy|ruck, where is that? | 15:20 |
rlandy|ruck | weshay|ruck: possible hitch | 15:22 |
rlandy|ruck | didn;t see it again | 15:22 |
zbr | weshay|ruck: talked with panda, promoter is back online after snapshot. we will do the work on tuesday because monday panda is recharging | 15:24 |
zbr | i am not confident to do this alone | 15:25 |
weshay|ruck | k | 15:25 |
rlandy|ruck | https://logserver.rdoproject.org/30/735930/2/openstack-check/tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001/4da5204/job-output.txt | 15:25 |
rlandy|ruck | yeah - so possible dlnr issues | 15:25 |
rlandy|ruck | weshay|ruck: so we are seeing OVB failures but not introspection | 15:26 |
weshay|ruck | ya.. in deployment | 15:26 |
* weshay|ruck out of 1-1's | 15:26 | |
rlandy|ruck | 2 hr 9 min is the last introspection failure I see | 15:26 |
rlandy|ruck | weshay|ruck: k - we have a REAL issue with promote jobs ... | 15:27 |
rlandy|ruck | https://logserver.rdoproject.org/openstack-component-security/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-centos-8-master-component-security-promote-consistent-to-component-ci-testing/3082b35/job-output.txt | 15:27 |
rlandy|ruck | upstream and downstream now showing the same issue | 15:27 |
rlandy|ruck | 2020-06-19 15:05:00.530944 | primary | /home/zuul/workspace/dlrnapi_venv/bin/activate: line 31: $1: unbound variable | 15:27 |
weshay|ruck | vex is failing on this again | 15:28 |
weshay|ruck | packages/metalsmith/_provisioner.py\", line 170, in _reserve_node\n 'Failed to reserve a node: %s' % exc)\nmetalsmith.exceptions.ReservationFailed: Failed to reserve a node: Allocation fd30a1f8-8766-42aa-985b-06133c659951 failed: Failed to process allocation fd30a1f8-8766-42aa-985b-06133c659951: none of the requested nodes are available and match the resource class baremetal.\n", "module_stdout": "", "msg": "MODULE FAILURE\nSee | 15:28 |
weshay|ruck | stdout/stderr for the exact error", "rc": 1} | 15:28 |
* rlandy|ruck looks into promote issue | 15:28 | |
weshay|ruck | rdo is failing on | 15:29 |
weshay|ruck | etalsmith.exceptions.DeploymentFailed: Failed to attach VIF 1df97598-93d0-433f-a6b5-cd376fc492da to bare metal node cd111159-1424-40f8-bdd8-9dedf96c7d55: Client Error for url: https://192.168.24.2:13385/v1/nodes/cd111159-1424-40f8-bdd8-9dedf96c7d55/vifs, Node cd111159-1424-40f8-bdd8-9dedf96c7d55 is locked by host undercloud.localdomain, please retry after the current operation is completed.\n", "module_stdout": "", "msg": "MODULE | 15:29 |
weshay|ruck | FAILURE\nSee stdout/stderr for the exact error", "rc": 1} | 15:29 |
rfolco | weshay|ruck, 1x1? | 15:29 |
weshay|ruck | rfolco, aye | 15:30 |
weshay|ruck | rlandy|ruck, there is some background on the cix board | 15:30 |
weshay|ruck | https://review.rdoproject.org/r/#/c/28131/ | 15:30 |
rlandy|ruck | weshay|ruck: on which error? | 15:30 |
weshay|ruck | https://trello.com/c/uDzT39AG/1509-cixlp1879472tripleociproa-ovb-overcloud-deploy-fails-sporadically-with-not-enough-free-physical-ports-error | 15:31 |
*** Goneri has quit IRC | 15:32 | |
*** matbu has quit IRC | 15:32 | |
rlandy|ruck | oh - looking in dlrn failure | 15:32 |
*** Goneri has joined #oooq | 15:32 | |
*** matbu has joined #oooq | 15:32 | |
rlandy|ruck | zbr: possible dlrn failure from your change | 15:33 |
rlandy|ruck | 2020-06-19 15:05:00.530829 | primary | + unset VIRTUAL_ENV | 15:33 |
rlandy|ruck | 2020-06-19 15:05:00.530944 | primary | /home/zuul/workspace/dlrnapi_venv/bin/activate: line 31: $1: unbound variable | 15:33 |
rlandy|ruck | zbr: pls see https://logserver.rdoproject.org/openstack-component-security/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-centos-8-master-component-security-promote-consistent-to-component-ci-testing/3082b35/job-output.txt | 15:34 |
*** TrevorV has quit IRC | 15:35 | |
zbr | rlandy|ruck: unrelated, but easy to fix, just move the -u after the activate. | 15:35 |
zbr | activate does not work with "set -u" on ancient versions of virtualenv | 15:36 |
zbr | https://github.com/pypa/virtualenv/issues/1342 | 15:36 |
zbr | same applies to venv, if I am correct. | 15:36 |
* rlandy|ruck tries | 15:36 | |
zbr | rlandy|ruck: read https://github.com/pypa/virtualenv/issues/1029 -- OP and year ;) | 15:37 |
zbr | guess who fixed it | 15:37 |
zbr | try to define VIRTUAL_ENV_DISABLE_PROMPT=1 | 15:37 |
*** TrevorV has joined #oooq | 15:37 | |
rlandy|ruck | zbr: why the issue now? | 15:37 |
rlandy|ruck | this code has been running for months | 15:38 |
rlandy|ruck | the -u is after the activate | 15:43 |
rlandy|ruck | source $WORKSPACE/dlrnapi_venv/bin/activate | 15:43 |
rlandy|ruck | pip install -U dlrnapi_client shyaml | 15:43 |
rlandy|ruck | VIRTUAL_ENV_DISABLE_PROMPT=true is a workaround | 15:44 |
rlandy|ruck | zbr: ^^ I think this is related to your change due to the timing | 15:45 |
rlandy|ruck | I can try the workaround | 15:45 |
rlandy|ruck | but then people will yell because we used a workaround | 15:45 |
*** tosky has joined #oooq | 15:50 | |
rlandy|ruck | zbr: no show on https://review.rdoproject.org/r/28170 | 15:53 |
rlandy|ruck | still failes | 15:53 |
rlandy|ruck | fails | 15:53 |
*** amoralej is now known as amoralej|off | 15:56 | |
*** marios is now known as marios|out | 15:59 | |
rlandy|ruck | weshay|ruck: zbr: soniya29|rover: https://review.rdoproject.org/r/#/c/28171/ | 15:59 |
rlandy|ruck | to fix unbound var error above | 16:00 |
rlandy|ruck | zbr: I'm merging that revert when tests are done | 16:00 |
rlandy|ruck | panda: ^^ FYI | 16:02 |
*** ykarel is now known as ykarel|away | 16:03 | |
rlandy|ruck | hmmm .. tests still running ... brb | 16:09 |
*** rlandy|ruck is now known as rlandy|ruck|brb | 16:09 | |
*** jmasud has joined #oooq | 16:11 | |
*** dtantsur is now known as dtantsur|afk | 16:13 | |
weshay|ruck | rlandy|ruck|brb, FYI.. sagi and I discussed not loading vex as much as we are.. https://review.rdoproject.org/r/#/c/28146/ | 16:13 |
weshay|ruck | we may need to back more jobs off | 16:14 |
weshay|ruck | I think rdo is still hitting https://bugs.launchpad.net/tripleo/+bug/1879472 | 16:15 |
openstack | Launchpad bug 1879472 in tripleo "OVB overcloud deploy fails sporadically with "not enough free physical ports" error" [Critical,Triaged] | 16:15 |
*** udesale_ has quit IRC | 16:26 | |
weshay|ruck | rlandy|ruck|brb, fak https://logserver.rdoproject.org/openstack-component-baremetal/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-centos-8-train-component-baremetal-promote-consistent-to-component-ci-testing/27f8308/job-output.txt | 16:36 |
*** rlandy|ruck|brb is now known as rlandy|ruck | 16:37 | |
rlandy|ruck | weshay|ruck: merging the revert | 16:38 |
rlandy|ruck | https://review.rdoproject.org/r/#/c/28171/ | 16:38 |
weshay|ruck | k | 16:38 |
weshay|ruck | component pipeline upstream has issues | 16:38 |
rlandy|ruck | weshay|ruck: k - see the other error | 16:39 |
rlandy|ruck | weshay|ruck: looking into it | 16:39 |
weshay|ruck | https://logserver.rdoproject.org/openstack-component-baremetal/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-centos-8-master-component-baremetal-promote-consistent-to-component-ci-testing/259ab9d/job-output.txt | 16:40 |
weshay|ruck | rlandy|ruck, so.. I was accidently looking at train | 16:40 |
weshay|ruck | which .. that's a OK failure | 16:40 |
weshay|ruck | 2020-06-19 16:26:01.012107 | primary | /home/zuul/workspace/dlrnapi_venv/bin/activate: line 31: $1: unbound variable | 16:40 |
weshay|ruck | is the error in master | 16:40 |
rlandy|ruck | weshay|ruck: ^^ that error we just merged the revert for | 16:40 |
rlandy|ruck | weshay|ruck: the train thing is another problem | 16:41 |
weshay|ruck | rlandy|ruck, train is fine.. centos-8 train is now componentized | 16:41 |
weshay|ruck | we can ignore that, that's sprint work | 16:41 |
rlandy|ruck | weshay|ruck: periodic-tripleo-centos-8-train-component-baremetal-promote-consistent-to-component-ci-testing - ok error? | 16:41 |
rlandy|ruck | ok fantastic | 16:41 |
weshay|ruck | rlandy|ruck, how does https://review.rdoproject.org/r/#/c/28171/ impact the 2020-06-19 16:26:01.012107 | primary | /home/zuul/workspace/dlrnapi_venv/bin/activate: line 31: $1: unbound variable | 16:42 |
weshay|ruck | in consistent -> component-ci-testing | 16:42 |
rlandy|ruck | weshay|ruck: see my ranting above | 16:42 |
rlandy|ruck | weshay|ruck: testproject https://review.rdoproject.org/r/#/c/24960/ | 16:43 |
*** Goneri has quit IRC | 16:43 | |
rlandy|ruck | I tried the workaround zbr suggested | 16:43 |
rlandy|ruck | no dice | 16:43 |
rlandy|ruck | we revert | 16:43 |
weshay|ruck | fine | 16:43 |
rlandy|ruck | can't break promotions | 16:43 |
weshay|ruck | which file though? | 16:43 |
weshay|ruck | in that patch? | 16:43 |
rlandy|ruck | getting | 16:44 |
rlandy|ruck | https://review.rdoproject.org/r/#/c/28171/1/ci-scripts/tripleo-upstream/dlrnapi_venv.sh | 16:44 |
weshay|ruck | k | 16:45 |
weshay|ruck | thank you | 16:45 |
rlandy|ruck | weshay|ruck: ok - the overcloud deploy issue on OVB? | 16:45 |
weshay|ruck | rlandy|ruck, somebody promoted ironic over the compnent-pipeline to current tripleo | 16:45 |
rlandy|ruck | we got that sorted or I should investigate? | 16:45 |
rlandy|ruck | shoot them | 16:45 |
weshay|ruck | do we have any idea how that happened? | 16:45 |
* rlandy|ruck will check | 16:45 | |
rlandy|ruck | let's look at dlrn | 16:45 |
weshay|ruck | ironic-lib in current-tripleo > than promoted-components | 16:47 |
*** dsneddon has joined #oooq | 16:47 | |
rlandy|ruck | python-ironic-lib-4.3.0-0.20200605155925.df238ba.el8.src.rpm2020-06-05 16:00 101K | 16:49 |
rlandy|ruck | vs | 16:49 |
rlandy|ruck | python-ironic-lib-4.3.0-0.20200605155925.df238ba.el8.src.rpm2020-06-05 16:00 101K | 16:49 |
rlandy|ruck | weshay|ruck: ^^ am I crazy? | 16:49 |
rlandy|ruck | looks the same | 16:49 |
weshay|ruck | hrm | 16:49 |
* weshay|ruck looks | 16:49 | |
rlandy|ruck | https://trunk.rdoproject.org/centos8-master/component/baremetal/promoted-components/ | 16:50 |
rlandy|ruck | https://trunk.rdoproject.org/centos8-master/component/baremetal/current-tripleo/ | 16:50 |
rlandy|ruck | same | 16:50 |
weshay|ruck | ya.. but /me looks at repo | 16:50 |
weshay|ruck | hrm.. wth.. I'm crazy | 16:51 |
rlandy|ruck | weshay|ruck: well that would have been at least an explanation | 16:54 |
rlandy|ruck | weshay|ruck: maybe mirrors - pls post repo link | 16:54 |
rlandy|ruck | in fact afaik, you can't promote out of order | 16:56 |
rlandy|ruck | I messed up once and tried to promote the wrong name - dlrn is smart enough to stop me | 16:56 |
*** ChanServ has quit IRC | 16:59 | |
weshay|ruck | rlandy|ruck, I'll focus on the master baremetal component promotion after you patch lands | 17:01 |
weshay|ruck | bmc should be fixed so.. | 17:01 |
*** derekh has quit IRC | 17:02 | |
rlandy|ruck | weshay|ruck: we'll have to rerun that line, no> | 17:02 |
weshay|ruck | rlandy|ruck, just triggered it | 17:02 |
rlandy|ruck | weshay|ruck: k - anything else I should work on here? | 17:03 |
rlandy|ruck | gate failures /o\ | 17:03 |
weshay|ruck | rlandy|ruck, there's an ipa failure | 17:04 |
weshay|ruck | rlandy|ruck, probably ipa setup.. saw it yesterday | 17:04 |
*** jpena is now known as jpena|off | 17:04 | |
rlandy|ruck | 2020-06-19 15:48:35.834989 | primary | TASK [ipa-multinode : install tls dependencies] ******************************** | 17:04 |
rlandy|ruck | 2020-06-19 15:48:35.835120 | primary | Friday 19 June 2020 15:48:35 +0000 (0:00:00.584) 0:13:38.422 *********** | 17:04 |
rlandy|ruck | 2020-06-19 15:48:38.518933 | primary | fatal: [undercloud]: FAILED! => { | 17:04 |
rlandy|ruck | 2020-06-19 15:48:38.519005 | primary | "changed": false, | 17:04 |
rlandy|ruck | 2020-06-19 15:48:38.519033 | primary | "results": [] | 17:04 |
rlandy|ruck | 2020-06-19 15:48:38.519062 | primary | } | 17:04 |
rlandy|ruck | 2020-06-19 15:48:38.519103 | primary | | 17:04 |
rlandy|ruck | 2020-06-19 15:48:38.519147 | primary | MSG: | 17:04 |
rlandy|ruck | 2020-06-19 15:48:38.519181 | primary | | 17:04 |
rlandy|ruck | 2020-06-19 15:48:38.519206 | primary | Failed to download packages: Cannot download Packages/python3-qrcode-core-5.1-12.module_el8.2.0+370+b142e101.noarch.rpm: All mirrors were tried | 17:05 |
weshay|ruck | stupid mirrors | 17:05 |
rlandy|ruck | https://zuul.openstack.org/builds?job_name=tripleo-ci-centos-8-standalone-on-multinode-ipa | 17:05 |
rlandy|ruck | pretty bad lately | 17:05 |
rlandy|ruck | maybe legit | 17:05 |
rlandy|ruck | 2020-06-19 12:21:52.167862 | primary | Checking DNS domain 70.69.158.in-addr.arpa., please wait ... | 17:05 |
rlandy|ruck | 2020-06-19 12:21:52.167878 | primary | DNS zone 70.69.158.in-addr.arpa. already exists in DNS and is handled by server(s): ['ns10.ovh.ca.', 'dns10.ovh.ca.'] | 17:05 |
rlandy|ruck | other failures are different | 17:06 |
weshay|ruck | rlandy|ruck, I know ade is really needing that patch to land | 17:06 |
rlandy|ruck | weshay|ruck: eek ... passed on check | 17:07 |
weshay|ruck | aye | 17:07 |
weshay|ruck | the other is pip.. meh | 17:07 |
rlandy|ruck | weshay|ruck: what's the right thing to do here (recheck and pray)? | 17:07 |
weshay|ruck | gate is ok.. minus ipa | 17:07 |
weshay|ruck | rlandy|ruck, /me looks at job history | 17:07 |
rlandy|ruck | one of the cloud looks like it has a dns issue with that job | 17:08 |
weshay|ruck | rlandy|ruck, we're going to need to run a elastic-recheck query on it | 17:08 |
weshay|ruck | to really see | 17:08 |
rlandy|ruck | ack | 17:09 |
weshay|ruck | rlandy|ruck, let's put up a change to include | 17:09 |
weshay|ruck | 2020-06-19 13:10:54.125489 | primary | The ipa-server-install command failed. See /var/log/ipaserver-install.log for more information | 17:09 |
weshay|ruck | in the elastic-recheck file | 17:09 |
weshay|ruck | that isn't captured atm | 17:09 |
weshay|ruck | https://957cf5a247037a442a69-452dd4b4c84f55a29b71ee3516016f2d.ssl.cf2.rackcdn.com/729465/42/check/tripleo-ci-centos-8-standalone-on-multinode-ipa/b0e585a/job-output.txt | 17:09 |
weshay|ruck | 2020-06-19T13:10:53Z DEBUG The ipa-server-install command failed, exception: DNSZoneAlreadyExists: DNS zone 124.72.198.in-addr.arpa. already exists in DNS and is handled by server(s): ['ns11.iweb-hosting.com.', 'ns12.iweb-hosting.com.'] | 17:10 |
weshay|ruck | ^ | 17:10 |
rlandy|ruck | weshay|ruck: pls remind me where the rdoclould elastic recheck file is - been a while | 17:10 |
weshay|ruck | I'll get it.. | 17:10 |
weshay|ruck | rlandy|ruck, can you ping ade w/ ^ | 17:10 |
rlandy|ruck | weshay|ruck: so that second error only happens on one cloud | 17:10 |
weshay|ruck | why is that happening? | 17:10 |
rlandy|ruck | ack | 17:10 |
* rlandy|ruck checks with ade | 17:10 | |
weshay|ruck | https://bugs.launchpad.net/tripleo/+bug/1884287 | 17:12 |
openstack | Launchpad bug 1884287 in tripleo "ipa-server install error: 2020-06-19T13:10:53Z DEBUG The ipa-server-install command failed, exception: DNSZoneAlreadyExists: DNS zone" [Critical,Triaged] | 17:12 |
*** marios|out has quit IRC | 17:14 | |
rlandy|ruck | weshay|ruck: updated that LP = pinging ade | 17:17 |
*** ChanServ has joined #oooq | 17:37 | |
*** tepper.freenode.net sets mode: +o ChanServ | 17:37 | |
*** Goneri has joined #oooq | 17:37 | |
*** dsneddon has quit IRC | 18:11 | |
*** dsneddon has joined #oooq | 18:12 | |
*** jmasud has quit IRC | 18:17 | |
*** jmasud has joined #oooq | 18:21 | |
*** jmasud has quit IRC | 18:23 | |
*** jmasud has joined #oooq | 18:28 | |
*** jmasud has quit IRC | 18:29 | |
*** jmasud has joined #oooq | 18:33 | |
*** jmasud has quit IRC | 18:50 | |
*** jmasud has joined #oooq | 19:00 | |
*** jmasud has quit IRC | 19:13 | |
*** jmasud has joined #oooq | 19:21 | |
*** jmasud has quit IRC | 19:33 | |
*** yolanda has quit IRC | 19:41 | |
*** jfrancoa has quit IRC | 19:52 | |
*** sanjayu_ has quit IRC | 19:53 | |
*** saneax has joined #oooq | 19:54 | |
*** jfrancoa has joined #oooq | 20:01 | |
*** TrevorV has quit IRC | 20:02 | |
weshay|ruck | rlandy|ruck, no luck w/ ovb on master or ussir | 20:04 |
weshay|ruck | ussri | 20:04 |
rlandy|ruck | weshay|ruck: no luck with introspection downstream either | 20:05 |
rlandy|ruck | weshay|ruck: so baremetal promoted | 20:05 |
rlandy|ruck | or not yet | 20:05 |
weshay|ruck | it did not.. we could waive it.. but I don't think it matters.. that ironic-lib patch is in.. | 20:06 |
weshay|ruck | still a vif issue | 20:06 |
weshay|ruck | waiting on master logs | 20:06 |
weshay|ruck | rlandy|ruck, we should think about turning ovb off in a bunch of repos | 20:06 |
weshay|ruck | maybe give the clouds more capacity | 20:07 |
rlandy|ruck | vif issues is usually cloud related | 20:07 |
rlandy|ruck | weshay|ruck: vexx only? | 20:07 |
weshay|ruck | both rdo and vex | 20:07 |
rlandy|ruck | ugh | 20:07 |
* rlandy|ruck looks for sshnaidm|off's patch to remove jobs from vexx | 20:08 | |
weshay|ruck | perhaps we run ovb in periodic, component, and turn off 3rd party check | 20:08 |
rlandy|ruck | weshay|ruck: maybe - we are getting no value | 20:08 |
rlandy|ruck | weshay|ruck: disable all the tests moved from vexx here https://review.rdoproject.org/r/#/c/28146/2/zuul.d/ovb-jobs.yaml? | 20:09 |
rlandy|ruck | we are running too many versions/releases | 20:10 |
weshay|ruck | rlandy|ruck, https://review.opendev.org/737072 | 20:14 |
weshay|ruck | rlandy|ruck, ya.. maybe limit to master? | 20:14 |
weshay|ruck | for now.. until we get more capacity | 20:14 |
rlandy|ruck | weshay|ruck: it is only master | 20:15 |
rlandy|ruck | wait - what are we talking about?? | 20:15 |
weshay|ruck | rlandy|ruck, oh... sorry ovb.. set to master only.. no branchful jobs | 20:16 |
weshay|ruck | and only trigger on master | 20:16 |
rlandy|ruck | weshay|ruck: yeah | 20:16 |
rlandy|ruck | too much going on | 20:16 |
rlandy|ruck | leave the rest of the ovb jobs in periodic | 20:17 |
weshay|ruck | rlandy|ruck, w/ regards to containers.. spec'ing out upstream steps https://tree.taiga.io/project/tripleo-ci-board/epic/1819 | 20:17 |
weshay|ruck | rlandy|ruck, aye | 20:17 |
rlandy|ruck | weshay|ruck: ^^ ack - looks fine - only trick is | 20:18 |
rlandy|ruck | turning push off for one | 20:18 |
rlandy|ruck | and getting push to work on new | 20:18 |
rlandy|ruck | that might be take a bit | 20:18 |
rlandy|ruck | where we will either have dup containers | 20:18 |
rlandy|ruck | or none | 20:18 |
weshay|ruck | we can turn the old one off | 20:18 |
rlandy|ruck | unless we temp namespace it | 20:18 |
rlandy|ruck | ie: if push works well off the bat, fine | 20:19 |
weshay|ruck | aye | 20:19 |
rlandy|ruck | if it doesn't leave the old job there | 20:19 |
rlandy|ruck | weshay|ruck: can we push to temp namespace to test? | 20:19 |
rlandy|ruck | or I can try push downstream first | 20:19 |
rlandy|ruck | less impact | 20:19 |
rlandy|ruck | other than that, tasks look good | 20:20 |
weshay|ruck | sure.. we can poke at it donwstream.. you've done it :) | 20:23 |
rlandy|ruck | weshay|ruck: I haven't enabled push | 20:24 |
rlandy|ruck | but I can do it asap - if that will help upstream | 20:24 |
weshay|ruck | should be a number of people looking for new tasks | 20:25 |
weshay|ruck | sandeep, pooja, bhagyashri rfolco etc | 20:25 |
weshay|ruck | rfolco, added centos-8 train and new containers build user stories to the board | 20:26 |
weshay|ruck | rfolco, please encourage and esure people pick up the new work | 20:27 |
rfolco | weshay|ruck, cool will do | 20:27 |
*** jfrancoa has quit IRC | 20:46 | |
*** rfolco has quit IRC | 20:54 | |
rlandy|ruck | weshay|ruck: where are we with decreasing OVB job runs? should I work on that? | 20:55 |
weshay|ruck | rlandy|ruck, no one is going to be too worried about .. I think it's a smart idea until we get it working 95% of the time | 20:56 |
rlandy|ruck | weshay|ruck: ok - checking templates | 20:56 |
weshay|ruck | rlandy|ruck, we could adjust the template maybe | 20:56 |
weshay|ruck | ya | 20:56 |
rlandy|ruck | that's where it's kicked from | 20:56 |
* rlandy|ruck gets | 20:56 | |
rlandy|ruck | https://github.com/rdo-infra/rdo-jobs/blob/master/zuul.d/project-templates.yaml | 20:57 |
rlandy|ruck | weshay|ruck: want to gm to change that ^^? | 20:57 |
weshay|ruck | sure | 20:59 |
weshay|ruck | https://meet.google.com/pca-svmt-kvn | 20:59 |
rlandy|ruck | joining | 20:59 |
weshay|ruck | rlandy|ruck, https://review.rdoproject.org/zuul/builds?job_name=tripleo-ci-centos-8-ovb-1ctlr_1comp-featureset001&job_name=tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001 | 21:03 |
rlandy|ruck | https://review.rdoproject.org/r/28173 Reduce where OVB jobs are run - overloaded clouds | 21:13 |
*** saneax has quit IRC | 21:47 | |
*** weshay|ruck is now known as weshay_pto | 22:00 | |
*** jmasud has joined #oooq | 22:47 | |
*** rlandy|ruck has quit IRC | 22:59 | |
*** tosky has quit IRC | 23:45 |
Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!