Wednesday, 2019-08-21

*** holser has joined #tripleo00:09
*** bnemec has joined #tripleo00:16
*** pkopec has quit IRC00:20
*** holser has quit IRC00:23
openstackgerritKevin Carter (cloudnull) proposed openstack/tripleo-common master: Log exceptions when checking status  https://review.opendev.org/67491900:24
*** bnemec has quit IRC00:35
openstackgerritKevin Carter (cloudnull) proposed openstack/tripleo-common master: Log exceptions when checking status  https://review.opendev.org/67491900:48
*** cdearborn has quit IRC00:58
*** spsurya has joined #tripleo01:00
openstackgerritMerged openstack/tripleo-heat-templates master: Revert "Point InternalTLSVncCAFile to /etc/ipa/ca.crt"  https://review.opendev.org/67754901:03
openstackgerritMerged openstack/tripleo-heat-templates stable/stein: Revert "Point InternalTLSVncCAFile to /etc/ipa/ca.crt"  https://review.opendev.org/67754801:03
*** rlandy|ruck has quit IRC01:06
openstackgerritMerged openstack/tripleo-heat-templates stable/rocky: Revert "Point InternalTLSVncCAFile to /etc/ipa/ca.crt"  https://review.opendev.org/67755001:11
openstackgerritTakashi Kajinami proposed openstack/tripleo-heat-templates master: Add *_domain_name in authtoken configuration in Octavia  https://review.opendev.org/67758701:20
openstackgerritTakashi Kajinami proposed openstack/tripleo-heat-templates master: Add *_domain_name in authtoken configuration in Sahara  https://review.opendev.org/67758801:28
openstackgerritKevin Carter (cloudnull) proposed openstack/tripleo-heat-templates master: Convert firewall rules to use TripleO-Ansible  https://review.opendev.org/67723701:29
openstackgerritMerged openstack/diskimage-builder master: dracut-regenerate: catch failures and exit code  https://review.opendev.org/67603201:48
*** mschuppert has quit IRC01:53
openstackgerritMerged openstack/diskimage-builder master: block-device-efi : expand disk size calculation  https://review.opendev.org/67635402:30
*** rh-jelabarre has quit IRC02:37
openstackgerritMerged openstack/tripleo-common master: Add "rhel_containers" variable to skip containers for RHEL  https://review.opendev.org/67649702:56
openstackgerritMerged openstack/tripleo-quickstart-extras master: Define var undercloud_enable_nova  https://review.opendev.org/66416802:56
*** cjloader has joined #tripleo03:09
openstackgerritMerged openstack/python-tripleoclient stable/stein: Use reset to fix cmdline  https://review.opendev.org/67756903:33
*** psachin has joined #tripleo03:34
*** janki has joined #tripleo03:41
*** surpatil has joined #tripleo03:50
*** gkadam has joined #tripleo03:54
*** gkadam has quit IRC03:54
openstackgerritTakashi Kajinami proposed openstack/tripleo-heat-templates master: Use the special user role 'service' as service token role  https://review.opendev.org/67451604:05
*** dsneddon has quit IRC04:15
openstackgerritTakashi Kajinami proposed openstack/tripleo-heat-templates master: Use the special user role 'service' as service token role  https://review.opendev.org/67451604:16
openstackgerritTakashi Kajinami proposed openstack/tripleo-heat-templates master: Strictly require service token roles  https://review.opendev.org/67759904:17
*** dsneddon has joined #tripleo04:18
*** skramaja has joined #tripleo04:22
*** dsneddon has quit IRC04:24
*** jaosorior has quit IRC04:34
*** ade_lee has quit IRC04:35
*** ade_lee has joined #tripleo04:37
*** ramishra has joined #tripleo04:41
*** dsneddon has joined #tripleo04:42
*** soniya29 has joined #tripleo04:44
*** dsneddon has quit IRC04:46
*** ykarel|away has joined #tripleo04:48
*** ratailor has joined #tripleo05:02
*** shyamb has joined #tripleo05:06
*** kopecmartin|off is now known as kopecmartin05:08
*** udesale has joined #tripleo05:10
*** udesale has quit IRC05:14
*** raukadah is now known as chkumar|rover05:17
*** dsneddon has joined #tripleo05:22
*** soniya29 has quit IRC05:24
*** waleedm has joined #tripleo05:25
Tenguhello there :)05:25
openstackgerritMerged openstack/tripleo-common master: Close the http sessions of registry on image prepare  https://review.opendev.org/67638705:27
*** dmacpher has quit IRC05:31
*** dmacpher_ has joined #tripleo05:31
*** ykarel|away is now known as ykarel05:35
*** holser has joined #tripleo05:38
openstackgerritSaravanan KR proposed openstack/tripleo-common stable/stein: Close the http sessions of registry on image prepare  https://review.opendev.org/67760905:43
*** yprokule has joined #tripleo05:44
*** jaosorior has joined #tripleo05:49
openstackgerritCédric Jeanneret (Tengu) proposed openstack/python-tripleoclient stable/stein: Run Validations with ThreadPoolExecutor  https://review.opendev.org/67717006:02
openstackgerritCédric Jeanneret (Tengu) proposed openstack/tripleo-heat-templates master: Use tripleo-validations-package role instead of puppet  https://review.opendev.org/67742906:06
*** florianf has joined #tripleo06:06
openstackgerritCédric Jeanneret (Tengu) proposed openstack/tripleo-validations master: Add Molecule tests for check-network-gateway  https://review.opendev.org/67274506:07
*** hjensas has quit IRC06:12
*** sanjayu__ has joined #tripleo06:13
*** mschuppert has joined #tripleo06:18
*** jaosorior has quit IRC06:21
*** dsneddon has quit IRC06:24
*** dciabrin has joined #tripleo06:30
*** holser has quit IRC06:32
*** nawar has joined #tripleo06:32
nawarmorning06:33
nawaris there a problem with the docs on openstack site06:33
ramishranawar: It seems I can access those ex. https://docs.openstack.org/tripleo-docs/latest/install/index.html, what're you looking for?06:35
nawarramishra everything is deprecated and the installation of undercould is gone06:37
ramishranawar: I think it's due to https://github.com/openstack/tripleo-docs/commit/c6918e5da60a26ca9e64855778efe128f2ce6de206:46
ramishramoved to deploy guide.. I don't see the deploy guide published yet though06:46
*** waleedm_ has joined #tripleo06:51
*** waleedm__ has joined #tripleo06:52
*** mkisielewski has joined #tripleo06:53
*** waleedm has quit IRC06:55
*** dsneddon has joined #tripleo06:56
*** waleedm_ has quit IRC06:56
*** waleedm has joined #tripleo06:59
*** waleedm_ has joined #tripleo07:00
*** shyamb has quit IRC07:01
*** waleedm__ has quit IRC07:02
*** jaosorior has joined #tripleo07:04
*** waleedm has quit IRC07:05
openstackgerritMartin Schuppert proposed openstack/tripleo-docs master: Enhancement to cell v2 doc with split for stein/train  https://review.opendev.org/67274407:07
*** jpich has joined #tripleo07:12
*** rcernin has quit IRC07:14
ramishranawar: https://0b2142c78b3d718d4667-da9fdb6441bd81af0c9914e787df88dd.ssl.cf5.rackcdn.com/676720/3/check/openstack-tox-docs/4c88b1c/html/install/index.html is the new install guide picked from a doc check job, if need it in the time being07:14
*** dsneddon has quit IRC07:17
*** shyamb has joined #tripleo07:20
*** udesale has joined #tripleo07:24
*** amoralej|off is now known as amoralej07:25
skramajaEmilienM slagle could you take a look at https://review.opendev.org/#/c/667792/?07:27
*** xek has joined #tripleo07:28
*** triple-oh-noob has joined #tripleo07:34
triple-oh-noobHi. Um, has someone moved the openstack tripleo installer docs recently?07:34
triple-oh-noobgetting 404s for a few pages I was on only yesterday07:35
triple-oh-noobhttps://docs.openstack.org/tripleo-docs/latest/install/advanced_deployment/ceph_config.html#tuning-ceph-osd-cpu-and-memory07:35
triple-oh-noobeg07:35
ramishratriple-oh-noob: https://0b2142c78b3d718d4667-da9fdb6441bd81af0c9914e787df88dd.ssl.cf5.rackcdn.com/676720/3/check/openstack-tox-docs/4c88b1c/html/install/advanced_deployment/ceph_config.html#tuning-ceph-osd-cpu-and-memory, still in the process of being published..07:39
*** rpittau|afk is now known as rpittau07:40
Tenguhmmm.... does CI have some issue? https://e1f254eb8f81e7924f24-889c896421b71b87d6e141679c8ef033.ssl.cf5.rackcdn.com/672745/12/check/tripleo-ci-centos-7-containers-multinode/b195cd9/job-output.txt  "/etc/ci/mirror_info.sh: line 54: NODEPOOLMIRROR_HOST: unbound variable"07:40
Tengu(apparently the new logging infra doesn't provide the direct link for lines anymore ;_;)07:40
triple-oh-noobThat was just one example page.07:40
Tenguchkumar|rover: any idea for that "/etc/ci/mirror_info.sh: line 54: NODEPOOLMIRROR_HOST: unbound variable" ?07:41
ramishratriple-oh-noob: I pasted the full install guide earlier for another query07:41
ramishrahere it goes again https://0b2142c78b3d718d4667-da9fdb6441bd81af0c9914e787df88dd.ssl.cf5.rackcdn.com/676720/3/check/openstack-tox-docs/4c88b1c/html/install/index.html07:41
Tengumeh... same issue there: https://object-storage-ca-ymq-1.vexxhost.net/v1/86bbbcfa8ad043109d2d7af530225c72/logs_45/672745/12/check/tripleo-ci-centos-7-undercloud-containers/eae9aa1/job-output.txt07:42
ykarelramishra, yup https://review.opendev.org/#/c/677029/ moved the doc to deploy-guide and seems publish-deploy-guide job is not running, which resulting in docs not available07:42
ramishraThere were a few patches for all the reorganization of the docs and the publishing takes a little time..07:42
ykarelramishra, let me fire a patch to run that job and see if it does something07:43
ramishraykarel: the job is not running? what's the frequency?07:43
ykarelramishra, we need to include that job in project config of tripleo-docs project07:43
ramishraykarel: Isn't it already there?07:44
ykarelramishra, i don't see it https://github.com/openstack/tripleo-docs/blob/master/.zuul.yaml07:44
*** dsneddon has joined #tripleo07:45
ramishraykarel: ok, I thought it has some frequency and takes some time..I could be wrong though07:45
*** jpena|off is now known as jpena07:47
openstackgerrityatin proposed openstack/tripleo-docs master: Run deploy guide jobs  https://review.opendev.org/67766107:48
ykarelramishra, le't see now ^^07:49
triple-oh-noobramishra: Many thanks. Was disconnected last night otherwise i'd have had a look over the chat :-)07:49
*** bhagyashris has joined #tripleo07:50
*** jtomasek has joined #tripleo07:51
*** shyamb has quit IRC07:54
*** zbr is now known as zbr|ooo07:56
*** jaosorior has quit IRC08:00
*** lucasagomes has joined #tripleo08:01
*** ksambor has quit IRC08:04
openstackgerritCédric Jeanneret (Tengu) proposed openstack/python-tripleoclient stable/stein: Run Validations with ThreadPoolExecutor  https://review.opendev.org/67717008:04
*** ksambor has joined #tripleo08:06
*** cylopez has joined #tripleo08:09
*** jtomasek has quit IRC08:09
*** dtantsur|afk is now known as dtantsur08:11
*** pkopec has joined #tripleo08:12
Tenguksambor: sorry for that -^ - trying to find the right line(s) for the doc requirement....08:14
*** jtomasek has joined #tripleo08:14
*** gfidente|afk is now known as gfidente08:15
*** ykarel is now known as ykarel|lunch08:15
openstackgerrityatin proposed openstack/tripleo-docs master: Run deploy guide jobs  https://review.opendev.org/67766108:15
chkumar|roverTengu: checking08:16
ramishraykarel: I think promotion is part of publish-openstack-docs-pti https://github.com/openstack/openstack-zuul-jobs/blob/master/zuul.d/project-templates.yaml#L10308:16
ramishrahttps://github.com/openstack/project-config/blob/master/zuul.d/jobs.yaml#L30308:17
ramishrathere have been some changes around that recently, so may be something is broken08:17
ksamborTengu if you will put sphinx!=1.6.6,!=1.6.7,!=2.1.0,>=1.6.2; python_version>='3.4' # BSD like is in tripleo-common it should work08:19
*** holser has joined #tripleo08:19
Tenguksambor: it was complaining about the 2.1.0 in fact :)08:25
chkumar|roverTengu: https://review.opendev.org/#/c/677669/08:26
chkumar|roverwill fix the issue08:26
Tenguchkumar|rover: ah, great!08:26
*** suuuper has joined #tripleo08:27
*** tkajinam has quit IRC08:29
*** hjensas has joined #tripleo08:30
*** shyamb has joined #tripleo08:32
triple-oh-noobIf I had deployed previously with masquerade = true for the undercloud, can I "undo" this option simply?08:36
*** shyam89 has joined #tripleo08:45
*** shyamb has quit IRC08:46
*** pierreprinetti has joined #tripleo08:53
*** gkadam has joined #tripleo08:54
Tenguwow, that was a fast merge, chkumar|rover :). thanks!08:56
*** gkadam is now known as gkadam-brb08:57
shyam89Hi08:58
shyam89We are doing minor updat on osp13 cloud08:58
shyam89Step - "openstack overcloud update run --nodes Controller" has failed08:58
shyam89no ansible.log found in /var/lib/mistral/*/08:59
shyam89What other log files/debugging steps we should start with?08:59
*** gkadam-brb is now known as gkadam09:00
shyam89Tengu: Any thoughts ^^^09:04
*** surpatil has quit IRC09:09
*** jtomasek has quit IRC09:11
chkumar|roverTengu: it broke everywhere so fast :-)09:17
*** jaosorior has joined #tripleo09:19
ykarel|lunchramishra, no, publish-deploy-guide will publish deploy-guide09:25
*** ykarel|lunch is now known as ykarel09:25
ykarelramishra, https://review.opendev.org/#/c/677661 is green now, once this merges, guide should be published with job in post pipeline09:25
*** derekh has joined #tripleo09:26
ramishraykarel: I don't know..., AFAIU publish-deploy-guide is an old job before PTI and the cotents changed/missing are part of the install guide https://0b2142c78b3d718d4667-da9fdb6441bd81af0c9914e787df88dd.ssl.cf5.rackcdn.com/676720/3/check/openstack-tox-docs/4c88b1c/html/install/index.html09:29
ykarelramishra, okk then need to check, ramishra is https://object-storage-ca-ymq-1.vexxhost.net/v1/86bbbcfa8ad043109d2d7af530225c72/logs_61/677661/2/check/build-openstack-deploy-guide/7c544e1/docs/ not complete?09:31
ramishraykarel: I think it's incomplete...09:33
ykarelramishra, then it means migration is incomplete, but anyway job need to run and publish09:37
ykarelramishra, i just checked publish-deploy-guide only pushes deploy-guide09:37
ykarelsome projects are running it http://zuul.openstack.org/builds?job_name=publish-deploy-guide09:37
ykareland from logs and corresponding playbooks it's publishing to project-deploy-guide09:38
ykarelhttps://opendev.org/openstack/project-config/src/branch/master/zuul.d/jobs.yaml#L858-L86609:39
ramishraykarel: As I mentioned publishing docs.openstack.org happens in the promote pipeline, but I'm not too famililar with PTI.. about the projects using publish-deploy-guide, may be those guys have not migrated to PTI.. good to check with infra...09:40
ramishrabetter to check with infra09:40
*** pierreprinetti has quit IRC09:40
ykarelramishra, i checked again and publish-openstack-docs-pti is publishing only docs, but will confirm from infra09:47
openstackgerritSergii Golovatiuk proposed openstack/python-tripleoclient stable/stein: Suppress output for ssh-keygen  https://review.opendev.org/67757709:52
ykarelchkumar|rover, is scenario001 failure known10:02
ykarelchkumar|rover, http://zuul.openstack.org/builds?job_name=tripleo-ci-centos-7-scenario001-standalone&branch=master10:05
ykarelchkumar|rover, caused with https://review.opendev.org/#/c/677427/110:05
*** shyam89 has quit IRC10:05
ykarelused here, https://opendev.org/openstack/puppet-tripleo/src/branch/master/manifests/profile/base/ceilometer.pp#L157 needs cleanup10:10
ykarelomg it's not just scenario001, other scenarios are also affected10:11
ykarelreporting bug10:12
*** ksambor is now known as ksambor|lunch10:13
*** rpittau is now known as rpittau|bbl10:14
ykarelchkumar|rover, https://bugs.launchpad.net/tripleo/+bug/184090110:16
openstackLaunchpad bug 1840901 in tripleo "Jobs failing with:- Could not find class ::ceilometer::dispatcher::gnocchi" [Undecided,New]10:16
openstackgerritChandan Kumar (raukadah) proposed openstack/puppet-tripleo master: Remove deprecated ceilometer::dispatcher::gnocchi  https://review.opendev.org/67769010:23
*** sshnaidm|afk is now known as sshnaidm10:23
*** shyamb has joined #tripleo10:30
*** triple-oh-noob has quit IRC10:31
*** bhagyashris has quit IRC10:31
*** gfidente has quit IRC10:35
*** gfidente has joined #tripleo10:41
*** ksambor|lunch is now known as ksambor10:46
*** jchhatbar has joined #tripleo10:47
*** janki has quit IRC10:49
openstackgerritFrancesco Pantano proposed openstack/tripleo-heat-templates master: [DNM] - Testing ceph_dashboard haproxy endpoint config  https://review.opendev.org/67769911:02
*** owalsh is now known as owalsh|away11:08
*** florianf has quit IRC11:11
*** sanjayu__ has quit IRC11:17
*** shyamb has quit IRC11:20
*** saneax has joined #tripleo11:20
*** raildo has joined #tripleo11:21
*** mcornea has joined #tripleo11:24
*** florianf has joined #tripleo11:24
*** paramite has joined #tripleo11:25
*** udesale has quit IRC11:26
*** udesale has joined #tripleo11:27
*** suuuper has quit IRC11:30
*** holser has quit IRC11:32
*** holser__ has joined #tripleo11:32
openstackgerritRajesh Tailor proposed openstack/tripleo-heat-templates master: Use default value for NovaLiveMigrationWaitForVIFPlug  https://review.opendev.org/67770311:33
openstackgerritRajesh Tailor proposed openstack/tripleo-heat-templates stable/stein: Use default value for NovaLiveMigrationWaitForVIFPlug  https://review.opendev.org/67770411:36
*** rh-jelabarre has joined #tripleo11:37
openstackgerritRajesh Tailor proposed openstack/tripleo-heat-templates stable/stein: Use default value for NovaLiveMigrationWaitForVIFPlug  https://review.opendev.org/67770411:38
*** morazi has joined #tripleo11:38
*** jpena is now known as jpena|lunch11:39
openstackgerritRajesh Tailor proposed openstack/tripleo-heat-templates master: Use default value for NovaLiveMigrationWaitForVIFPlug  https://review.opendev.org/67770311:42
*** shyamb has joined #tripleo11:42
openstackgerritMerged openstack/tripleo-heat-templates master: Allow combining system_upgrade_prepare and system_upgrade_run into system_upgrade  https://review.opendev.org/67618711:42
openstackgerritFrancesco Pantano proposed openstack/tripleo-common master: Change ceph dashboard service name to meet puppet requirements  https://review.opendev.org/67770511:42
openstackgerritFrancesco Pantano proposed openstack/tripleo-heat-templates master: Add the certificate specs in ceph_grafana composable service  https://review.opendev.org/67455611:43
openstackgerritRajesh Tailor proposed openstack/tripleo-heat-templates stable/stein: Use default value for NovaLiveMigrationWaitForVIFPlug  https://review.opendev.org/67770411:43
openstackgerritRajesh Tailor proposed openstack/tripleo-heat-templates stable/stein: Use default value for NovaLiveMigrationWaitForVIFPlug  https://review.opendev.org/67770411:44
*** florianf has quit IRC11:45
openstackgerritFrancesco Pantano proposed openstack/puppet-tripleo master: Replace ceph_grafana-server with ceph_grafana  https://review.opendev.org/67770811:47
openstackgerritFrancesco Pantano proposed openstack/tripleo-heat-templates master: [DNM] - Testing ceph_dashboard haproxy endpoint config  https://review.opendev.org/67769911:48
*** rlandy has joined #tripleo11:50
*** rlandy is now known as rlandy|ruck11:50
ykarelchkumar|rover, Playbook run of multinode-standalone.yml passed successfully for ceilometer11:51
chkumar|roverykarel: awesome11:51
*** florianf has joined #tripleo11:51
openstackgerritRajesh Tailor proposed openstack/tripleo-heat-templates master: Use default value for NovaLiveMigrationWaitForVIFPlug  https://review.opendev.org/67770311:55
openstackgerritRajesh Tailor proposed openstack/tripleo-heat-templates master: Use default value for NovaLiveMigrationWaitForVIFPlug  https://review.opendev.org/67770311:58
*** rlandy|ruck is now known as rlandy|ruck|mtg11:58
*** nawar has left #tripleo11:59
openstackgerritRajesh Tailor proposed openstack/tripleo-heat-templates stable/stein: Use default value for NovaLiveMigrationWaitForVIFPlug  https://review.opendev.org/67770411:59
*** udesale has quit IRC12:02
*** ekultails has joined #tripleo12:10
*** rpittau|bbl is now known as rpittau12:10
openstackgerritSaravanan KR proposed openstack/tripleo-ansible master: WIP: Create a role for OvS-DPDK host configuration  https://review.opendev.org/67771212:15
openstackgerritKevin Carter (cloudnull) proposed openstack/tripleo-common master: Log exceptions when checking status  https://review.opendev.org/67491912:24
openstackgerritKevin Carter (cloudnull) proposed openstack/tripleo-common master: Log exceptions when checking status  https://review.opendev.org/67491912:26
*** udesale has joined #tripleo12:32
*** rfolco has quit IRC12:33
*** pbandark has joined #tripleo12:34
*** shyam89 has joined #tripleo12:37
*** shyamb has quit IRC12:37
*** jchhatba_ has joined #tripleo12:39
*** rfolco has joined #tripleo12:39
*** jpena|lunch is now known as jpena12:40
*** gkadam has quit IRC12:42
*** jchhatbar has quit IRC12:42
*** jchhatba_ has quit IRC12:43
*** rlandy|ruck|mtg is now known as rlandy|ruck12:43
*** amoralej is now known as amoralej|lunch12:50
*** shyam89 has quit IRC12:55
openstackgerritKevin Carter (cloudnull) proposed openstack/tripleo-heat-templates master: Convert firewall rules to use TripleO-Ansible  https://review.opendev.org/67723712:58
*** pierreprinetti has joined #tripleo13:04
*** cdearborn has joined #tripleo13:05
openstackgerritSaravanan KR proposed openstack/tripleo-common master: Add filter plugins path to the ansible.cfg  https://review.opendev.org/67772613:17
*** tesseract has joined #tripleo13:22
*** tesseract has quit IRC13:22
openstackgerritMartin Schuppert proposed openstack/tripleo-docs master: Enhancement to cell v2 doc with split for stein/train  https://review.opendev.org/67274413:22
chkumar|roverEmilienM: mwhahaha https://review.opendev.org/#/c/677690/ needs +2 and +w on this to clear check queue13:24
openstackgerritSaravanan KR proposed openstack/tripleo-heat-templates master: WIP: Move OvS-DPDK deployment to ansible role  https://review.opendev.org/67772813:30
*** bfournie has joined #tripleo13:31
openstackgerritSaravanan KR proposed openstack/tripleo-heat-templates master: WIP: Move OvS-DPDK deployment to ansible role  https://review.opendev.org/67772813:31
openstackgerritSaravanan KR proposed openstack/tripleo-ansible master: WIP: Create a role for OvS-DPDK host configuration  https://review.opendev.org/67771213:32
*** ratailor has quit IRC13:36
*** hjensas has quit IRC13:37
*** Goneri has joined #tripleo13:42
ykarelEmilienM, mwhahaha can u check https://review.opendev.org/#/c/677661/13:44
ykareldeploy-guide13:44
mwhahahathx13:44
ade_leebandini, hey13:46
*** zbr|ooo is now known as zbr13:47
*** udesale has quit IRC13:47
*** ykarel is now known as ykarel|afk13:53
openstackgerritMerged openstack/tripleo-docs master: Run deploy guide jobs  https://review.opendev.org/67766113:53
*** bnemec has joined #tripleo13:55
bandinimwhahaha: https://review.opendev.org/#/c/677405/ ok to merge?13:57
mwhahahafor you? maybe13:57
bandinieheheh13:57
Tenguwooohooo! CI back on track - care to make that one merge? https://review.opendev.org/#/c/672745/   it's locking a bunch of other changes already approved :)13:59
cloudnulltripleo-transformation meeting time - sshnaidm cloudnull mnaser ekultails owalsh mwhahaha Tengu ykarel sshnaidm Vorrtex14:00
cloudnull#startmeeting tripleo14:00
openstackMeeting started Wed Aug 21 14:00:18 2019 UTC and is due to finish in 60 minutes.  The chair is cloudnull. Information about MeetBot at http://wiki.debian.org/MeetBot.14:00
openstackUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.14:00
*** openstack changes topic to " (Meeting topic: tripleo)"14:00
openstackThe meeting name has been set to 'tripleo'14:00
cloudnull#topic rollcall14:00
*** openstack changes topic to "rollcall (Meeting topic: tripleo)"14:00
cloudnullo/14:00
Tengu«o/14:00
ekultailso/14:00
openstackgerritArx Cruz proposed openstack/tripleo-quickstart master: Adapt os_tempest in FS001  https://review.opendev.org/67340014:01
*** amoralej|lunch is now known as amoralej14:01
cloudnullas a reminder, our etherpad can be found here14:01
mwhahahahi2u14:01
cloudnull#link https://etherpad.openstack.org/p/tripleo-ansible-agenda14:01
*** ykarel|afk has quit IRC14:02
cloudnullplease update the meeting details if you want to talk about something specifically14:02
cloudnullok so lets just jump into it.14:05
cloudnull#topic recap14:06
*** openstack changes topic to "recap (Meeting topic: tripleo)"14:06
cloudnulllast week mnaser got started on the ansible-sig14:06
mnaseryes, im emailing something out soon to setup sometime we can start meeting at :)14:06
cloudnullthere's a new IRC channel for folks to join if they're interested.14:06
cloudnull#openstack-ansible-sig14:07
cloudnullmnaser are there pending reviews that folks should be aware of?14:07
mnaser uh i need to unlazy and clean up the patch to add the ssh connection plugin under 'sig'14:08
mnaseri will get around that soon i hope14:08
cloudnullon that same note - https://review.opendev.org/#/c/67642114:08
mnaser(also i'll be at ansiblefest)14:08
cloudnulloh cool! that'll be fun :)14:08
cloudnullowalsh|away ekultails mnaser and I worked on getting the connection plugin into a stand-alone repo, that's done and should be imported into the openstack namespace just as soon as the sig bits get ironed out.14:09
cloudnullanything else folks want to talk about with regard to some of the things we covered here/last week ?14:10
cloudnullok moving on.14:12
cloudnull#topic Open Discussion14:12
*** openstack changes topic to "Open Discussion (Meeting topic: tripleo)"14:12
cloudnullwe'll start with14:13
cloudnullTripleO-Ansible roadmap seeking feedback - https://etherpad.openstack.org/p/tripleo-ansible-roadmap14:13
cloudnull^ ekultails ?14:13
*** aakarsh|2 has joined #tripleo14:13
*** rfolco is now known as not_rlandy14:13
*** not_rlandy is now known as folco14:14
ekultailsThat is something cloudnull and I were talking about last week. The Etherpad explains the general direction we're looking to take with TripleO-Ansible and TripleO in Ansible.14:14
*** folco is now known as rfolco14:14
*** gregwork has quit IRC14:14
cloudnulli believe this also ties into the spec from EmilienM -https://review.opendev.org/#/c/671563/14:15
EmilienMthis one is on hold for now14:15
EmilienM(I haven't convinced myself about costs vs benefits of this one)14:15
*** cdearborn has quit IRC14:15
*** mgagne has quit IRC14:16
*** csatari has quit IRC14:16
*** aakarsh has quit IRC14:16
*** irclogbot_1 has quit IRC14:17
cloudnullEmilienM anything we can help with in terms of ironing that out ?14:17
*** mgagne has joined #tripleo14:17
*** cdearborn has joined #tripleo14:18
EmilienMnot that I can think now14:18
cloudnullok14:18
*** gregwork has joined #tripleo14:19
*** irclogbot_1 has joined #tripleo14:19
*** csatari has joined #tripleo14:19
cloudnullif folks can add to etherpad ekultails started it'd be very helpful in general.14:19
cloudnulllast of the tripleo-common role import work is ready for review - https://review.opendev.org/#/q/topic:hieraroles+(status:open)14:19
cloudnullif folks can review that one it'd be greatly appreciated.14:20
cloudnullTripleo-Ansible also a few open-reviews that are now passing - https://review.opendev.org/#/q/project:%255Eopenstack/tripleo-ansible+status:open+label:verified%253D%252B1%252Cuser%253Dzuul - if folks have time to review these, that would also be greatly appreciated.14:21
cloudnullis there anything else folks want to talk about while we're here?14:21
*** mkisielewski has quit IRC14:25
openstackgerritRonelle Landy proposed openstack/tripleo-ci master: DMN: testing for periodic jobs  https://review.opendev.org/62915714:26
cloudnullok well if there's nothing else, I think we can call it.14:26
cloudnullthanks everyone!14:26
cloudnull#endmeeting14:26
*** openstack changes topic to "CI Status: GREENish RDOCloud Status: MEHish | community irc meeting Tues@1400 UTC - tripleo-ci-community meeting Tues@1330 UTC | https://docs.openstack.org/tripleo-docs/latest/"14:26
openstackMeeting ended Wed Aug 21 14:26:56 2019 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)14:26
openstackMinutes:        http://eavesdrop.openstack.org/meetings/tripleo/2019/tripleo.2019-08-21-14.00.html14:26
openstackMinutes (text): http://eavesdrop.openstack.org/meetings/tripleo/2019/tripleo.2019-08-21-14.00.txt14:27
openstackLog:            http://eavesdrop.openstack.org/meetings/tripleo/2019/tripleo.2019-08-21-14.00.log.html14:27
*** ykarel|afk has joined #tripleo14:27
dmsimardcloudnull: I have two nodes for the standalone job, one from rax and one from ovh14:28
cloudnulldmsimard cool!14:28
*** ykarel|afk is now known as ykarel14:28
cloudnullhave you been able to dig into the failures ?14:28
ykarelramishra, deploy-guide published, https://docs.openstack.org/project-deploy-guide/tripleo-docs/latest/14:29
*** rlandy|ruck is now known as rlandy|ruck|mtg14:29
ykarelramishra, can u share what u find missing or report bug so it can be fixed14:29
*** pierreprinetti has quit IRC14:30
openstackgerritTakashi Kajinami proposed openstack/python-tripleoclient master: DNM: just to show possible improvement for the parent patch  https://review.opendev.org/67774114:31
dmsimardcloudnull: I have not, I'll have time a bit later but I've added your keys in the meantime: root@162.242.237.97 and root@158.69.67.23114:31
dmsimards/keys/key/14:32
*** waleedm_ has quit IRC14:32
ramishraykarel: I would have liked us to use htaccess redirects from install guide to deploy guide for users to still be able to access the old pages, but yeah I'll check later14:32
cloudnulldmsimard thanks!14:34
cloudnullwill take a look shortly :)14:34
ykarelramishra, ack14:34
*** mcornea has quit IRC14:36
*** mcornea has joined #tripleo14:37
*** chkumar|rover is now known as raukadah14:40
*** jaosorior has quit IRC14:41
*** Vorrtex has joined #tripleo14:41
*** rlandy|ruck|mtg is now known as rlandy|ruck14:45
*** cfontain_ has joined #tripleo14:46
openstackgerritEmilien Macchi proposed openstack/paunch stable/stein: Revert "Reduce the usage of "podman inspect" command"  https://review.opendev.org/67775414:48
*** ratailor has joined #tripleo14:49
openstackgerritEmilien Macchi proposed openstack/paunch master: Revert "Fix mismatching fixed vs unique container names"  https://review.opendev.org/67775614:53
openstackgerritEmilien Macchi proposed openstack/paunch master: Revert "Fix mismatching fixed vs unique container names"  https://review.opendev.org/67775614:53
openstackgerritFrancesco Pantano proposed openstack/tripleo-heat-templates master: Add the certificate specs in ceph_grafana composable service  https://review.opendev.org/67455614:54
*** PagliaccisCloud has joined #tripleo14:55
openstackgerritEmilien Macchi proposed openstack/paunch master: Revert "Reduce the usage of "podman inspect" command"  https://review.opendev.org/67775714:57
openstackgerritEmilien Macchi proposed openstack/paunch master: Revert "Optimize container CLI for getting unique names"  https://review.opendev.org/67775814:58
*** pierreprinetti has joined #tripleo14:58
openstackgerritEmilien Macchi proposed openstack/paunch master: Revert "Optimize container CLI for getting unique names"  https://review.opendev.org/67775814:59
openstackgerritEmilien Macchi proposed openstack/paunch master: Revert "Optimize container CLI for getting unique names"  https://review.opendev.org/67775814:59
openstackgerritEmilien Macchi proposed openstack/paunch master: Revert "Reduce the usage of "podman inspect" command"  https://review.opendev.org/67775714:59
*** yprokule has quit IRC15:03
*** jtomasek has joined #tripleo15:06
dmsimardcloudnull: again rsyslog failing -- can we nuke that image or something ?15:07
dmsimardweshay, rlandy|ruck: ^15:07
dmsimardhttps://18e51e15a34f17ffbc81-ffc80d196410a18186442d9badd30b78.ssl.cf2.rackcdn.com/674919/29/check/tripleo-ci-centos-7-standalone/79c7da9/logs/undercloud/var/log/tripleo-container-image-prepare.log.txt.gz15:07
weshaycloudnull dmsimard how about we put something in that randomizes which container is pulled first :)15:08
weshayand then test15:08
rlandy|ruckshould update15:08
dmsimardweshay: there's other containers that are working right before15:08
* weshay looks15:08
dmsimardcentos-binary-cinder-api, centos-binary-cinder-scheduler etc15:08
dmsimardI wonder if that tagged image would still be on the rdo registry /me looks15:09
weshaydmsimard we can also exclude it from a deployment I guess15:09
weshayur right.. in other containers are pulling ok15:09
*** suuuper has joined #tripleo15:12
dmsimardso it's trying to pull centos-binary-rsyslog:a447a10b12efed2e989ed61de5d0d1562a2919ea_d0e11ceb15:12
dmsimardwhich we still have in rdo registry15:13
weshayah cool15:13
dmsimardthere's some differences between the image in dockerhub and the one from rdo registry15:15
weshaywhat are you seeing?15:15
dmsimardah, actually, there aren't15:17
dmsimardhttps://www.diffchecker.com/lMQYR8pP15:17
dmsimardthe layer hashes are the same15:17
dmsimardit's the repo tags that are different, I don't think that matters15:17
dmsimard¯\_(ツ)_/¯15:18
weshaywe can try to repush a rsyslog container if you think it may help...  the containers are getting refreshed fairly often15:18
weshayalready.. if there was a bad push..   anything about the docker files stand out?15:19
weshayif that one container was bad.. we would see it other providers too.. so we're still chasing something odd here afaict15:20
dmsimardweshay: this is most definitely occurring in other places than ovh15:20
dmsimardthe one I linked yesterday came from limestone, I've seen failures in rax too15:21
rlandy|ruckabishop: hi - after changing the size of  CinderLVMLoopDeviceSize  - we've seem some instances of this failure: tempest.api.volume.admin.test_snapshots_actions.SnapshotsActionsTest.test_snapshot_force_delete_when_snapshot_is_error as in logs: http://logs.rdoproject.org/openstack-periodic-master/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-7-ovb-1ctlr_2comp-featureset020-master/9c5f8af/logs/undercloud/15:21
rlandy|ruckhome/zuul/tempest.log.txt.gz and https://logs.rdoproject.org/openstack-periodic-master/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-7-ovb-1ctlr_2comp-featureset020-master/9c5f8af/logs/overcloud-controller-0/var/log/containers/cinder/cinder-scheduler.log.txt.gz?level=WARNING#_2019-08-21_04_27_15_82015:21
rlandy|ruckabishop: I am rerunning those tests to conform repeatability - but any other suggestions to modify?15:22
rlandy|ruckconfirm15:22
weshaydmsimard ah ur right.. this is new..  every time I've looked at this it was 99% ovh15:23
weshaybut not now15:23
weshaynode_provider15:23
weshay  23% limestone-regionone15:23
weshay  15% rax-ord15:23
weshay  14% ovh-bhs115:23
weshay  13% fortnebula-regionone15:23
weshay  12% rax-dfw15:23
weshayrlandy|ruck this is new to me15:23
weshaynews15:23
*** cylopez has quit IRC15:24
rlandy|ruckweshay: not really fornebula has been a problem for a while as well15:24
weshaywait.. ha15:24
weshaysorry wrong query15:24
dmsimardweshay: the rsyslog error is always the same on the same hash with the same hash mismatch -- I would at least try to push another one in case there's a bad blob somewhere15:24
weshaydang it15:24
rlandy|rucklimestone we fixed somewhat15:24
*** pierreprinetti has quit IRC15:24
mwhahahaweshay: dmsimard: do we ever see these container failures in stein? or is it only in master?15:25
abishoprlandy|ruck: checking...15:25
weshaythey may have fixed it15:25
weshay0 fails in 24 hrs / 168 fails in 10 days15:25
weshayhttp://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%22Action%5C%5C%5C%22%3A%5C%5C%5C%22pull%5C%22%20AND%20message%3A%5C%22ERROR%20%2Fvar%2Flog%2Ftripleo-container-image-prepare.log%5C%22%20AND%20tags%3A%5C%22console%5C%22%20AND%20voting%3A115:25
dmsimardmwhahaha: I haven't witnessed it occurring in stein15:26
mwhahahaso if it hasn't happened in stein, it would point to some code change in master15:26
weshaydmsimard15:29
weshaynode_provider15:29
weshay  83% ovh-bhs115:29
weshay  9% ovh-gra115:29
weshay  1% rax-dfw15:29
weshay  1% vexxhost-sjc115:29
weshay  1% rax-iad15:29
rlandy|ruckha - ok15:29
weshaydmsimard it's still ovh :)15:29
weshayand it's not happened in the last 24hr15:30
dmsimardweshay: I think there are two different issues15:30
weshayk.. k15:30
dmsimard1) we need to do something about rsyslog15:30
weshayk15:30
dmsimard2) there are /sometimes/ authentication errors leading to failed pulls (of which rsyslog is also victim)15:31
weshayk15:31
dmsimard1) is a digest mismatch, i.e, this:15:32
dmsimardmsg="error copying src image [\"docker://192.168.24.1:8787/tripleomaster/centos-binary-rsyslog:bb7bc41730bc5be97fe27a9445337152c45f08f6_a47d70e7\"] to dest image [\"192.168.24.1:8787/tripleomaster/centos-binary-rsyslog:bb7bc41730bc5be97fe27a9445337152c45f08f6_a47d70e7\"] err: Error reading blob15:32
dmsimardsha256:eeed6d25dc3e17ce4389e478dbc4b5dc2f8148f9d22d7fb37991359feb98b31c: Digest did not match, expected sha256:eeed6d25dc3e17ce4389e478dbc4b5dc2f8148f9d22d7fb37991359feb98b31c, got sha256:e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855"15:32
dmsimard2) is something else15:32
dmsimard#1 is hurting us and happening all over, we need to fix it15:32
dmsimardthe work that cloudnull is doing will hopefully help us figure out #215:33
weshaydmsimard the logging confuses me at times..  so just to make sure I'm understanding it...     docker://192 is the local registry.. is that error pulling from docker.io/proxy or pushing into the local reg15:33
dmsimardweshay: that particular line came from https://storage.bhs1.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/logs_21/677521/3/check/tripleo-ci-centos-7-standalone/6406cc9/logs/undercloud/var/log/tripleo-container-image-prepare.log.txt.gz15:34
dmsimardI'm not the most knowledgeable around these bits of code but if I were to take a guess, that's image_uploader downloading the images off of docker hub to push them to the local registry, yes15:34
weshaydmsimard that looks like a push to the local reg error15:34
weshaycloudnull am I reading that incorrectly?15:35
weshayI guess the digest didn't match15:35
weshaytotal hits: 23115:37
weshaybuild_branch15:37
weshay  100% master15:37
weshaymwhahaha ^15:37
weshayso 100% on master 99% on ovh,  and either logstash had a hiccup or it hasn't happened in the last 24 hr15:37
weshaythe main change from stein -> master would be buildah built containers vs.. docker15:38
dmsimardweshay: what is your query and how are you figuring out 99% ovh ?15:38
* weshay gets15:38
weshaydmsimard https://opendev.org/opendev/elastic-recheck/commit/1d1121f111748ef90538318719b09cb3c51226e115:39
openstackgerritFrancesco Pantano proposed openstack/tripleo-heat-templates master: Add the certificate specs in ceph_grafana composable service  https://review.opendev.org/67455615:44
ykarelweshay, since when it's happening, buildah got updated in master on 12th August15:44
ykarelas per logstash query i see it's around that day only15:44
weshayinteresting15:45
ykarelbut why only ovh, may be performance related15:45
weshaythanks ykarel /me looks15:45
dmsimardweshay: that's #215:45
dmsimardI want to fix rsyslog :D15:45
dmsimardnot dismissing that there isn't a problem, though15:45
dmsimardI was not able to find anything that would be specific to ovh yesterday15:45
weshayaye.. thanks for looking...  fwiw it may be due to something we can't see15:46
weshaydmsimard ykarel may be on to something ( as usual )   I  don't see hits prior to 8/1215:47
dmsimardas far as I can tell the rsyslog image in the rdo registry is no different than the one in docker hub -- pulled both individually and everything between them matches15:47
ykarelweshay, dmsimard if node is hold where issue is reproduced, good to confirm by downgrading buildah(to buildah-1.8.2-1.gite23314b.el7), and then see what's issue with new buildah15:49
dmsimardykarel: we have two held nodes15:49
dmsimardI can add your key to them15:49
ykarelhttps://trunk.rdoproject.org/centos7-master/deps/latest/x86_64/buildah-1.8.2-1.gite23314b.el7.x86_64.rpm15:50
*** pierreprinetti has joined #tripleo15:50
dmsimard# rpm -qa |grep buildah15:50
dmsimardbuildah-1.10.1-2.git8c1c2c5.el7.x86_6415:50
ykarelyes ^^ the updated one15:50
ykarelhttps://review.rdoproject.org/r/#/c/21791/1/buildsys-tags/cloud7-openstack-train-testing.yml15:50
openstackgerritKevin Carter (cloudnull) proposed openstack/tripleo-common master: Log exceptions when checking status  https://review.opendev.org/67491915:51
ykareldmsimard, https://ykarel.fedorapeople.org/mydata/pupkey.txt , but i can check only after some time(going for dinner)15:52
openstackgerritMerged openstack/python-tripleoclient stable/stein: Suppress output for ssh-keygen  https://review.opendev.org/67757715:52
weshayhttps://paste.pics/377e5067c36b5c648250e0167808d8af15:52
dmsimardykarel: root@162.242.237.97 and root@158.69.67.23115:53
dmsimardweshay: yeah ...15:53
ykareldmsimard, ack, both acccessible15:54
*** ykarel is now known as ykarel|afk15:54
*** mgagne has quit IRC15:54
*** mgagne has joined #tripleo15:55
*** jtomasek has quit IRC15:56
dmsimardweshay: when/where is buildah installed ? can we send a dummy patch to test with the previous package and see what happens ?15:56
dmsimardI see https://opendev.org/openstack/tripleo-ci/src/branch/master/roles/build-containers/tasks/pre.yaml#L4815:59
openstackgerritEmilien Macchi proposed openstack/paunch master: runner: return True to check if image/container exists  https://review.opendev.org/67777415:59
* weshay gets15:59
weshaydmsimard https://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-centos-7-master-containers-build-push and http://zuul.openstack.org/builds?job_name=tripleo-build-containers-centos-7-buildah16:02
*** skramaja has quit IRC16:02
*** suuuper has quit IRC16:04
*** paramite has quit IRC16:04
*** rpittau is now known as rpittau|afk16:05
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-validations stable/stein: Set undercloud-connection to local by default  https://review.opendev.org/67777616:06
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-validations stable/stein: Add undercloud_key_file option in tripleo-ansible-inventory.  https://review.opendev.org/67777716:06
dmsimardok buildah is a dependency of openstack-tripleo-common16:07
dmsimard--> Processing Dependency: buildah for package: openstack-tripleo-common-11.1.1-0.20190821053435.80013b0.el7.noarch16:08
weshaythe version of buildah in centos7 may be older than what we want/need16:09
openstackgerritMerged openstack/puppet-tripleo master: Remove deprecated ceilometer::dispatcher::gnocchi  https://review.opendev.org/67769016:09
rlandy|ruckyay16:10
dmsimardweshay: latest upstream version is 1.10.1 which is what is currently provided16:11
weshayk16:11
dmsimardwe had 1.8.2 prior16:11
*** ratailor has quit IRC16:12
dmsimard225 commits between 1.8.2 and 1.10.1, would be a fun bisect16:15
dmsimardvirt-sig has a build for 1.9.216:15
weshaydmsimard I wonder if we should put this on hold for another 24hrs?16:18
mwhahahaso16:18
dmsimardso buildah is picked up from openstack-tripleo-common and openstack-tripleo-common is actually picked up from tripleoclient --> Processing Dependency: openstack-tripleo-common >= 10.7.0 for package: python2-tripleoclient-12.1.1-0.20190821060145.f160f47.el7.noarch16:18
mwhahahawe're theorizing that it's failing on cloudflare16:18
mwhahahawe query docker hub for the laywers which redirects us to cloudflare16:18
mwhahahawe're wondering if cloudflare is returning 401 for ovh16:19
dmsimardmwhahaha: I have a held node from ovh16:19
*** ykarel|afk is now known as ykarel16:20
weshayfak...  I'm only right on my second attempts today..  the centos7 master job is still using docker  I think.. rlandy|ruck can we look at that?16:20
*** xarlos has joined #tripleo16:20
weshayrhel8 is using buildah16:21
weshaywhich is not related to this issue16:21
*** lucasagomes has quit IRC16:21
weshaymight have taken you down a false path16:21
ykareldmsimard, weshay so https://bugs.launchpad.net/tripleo/+bug/1839532 is reported on 9th august, so issue is happening even before 12th august buildah got updated16:21
openstackLaunchpad bug 1839532 in tripleo "tripleo gate jobs are failing to pull containers when running on ovh provider with "UNAUTHORIZED" error" [Critical,Triaged]16:21
ykarelalso matt reported in a comment it's happening from 5th August16:22
weshayykarel ok.. makes more sense then16:22
ykarelweshay, looking further i found buildah is update to 1.9.0 in centos extras on 5th August16:22
weshaydo we have jobs failing today.. perhaps logstash is faking me out today too..16:22
* weshay looks16:22
openstackgerritKevin Carter (cloudnull) proposed openstack/tripleo-common master: Log exceptions when checking status  https://review.opendev.org/67491916:23
ykareldmsimard, yes we can try installing different version of buildah in job16:25
ykarelafter tripleo packages are installed16:25
ykarelin install-packages.sh16:25
ykarelor we can blacklist doubted buildah version16:25
dmsimardykarel: do you want to try that ? I was going to hack something around here https://opendev.org/openstack/tripleo-quickstart/src/branch/master/config/release/tripleo-ci/CentOS-7/consistent-master.yml#L10016:26
dmsimard(or master.yml, or both)16:26
ykareldmsimard, in ci master.yml is used16:26
dmsimardykarel: ok, it is my understanding that https://opendev.org/openstack/tripleo-quickstart/src/branch/master/config/release/tripleo-ci/CentOS-7/consistent-master.yml#L116 picks up openstack-tripleo-common which picks up buildah16:26
ykareldmsimard, yes16:27
ykareldmsimard, you can try similar to https://review.opendev.org/#/c/636860/16:27
dmsimardI would naively add a "yum install https://trunk.rdoproject.org/centos7-master/deps/latest/x86_64/buildah-1.8.2-1.gite23314b.el7.x86_64.rpm" but not sure16:27
ykareldmsimard, we need yum downgrade https://trunk.rdoproject.org/centos7-master/deps/latest/x86_64/buildah-1.8.2-1.gite23314b.el7.x86_64.rpm16:27
ykarelif doing after buildah is installed latest version16:27
ykarelor if we change repo to exclude buggy version, 1.8.2 will auto install16:28
ykarelso we need to exclude 1.9 from extras, and 1.10 from rdo deps16:29
dmsimardok let me see16:30
*** amoralej is now known as amoralej|off16:30
ykarelack16:30
*** jpena is now known as jpena|off16:30
weshayykarel dmsimard I don't think buildah is in play16:31
weshayykarel dmsimard   for centos job I think we're still using docker16:31
ykarelweshay, why? as per dates atleast it seems related16:31
dmsimardweshay: it's great if it's not, it's not expensive to double check16:31
weshaydmsimard k16:32
weshayykarel so I've had my head in rhel8 too much.. we're using buildah there to build the containers in the periodic jobs, but centos-7 train is using docker16:32
weshayI think I lead you down the wrong road16:33
ykarelweshay, from job logs, container_build_tool': 'buildah'16:33
ykarelhttps://logs.opendev.org/63/674363/1/gate/tripleo-ci-centos-7-undercloud-containers/9285c4c/logs/undercloud/var/log/tripleo-container-image-prepare.log.txt.gz16:33
ykarelfor -push job, it's buildah rhel8, and docker centos716:34
weshayykarel ya.. after checking that is what I'm seeing yes16:34
*** florianf has quit IRC16:35
openstackgerritDavid Moreau Simard proposed openstack/tripleo-quickstart master: Do not merge: Test master with buildah 1.8  https://review.opendev.org/67778516:36
dmsimardykarel: ^ like this ?16:36
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-heat-templates stable/stein: Add tags always into external update tasks.  https://review.opendev.org/67778616:36
ykareldmsimard, /me looks16:36
dmsimardah wait16:36
dmsimardmistake16:36
*** dtantsur is now known as dtantsur|afk16:37
openstackgerritDavid Moreau Simard proposed openstack/tripleo-quickstart master: Do not merge: Test master with buildah 1.8  https://review.opendev.org/67778516:37
ykareldmsimard, looks fine16:38
*** cfontain_ has quit IRC16:38
ykarelhopefuuly some jobs will land on ovh16:38
ykarelweshay, so on ovh it always fails, or it passes sometimes16:38
dmsimardmwhahaha: this hunch with ovh and cloudflare, would we be able to reproduce it on a held now ?16:38
dmsimards/now/node/16:38
*** pierreprinetti has quit IRC16:41
*** spsurya has quit IRC16:43
* rlandy|ruck chceks16:43
*** cfontain_ has joined #tripleo16:45
ykareldmsimard, i am running deploy on the holded node root@158.69.67.23116:46
ykarelafter downgrading buildah16:46
dmsimardykarel: cool, how long for feedback ?16:47
ykareldmsimard, i think 10-20 minutes16:47
*** xek has quit IRC16:47
dmsimard++16:47
mwhahahadmsimard: maybe16:49
*** Garyx has joined #tripleo16:49
weshaydmsimard ykarel it's not related16:53
*** cfontain_ has quit IRC16:53
dmsimardweshay: what isn't ?16:54
weshaybuildah16:54
dmsimardweshay: I'm out of ideas :)16:54
weshayI think mwhahaha is on to something w/ cloudflare16:55
rlandy|ruckweshay: we can change that to use buildah if you wish16:55
rlandy|ruckone setting16:55
weshayrlandy|ruck heh..  no.. I think we should pause all this work for another 24hrs16:55
weshaylet's see if they fixed something finally as this hasn't hit our radar today...16:56
weshayafaict16:56
rlandy|ruckok16:56
weshayI think the interwebs are afraid of dmsimard16:56
weshayas soon as he was invoked... it stopped16:57
dmsimardweshay: there's still #1 that needs to be taken care of :p16:58
weshayagree16:59
weshay<dmsimard>  1) is a digest mismatch, i.e, this:16:59
weshay09:32 <dmsimard>  msg="error copying src image [\"docker://192.168.24.1:8787/tripleomaster/centos-binary-rsyslog:bb7bc41730bc5be97fe27a9445337152c45f08f6_a47d70e7\"] to dest image [\"192.168.24.1:8787/tripleomaster/centos-binary-rsyslog:bb7bc41730bc5be97fe27a9445337152c45f08f6_a47d70e7\"] err: Error reading blob16:59
weshay09:32 <dmsimard>  sha256:eeed6d25dc3e17ce4389e478dbc4b5dc2f8148f9d22d7fb37991359feb98b31c: Digest did not match, expected sha256:eeed6d25dc3e17ce4389e478dbc4b5dc2f8148f9d22d7fb37991359feb98b31c, got sha256:e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855"16:59
dmsimardmaybe that one could be buildah related ? who knows16:59
weshaymwhahaha the above snippet is uploading to the local reg right?17:00
*** derekh has quit IRC17:00
openstackgerritAndreas Jaeger proposed openstack/tripleo-ansible master: Switch to promote docs job  https://review.opendev.org/67779117:03
*** psachin has quit IRC17:04
*** Chaserjim has joined #tripleo17:04
openstackgerritAndreas Jaeger proposed openstack/tripleo-validations master: Switch to promote docs job  https://review.opendev.org/67779217:06
ykarelweshay, ack, /me still doubt though17:07
ykarelas buildah in push job and in deploy job playing differently17:07
weshayplaying differently?17:10
weshayykarel don't understand yet what you mean17:11
*** PagliaccisCloud has quit IRC17:12
ykarelweshay, so in push job, rhel8 is buildah, centos7 is docker, in that job it just build and push, but in deploy container-image prepare, buildah pulls from docker.io and push to local registry, and then doing buildah pull from local registry and at that point it's failing(iiuc correctly the bug)17:15
ykareland also iirc promoter uses docker to push to docker.io17:16
ykareli am seeing DEBUG chardet.charsetprober [  ] IBM866 confidence = 0.0413828258123, below negative shortcut threshhold 0.05 in tripleo-container-image-prepare log in the reproducer, is that seen before?17:18
openstackgerritMerged openstack/tripleo-common stable/stein: Add python3 file for hardened images  https://review.opendev.org/67756417:22
*** fultonj has joined #tripleo17:23
xarlosfultonj: There's definately a connectivity problem.17:23
xarlosoh, hahaha. Noticed you joined just as I started to type that. Sorry. Welcome :-D17:23
xarlosbefore ceph even has issues, it seems that pacemaker isn't able to sync up with the other nodes either.17:25
xarlosSeems that it may be trying to talk over the wrong network for these services.17:26
openstackgerritMerged openstack/tripleo-common stable/stein: Add support for RHEL 8 and start using versionless element  https://review.opendev.org/67756517:26
openstackgerritMerged openstack/tripleo-common stable/rocky: Enable staging-ovirt (fence_rhevm) fencing agent.  https://review.opendev.org/67753117:27
openstackgerritMerged openstack/tripleo-common stable/queens: Enable staging-ovirt (fence_rhevm) fencing agent.  https://review.opendev.org/67642017:27
openstackgerritMerged openstack/python-tripleoclient stable/stein: Run Validations with ThreadPoolExecutor  https://review.opendev.org/67717017:27
*** jpich has quit IRC17:27
ykareldmsimard, on the reproducer node it failed with 1.8.2 also, not sure what's going on then17:43
dmsimardykarel: where is the failure ?17:44
ykareldmsimard, same place, rsyslog pull17:44
ykarelthe first image it tries, it fails and not move forward17:45
ykarelonly 1 image in registry, curl 192.168.24.1:8787/v2/_catalog17:45
dmsimardykarel: I've opened a tmux, can you join and show me ?17:48
ykareldmsimard, in17:51
openstackgerritEmilien Macchi proposed openstack/paunch master: runner: return True to check if image/container exists  https://review.opendev.org/67777417:53
*** dprince has joined #tripleo17:57
openstackgerritDirk Mueller proposed openstack/diskimage-builder master: zypper-minimal: Don't get confused by etc/resolv.conf symlink  https://review.opendev.org/67779617:57
*** gfidente is now known as gfidente|afk18:06
*** abishop is now known as abishop|afk18:11
*** pkopec has quit IRC18:14
*** morazi has quit IRC18:21
*** ramishra has quit IRC18:24
dmsimardweshay, mwhahaha: ykarel and I have been able to reproduce the rsyslog error but it's still mysterious18:26
mwhahahathe sha generation one or the 401 issue18:26
dmsimardthe digest mismatch18:28
*** Chaserjim has quit IRC18:28
dmsimardsee http://paste.openstack.org/raw/761107/18:32
*** jtomasek has joined #tripleo18:32
*** kopecmartin is now known as kopecmartin|off18:33
weshayk.. will look in a minute :) in 1-118:33
weshaydmsimard thank you!!18:33
mwhahahawait so you reproduced it but the source is the local registry18:34
mwhahahameaning when we wrote it out, it was bad18:34
mwhahahaor18:34
mwhahahaovh has odd disk issues?18:34
dmsimardmwhahaha: I would like to re-push it to the local registry but I'm not sure how18:35
mwhahahayou have to use this code that is currently giving us fits :D18:35
mwhahahacause that's the only way to do it18:35
mwhahahacurrently we are: ‎(ノಥ益ಥ)ノ sɹǝuıɐʇuoɔ18:36
dmsimardmwhahaha: yeah doing a manual buildah push gives a http 40518:36
mwhahahayea the local iamge is a read only implementation18:36
mwhahahawe have python code that fork lifts the layers on to disk18:36
mwhahahaso from that paste, did you just run those 3 commands so it failed teh first time but was successful later?18:37
* mwhahaha is trying to understand context18:37
*** holser__ has quit IRC18:37
dmsimardright, the issue reproduces only if the image isn't pulled already18:37
mwhahahabut we dont' use buildah pull18:38
dmsimardpulling it from dockerhub before trying to pull it from the local registry doesn't reproduce the digest mismatch18:38
mwhahahaoh yes we do18:39
dmsimardthe layer that the error is complaining about is apparently there: /var/lib/image-serve/v2/tripleomaster/centos-binary-rsyslog/blobs/sha256:8ba884070f611d31cb2c42eddb691319dc9facf5e0ec67672fcfa135181ab3df.gz18:40
dmsimardit says "expected sha256:8ba884070f611d31cb2c42eddb691319dc9facf5e0ec67672fcfa135181ab3df, got e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855"18:40
dmsimardI can't find e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855 anywhere18:40
mwhahahawell it's a generated hash based on the file content i think18:41
mwhahahait's supposed to match the manifest18:41
dmsimardoh wait18:41
mwhahahawas your reproducer on the ovh node or elsewhere?18:41
dmsimardovh but I don't think there's a correlation between ovh and the rsyslog issue18:42
mwhahahadid we see that elsewhere?18:42
dmsimardyes18:42
mwhahahak18:42
dmsimardso18:44
dmsimardin an interesting turn of events18:44
dmsimardit turns out that the sha256sum of /var/lib/image-serve/v2/tripleomaster/centos-binary-rsyslog/blobs/sha256:8ba884070f611d31cb2c42eddb691319dc9facf5e0ec67672fcfa135181ab3df.gz is e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b85518:44
dmsimardwhich is what buildah complains about18:44
mwhahaharight18:44
mwhahahathat likely doesn't match the expected output from the blob call18:45
dmsimardI sha256sum'd some other blobs and they all match18:45
mjturekdoes anyone know Gabriele Cerami's nick? He asked me to ping him but I can't  find it18:46
mwhahahamjturek: panda18:47
mjturekthanks mwhahaha !18:47
mjturekpanda you around?18:47
mwhahahadmsimard: so odd thing is that i don't know why that layer should be fetched, i don't see it in the manifest for rsyslog18:48
weshaymjturek probably too late for him18:49
dmsimardmwhahaha: where are you seeing that ? I see 8ba884070f611d31cb2c42eddb691319dc9facf5e0ec67672fcfa135181ab3df in both dockerhub and rdo registry https://www.diffchecker.com/lMQYR8pP18:49
mjturekfair enough! I'll try tomorrow morning weshay18:50
weshay:)18:50
mwhahahadmsimard: maybe i'm looking at the wrong tag, let me see18:50
dmsimardalso, for the record, the blob sum mismatch: http://paste.openstack.org/raw/761109/18:50
weshayhrm18:52
dmsimardlet me check the other node that we had held out of curiosity18:52
*** ksambor has quit IRC18:52
weshaywhen ur pulling from the held nodepool node.. is it going through ovh proxy/cdn ( whatever )18:53
dmsimardweshay: doesn't matter where I'm pulling from if the sha256 sum doesn't match on disk18:53
* weshay wonders if it's a round-robin style cdn.. and one server has an older copy of the container18:53
weshaytru18:54
dmsimardit's the push to the local registry that fails18:54
weshayya.. in that case... we need a diff logstash query to see how often that happens though18:54
dmsimardthis other node has a matching sha256 for that particular layer18:56
dmsimardbut it uses the tag a447a10b12efed2e989ed61de5d0d1562a2919ea_d0e11ceb instead of bb7bc41730bc5be97fe27a9445337152c45f08f6_a47d70e718:57
dmsimardah, there was a promotion, a447 is newer18:59
dmsimardbb7bc41730bc5be97fe27a9445337152c45f08f6_a47d70e7 is from 2019-08-18 while a447a10b12efed2e989ed61de5d0d1562a2919ea_d0e11ceb is from today19:00
dmsimardoh so it does still mismatch but on different layers now..19:02
mwhahahawhen you do a buildah pull from the local registry?19:02
dmsimardstill the same e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855 culprit though19:02
dmsimardError reading blob sha256:ea492b89a5eefe62721d9f4b129961d48b48e8a01b61ac53b42b24d1dc87cdd0: Digest did not match, expected sha256:ea492b89a5eefe62721d9f4b129961d48b48e8a01b61ac53b42b24d1dc87cdd0, got sha256:e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b85519:02
dmsimardmwhahaha: yeah "# buildah pull 192.168.24.1:8787/tripleomaster/centos-binary-rsyslog:a447a10b12efed2e989ed61de5d0d1562a2919ea_d0e11ceb"19:02
mwhahahaso that would point to the code that writes it out to /var/www/img-serve being bad19:03
mwhahahaeither we're cutting off a bit or something19:03
mwhahahaseems odd that it's the same layer repeatedly unless the cache is bad?19:03
*** morazi has joined #tripleo19:04
dmsimardit is suspicious that the same sha256sum is returned by two different layers from two different tags19:04
mwhahahaunless that's like the base centos layer19:04
openstackgerritKevin Carter (cloudnull) proposed openstack/tripleo-heat-templates master: Convert firewall rules to use TripleO-Ansible  https://review.opendev.org/67723719:05
weshaymwhahaha dmsimard that is happening but not that often http://paste.openstack.org/show/761110/19:05
mwhahahacloudnull: fun fact if you request blobs from the registry and it's not found you actually do get a 404 not a 40119:08
dmsimardweshay: I'm not convinced19:09
dmsimardweshay: those results are from a zuul-operator job19:09
weshay some of them19:09
weshaysome are from   7% tripleo-ci-fedora-28-standalone19:09
dmsimardweshay: that's one hit in the last 7 days19:09
dmsimardweshay: I don't think that query is getting the stuff we're interested in or perhaps the file it's located in isn't indexed19:10
weshaylet me try a diff query but the file is indexed19:10
dmsimardI would expect at least a few hits even for just "e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855" but logstash doesn't return anything19:10
weshaytbh it may not be indexing today and that's why we're not seeing ovh issues19:10
dmsimardweshay: logstash queue is a bit high http://grafana.openstack.org/d/T6vSHcSik/zuul-status?orgId=119:11
weshayah see that19:12
weshayso nice to have an infra guy here!!!119:12
dmsimardoh19:12
dmsimardturns out that "e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855" is the sha256sum of an empty file :/ http://paste.openstack.org/show/761112/19:14
mwhahahaempty gzip or empty19:14
*** jtomasek has quit IRC19:15
dmsimardempty file19:15
dmsimardsha256sum'd an empty file created by touch19:16
* dmsimard sighs19:16
weshaythat's good to know though.. /me notes it19:16
dmsimardok so then the blob is empty on disk, now what ? something failed during the upload to the local registry ?19:17
*** saneax has quit IRC19:17
*** pierreprinetti has joined #tripleo19:18
*** jtomasek has joined #tripleo19:18
mwhahahayea trying to track down where that is19:18
weshaythis is all local on the node right? no upstream log?19:18
mwhahahai don't suppose the disk is full19:18
dmsimardit's not19:20
dmsimardthat was for mwhahaha ^19:20
dmsimardweshay: the two nodes that we have the issue on are held from the tripleo-ci-centos-7-standalone job19:21
weshayk k19:21
mwhahahahttps://opendev.org/openstack/tripleo-common/src/branch/master/tripleo_common/image/image_uploader.py#L1175-L119419:21
mwhahahai think that's teh fetching the content19:21
mwhahahathought maybe it's a buildah thing where buildah pull results in a 0 byte laywers19:23
mwhahahathis is the export code https://opendev.org/openstack/tripleo-common/src/branch/master/tripleo_common/image/image_uploader.py#L1429-L145819:24
mwhahahahttps://opendev.org/openstack/tripleo-common/src/branch/master/tripleo_common/image/image_uploader.py#L1504-L155019:25
weshayI don't see that sha in anything but some f28 jobs19:25
mwhahahadmsimard: do you still have the node with the 0 file?19:25
dmsimardmwhahaha: I have two of them19:25
mwhahahadmsimard: can you look in the manifest file to see if size = 0?19:25
mwhahahaor gimme teh ssh19:25
mwhahahahttps://launchpad.net/~alex-schultz/+sshkeys19:26
dmsimardmwhahaha: root@162.242.237.97 = rax, root@158.69.67.231 = ovh19:26
*** morazi has quit IRC19:27
dmsimardfrom the rax node: http://paste.openstack.org/raw/761114/19:28
weshayrsyslog just updated again fyi19:30
weshay4 hrs ago19:30
weshay0eff77ac8d8e39c967d275f15fb633a6cc97ecc4_5016c5e019:30
dmsimardohhhh19:31
weshaywhat's up19:32
mwhahahaprobably nothing, he just likes to get our hopes up19:32
weshaylolz19:32
dmsimardpaste from rax with a little bit more meat: http://paste.openstack.org/show/761116/19:33
mwhahahaare you ohhhh'ing the dropped conneciton thing?19:34
mwhahahaso it tries to fetch, conneciton gets drop, doesn't refetch because files already exists19:35
*** morazi has joined #tripleo19:35
mwhahahawe need to HEAD but check file size19:35
mwhahahalet me see if we properly return filesize19:37
dmsimardmwhahaha: the "Uploading layer" and "export layer to" parts that occur before "Fetching layer" are likely resulting in an empty file19:37
dmsimardI'm not sure why we would attempt to upload or export anything before fetching it though19:37
mwhahahai think this is the awkward part of our having to manually output the laywers to the fs19:38
mwhahahamy layers needs lawyers19:38
*** pierreprinetti has quit IRC19:40
*** abishop|afk is now known as abishop19:40
mwhahahawe don't provide a filesize19:41
mwhahahai found it19:42
mwhahahahmm maybe not19:43
mwhahahahttps://opendev.org/openstack/tripleo-common/src/branch/master/tripleo_common/image/image_export.py#L78-L12319:44
mwhahahaso that's the function that handles this bits19:44
mwhahahanow if it throws an exception, it's supposed to remove teh file19:44
mwhahahathat's called from https://opendev.org/openstack/tripleo-common/src/branch/master/tripleo_common/image/image_uploader.py#L151219:45
mwhahahahttps://opendev.org/openstack/tripleo-common/src/branch/master/tripleo_common/image/image_uploader.py#L1205-L1221 is the function where teh Uploading layer comes in19:46
mwhahahawhich if that fails, gets retried and so the 2nd time thorugh, the file already exists19:47
mwhahahaso it seems liek https://opendev.org/openstack/tripleo-common/src/branch/master/tripleo_common/image/image_export.py#L117-L118 isn't working correctly19:48
mwhahahabut i don't see any "Error while" in the logs19:49
*** pbandark has quit IRC19:50
openstackgerritRonelle Landy proposed openstack/tripleo-quickstart master: DNM: Test adding compute_feature_enabled.config_drive to fs035  https://review.opendev.org/67780719:52
mwhahahait looks like https://opendev.org/openstack/tripleo-common/src/branch/master/tripleo_common/image/image_export.py#L101-L105 doesn't get caught by the exception19:54
mwhahahabecause I don't see the "Error while writing" bits19:54
* mwhahaha flips tables19:54
mwhahahai don't think IOError inherits from Exception19:55
dmsimardI see plenty of "Calculated layer digest", no "Expected digest" or "Error while writing blob"19:55
mwhahaharight i think we need to catch the IOError19:57
mwhahahaor at least add cleanup code in there19:57
xarlosshould I specify the ha environment template AFTER my network isolation template, or does it not matter? :-/20:00
*** morazi has quit IRC20:01
dmsimardmwhahaha: cloudnull has been working on https://review.opendev.org/#/c/674919/31/tripleo_common/image/image_export.py20:01
mwhahahayea but i'm going to just target this one bit, we can rewrite it later20:02
openstackgerritArx Cruz proposed openstack/tripleo-quickstart master: Adapt os_tempest in FS001  https://review.opendev.org/67340020:02
mwhahahathis part should likely be backported beyond the 401 bits20:02
dmsimardmwhahaha: fwiw the rax node has the cloudnull image_export (bunch of "Provided layer digest" in logs) and it doesn't trip the removal of the blob either20:03
dmsimardso digest ==  layer_digest20:03
dmsimardhttp://paste.openstack.org/show/761157/20:04
*** morazi has joined #tripleo20:05
cloudnulldmsimard it should only trip the removal when the layer's are different -  https://review.opendev.org/#/c/674919/31/tripleo_common/image/image_export.py@10420:09
*** holser has joined #tripleo20:09
cloudnulland verify_digest is true20:10
dmsimardcloudnull: we found out that the rsyslog digest mismatches are actually because some blobs are empty in /var/lib/image-serve20:10
cloudnullso there's actually something wrong with that image?20:10
dmsimardcloudnull: no, something goes wrong during the upload of the image to the local registry -- you can see it happen here: http://paste.openstack.org/show/761116/20:11
cloudnullmwhahaha and I were looking at how that manifest / blob arrays are built this morning20:11
mwhahahacloudnull: so it seems like mid-export, it's getting an issue which is triggering reauth/retry20:11
mwhahahacloudnull: but it's not cleaning up the empty file before it does that20:11
mwhahahacloudnull: so when it retries, it skips redownloading it because it thinks it's there20:12
cloudnulli see20:12
mwhahahacloudnull: http://paste.openstack.org/show/761116/20:12
mwhahahathat past show its20:12
mwhahahauploading... export .. fetching .... reauth ... and later Layer already exists20:12
mwhahahaso somewhere iun the export code sadness is occuring20:13
*** dprince has quit IRC20:13
mwhahahahttps://opendev.org/openstack/tripleo-common/src/branch/master/tripleo_common/image/image_export.py#L90-L11920:13
*** ykarel has quit IRC20:13
mwhahahai think if f.write throws an IOerror, it doesn't trigger the Exception20:13
dmsimardmwhahaha: so "layer_stream" is actually empty ?20:14
mwhahahaalso possible20:14
cloudnullso the connection is severed mid stream20:15
mwhahahaif length is 0, we need to nuke it as well20:15
mwhahahathough i'm trying ot figure out how the "Error while writing" isn't triggered20:15
mwhahahahttps://opendev.org/openstack/tripleo-common/src/branch/master/tripleo_common/image/image_uploader.py#L1495 is the uploading log line20:15
mwhahahahttps://opendev.org/openstack/tripleo-common/src/branch/master/tripleo_common/image/image_uploader.py#L1512 should be the export20:16
dmsimardmwhahaha: https://opendev.org/openstack/tripleo-common/src/branch/master/tripleo_common/image/image_uploader.py#L1205-L1221 ?20:16
*** ykarel has joined #tripleo20:16
mwhahahai think that's the registry to registry20:17
mwhahahathi sis local to registry?20:17
mwhahahawould be nice if these lines were different :D20:17
cloudnullwhen we see the log lines - "Calculated layer digest" and "Provided layer digest" the write should already be complete.20:17
dmsimardyeah ..20:17
mwhahahaThe other issue is that this might be multithread related20:18
mwhahahacause this is all shoved in to the threadpool20:18
mwhahahahttps://opendev.org/openstack/tripleo-common/src/branch/master/tripleo_common/image/image_uploader.py#L1256-L126420:19
*** morazi has quit IRC20:19
dmsimardmwhahaha: it's kinda hard to tell if it's local to registry or registry to registry20:20
mwhahahahttps://opendev.org/openstack/tripleo-common/src/branch/master/tripleo_common/image/image_uploader.py#L1175-L1194 is the fetching laywer bits20:20
mwhahahaso if that session.get triggers a 401, that'd be a problem20:20
mwhahahanow the questio nis where does that get used from20:20
cloudnulllooking at https://review.opendev.org/#/c/674919/31/tripleo_common/image/image_export.py@91 we should see a write exception (IOError) if the stream is interrupted or is broken for some reason?20:20
cloudnullthat's used in _copy_layer_registry_to_registry20:21
*** jtomasek has quit IRC20:21
mwhahahai wonder if https://opendev.org/openstack/tripleo-common/src/branch/master/tripleo_common/image/image_uploader.py#L1166-L1174 is throwing it off20:22
mwhahahait really does seems like we ahve too many retries everywhere20:23
mwhahahaand that we should only retry the entire thing20:23
cloudnullmwhahaha +120:23
cloudnullthe use of tenacity is rampant20:24
dmsimardone could say it's .... tenacious20:24
cloudnulllooking at https://openstack.fortnebula.com:13808/v1/AUTH_e8fd161dc34c421a979a9e6421f823e9/logs_19/674919/31/check/tripleo-ci-centos-7-standalone/18a23f1/logs/undercloud/var/log/tripleo-container-image-prepare.log.txt.gz we can now see the manifests, search for Current image manifest20:24
cloudnullthe rsyslog manifest json is there.20:24
openstackgerritwes hayutin proposed openstack/tripleo-ci master: add an about doc section  https://review.opendev.org/67781420:26
mwhahahaso that one is the same problem20:26
mwhahahauploading layer... export.. fetch... 401...layer already exists20:26
openstackgerritwes hayutin proposed openstack/tripleo-ci master: add an about doc section  https://review.opendev.org/67781420:26
mwhahahaand then the subsequent issues are buildah trying to pull an empty layer20:26
mwhahahactrl+f the first ea492b89a5eefe62721d9f4b129961d48b48e8a01b61ac53b42b24d1dc87cdd020:27
cloudnullright after "Resetting dropped connection: mirror.ord.rax.opendev.org"20:27
mwhahahaso the connections went stale because it was doing stuff for a while20:29
mwhahahatook like 18 mins to download sha256:209971f216a96e86b32611cc51fb4d60289b328fda8149dac99cf93b12567d7a20:29
*** morazi has joined #tripleo20:31
dmsimardneed to be off for a bit, if there's anything infra related you'd like to check, feel free to ping me and I'll check in later20:33
mwhahahathanks dmsimard20:33
weshaythanks! dmsimard :)20:33
mwhahahacloudnull: so i think the entry point for this i _copy_layer_registry_to_registry20:33
dmsimardbtw we should probably update https://bugs.launchpad.net/tripleo/+bug/1839532 with the extent of what we know20:34
openstackLaunchpad bug 1839532 in tripleo "tripleo gate jobs are failing to pull containers when running on ovh provider with "UNAUTHORIZED" error" [Critical,Triaged]20:35
openstackgerritMerged openstack/tripleo-common master: Allow mixing of count and instances  https://review.opendev.org/67043720:35
mwhahahai filed https://bugs.launchpad.net/tripleo/+bug/1840973 for this specific 0 byte issue20:35
openstackLaunchpad bug 1840973 in tripleo "container image prepare doesn't cleanup files on filesystem connection issues occur mid export" [High,In progress] - Assigned to Alex Schultz (alex-schultz)20:35
openstackgerritKevin Carter (cloudnull) proposed openstack/tripleo-common master: Log exceptions when checking status  https://review.opendev.org/67491920:37
cloudnulladded https://review.opendev.org/#/c/674919/32/tripleo_common/image/image_export.py@9920:37
mwhahahacloudnull: so i think https://opendev.org/openstack/tripleo-common/src/branch/master/tripleo_common/image/image_uploader.py#L1469 is the retry issue. So when the connection drops, that function is retried even though it's already gone into cls._copy_stream_to_registry20:37
cloudnullshould there be a write error in the stream it will raise an IOError which _copy_layer_registry_to_registry seems to expect20:38
mwhahahait seems to be going _copy_layer_local_to_registry -> _copy_stream_to_registry -> image_export.export_stream -> _layer_stream_local -> sadness20:39
mwhahahaactually not _layer_stream_loca, it's trying to fetch it from teh remote which would be _layer_stream_registry20:40
mwhahahaso that would mean it's _copy_layer_registry_to_registry actually20:41
* mwhahaha cries a bit more20:41
mwhahahawonder if that means that _copy_registry_to_registry is the retry since that'll retry on RequestException20:42
mwhahahahttps://opendev.org/openstack/tripleo-common/src/branch/master/tripleo_common/image/image_uploader.py#L123820:42
mwhahaha_copy_registry_to_registry should only trigger on IOError20:42
openstackgerritMerged openstack/diskimage-builder master: zypper-minimal: Don't get confused by etc/resolv.conf symlink  https://review.opendev.org/67779620:43
*** ykarel has quit IRC20:48
*** fultonj has quit IRC20:54
cloudnullwith the new change, and seemingly endless layers of abstractions it "should" force an IOError when it fails. i hope? i think? i hope...20:56
mwhahahano idea20:57
mwhahahathe other way to attack it is to do a content length check when seeing if the layer already exists but that might not work for real registries20:58
mwhahahait'd prevent this for our local registry but that doesn't seem to be right20:58
mwhahahait seems like tenacity is taking precedence over our exception handling in this case20:58
mwhahahayou'd think that https://opendev.org/openstack/tripleo-common/src/branch/master/tripleo_common/image/image_export.py#L89-L119 would be sufficient but it doesn't seem to be triggering in this case20:59
cloudnull(_copy_layer_registry_to_registry || _copy_layer_local_to_registry) > _copy_stream_to_registry > image_export.export_stream > sadness seems to be the ticket.21:00
mwhahahai think we should remove the trtry from _layer_stream_registry21:01
mwhahahabecause that should get caught elsewhere21:01
cloudnullbut _copy_layer_local_to_registry has tenacity only checking for `requests.exceptions.RequestException`21:01
cloudnull+121:01
mwhahahaso in this case i don't think it's copy_layer_local_to_rgistry21:01
mwhahahabecause that assumes it's available on disk (in this case it isn't yet)21:01
mwhahahaso this is a registry -> registry copy21:02
mwhahahaso the offending entry point is _copy_layer_registry_to_registry21:02
* cloudnull will push another review to remove tenacity on _layer_stream_registry21:02
mwhahaha_copy_layer_registry_to_registry should only retry on IOError which would be a digest problem21:02
cloudnulldone.21:03
mwhahahaso if we take tenacity off _layer_stream_registry, i wonder if it'll trigger the exception handling in export_stream21:03
openstackgerritKevin Carter (cloudnull) proposed openstack/tripleo-common master: Log exceptions when checking status  https://review.opendev.org/67491921:03
cloudnull^ about to find out21:03
mwhahahasoooo come back tomorrow to find out ? :D21:03
cloudnullsame bat time, same bat channel21:03
* cloudnull will sit here and watch it for the next hour or two 21:04
cloudnullso while I have you ,21:04
cloudnullwhat is the best way to troubleshoot "TemplateOutputError: Error in 46 output role_data: Incorrect arguments: Items to merge must be maps."?21:05
mwhahahacloudnull: the items you're trying to merge together aren't dictionaries21:05
mwhahahaso data type problem21:05
mwhahahado you have a review in reference to that error?21:06
cloudnullhttps://review.opendev.org/#/c/677237/21:06
cloudnullat least this one I can reproduce locally.21:06
mwhahahai'm assuming it's https://review.opendev.org/#/c/677237/10/deployment/tripleo-firewall/tripleo-firewall-baremetal-ansible.yaml@6021:08
mwhahahai'd send a note to tabi maybe he can give you a hint tomorrow21:09
cloudnullwill do21:10
cloudnullthanks!21:10
*** Goneri has quit IRC21:12
openstackgerritMerged openstack/tripleo-common master: Add an undeploy_roles workflow  https://review.opendev.org/67284821:16
openstackgerritMerged openstack/tripleo-heat-templates master: Add ExtraKernelPackages  https://review.opendev.org/67650421:16
*** altlogbot_2 has quit IRC21:16
openstackgerritMerged openstack/python-tripleoclient master: Use python docstring.  https://review.opendev.org/67741821:17
openstackgerritMerged openstack/tripleo-heat-templates stable/queens: keystone: drop duplicate -DFOREGROUND  https://review.opendev.org/67743021:17
openstackgerritSteve Baker proposed openstack/tripleo-heat-templates stable/stein: Configure nova_compute for vendordata  https://review.opendev.org/67783621:20
openstackgerritSteve Baker proposed openstack/tripleo-heat-templates stable/rocky: Configure nova_compute for vendordata  https://review.opendev.org/67783821:21
*** rcernin has joined #tripleo21:27
openstackgerritTom Barron proposed openstack/tripleo-heat-templates master: Unescape IPv6 addresses for ceph_nfs_bind_addr  https://review.opendev.org/67784121:35
*** raildo has quit IRC21:36
*** altlogbot_3 has joined #tripleo21:38
*** altlogbot_3 has quit IRC21:38
*** sshnaidm is now known as sshnaidm|afk21:40
*** altlogbot_2 has joined #tripleo21:42
*** altlogbot_2 has quit IRC21:42
xarlosmwhahaha: If you get a moment, could you have a look at this and see if you can point me into the direction of fixing my HA config? :-/ http://pastebin.com/57wwhrHk21:52
xarlosI cannot quite tell if some of the things there are expected erronious, or a key indicator to my new deployment21:54
*** rlandy|ruck is now known as rlandy|ruck|bbl21:57
*** bnemec has quit IRC21:59
mwhahahaI don't have any suggestions21:59
xarlosokay, thanks.21:59
mwhahahaDoes pcs status return ok?22:00
xarlosNo. Says cluster is not currently running.22:00
xarlosI can telnet to the port.22:01
*** morazi has quit IRC22:07
mwhahahaSounds like pacemaker startup issues22:08
mwhahahaMaybe ntp/hostname/networking22:08
xarlosyeah. Could it be something I missed out configuring? Typically as long as I define the number of nodes, there's not a lot else to do :-/22:09
openstackgerritKevin Carter (cloudnull) proposed openstack/tripleo-heat-templates master: Convert firewall rules to use TripleO-Ansible  https://review.opendev.org/67723722:09
*** morazi has joined #tripleo22:09
xarlosntp seems fine22:14
xarloshostname is correct in /etc/hosts22:15
openstackgerritTakashi Kajinami proposed openstack/python-tripleoclient master: DNM: just to show possible improvement for the parent patch  https://review.opendev.org/67774122:15
xarlosnetworking could be an issue, but in what sense? :-/22:15
openstackgerritMerged openstack/tripleo-ansible master: Switch to promote docs job  https://review.opendev.org/67779122:18
openstackgerritSteve Baker proposed openstack/tripleo-quickstart-extras master: Support direct provisioning of bare metal  https://review.opendev.org/66445622:18
openstackgerritSteve Baker proposed openstack/tripleo-quickstart-extras master: DNM: undercloud_enable_nova: false by default  https://review.opendev.org/66417022:18
openstackgerritSteve Baker proposed openstack/tripleo-quickstart-extras master: DNM: baremetal_provision:true by default  https://review.opendev.org/67784922:18
*** holser has quit IRC22:20
*** rh-jelabarre has quit IRC22:21
*** threestrands has joined #tripleo22:34
*** rcernin has quit IRC22:40
*** rcernin has joined #tripleo22:43
xarloswjere can I get the pcsd password from?22:45
*** ekultails has quit IRC22:50
*** tkajinam has joined #tripleo22:56
openstackgerritTakashi Kajinami proposed openstack/python-tripleoclient master: DNM: just to show possible improvement for the parent patch  https://review.opendev.org/67774122:59
*** morazi has quit IRC23:01
*** Vorrtex has quit IRC23:01
openstackgerritKevin Carter (cloudnull) proposed openstack/tripleo-common master: Log exceptions when checking status  https://review.opendev.org/67491923:04
xarlosmwhahaha: What would be stopping me from sshing from one controller to another?23:09
xarlos(iptables is not runnig)23:09
*** morazi has joined #tripleo23:10
xarlosI can talk over the provisioning network over ssh, but not over any of the bridged connections.23:12
xarlos(different nics)23:12
xarlosHaving masquerade on the undercloud has o bearing on the overcloud, does it?23:33
openstackgerritEmilien Macchi proposed openstack/paunch master: Add unique names support for cont_exec_args method  https://review.opendev.org/67785223:35
*** dsneddon has quit IRC23:36
*** dsneddon has joined #tripleo23:37
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Add arm64 based functional test  https://review.opendev.org/67611123:39
*** dsneddon has quit IRC23:44
*** cdearborn has quit IRC23:45
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: Add KOLLA_BOOTSTRAP=True to 2 bootstrap containers  https://review.opendev.org/67785523:57

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!