Wednesday, 2018-05-16

*** saneax is now known as saneax-_-|AFK00:00
*** agopi has joined #tripleo00:03
*** ooolpbot has joined #tripleo00:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION00:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177143500:10
*** ooolpbot has quit IRC00:10
openstackLaunchpad bug 1771435 in tripleo "scenario001/002 failing on autoscaling with urllib3.exceptions.SSLError: [SSL: UNKNOWN_PROTOCOL] unknown protocol (_ssl.c:579)" [Critical,In progress] - Assigned to Alex Schultz (alex-schultz)00:10
*** myoung|ruck|afk is now known as myoung|ruck00:14
openstackgerritRonelle Landy proposed openstack-infra/tripleo-ci master: DNM - Testing integration of releases script in toci  https://review.openstack.org/56871700:15
myoung|ruckweshay, sshnaidm|rover, mwhahaha: fyi queens promoted (http://dashboards.rdoproject.org/queens, https://trunk.rdoproject.org/centos7-queens/a7/fb/a7fbaca2a97adda856c3ba5d5166fb1665f02bc0_85b157a9)00:16
*** rlandy is now known as rlandy|bbl00:16
*** mburned is now known as mburned_out00:17
*** agopi has quit IRC00:19
*** linhnm has joined #tripleo00:33
*** dtrainor has quit IRC00:52
*** psahoo has joined #tripleo00:57
*** toure is now known as toure|gone01:01
*** ooolpbot has joined #tripleo01:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION01:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177143501:10
openstackLaunchpad bug 1771435 in tripleo "scenario001/002 failing on autoscaling with urllib3.exceptions.SSLError: [SSL: UNKNOWN_PROTOCOL] unknown protocol (_ssl.c:579)" [Critical,In progress] - Assigned to Alex Schultz (alex-schultz)01:10
*** ooolpbot has quit IRC01:10
*** tiswanso has joined #tripleo01:16
*** tiswanso_ has quit IRC01:19
openstackgerritMerged openstack/tripleo-validations stable/ocata: Validate that there should not be XFS volumes with ftype=0  https://review.openstack.org/56473801:30
*** ssbarnea_ has quit IRC01:34
*** gyankum has joined #tripleo01:55
*** atoth has quit IRC01:59
*** lblanchard has joined #tripleo02:00
*** bkopilov has quit IRC02:00
openstackgerritMerged openstack/tripleo-common master: Install Octavia amphora image if Red Hat  https://review.openstack.org/56691302:01
openstackgerritMerged openstack/tripleo-common master: add tripleo update job as voting  https://review.openstack.org/56352602:01
*** linhnm has quit IRC02:03
openstackgerritMerged openstack/tripleo-heat-templates master: Deploy Designate in scenario003  https://review.openstack.org/55500702:06
openstackgerritMerged openstack/tripleo-common master: Adds Create Container Workflow  https://review.openstack.org/56361902:06
openstackgerritMerged openstack/tripleo-upgrade master: Use tripleo_role_name instead of role_name  https://review.openstack.org/56844302:06
openstackgerritMerged openstack/tripleo-common master: Include 'tripleo_role_name' in the inventory  https://review.openstack.org/56834002:06
*** ooolpbot has joined #tripleo02:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION02:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177143502:10
*** ooolpbot has quit IRC02:10
openstackLaunchpad bug 1771435 in tripleo "scenario001/002 failing on autoscaling with urllib3.exceptions.SSLError: [SSL: UNKNOWN_PROTOCOL] unknown protocol (_ssl.c:579)" [Critical,In progress] - Assigned to Alex Schultz (alex-schultz)02:10
*** rlandy|bbl has quit IRC02:16
openstackgerritMerged openstack/tripleo-heat-templates master: deploy-steps: switch to tripleo_role_name  https://review.openstack.org/56834302:20
openstackgerritMerged openstack/python-tripleoclient master: Improve heat launcher user retrieval  https://review.openstack.org/56090402:20
openstackgerritMerged openstack-infra/tripleo-ci master: add tripleo update job as voting  https://review.openstack.org/56552302:20
*** lblanchard has quit IRC02:23
openstackgerritMerged openstack/python-tripleoclient master: Error if deployment fails  https://review.openstack.org/56738402:32
openstackgerritMarius Cornea proposed openstack/tripleo-upgrade master: Load roles list from yaml instead of awk parsing  https://review.openstack.org/56869602:33
*** dmacpher has joined #tripleo02:35
*** liverpooler has joined #tripleo02:36
*** gkadam has joined #tripleo02:36
*** dbecker has quit IRC02:46
*** liverpooler has quit IRC02:49
*** agopi has joined #tripleo02:50
*** psachin has joined #tripleo02:50
openstackgerritMerged openstack/tripleo-heat-templates master: Revert "Change default endpoint map entries to use TLS"  https://review.openstack.org/56869902:54
*** agopi has quit IRC02:54
*** ramishra has joined #tripleo02:57
*** agopi has joined #tripleo02:58
openstackgerritMerged openstack/tripleo-heat-templates master: update tht jobs to include network/endpoints  https://review.openstack.org/56870003:00
*** dbecker has joined #tripleo03:01
openstackgerritwes hayutin proposed openstack/tripleo-upgrade master: DNM, test  https://review.openstack.org/56873203:04
*** gkadam has quit IRC03:07
*** agopi has quit IRC03:10
*** ooolpbot has joined #tripleo03:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION03:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097203:10
*** ooolpbot has quit IRC03:10
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)03:10
*** agopi has joined #tripleo03:10
openstackgerritwes hayutin proposed openstack/tripleo-upgrade master: add container minimal check and gate  https://review.openstack.org/56873303:20
*** gyankum has quit IRC03:20
openstackgerritwes hayutin proposed openstack/tripleo-upgrade master: DNM, test  https://review.openstack.org/56873203:21
*** bkopilov has joined #tripleo03:23
*** alee_afk is now known as alee03:30
*** rajinir has quit IRC03:39
*** links has joined #tripleo03:41
EmilienMstevebaker: https://review.openstack.org/#/c/568716/ if you can look, thx03:42
stevebakerEmilienM: looking03:44
*** fragatina has quit IRC03:46
*** janki has joined #tripleo03:49
openstackgerritEmilien Macchi proposed openstack/ansible-role-container-registry master: Handle Docker rpm updates  https://review.openstack.org/56871403:53
openstackgerritSteve Baker proposed openstack/tripleo-quickstart-extras master: Populate /etc/yum/vars/contentdir  https://review.openstack.org/56870103:53
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: docker: cleanup update tasks  https://review.openstack.org/56871503:54
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: Deploy Docker via Ansible and not Puppet  https://review.openstack.org/56137703:58
EmilienMjaosorior: sorry we had to revert one of your patches03:58
EmilienMjaosorior: see https://review.openstack.org/56869903:58
Tenguhello there03:59
openstackgerritEmilien Macchi proposed openstack/tripleo-common master: (cleanup) remove usage of role_name  https://review.openstack.org/56834704:02
openstackgerritEmilien Macchi proposed openstack/tripleo-common master: (cleanup) remove usage of role_name  https://review.openstack.org/56834704:02
jaosoriorEmilienM: fuck04:02
EmilienMjaosorior: the week isn't bright CI side04:03
jaosoriorEmilienM: where is the scenario001 and 002's definition?04:03
jaosoriorthey probably are missing the CA installation...04:03
*** sanjay__u has joined #tripleo04:03
*** tzumainn has quit IRC04:03
EmilienMjaosorior: in THT/ci/environments?04:03
EmilienMjaosorior: can you please review this quick one? https://review.openstack.org/#/c/568716/04:03
openstackgerritJuan Antonio Osorio Robles proposed openstack/tripleo-heat-templates master: Revert "Revert "Change default endpoint map entries to use TLS""  https://review.openstack.org/56873604:04
jaosoriorEmilienM: gonna try to revert the revert. Is that will run the relevant scenarios, right?04:05
EmilienMjaosorior: thanks to https://review.openstack.org/#/c/568700/, yes04:06
jaosoriorgood04:06
jaosorioruhm... wait04:07
jaosoriorunknown protocol04:07
jaosoriorthat's not an SSL verification error04:07
jaosoriorthat that's ceilometer using the wrong port04:08
*** ooolpbot has joined #tripleo04:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION04:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097204:10
*** ooolpbot has quit IRC04:10
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)04:10
*** gyankum has joined #tripleo04:10
*** fragatina has joined #tripleo04:11
EmilienMjaosorior: oops04:14
weshayEmilienM, behave04:15
openstackgerritAde Lee proposed openstack/tripleo-upgrade master: Add config_change role  https://review.openstack.org/56730004:16
Tenguhello weshay :)04:17
weshayTengu, hey brotha.. sorry I missed you today in the community mtg04:18
Tenguweshay: no problem, we'll catch up eventually ;). Now that I'm part of the Beast.04:18
jaosoriorEmilienM: if you have any chance to help debug this http://logs.openstack.org/45/560445/29/check/tripleo-ci-centos-7-scenario002-multinode-oooq-container/6c9f8c0/logs/tempest.html.gz I would really appreciate it. I had seen it before (even without TLS) but can't figure out what the issue is.04:21
Tengujaosorior: oh. I had that one yesterday04:21
Tenguon my fluentd config patch04:21
jaosoriorseems to call heat a bunch of times for some reason04:22
Tengujaosorior: (that is, the first: Details: {u'message': u'Unable to complete operation on subnet 64fc7190-ca23-4b5a-b838-731fd609fdec: One or more ports have an IP allocation from this subnet.', u'type': u'SubnetInUse', u'detail': u''}04:22
jaosoriorthen it fails and tries to call the cleanup04:22
jaosoriorwhich ends up using the wrong port04:22
jaosoriorTengu: right, that's the error in the cleanup04:23
jaosoriorbut the issue starts in telemetry_tempest_plugin.scenario.test_telemetry_integration.TestTelemetryIntegration04:23
jaosoriorEmilienM: anybody from the ceilo team I can ask about this?04:24
weshayjaosorior, pm04:25
EmilienMjaosorior: weird04:26
EmilienMI haven't see it today04:26
openstackgerritSteve Baker proposed openstack/tripleo-quickstart-extras master: WIP Use "openstack tripleo container image prepare"  https://review.openstack.org/56840304:26
jaosoriorEmilienM: right I'm talking more than a month ago, exact same thing04:27
jaosorioranyway, Unkown protocol error means that something is trying to using https in an http port04:28
jaosoriorjust not sure what, cause the tempest logs are not very explicit about it04:28
jaosoriorbut the test calling heat a bunch of times, then failing, then calling the teardown and failing at that, is what I mean04:29
Tengui.e. race condition at some point?04:29
weshayEmilienM, fyi https://review.openstack.org/#/c/568733/04:29
jaosoriorTengu: not sure.04:35
jaosoriorTengu: apparently this trace is all I get from tempest...04:36
openstackgerritEmilien Macchi proposed openstack/tripleo-common master: (cleanup) remove usage of role_name  https://review.openstack.org/56834704:37
jaosoriorTengu: if you have time to look into this as well, I sure would appreciate the help04:38
*** moguimar has quit IRC04:38
*** pblaho has quit IRC04:38
*** dtantsur|afk has quit IRC04:38
Tengujaosorior: I might give it a try :)04:38
*** moguimar has joined #tripleo04:39
*** anilvenkata has joined #tripleo04:40
*** jaosorior has quit IRC04:40
Tengujaosorior: hmmm there's this log file, but I don't see error in it for now: http://logs.openstack.org/45/560445/29/check/tripleo-ci-centos-7-scenario002-multinode-oooq-container/6c9f8c0/logs/undercloud/home/zuul/tempest/tempest.log.txt.gz04:40
Tenguduh..04:40
*** jaosorior has joined #tripleo04:47
jaosoriorTengu: do you have a node where you can deploy?04:48
Tengujaosorior: not yet, I should get a tiny monster tomorrow (ordered yesterday in an online store)04:48
Tengujaosorior: and as I lack the RAM in my laptop, can't really afford to deploy anything on it.04:49
Tengujaosorior: you can follow the subnet life in this log file: http://logs.openstack.org/45/560445/29/check/tripleo-ci-centos-7-scenario002-multinode-oooq-container/6c9f8c0/logs/undercloud/home/zuul/tempest/tempest.log.txt.gz04:49
*** ykarel|away has joined #tripleo04:50
Tengualthough I don't really see anything weird :/.04:50
Tenguit ends up with the failure04:50
jaosoriorright04:50
jaosoriorI just don't know what it tried to do before the failure04:50
Tenguah, the log would show it.04:50
Tengujaosorior: the error starts here: http://logs.openstack.org/45/560445/29/check/tripleo-ci-centos-7-scenario002-multinode-oooq-container/6c9f8c0/logs/undercloud/home/zuul/tempest/tempest.log.txt.gz#_2018-05-15_18_19_03_47104:51
Tenguyou might want to check a bit above that line?04:51
*** saneax-_-|AFK is now known as saneax04:51
*** pblaho has joined #tripleo04:52
Tenguapparently.... hmm. there are a lot of redirection just before, if we take the XML file present in the parent dir.04:52
jaosoriorright04:53
jaosoriorthe redirection from heat04:53
Tenguah, port 13004 is heat?04:53
jaosorioryeah04:53
* Tengu takes note04:53
jaosoriorI don't really understand the redirections though04:53
jaosorior"Redirecting https://overcloud.localdomain:13004/v1/99fdd28b260b460c9040a043b5d763fe/stacks/integration_test -> https://overcloud.localdomain:13004/v1/99fdd28b260b460c9040a043b5d763fe/stacks/integration_test/156f4da9-ee97-42ab-bb76-8c66ccc6f774"04:54
jaosorioruhu04:54
jaosoriorbrb04:54
*** jaosorior has quit IRC04:54
*** paramite_ has joined #tripleo04:58
*** cshastri has joined #tripleo04:59
*** jaosorior has joined #tripleo05:02
Tengujaosorior: digging a bit further in the script launching tempest, it has a concurrency of 1, so it should NOT create some race condition.05:04
jaosoriorwell that's good to know05:04
jaosoriorbut... where are the redirections coming from?05:04
Tenguhttp://logs.openstack.org/45/560445/29/check/tripleo-ci-centos-7-scenario002-multinode-oooq-container/6c9f8c0/logs/undercloud/home/zuul/tempest_output.log.txt.gz#_2018-05-15_18_18_4105:04
TenguI think the best shot would be to check the telemetry_tempest_plugin.scenario.test_telemetry_integration.TestTelemetryIntegration.test_autoscaling content.05:05
*** aufi has joined #tripleo05:05
Tenguso that we might understand WHAT it does. this should be deployed on any undercloud I guess.05:05
Tenguas part of the tempest bundle05:05
Tengumeaning we should even get a hand on it via github05:05
jaosoriorTengu: https://github.com/openstack/telemetry-tempest-plugin05:06
*** yprokule has joined #tripleo05:06
Tenguthank you :)05:06
jaosoriorTengu: https://github.com/openstack/telemetry-tempest-plugin/blob/master/telemetry_tempest_plugin/integration/gabbi/gabbits-live/autoscaling.yaml05:06
jaosoriorTengu: oho https://github.com/openstack/telemetry-tempest-plugin/blob/master/telemetry_tempest_plugin/integration/gabbi/gabbits-live/autoscaling.yaml#L2905:07
jaosoriorthat's an explicit loop05:07
jaosoriorthat's why we see all those redirects05:07
jaosoriorfunky05:07
Tenguhmm05:08
Tenguso it loops 300 times on the same URL05:08
Tenguwith 1 second delay05:08
jaosoriorwell05:08
Tengu..... that's a DoS05:08
jaosoriorit I guess it polls until it gets a status 20005:09
Tenguunless it "breaks" the loop when it gets a 20005:09
Tengu:D05:09
Tenguanyway. maybe heat engine is just falling appart with that loop05:09
Tengujaosorior: you said it was random right?05:10
*** ooolpbot has joined #tripleo05:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION05:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097205:10
*** ooolpbot has quit IRC05:10
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)05:10
jaosoriorTengu: I think what happens is that it fails creating the stack05:10
Tenguhmm. do we actually get the 300 iteration? I don't think so. urllib3 fails before the end05:10
*** udesale has joined #tripleo05:11
Tengudue to some connection issue. funky part, it's not a "connection refused", meaning heat might still be running properly05:11
*** dxiri has quit IRC05:11
jaosoriorI don't see any issues in the heat logs05:12
jaosorioractually, someone does a stack update05:12
jaosoriorand it even gets to update complete05:12
jaosoriorTengu: http://logs.openstack.org/45/560445/29/check/tripleo-ci-centos-7-scenario002-multinode-oooq-container/6c9f8c0/logs/subnode-2/var/log/containers/heat/heat-engine.log.txt.gz#_2018-05-15_18_18_39_99105:12
jaosoriorright05:12
Tengudarn. of course, "containers" -.-05:12
jaosoriorTengu: the stack update is here https://github.com/openstack/telemetry-tempest-plugin/blob/master/telemetry_tempest_plugin/integration/gabbi/gabbits-live/autoscaling.yaml#L10205:12
Tenguforget that directory whily searching.05:12
TenguL108 in fact05:13
Tengubut yep.05:13
Tengujaosorior: http://logs.openstack.org/45/560445/29/check/tripleo-ci-centos-7-scenario002-multinode-oooq-container/6c9f8c0/logs/subnode-2/var/log/containers/heat/heat-engine.log.txt.gz#_2018-05-15_18_17_29_184  we do have the CREATE COMPLETE05:15
Tenguand, according to the log timestamp, it's BEFORE the crash05:15
jaosoriorSo, it's not a heat issue05:15
Tenguand there a second stack created successfully, still before the crash: http://logs.openstack.org/45/560445/29/check/tripleo-ci-centos-7-scenario002-multinode-oooq-container/6c9f8c0/logs/subnode-2/var/log/containers/heat/heat-engine.log.txt.gz#_2018-05-15_18_17_43_13305:16
*** elgxl has joined #tripleo05:17
jaosoriorTengu: I think the whole thing went well and it failed on the cleanup05:20
Tenguhmm. indeed.05:21
Tengumeaning the cleanup maybe isn't full05:21
Tengui.e. it tries to drop the subnet before the instances/ports are really down05:21
Tenguso this is another .yaml file right?05:21
*** elgxl has quit IRC05:21
jaosoriortrying to find it05:22
Tengumaybe https://github.com/openstack/telemetry-tempest-plugin/blob/b30a19214d0036141de75047b444d48ae0d0b656/telemetry_tempest_plugin/integration/gabbi/gabbits-live/aodh-gnocchi-threshold-alarm.yaml ?05:22
Tenguthe only one matching "teardown" in that repo05:22
*** ratailor has joined #tripleo05:23
Tengufunky.05:23
Tenguthis is in the test just BEFORE the one that fails05:23
*** saneax is now known as saneax-_-|AFK05:23
Tengujaosorior: https://github.com/openstack/telemetry-tempest-plugin/blob/master/telemetry_tempest_plugin/integration/gabbi/gabbits-live/autoscaling.yaml#L147 well..05:24
*** remus has joined #tripleo05:24
jaosoriorTengu: sure05:25
jaosoriorTengu: it deletes the stack; but we see that it deleted some image05:25
jaosoriorthe images are not handled as part of the stack05:25
Tenguthere' also : https://github.com/openstack/telemetry-tempest-plugin/blob/master/telemetry_tempest_plugin/integration/gabbi/gabbits-live/autoscaling.yaml#L129 just before the stack deletion05:25
*** agurenko has joined #tripleo05:26
jaosoriorTengu: shit05:28
jaosoriorTengu: so...05:28
jaosoriorTengu: this is where the integration tests get called https://github.com/openstack/telemetry-tempest-plugin/blob/master/telemetry_tempest_plugin/scenario/test_telemetry_integration.py05:29
Tenguyup05:29
jaosoriorhere, for instance is how they assign the endpoints https://github.com/openstack/telemetry-tempest-plugin/blob/master/telemetry_tempest_plugin/scenario/test_telemetry_integration.py#L8405:29
jaosoriormost of it is just getting the endpoints05:29
Tenguyup.05:29
jaosoriorexcept for the glance image05:29
jaosoriorwhich calles self.glance_image_create()05:30
jaosoriorthat function is part of the parent class05:30
jaosoriormanager.ScenarioTest05:30
jaosoriorwhich is part of the base tempest definitions05:30
Tenguhttps://github.com/openstack/tempest/blob/526468df52e4dcb8193259ebd55f100dddb97fd2/tempest/scenario/manager.py#L42905:31
jaosoriorright05:31
jaosoriorthat clales _image_create05:31
TenguL40005:31
jaosoriorwhich does this https://github.com/openstack/tempest/blob/526468df52e4dcb8193259ebd55f100dddb97fd2/tempest/scenario/manager.py#L42005:31
jaosoriorso, what we're seeing is the cleanup running all the "addCleanup" callbacks05:32
jaosoriorwhcih, IIRC, are called at random05:32
jaosorioror maybe they aren't, but still, we would need to check each addCleanup that is used05:32
jaosoriorTengu: oho05:36
jaosoriorwait up05:36
jaosoriorI skipped something05:36
Tenguhmm?05:36
* Tengu digging in the code05:36
jaosoriorTengu: So, check the SSL exception log05:37
*** jtomasek has joined #tripleo05:37
jaosoriorTengu: I had missed this:05:37
jaosoriorAssertionError: From test "check event" :05:37
Tenguso we know what test actually fail with that05:38
jaosoriorTengu: it's this https://github.com/openstack/telemetry-tempest-plugin/blob/master/telemetry_tempest_plugin/integration/gabbi/gabbits-live/autoscaling.yaml#L6605:38
Tengupanko?05:39
Tenguwhat service is that?05:39
jaosoriorTengu: https://docs.openstack.org/panko/latest/05:39
Tenguah, yep05:39
Tengustorage for ceilometer05:39
Tenguis panko listening with TLS?05:39
jaosoriorTengu: it's public endpoint is05:40
jaosoriorbut hey! at least we know what service failed05:40
Tenguhttp://logs.openstack.org/45/560445/29/check/tripleo-ci-centos-7-scenario002-multinode-oooq-container/6c9f8c0/logs/subnode-2/var/log/containers/httpd/panko-api/panko_wsgi_error.log.txt.gz05:41
jaosorioruuuh...05:41
jaosoriorfunky05:41
Tengualthough..05:41
Tenguthe access log actually does show working connections later.05:41
*** dparkes has quit IRC05:41
jaosoriorTengu: nothing in the panko logs either http://logs.openstack.org/45/560445/29/check/tripleo-ci-centos-7-scenario002-multinode-oooq-container/6c9f8c0/logs/subnode-2/var/log/containers/panko/app.log.txt.gz05:42
Tengujaosorior: was about to say that :]05:42
Tengujust the warning, but not an ssl-related.05:42
TenguBUT05:42
Tengussl is managed in haproxy right?05:43
jaosorioryes05:43
Tenguwhat does IT say.05:43
jaosoriorwas about to check the config05:43
*** marios has joined #tripleo05:43
Tenguhmmm where is it now with the containers....05:43
*** quiquell|off is now known as quiquell05:43
jaosoriorTengu: it still outputs its logs to journal05:44
Tenguhmmm yep, but its config file itself?05:44
Tenguoh. we're missing /var/lib/kolla directory ?05:45
jaosoriorTengu: http://logs.openstack.org/45/560445/29/check/tripleo-ci-centos-7-scenario002-multinode-oooq-container/6c9f8c0/logs/subnode-2/var/log/config-data/haproxy/etc/haproxy/05:45
Tenguah05:46
Tengujaosorior: wondering.... there's no support for the redirect in the yaml part regarding panko05:47
jaosoriorTengu: ??05:47
Tenguin the autoscaling.yaml that is05:48
jaosoriorTengu: what do you mean?05:48
Tengumissing "redirects: true" - but not sure it causes any issue for that one05:48
openstackgerritRajesh Tailor proposed openstack/tripleo-heat-templates master: Allow configuration of NFS backend for Nova  https://review.openstack.org/56417905:50
*** masco has joined #tripleo05:50
*** dciabrin has quit IRC05:50
jaosoriorTengu: so there's two options I can see right now05:51
jaosoriorone is that the redirect is messing things up05:51
jaosoriorthe second is that from the beginning we are getting the URL wrong05:51
jaosoriorBut I think the issue is in the call to $ENVIRON['PANKO_SERVICE_URL']/v2/events05:52
jaosoriorI'm trying to bring up an environment to reproduce it05:52
Tengujaosorior: May 15 18:18:39 centos-7-rax-dfw-0004034638 haproxy[53297]: 192.168.24.2:54498 [15/May/2018:18:18:39.577] panko panko/<NOSRV> -1/-1/-1/-1/9 400 187 - - PR-- 126/0/0/0/3 0/0 "<BADREQ>"05:53
Tengugotcha05:53
Tenguthank you, lovely grep :)05:53
*** fragatina has quit IRC05:55
*** fragatina has joined #tripleo05:55
*** dparkes has joined #tripleo05:56
jaosoriorTengu: don't really get taht log05:56
Tengujaosorior: in journal05:57
jaosoriorright05:57
jaosoriorbut, don't really understand what happened there05:57
jaosoriorsomething did a bad request to the panko endpoint05:57
jaosoriorbut... why doesn't it show the endpoint that was used?05:57
Tengujaosorior: http://paste.openstack.org/show/721060/05:58
jaosorioroho05:59
* Tengu loves grep05:59
TenguI could isolate haproxy process logs as well if you want them :)05:59
jaosoriorthat would be nice :D05:59
jaosoriorbut yeah, still not entirely sure what the issue is :/06:00
Tengu wc -l process.log06:00
Tengu4526 process.log06:00
Tenguor maybe not that nice XD06:00
*** dparkes has quit IRC06:00
Tengujaosorior: `grep 'haproxy\[' journal.log.gz > process.log`06:01
Tenguwget apparently get an uncompressed file.06:01
Tenguprobably an inflate from the web server.06:01
jaosoriorchandankumar: are you around>?06:03
*** psachin has quit IRC06:03
*** jfrancoa has joined #tripleo06:03
*** pliu has quit IRC06:04
*** lucas-afk has quit IRC06:05
*** remus has quit IRC06:06
*** pliu has joined #tripleo06:06
jaosoriorTengu: I'm trying to figure out how tempest got the panko url06:06
jaosoriorTengu: this is how the urls are assigned https://github.com/openstack/telemetry-tempest-plugin/blob/master/telemetry_tempest_plugin/scenario/test_telemetry_integration.py#L8406:07
*** lucasagomes has joined #tripleo06:07
jaosoriorit eventually calls this function https://github.com/openstack/telemetry-tempest-plugin/blob/master/telemetry_tempest_plugin/scenario/test_telemetry_integration.py#L4506:07
*** yprokule_ has joined #tripleo06:07
jaosoriorthis is the tempest config that was used http://logs.openstack.org/45/560445/29/check/tripleo-ci-centos-7-scenario002-multinode-oooq-container/6c9f8c0/logs/undercloud/home/zuul/tempest/etc/tempest.conf.txt.gz06:07
*** ooolpbot has joined #tripleo06:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION06:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097206:10
*** ooolpbot has quit IRC06:10
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)06:10
Tengujaosorior: apparently the URLs are feeded directly by the API06:10
*** yprokule has quit IRC06:10
*** yprokule_ is now known as yprokule06:10
*** pblaho has quit IRC06:11
*** moguimar has quit IRC06:11
*** psachin has joined #tripleo06:12
jaosoriorTengu: yeah, they're gotten from the keystone catalog aparently06:13
*** moguimar has joined #tripleo06:13
jaosoriorTengu: still trying to bring up an environment06:14
jaosoriorI'll let you know when I do so we can test this out06:14
Tengujaosorior: tomorrow I should get all the orderd things, and I'll take a moment in order to build the computer, install it with some OS, and deploy a VM-based lab06:14
jaosoriorTengu: I have CentOS in my machine06:14
jaosoriorthat's what I've been using to deploy06:15
jaosoriorit's been working quite alright06:15
Tenguguess it doesn't really matter as the lab itself will be VM-based :).06:15
*** hjensas has quit IRC06:15
Tengubut I think I'll build something very similar to what I did in my previous job06:15
jaosoriorTengu: I think quickstart makes some assumptions as what the host OS is06:15
Tenguerf.. is RDO third party CI down?06:16
jaosoriorso, I do suggest either RHEL or CentOS06:16
jaosoriorno idea06:16
Tenguyup, will probably go centos.06:16
Tenguanyway, that's for tomorrow.06:16
openstackgerritMerged openstack/python-tripleoclient master: (cleanup) remove usage of vars when calling ansible  https://review.openstack.org/56834606:17
Tenguhttps://logs.rdoproject.org/28/566228/9/openstack-check/gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-master/Z14cea5ca93a54c4b823f0d4d36faf804/undercloud/home/jenkins/overcloud_prep_images.log.txt.gz  grmbl.06:18
*** cylopez has joined #tripleo06:25
*** pblaho has joined #tripleo06:25
*** skramaja has joined #tripleo06:26
*** hewbrocca_afk has quit IRC06:26
*** markmc has quit IRC06:26
*** udesale has quit IRC06:27
*** dparkes has joined #tripleo06:29
*** udesale has joined #tripleo06:29
*** holser__ has joined #tripleo06:33
*** elgxl has joined #tripleo06:34
*** chandankumar has quit IRC06:38
*** apetrich has quit IRC06:38
*** chandankumar has joined #tripleo06:39
*** lvdombrkr has joined #tripleo06:39
chandankumarjaosorior: yes sensi, How can I help ?06:40
jaosoriorchandankumar: I had some tempest questions but ended sorting them out06:43
jaosoriorthanks!06:43
chandankumarjaosorior: cool!06:43
Tengujaosorior: you have caught the issue? :)06:44
chandankumarjaosorior: I need some help on https://review.openstack.org/#/q/topic:tempest_log+(status:open+OR+status:merged)06:44
openstackgerritMerged openstack/python-tripleoclient master: undercloud upgrade: include UndercloudUpgrade service  https://review.openstack.org/56871606:47
*** saneax-_-|AFK is now known as saneax06:47
chandankumarjaosorior: what was the issue by the way?06:47
*** quiquell is now known as quiquell|afk06:49
sshnaidm|rover chandankumar, arxcruz, do you know about telemetry tests failing? http://logs.openstack.org/21/528621/16/check/tripleo-ci-centos-7-scenario002-multinode-oooq-container/de36d70/logs/undercloud/home/zuul/tempest_output.log.txt.gz06:50
*** apetrich has joined #tripleo06:50
Tenguchandankumar: ah, well, that is the issue jaosorior was tracking down :) -^06:53
chandankumarTengu: is it the same telemetry issue?06:54
Tengu2s06:54
sshnaidm|roverchandankumar, arxcruz fyi: https://bugs.launchpad.net/tripleo/+bug/177150806:54
openstackLaunchpad bug 1771508 in tripleo "Telemetry tests fail in scenario-001 and 002 jobs" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)06:54
Tenguchandankumar: yes, exactly if I check this: http://logs.openstack.org/21/528621/16/check/tripleo-ci-centos-7-scenario002-multinode-oooq-container/de36d70/logs/undercloud/home/zuul/tempest/tempest.html.gz06:54
*** marrusl has quit IRC06:54
chandankumarsshnaidm|rover: ^^ jaosorior is hunting down06:54
sshnaidm|roverchandankumar, it's blocking gates currently, so I'd propose to exclude it from tempest until it's solved06:55
*** hjensas has joined #tripleo06:55
chandankumarsshnaidm|rover: ack!06:55
*** slaweq has quit IRC06:56
sshnaidm|roverchandankumar, arxcruz  can you please add it to exclude list for now?06:56
jaosoriorTengu: haven't caught the issue... still deploying the environment06:57
jaosoriorjust waiting for ansible to run06:58
jaosoriorlike watching paint dry06:58
*** masco has quit IRC06:58
*** udesale has quit IRC06:58
*** slaweq has joined #tripleo06:58
TenguXD06:58
Tengujaosorior: btw, you use quickstart for such deploy?06:59
*** olap has joined #tripleo07:07
*** udesale has joined #tripleo07:07
*** marrusl has joined #tripleo07:08
*** ccamacho has joined #tripleo07:08
openstackgerritChandan Kumar proposed openstack/tripleo-quickstart-extras master: Add test_autoscaling tests to skip list  https://review.openstack.org/56876607:09
chandankumarsshnaidm|rover: ^^07:09
openstackgerritMartin André proposed openstack/tripleo-quickstart-extras master: Remove duplicated undercloud_enable_tempest key  https://review.openstack.org/56876707:10
*** ooolpbot has joined #tripleo07:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION07:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097207:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177150807:10
*** ooolpbot has quit IRC07:10
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)07:10
openstackLaunchpad bug 1771508 in tripleo "Telemetry tests fail in scenario-001 and 002 jobs" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)07:10
Tenguoh. the isue I got (introspection fails) :).07:10
sshnaidm|roverchandankumar, thanks!07:10
*** rcernin has quit IRC07:10
Tenguso won't obsess on the rechecks for now.07:10
*** masco has joined #tripleo07:11
chandankumarsshnaidm|rover: do we gate gnocchi github pr with tripleo?07:12
sshnaidm|roverchandankumar, I don't think so07:12
chandankumarsshnaidm|rover: I think we need one07:12
jaosoriorTengu: yep, quickstart07:12
sshnaidm|roverchandankumar, why github pr? gnocchi is in gerrit07:12
chandankumarsshnaidm|rover: it is on github, moving outside openstack long time ago07:13
jaosoriorsshnaidm|rover: not anymore AFAIK07:13
chandankumarsshnaidm|rover: https://github.com/gnocchixyz/gnocchi07:13
sshnaidm|roverI see, didn't know about that..07:14
sshnaidm|roverand why to do it? to prevent gating and errors detection? :)07:14
chandankumarsshnaidm|rover: yup, because telemetry-tempest-plugin gets out of sync07:15
chandankumarsshnaidm|rover: below is the list of jobs running https://github.com/gnocchixyz/gnocchi/pull/87807:16
sshnaidm|roverit's only linters jobs, don't check functionality07:17
*** agopi has quit IRC07:17
*** tesseract has joined #tripleo07:17
sshnaidm|roverwell, I don't remember we ever gated gnocchi with tripleo07:17
chandankumarsshnaidm|rover: adding a card in trello07:17
bandinijaosorior: apologies, was away yesterday. I see https://review.openstack.org/#/c/554926/ has merged \o/07:19
*** udesale has quit IRC07:19
*** ykarel|away is now known as ykarel07:19
openstackgerritMichele Baldessari proposed openstack/puppet-tripleo master: WIP move unfencing to meta_params  https://review.openstack.org/56876907:23
*** udesale has joined #tripleo07:24
*** zoli is now known as zoli|sickday07:26
*** zoli|sickday is now known as zoli07:27
*** quiquell|afk is now known as quiquell07:33
arxcruzchandankumar: sshnaidm|rover hi, sorry, was on the train, anything i can do ?07:33
*** florianf has joined #tripleo07:34
*** ffiore has joined #tripleo07:38
*** tosky has joined #tripleo07:38
*** elgxl has quit IRC07:39
sshnaidm|roverarxcruz, thanks, I think we are good atm07:43
*** agopi has joined #tripleo07:43
sshnaidm|roverarxcruz, but need to investigate why tempest test fail later07:43
arxcruz:)07:44
*** moshele has joined #tripleo07:45
arxcruzsshnaidm|rover: k07:46
chandankumararxcruz: https://review.openstack.org/#/c/568766/07:47
arxcruzchandankumar: maybe we should contact pradik07:49
arxcruzsshnaidm|rover: ^07:49
chandankumararxcruz: yup, better assign to them07:49
*** jpena|off is now known as jpena07:50
*** dmacpher has quit IRC07:50
*** dmacpher has joined #tripleo07:51
*** kopecmartin has joined #tripleo07:53
*** amoralej|off is now known as amoralej07:53
*** psahoo has quit IRC07:55
*** ykarel is now known as ykarel|lunch07:56
openstackgerritMartin Kopec proposed openstack/tripleo-quickstart-extras master: Remove hardcoded cinder v1 option for tempestconf  https://review.openstack.org/56877607:58
*** shardy has joined #tripleo08:00
*** mdnadeem has joined #tripleo08:04
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: Add python script to dynamically compose releases  https://review.openstack.org/56752108:05
openstackgerritMichele Baldessari proposed openstack/puppet-tripleo master: Move unfencing to meta_params  https://review.openstack.org/56876908:07
sshnaidm|roverarxcruz, yeah, definitely08:09
*** ooolpbot has joined #tripleo08:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION08:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097208:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177150808:10
*** ooolpbot has quit IRC08:10
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)08:10
openstackLaunchpad bug 1771508 in tripleo "Telemetry tests fail in scenario-001 and 002 jobs" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)08:10
*** psahoo has joined #tripleo08:11
*** akrivoka has joined #tripleo08:14
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: [WIP] Implement --output-file to write the bash script  https://review.openstack.org/56828508:15
*** markmc has joined #tripleo08:21
*** hewbrocca_afk has joined #tripleo08:22
openstackgerritMartin André proposed openstack/tripleo-common master: Revert "Revert "Pass connection info via ansible config file""  https://review.openstack.org/56878108:23
openstackgerritMartin André proposed openstack/tripleo-common master: Revert "Revert "Pass connection info via ansible config file""  https://review.openstack.org/56878108:25
openstackgerritDoug Szumski proposed openstack/diskimage-builder master: Remove duplicate GRUB command line entry  https://review.openstack.org/56860008:31
openstackgerritArx Cruz proposed openstack/tripleo-quickstart-extras master: Fix generation of docs  https://review.openstack.org/56878308:33
*** ssbarnea_ has joined #tripleo08:35
*** dbecker has quit IRC08:37
*** ykarel|lunch is now known as ykarel08:37
*** derekh has joined #tripleo08:41
*** agopi has quit IRC08:44
openstackgerritMarios Andreou proposed openstack/tripleo-heat-templates master: FFU Add cinder-backup missing fast_forward_upgrade_tasks  https://review.openstack.org/56852008:45
*** wolverineav has joined #tripleo08:46
openstackgerritJiri Tomasek proposed openstack/tripleo-ui master: Run undeploy_plan workflow to delete deployment  https://review.openstack.org/56636608:51
*** gkadam has joined #tripleo08:53
*** salmankhan has joined #tripleo08:54
openstackgerritSorin Sbarnea proposed openstack/tripleo-quickstart master: Resolve deprecation syntax warning  https://review.openstack.org/56878908:59
*** radeks_ has joined #tripleo09:00
*** gfidente has joined #tripleo09:06
*** gfidente has quit IRC09:06
*** gfidente has joined #tripleo09:06
*** links has quit IRC09:07
*** jistr has quit IRC09:09
*** ooolpbot has joined #tripleo09:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION09:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097209:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177150809:10
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)09:10
*** ooolpbot has quit IRC09:10
openstackLaunchpad bug 1771508 in tripleo "Telemetry tests fail in scenario-001 and 002 jobs" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)09:10
*** jistr has joined #tripleo09:12
openstackgerritJose Luis Franco proposed openstack/tripleo-upgrade master: Separate scripts creation tasks for under/overcloud.  https://review.openstack.org/56629109:16
openstackgerritJose Luis Franco proposed openstack/tripleo-upgrade master: Move SSL undercloud validation out of UC script create tasks.  https://review.openstack.org/56719009:16
*** panda|bbl is now known as panda09:18
*** links has joined #tripleo09:23
*** bogdando has joined #tripleo09:23
*** pmannidi has quit IRC09:27
openstackgerritBogdan Dobrelya proposed openstack/python-tripleoclient master: Persist generated undercloud parameters t-h-t  https://review.openstack.org/56576409:36
openstackgerritBogdan Dobrelya proposed openstack/tripleo-quickstart-extras master: Fix path and wire-in UC deploy role data file  https://review.openstack.org/56398809:36
openstackgerritCarlos Goncalves proposed openstack/tripleo-common stable/queens: Install Octavia amphora image if Red Hat  https://review.openstack.org/56880109:40
*** dtantsur has joined #tripleo09:43
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: Add python script to dynamically compose releases  https://review.openstack.org/56752109:47
jaosoriorbogdando: we have logrotate configured in the overcloud nodes, right?09:48
derekhsshnaidm|rover:  I'm looking into https://bugs.launchpad.net/tripleo/+bug/1770972 again09:50
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)09:50
sshnaidm|roverderekh, thanks, seems like the problem is back09:50
derekhsshnaidm|rover: is the image download location logged for this job somewhere? https://logs.rdoproject.org/65/566565/7/openstack-check/gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-master/Z14cd35a437b546d0a25ba0721c29b5d0/undercloud/home/jenkins/overcloud_prep_images.log.txt.gz#_2018-05-15_18_57_2109:50
sshnaidm|roverderekh, looking..09:51
sshnaidm|roverderekh, only those tasks: https://logs.rdoproject.org/65/566565/7/openstack-check/gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-master/Z14cd35a437b546d0a25ba0721c29b5d0/console.txt.gz#_2018-05-15_17_21_34_64509:52
sshnaidm|roverderekh, but I see that md5 checking is skipping, very odd.. checking now09:53
sshnaidm|roverhttps://logs.rdoproject.org/65/566565/7/openstack-check/gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-master/Z14cd35a437b546d0a25ba0721c29b5d0/console.txt.gz#_2018-05-15_17_21_37_47609:53
sshnaidm|roverderekh, damn.. I suspect I know what happens..09:55
derekhAlso I noticed that the 2 jobs lined to as examples in the bug appear to be using a ramdisk with a different sizes to each other09:55
derekh2018-05-15 17:55:58 | | e5b21dfb-c043-4c26-8a0f-e1396c4d9ac6 | bm-deploy-ramdisk |     ari     | 391551106 | active |09:55
derekh2018-05-15 17:11:29 | | 8fe45a7e-e3b9-447f-8dbb-08762b79b9a7 | bm-deploy-ramdisk |     ari     | 393831283 | active |09:55
derekhsshnaidm|rover: ya?09:56
derekh*linked09:56
sshnaidm|roverderekh, we check md5 here: https://github.com/openstack/tripleo-quickstart/blob/dc188905a1f5498a1562a4dda9963f79d25fe1ac/roles/fetch-images/tasks/fetch.yml#L13609:56
sshnaidm|roverderekh, but we use ansible cache for variables.. I'm not sure - with set_fact do we override cached variables or not?09:57
sshnaidm|roverderekh, because if not - we check md5 for previous donwloaded image - overcloud.qcow209:57
derekhsshnaidm|rover: I got no idea09:57
sshnaidm|roverderekh, we run these role twice - for overcloud image and ipa, overcloud is first09:57
sshnaidm|roverderekh, ok, I'll submit a patch09:58
openstackgerritSagi Shnaidman proposed openstack/tripleo-quickstart master: Don't cache calculated md5 variables  https://review.openstack.org/56880510:03
sshnaidm|roverderekh, let's see ^^10:03
openstackgerritSagi Shnaidman proposed openstack/tripleo-quickstart master: Don't cache calculated md5 variables  https://review.openstack.org/56880510:04
sshnaidm|roverderekh, btw, we changed image yesterday again in promotion10:05
sshnaidm|roverderekh, if this image is corrupted, maybe we have problems with uploading the image..10:05
*** bkopilov has quit IRC10:05
derekhsshnaidm|rover: I'm testing the wrong image so, this didn't change yesterday https://images.rdoproject.org/master/rdo_trunk/current-tripleo/10:05
derekhsshnaidm|rover: where should I be looking?10:06
sshnaidm|roverderekh, image that is used in jobs now is https://images.rdoproject.org/master/rdo_trunk/current-tripleo/10:06
derekhsshnaidm|rover: that what I was looking at, but the timestamp is from monday10:07
sshnaidm|roverderekh, ok, so is that image ok?10:08
derekhsshnaidm|rover: it worked for me,10:08
sshnaidm|roverok.. so at least upload worked well10:08
derekh 100% of the 1 time I tried it10:08
openstackgerritmelissaml proposed openstack/tripleo-docs master: Trivial: Update pypi url to new url  https://review.openstack.org/56324110:10
*** ooolpbot has joined #tripleo10:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION10:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097210:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177150810:10
*** ooolpbot has quit IRC10:10
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)10:10
openstackLaunchpad bug 1771508 in tripleo "Telemetry tests fail in scenario-001 and 002 jobs" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)10:10
*** salmankhan has quit IRC10:14
sshnaidm|roverpanda, can you please +2 on https://review.openstack.org/#/c/568766/ ?10:16
*** wolverineav has quit IRC10:17
*** salmankhan has joined #tripleo10:18
*** wolverineav has joined #tripleo10:18
*** agurenko_ has joined #tripleo10:18
*** agurenko has quit IRC10:19
sshnaidm|roverchandankumar, what is your username in LP?10:19
*** wolverineav has quit IRC10:23
*** jaganathan_ has quit IRC10:26
*** agurenko_ is now known as agurenko10:28
*** olap has quit IRC10:33
*** olap has joined #tripleo10:34
*** olap has quit IRC10:37
*** olap has joined #tripleo10:41
*** agurenko is now known as agurenko_10:42
*** olap has quit IRC10:44
*** olap has joined #tripleo10:44
*** links has quit IRC10:44
openstackgerritNir Magnezi proposed openstack/tripleo-heat-templates master: Make lb-mgmt-subnet a class B subnet  https://review.openstack.org/56813810:47
*** dciabrin has joined #tripleo10:47
*** masco has quit IRC10:47
*** olap has quit IRC10:49
*** olap has joined #tripleo10:50
openstackgerritNir Magnezi proposed openstack/tripleo-common master: Make lb-mgmt-subnet a class B subnet  https://review.openstack.org/56808910:51
*** dciabrin has quit IRC10:52
*** dciabrin has joined #tripleo10:53
*** dciabrin has quit IRC10:54
*** olap has quit IRC10:54
*** mburned_out is now known as mburned10:55
*** leanderthal has joined #tripleo10:56
*** agurenko has joined #tripleo10:56
*** dciabrin has joined #tripleo10:57
*** agurenko_ has quit IRC10:57
*** dciabrin has quit IRC10:57
*** links has joined #tripleo10:57
chandankumarsshnaidm|rover: chkumar24610:58
*** dciabrin has joined #tripleo10:59
*** dciabrin has quit IRC10:59
*** dciabrin has joined #tripleo10:59
*** masco has joined #tripleo11:00
sshnaidm|roverchandankumar, ok, I assigned https://bugs.launchpad.net/tripleo/+bug/1771508 to pradk for now, but he isn't online, please feel free to ping him about that11:02
openstackLaunchpad bug 1771508 in tripleo "Telemetry tests fail in scenario-001 and 002 jobs" [Critical,Triaged] - Assigned to Pradeep Kilambi (pkilambi)11:02
chandankumarsshnaidm|rover: ack11:02
*** janki has quit IRC11:03
*** lucasagomes is now known as lucas-hungry11:04
*** dciabrin_ has joined #tripleo11:04
jaosoriorchandankumar, sshnaidm|rover: I'm also taking a look at it11:04
sshnaidm|roverjaosorior, great, thanks!11:04
jaosoriorsshnaidm|rover: I'll poke pradk when he's online about this11:05
jaosoriorso we can both try to sovle it11:05
jaosorior*solve11:05
chandankumarjaosorior: sshnaidm|rover I have pinged pradk on #rhos-mm internally11:05
*** dciabrin has quit IRC11:07
*** ooolpbot has joined #tripleo11:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION11:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097211:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177150811:10
*** ooolpbot has quit IRC11:10
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)11:10
openstackLaunchpad bug 1771508 in tripleo "Telemetry tests fail in scenario-001 and 002 jobs" [Critical,Triaged] - Assigned to Pradeep Kilambi (pkilambi)11:10
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: Add method for getting dlrn_hash from release and hash_name  https://review.openstack.org/56732011:13
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: Add releases script pytest tests to tox.ini  https://review.openstack.org/56764911:13
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: [WIP] Add CLI argument parser and YAML file parser  https://review.openstack.org/56793611:13
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: [WIP] Implement --output-file to write the bash script  https://review.openstack.org/56828511:13
openstackgerritDmitry Tantsur proposed openstack/tripleo-heat-templates master: Remove support for classic drivers  https://review.openstack.org/56782711:13
openstackgerritDmitry Tantsur proposed openstack/instack-undercloud master: Remove support for classic drivers  https://review.openstack.org/56788611:13
openstackgerritDmitry Tantsur proposed openstack/tripleo-common master: Fix handling hardware types and drivers when generating fencing parameters  https://review.openstack.org/56789611:14
*** ssbarnea_ has quit IRC11:14
openstackgerritDmitry Tantsur proposed openstack/tripleo-common master: Convert classic drivers to hardware types on enrollment  https://review.openstack.org/56790011:14
*** dciabrin_ has quit IRC11:18
openstackgerritArx Cruz proposed openstack/tripleo-quickstart-extras master: Fix generation of docs  https://review.openstack.org/56878311:23
*** jaosorior has quit IRC11:23
*** dciabrin_ has joined #tripleo11:27
florianfjtomasek: so I ran into an interesting problem with the deployment status tracking11:28
*** olap has joined #tripleo11:29
*** abishop has joined #tripleo11:31
*** janki has joined #tripleo11:31
florianfjtomasek: I can't pinpoint exactly what happened, because I was switching ui patches between deployment attempts. But in the end what happened is: I started a deployment through the UI. then deleted it with `openstack stack delete overcloud` while it was deploying. this resulted in an error message shown in the UI ("updating a stack when it's deleting isn't supported")11:32
*** janki has quit IRC11:32
*** quiquell is now known as quique|lunch11:32
florianfjtomasek: this error message is the last message stored in swift11:32
*** janki has joined #tripleo11:32
florianfjtomasek: But since then the stack has been deleted, but the error message still shows up.11:32
florianfjtomasek: which makes sense, because I guess if you delete a stack through `openstack stack delete` it will not leave a message in swift and thus the UI will not recognize the status change.11:34
*** abishop has quit IRC11:34
*** rh-jelabarre has joined #tripleo11:34
*** panda is now known as panda|lunch11:36
openstackgerritYurii Prokulevych proposed openstack/tripleo-upgrade master: Run Ceph upgrade before converge.  https://review.openstack.org/56628211:36
*** dciabrin__ has joined #tripleo11:38
*** dciabrin_ has quit IRC11:41
*** jaosorior has joined #tripleo11:42
*** gkadam has quit IRC11:43
*** gkadam has joined #tripleo11:43
*** dciabrin__ has quit IRC11:43
*** raildo has joined #tripleo11:44
*** rmascena has joined #tripleo11:44
akrivokawhen deploying oooq on rdo cloud using devmode.sh, what is the proper place to make customizations for the deployment? I've tried changing the values in ~/.quickstart/config/environments/rdocloud.yml but they are getting overwritten with defaults when I run the devmode.sh11:45
openstackgerritMarios Andreou proposed openstack/python-tripleoclient master: Add .deployment.v1.deploy_on_servers to ffwd-upgrade prepare  https://review.openstack.org/56633611:45
akrivokae.g. I want to change undercloud_flavor and latest_guest_image11:46
*** jpena is now known as jpena|lunch11:47
*** ssbarnea_ has joined #tripleo11:47
*** jcoufal has joined #tripleo11:50
*** atoth has joined #tripleo11:51
jtomasekflorianf: yeah, new undeploy workflow should be used to delete the deployment, So the status in swift gets updated too. In case when user just deletes the stack, it gets to the incorrect state. This situation sort of equals to situation when user deletes swift object for example. It is like deleting something from database. But, your issue whould be resolved with these patches: https://review.openstack.org/#/c/564315/3 and https://review.openstack.org/11:51
jtomasek#/c/566699/411:51
jtomasekflorianf: https://review.openstack.org/#/c/566699/411:51
jtomasekflorianf: one retrieves the deployment status from swift and other recovers deployment status based on stack status11:52
*** abishop has joined #tripleo11:52
jtomasekI raised a point of combining those together11:53
openstackgerritBogdan Dobrelya proposed openstack/python-tripleoclient master: Fix hiera data override file writing  https://review.openstack.org/56881811:53
bogdandojaosorior: yes11:54
*** abishop has quit IRC11:55
florianfjtomasek: ok cool, so this is not an unkown issue. :-)11:55
chandankumarsshnaidm|rover: jaosorior as per sileht it appera s that haproxy not running ssl while the panko endpoint is https11:58
silehtor the reverse :)11:58
*** trown|outtypewww is now known as trown11:59
jaosoriorchandankumar: isn't it?11:59
silehtI just check the haproxy config and it looks good11:59
silehtI'm looking for the keystone catalog content to see what the endpoint is11:59
silehtjaosorior, ^11:59
*** wolverineav has joined #tripleo12:00
bogdandomwhahaha: hi, do you remember that bug for broken exit code of undercloud install?12:00
jaosorior#startmeeting TripleO Security Squad12:00
openstackMeeting started Wed May 16 12:00:47 2018 UTC and is due to finish in 60 minutes.  The chair is jaosorior. Information about MeetBot at http://wiki.debian.org/MeetBot.12:00
openstackUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.12:00
*** openstack changes topic to " (Meeting topic: TripleO Security Squad)"12:00
openstackThe meeting name has been set to 'tripleo_security_squad'12:00
jaosoriorI'll wait a little bit fo rmore folks to log in12:00
lhindshey oz12:00
jaosoriorhey lhinds! how's it going?12:01
lhindsgood thanks12:01
jaosorior#topic Public TLS work udpate12:06
*** openstack changes topic to "Public TLS work udpate (Meeting topic: TripleO Security Squad)"12:06
jaosoriorright! so12:07
*** amoralej is now known as amoralej|lunch12:07
jaosoriorpublic TLS by default merged12:07
jaosorior....and it was reverted :D12:07
jaosoriorIt was reverted here https://review.openstack.org/#/c/568699/12:08
jaosoriorbecause of this bug https://bugs.launchpad.net/tripleo/+bug/177143512:08
openstackLaunchpad bug 1771435 in tripleo "scenario001/002 failing on autoscaling with urllib3.exceptions.SSLError: [SSL: UNKNOWN_PROTOCOL] unknown protocol (_ssl.c:579)" [Critical,Fix released] - Assigned to Alex Schultz (alex-schultz)12:08
jaosoriorit seems that tempest (the telemetry plugin) is poking panko12:09
*** lucas-hungry is now known as lucasagomes12:09
jaosoriorand it gets a TLS endpoint with a non-TLS port (for some strange reason)12:09
jaosoriorI'm still not sure why that happens12:09
jaosoriorbut I'm looking into it12:09
jaosoriorseems sileht is also looking into it12:10
*** ooolpbot has joined #tripleo12:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION12:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097212:10
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)12:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177150812:10
*** ooolpbot has quit IRC12:10
openstackLaunchpad bug 1771508 in tripleo "Telemetry tests fail in scenario-001 and 002 jobs" [Critical,Triaged] - Assigned to Pradeep Kilambi (pkilambi)12:10
jaosoriorif someone wants to help with that12:10
jaosoriorI can provide details about how to reproduce it12:10
jaosoriorso let me know12:10
jaosoriorhelp is very much appreciated12:11
*** shardy_ has joined #tripleo12:11
jaosorioronce that merges, then just docs are missing and we'll have public TLS by default :D12:11
Tengucan't help more than what I did for now :/12:11
Tengulearning curve is nice :312:11
sshnaidm|roverderekh, weshay I suspect there is different problem with images12:12
jaosoriorTengu: you're getting your system tomorrow, right?12:12
Tenguthe builder? yep.12:12
sshnaidm|roverderekh, we update our images in the job: https://logs.rdoproject.org/15/568715/2/openstack-check/gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-master/Z5df1951657694a9ebaad63e71362a76a/console.txt.gz#_2018-05-16_04_15_05_34612:12
sshnaidm|roverderekh, it's done so: https://github.com/openstack/tripleo-quickstart-extras/blob/69ad943adda9000f79277f0230a5751869de9cb3/roles/modify-image/tasks/manual.yml#L33-L7012:13
*** shardy has quit IRC12:13
jaosoriorTengu: let me know and I can help you reproduce the issue12:13
sshnaidm|roverderekh, weshay but what we have when running update: https://logs.rdoproject.org/15/568715/2/openstack-check/gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-master/Z5df1951657694a9ebaad63e71362a76a/undercloud/home/jenkins/repo_setup.sh.1526444104.log.txt.gz12:13
sshnaidm|roverit may be a reason for failures..12:14
jaosoriorany other questions/feedback on the public TLS stuff?12:14
Tengujaosorior: ok :).12:14
openstackgerritSagi Shnaidman proposed openstack-infra/tripleo-ci master: DNM: build image in every OVB job  https://review.openstack.org/56825812:14
weshayoof12:15
*** ansmith has quit IRC12:16
Tenguhello weshay :)12:16
*** tiswanso has quit IRC12:16
jaosorior#topic Secret management12:16
*** openstack changes topic to "Secret management (Meeting topic: TripleO Security Squad)"12:16
*** dprince has joined #tripleo12:16
jaosoriorSo, I sent out a mail about enabling swift volume encryption by default http://lists.openstack.org/pipermail/openstack-dev/2018-May/130529.html12:17
jaosoriormwhahaha: are you around? I saw you reviewed the patch and had some concerns12:17
mwhahahaSorta12:18
jaosoriormwhahaha: swift isn't really poked that much anymore12:19
weshaymatbu, chem https://review.openstack.org/#/c/568680/12:19
mwhahahaSo the perf thing probably ok12:19
jaosoriormwhahaha: just to store the plan and get the plan out12:19
jaosoriorupdate the plan from the UI12:19
jaosoriorthat's about it AFAIK12:19
jaosoriorooh and get artifacts from the overcloud12:19
openstackgerritEmilien Macchi proposed openstack/tripleo-upgrade master: add container minimal check and gate  https://review.openstack.org/56873312:19
openstackgerritSagi Shnaidman proposed openstack/tripleo-quickstart-extras master: WIP: Reproduce CI multinode job with libvirt  https://review.openstack.org/54342912:20
mwhahahaBut more services is kinda a problem, also how secure is a generic barbican12:20
openstackgerritEmilien Macchi proposed openstack/tripleo-upgrade master: add container minimal check and gate  https://review.openstack.org/56873312:20
*** salmankhan has quit IRC12:20
mwhahahaLike would luks be better12:21
*** tcw has quit IRC12:21
jaosoriormwhahaha: it isn't great, but from there we can more forward to using the pkcs11 plugin for the more security concerned12:21
Tengumwhahaha: you'd still get the key somewhere, or have to manually enter encryption password manually after each reboot12:21
mwhahahaLuks solves the data at rest problem better imho12:22
mwhahahaAnd the undercloud is less of a problem for automatic reboots12:22
*** salmankhan has joined #tripleo12:23
mwhahahaSince we don't assume 100% uptime12:23
*** fultonj has joined #tripleo12:23
*** fultonj has joined #tripleo12:23
mwhahahaHaving dealt with hsm's before I'd rather we recommend luks for the undercloud12:24
mwhahahaThat's my take on it12:24
jaosoriormwhahaha: some people require hardware security12:24
jaosoriorsome folks even want to tie luks to an hsm12:24
mwhahahaThen those people enable it12:24
mwhahahaBut not be default12:24
EmilienMbogdando: thx for https://review.openstack.org/#/c/568818/12:25
mwhahahaI don't see upside to it being on by default12:25
jaosorioralright, those are valid points; I'll leave the commit up there for a bit and see what other folks think; more feedback is always good :)12:25
mwhahahaFalse sense of security is bad :D12:26
jaosorioragreed12:27
Tengusmall question: is there a way to trigger an rdo third party CI without triggering zuul?12:27
*** pkovar has joined #tripleo12:27
beaglesare the current containerized undercloud install docs in https://docs.openstack.org/tripleo-docs/latest/install/installation/installing.html correct?12:27
mwhahahaTengu: check-rdo12:27
Tengumwhahaha: thank you!12:28
jaosorior#topic Kerberos auth for keystone12:28
*** openstack changes topic to "Kerberos auth for keystone (Meeting topic: TripleO Security Squad)"12:28
jaosoriorAlright, something else I wanted to bring up was a (relatively) low hanging fruit12:28
*** abishop has joined #tripleo12:28
jaosoriorkeystone supports kerberos for authentication, and I don't think it would be too hard to do (you can do a TLS everywhere deployment if you need keberos around)12:29
beaglesI'm getting what appears to be issues inc onfiguring nova_placement, heat_api, ironic_api, mysql, ironic, mistral, zaqar, nova, keystone...well basically everything I think12:29
jaosoriorsome folks have expressed interest about it, so I thought it would be a good thing to have'12:29
*** rlandy has joined #tripleo12:29
sshnaidm|roverweshay, well, seems like we can't update images at all, jobs pass only when we build them..12:29
jaosoriorso, if someone wants to pick up that work, I can provide details on how to do it12:29
jaosoriorso, let me know :D12:29
Tengujaosorior: is there some open issue for that?12:30
jaosoriorTengu: there isn't; didn't think about tracking it with launchpad given it's not a bug but a feature request :D12:30
Tenguthere are FRE on launchpad :).12:31
jaosoriorOK, I can write one then12:31
*** abishop has quit IRC12:31
jaosorior#action jaosorior to write an RFE bug about Kerberos authentication12:31
Tenguthat would be best in order to follow12:32
*** abishop has joined #tripleo12:32
jaosoriorI'll provide all the details needed to get that working on that bug12:32
jaosorior#topic Any other business12:33
*** openstack changes topic to "Any other business (Meeting topic: TripleO Security Squad)"12:33
weshaysshnaidm|rover, ok.. I like the patch12:33
jaosoriorAnything else folks want to bring up to the meeting?12:33
weshaythanks sshnaidm|rover12:33
lhindsjaosorior: yup12:33
lhinds#topic limiting heat-admin12:33
lhindsso I have my new machine now and have been thinking of taking the following approach to get a list of every sudo call.12:34
jaosorior#topic limiting heat-admin12:34
*** openstack changes topic to "limiting heat-admin (Meeting topic: TripleO Security Squad)"12:34
*** panda|lunch is now known as panda12:34
lhindsin audit you can track all sudo calls:12:34
lhindshttps://github.com/openstack/tripleo-heat-templates/blob/master/environments/auditd.yaml#L10912:34
lhinds/var/log/audit/*12:35
*** liverpooler has joined #tripleo12:35
lhindsThe puppet service can be used to set this up in the overcloud with an environment file, but seeking advice on how I could do this for the undercloud12:35
jaosoriorlhinds: well, we're moving towards having a containerized undercloud, which would be deployed with t-h-t as well12:36
lhindsI guess I could use guestfs into the image and set it up there. I could also add a grub2.conf option to enable it early in the boot phase.12:36
jaosoriorlhinds: so you could enable the same functionality for the undercloud that way12:36
lhindsjaosorior: ack, see what you mean. So would I be able to pull in an -enviroment file to configure audit within the undercloud12:37
lhindscontainer or vm12:37
jaosoriorright12:37
lhindsit won't be a feature, just a debug method to help me see sudo calls12:37
jaosoriorunderstood12:38
jaosoriorthat's a good start for that12:38
lhindsI guess I can ping you with this outside the meeting if you can help me jaosorior12:38
jaosoriorlhinds: sure!12:39
lhindsjust need to grok the best way to do it, and then I will be on my way to getting it scoped out and a patch submitted12:39
lhindslets do that (will send a DM to you)12:39
jaosoriorawesome12:40
lhindsI can then see a complete list of every user who calls sudo (so validations, nova, keystone etc)12:40
jaosoriorsounds like a plan to get this started12:40
lhindscool. that's it for me.12:40
jaosoriorthe main concern I guess is heat-admin and validations12:40
jaosorioropenstack services have their own sudoer rules, which look alright, as far as I've seen12:40
lhindsyup, validations is the big one..so i also need to think about making sure validations makes lots of noise and gets used a lot12:41
lhindsjaosorior: there is also rootwrap which nicely limits things12:41
jaosorior#topic Any other business12:43
*** openstack changes topic to "Any other business (Meeting topic: TripleO Security Squad)"12:43
*** jpena|lunch is now known as jpena12:43
jaosoriorAnything else folks want to bring up?12:43
*** pchavva has joined #tripleo12:44
*** masco has quit IRC12:44
jaosoriorAlright, thanks for joining folks!12:45
jaosorior#endmeeting12:45
*** openstack changes topic to "Welcome to Rocky. CI status: YELLOW (OVB failing) | http://tripleo.org/ | https://docs.openstack.org/tripleo-docs/latest"12:45
openstackMeeting ended Wed May 16 12:45:08 2018 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)12:45
openstackMinutes:        http://eavesdrop.openstack.org/meetings/tripleo_security_squad/2018/tripleo_security_squad.2018-05-16-12.00.html12:45
openstackMinutes (text): http://eavesdrop.openstack.org/meetings/tripleo_security_squad/2018/tripleo_security_squad.2018-05-16-12.00.txt12:45
openstackLog:            http://eavesdrop.openstack.org/meetings/tripleo_security_squad/2018/tripleo_security_squad.2018-05-16-12.00.log.html12:45
*** tiswanso has joined #tripleo12:45
*** rbowen has joined #tripleo12:45
*** tcw has joined #tripleo12:45
lhindsthanks jaosorior12:45
openstackgerritSagi Shnaidman proposed openstack/tripleo-quickstart-extras master: DNM: disable update of IPA image in jobs  https://review.openstack.org/56883312:45
*** tiswanso has quit IRC12:45
*** tiswanso has joined #tripleo12:46
*** Goneri has joined #tripleo12:51
openstackgerritBogdan Dobrelya proposed openstack/python-tripleoclient master: Log errors with raised exceptions  https://review.openstack.org/56883712:54
derekhsshnaidm|rover: nice one, that DNM patch should confirm it12:56
sshnaidm|roverderekh, I think we hit this: https://serverfault.com/questions/911781/yum-rpm-failed-to-initialize-nss-library-in-chroot12:57
*** moguimar has quit IRC12:57
*** masco has joined #tripleo12:57
openstackgerritMerged openstack/tripleo-quickstart-extras master: Add test_autoscaling tests to skip list  https://review.openstack.org/56876612:58
openstackgerritMerged openstack/tripleo-ui master: Exclude nodes deployed with another deployment plan  https://review.openstack.org/56364612:58
openstackgerritMerged openstack/tripleo-ui master: Add sanitizeMessage function  https://review.openstack.org/56450212:58
openstackgerritSagi Shnaidman proposed openstack/tripleo-quickstart-extras master: Mount /dev for chrooted environment  https://review.openstack.org/56883812:59
sshnaidm|roverderekh, but I'm not sure if i do it right ^^12:59
jaosoriorsileht: have you found out anything else about the bug?12:59
silehtjaosorior, not really, do we drop the keystone catalog somewhere ?12:59
jaosoriorsileht: I tried reproducing it, but my deployment is failing cause it can't scale the stack for some reason13:00
jaosoriorsileht: not AFAIK13:00
sshnaidm|roverderekh, it's the same is here: https://github.com/openstack-infra/tripleo-ci/blob/3a8922988fb915ba8830fc00abf28beb10c45ecb/scripts/common_functions.sh#L11913:00
slaglejtomasek: hi. i was thinking that before someone would use https://review.openstack.org/#/c/564315/, they would've upgraded their overcloud, so it doesn't need to support both old and new13:00
silehtjaosorior, I only found a partial output that don't really help13:00
slaglejtomasek: as the workflow you're adding would do the conversion13:00
*** rmascena has quit IRC13:01
*** toure|gone is now known as toure13:01
sshnaidm|roverderekh, please advice if you know how to apply this solution for our case..13:01
*** ratailor has quit IRC13:01
derekhsshnaidm|rover: trying to find out13:02
silehtjaosorior, can you list the keystone catalog on your installation ?13:05
dtantsurjaosorior: hey! are you The One to ask about SSL in the undercloud? :)13:05
jaosoriordtantsur: yes13:06
derekhsshnaidm|rover: looks ok to me http://paste.openstack.org/show/721090/13:07
jaosoriorsileht: no; I just nuked it to deploy a bigger deployment to try to deal with the scaling issues13:07
derekhsshnaidm|rover: but you probably also need to unmount it13:07
jaosoriorsorry :(13:07
dtantsurjaosorior: I have an issue here, which is probably no one's fault, just imperfection of the world. on an undercloud with SSL nothing using requests (e.g. openstack clients) works in a venv13:08
dtantsurdo you think we could provide some CA environment variables in stackrc to help requests find the SSL bundle?13:08
*** ukalifon has joined #tripleo13:09
jaosoriordtantsur: yes we could13:10
sshnaidm|roverderekh, seems like I need to do it after image archived and before removing mountdir? https://github.com/openstack/tripleo-quickstart-extras/blob/4f205f4cea20a3c690eda9ff7856b197b42e4f9a/roles/modify-image/tasks/manual.yml#L7013:11
dtantsurjaosorior: I can try giving it a shot, but I really don't know where to start13:11
sshnaidm|roverderekh, I mean umount13:11
*** ooolpbot has joined #tripleo13:11
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION13:11
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097213:11
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)13:11
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177150813:11
*** ooolpbot has quit IRC13:11
openstackLaunchpad bug 1771508 in tripleo "Telemetry tests fail in scenario-001 and 002 jobs" [Critical,Triaged] - Assigned to Pradeep Kilambi (pkilambi)13:11
jaosoriordtantsur: How was it deployed? with a user-provided cert? or with the autogenerated one?13:11
dtantsurjaosorior: autogenerated one, I guess. at least I did not provide anything :)13:12
*** derekh has quit IRC13:12
*** ansmith has joined #tripleo13:12
*** derekh has joined #tripleo13:12
jaosoriordtantsur: it depends on the CA used, the default local CA (which is what it's using I think), uses '/etc/pki/ca-trust/source/anchors/cm-local-ca.pem'13:13
openstackgerritSagi Shnaidman proposed openstack/tripleo-quickstart-extras master: Mount /dev for chrooted environment  https://review.openstack.org/56883813:13
trozetmwhahaha: i've gotten past the pacemaker issue and made it to step 4 (yay), but now it gets to step 4 in ansible and just hangs forever13:13
*** dhill_ has joined #tripleo13:13
sshnaidm|roverderekh, ^^13:13
trozetmwhahaha: do you know of any bugs filed around that?13:13
jaosoriordtantsur: if it would be using FreeIPA, it would be /etc/ipa/ca.crt13:13
*** mcornea has joined #tripleo13:13
derekhsshnaidm|rover: before the "find . -print" id say13:13
jaosoriorand if it would be user-provided... then who knows :D13:13
dtantsurwell, I have /etc/pki/ca-trust/source/anchors/cm-local-ca.pem13:13
dtantsuroh, this starts sounding even more complex..13:13
openstackgerritGabriele Cerami proposed openstack-infra/tripleo-ci master: Add ability to use a different release per playbook  https://review.openstack.org/56656513:14
jaosoriordtantsur: PKI is pain13:14
jaosoriordtantsur: but yeah, the most common case would be /etc/pki/ca-trust/source/anchors/cm-local-ca.pem13:14
Tengu:]13:14
openstackgerritSagi Shnaidman proposed openstack/tripleo-quickstart-extras master: Mount /dev for chrooted environment  https://review.openstack.org/56883813:15
sshnaidm|roverderekh, like that? ^^13:15
derekhsshnaidm|rover: for "Close initramfs " id move it up 2 lines, we were not zipping up the dev directory before so no need to start13:15
derekhsshnaidm|rover: yup13:15
*** lblanchard has joined #tripleo13:15
sshnaidm|rovergreat, let's see13:15
* sshnaidm|rover is crossing finger on both hands and legs13:15
*** tzumainn has joined #tripleo13:17
dtantsurjaosorior: okay, setting --os-cacert to this patch fixes the problem for me13:18
dtantsurnow I need to understand where to wire it in...13:18
jaosoriordtantsur: well, there is the OS_CACERT environment variable. That could go in the stackrc13:19
*** lblanchard has quit IRC13:20
dtantsurnow I need to figure out where stackrc is generated for containerized undercloud. anyone remembers from the top of the head?13:20
*** lblanchard has joined #tripleo13:20
jaosoriorI don't :(13:21
mwhahahatrozet: no i'm not aware of that13:22
dtantsurjaosorior: filed https://bugs.launchpad.net/tripleo/+bug/177156513:23
openstackLaunchpad bug 1771565 in tripleo "SSL custom certificates do not work with anything using request in a venv" [Medium,Triaged] - Assigned to Dmitry Tantsur (divius)13:23
*** quique|lunch is now known as quiquell13:23
jaosoriordtantsur: where is requests coming from in that case? pip?13:25
dtantsurbogdando: hey! in the new shiny containerized world where is stackrc generated?13:26
*** amoralej|lunch is now known as amoralej13:26
*** hjensas has quit IRC13:26
dtantsurjaosorior: anything using python-requests in a venv13:26
dtantsurokay, I guess it's here: https://github.com/openstack/tripleo-heat-templates/blob/fea5bfbcc81f322812720f29459d5ae648a4647b/extraconfig/post_deploy/undercloud_post.sh#L1413:27
bogdandodtantsur: /root/stackrc13:27
bogdandoor /home/stack/stackrc13:27
dtantsuryeah, I was wondering about the code generating it. I think I've found it13:28
dtantsurjaosorior: okay, I have to update https://github.com/openstack/tripleo-heat-templates/blob/fea5bfbcc81f322812720f29459d5ae648a4647b/extraconfig/post_deploy/undercloud_post.sh#L14 apparently. Is there a variable in THT that points at the CA file?13:28
sshnaidm|rovermwhahaha, I opened 2 bugs today about fs010 failing in pike and queens, which block them, should I add "alert" there? https://bugs.launchpad.net/tripleo/+bug/1771551  and  https://bugs.launchpad.net/tripleo/+bug/177154913:29
openstackLaunchpad bug 1771551 in tripleo "Containers multinode jobs fails on stable pike because of pacemaker" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)13:29
openstackLaunchpad bug 1771549 in tripleo "Containers multinode jobs fails on stable queens" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)13:29
jaosoriordtantsur: there isn't (but there should) it's in puppet.13:31
* dtantsur takes a big sip of vodka13:31
openstackgerritRonelle Landy proposed openstack-infra/tripleo-ci master: Add dry run option to toci_quickstart  https://review.openstack.org/56706013:31
slagleare we still YELLOW? ovb appears to be "passing" when the mood strikes13:32
openstackgerritJiri Stransky proposed openstack/tripleo-quickstart master: Deploy TLS overcloud in fs051 (sc000 upgrade job)  https://review.openstack.org/56884313:33
aleejaosorior, mcornea so it looks like the right tags are being set - but the password job still is skipping the password related tasks .. http://logs.openstack.org/97/567897/4/experimental/tripleo-ci-centos-7-scenario000-multinode-oooq-container-password-changes/9118617/13:33
jaosoriordtantsur: there is a variable (I had forgotten)13:33
jaosoriordtantsur: it's InternalTLSCAFile13:33
aleelooks like we're still missing something to get them to be invoked13:33
dtantsur\o/13:34
jaosoriordtantsur: check environments/public-tls-undercloud.yaml13:34
dtantsurthanks jaosorior13:34
* dtantsur puts vodka away13:34
jistrjaosorior: fyi our current blocking issue on upgrade job is related to TLS https://bugs.launchpad.net/tripleo/+bug/177156713:34
openstackLaunchpad bug 1771567 in tripleo "sc000 upgrade failing on haproxy TLS config during initial deployment" [High,Triaged]13:34
aleemcornea, made the changes you suggested in tripleo-upgrade13:35
openstackgerritJames Slagle proposed openstack/python-tripleoclient master: overcloud plan deployment failures  https://review.openstack.org/56867313:35
mcorneaalee: hey, I noticed but I didn't get to review yet13:35
jaosoriorjistr: I see; so as part of that upgrade we should pass the no-tls environment13:35
jistrjaosorior: it fails on deploy, before trying any upgrade, i suspect it's because we use Rocky UC to deploy Queens OC, and the client+common from Rocky doesn't work well with Queens templates with something TLS related. I'm trying to take the easy way out by enabling TLS in the upgrade job on deploy too https://review.openstack.org/#/c/568843/13:35
aleemcornea, cool - its just that despite the changes, it seems like the tasks are still not being executed13:37
jaosoriorjistr: yeah, that would be the easiest option13:37
jaosoriorjistr: but yeah, for non-tls deployments, we should use the no-tls environment when upgrading13:37
aleejaosorior, trown -- any ideas?13:37
jaosorior(still needs to write the docs)13:37
*** cshastri has quit IRC13:38
shardy_jtomasek: Hi, just been thinking about how we might adapt capabilities-map.yaml to also work with non-openstack plans, have you already looked into that?13:38
*** shardy_ is now known as shardy13:38
jaosoriortrown, sshnaidm|rover: Do all the tasks in quickstart need to have tags? Would that be what alee is missing?13:38
openstackgerritJames Slagle proposed openstack/python-tripleoclient master: overcloud plan deployment failures  https://review.openstack.org/56867313:38
sshnaidm|roverjaosorior, if you don't use "-t all", then tasks should have tags to be executed13:39
jistrjaosorior: yea makes sense re passing the env file. We don't get far enough to attempt an upgrade there though :/ We might need to tweak the Rocky client/common to be able to deploy Queens at all.13:39
shardyjtomasek: There seem to be two missing parts, a way to restrict a plan to only specific environment_groups, and a way to do globbing in environment_groups so we can e.g have an OpenShift group that points to environments/openshift or whatever13:39
sshnaidm|roverjaosorior, usually tags are in high levels for files/roles, not for each task13:39
aleesshnaidm|rover, so - it looks like all the tasks do have tags ..13:39
aleeso they are listed but are still being skipped13:40
jistrjaosorior: i'm not sure if we support fresh deployment of Q OC with R UC, but we should support at least basic management of Q OC from R UC, i'm not sure if that broke too or if it's just fresh deploy which gets in trouble13:40
sshnaidm|roveralee, I need to know what's the problem to answer :)13:40
aleesshnaidm|rover, thanks -- let me give some context :)13:40
jistrjaosorior: anyway, trying to postpone this by going all-TLS and we'll see how far that gets us13:41
aleesshnaidm|rover, I'm trying to add a job to check password changes13:41
jaosoriorjistr: want to have a talk tomorrow to see how we can address this?13:41
aleesshnaidm|rover, https://review.openstack.org/#/c/567897/13:41
mcorneaalee: so checking the output I see that you now passed overcloud-config-change tag  to the multinode-overcloud-upgrade.yml playbook but that playbook doesn't contain the tag as it does for upgrade https://github.com/openstack/tripleo-quickstart-extras/blob/master/playbooks/multinode-overcloud-upgrade.yml#L3813:41
jistrjaosorior: yea sure13:41
jaosoriorjistr: lets do that, I'm sure we can come up with some solution13:41
openstackgerritEmilien Macchi proposed openstack/puppet-tripleo master: Neutron wrappers: lookup for THT parameter  https://review.openstack.org/56673713:41
openstackgerritEmilien Macchi proposed openstack/puppet-tripleo master: Neutron wrappers: lookup for THT parameter  https://review.openstack.org/56673713:42
aleemcornea, ah - looking ..13:42
openstackgerritEmilien Macchi proposed openstack/puppet-tripleo master: Neutron wrappers: lookup for THT parameter  https://review.openstack.org/56673713:42
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: Deploy Docker via Ansible and not Puppet  https://review.openstack.org/56137713:42
aleemcornea, ok - yeah - I was looking at tripleo-update as a guide -- looks like there is a separate file for that ..13:44
aleeplaybooks/multinode-overcloud-update.yml13:44
aleeguess I need a similar one for config_change13:45
openstackgerritJiri Stransky proposed openstack-infra/tripleo-ci master: Add keystone only upgrade jobs  https://review.openstack.org/55842513:45
mwhahahasshnaidm|rover: yea13:46
openstackgerritRonelle Landy proposed openstack-infra/tripleo-ci master: Add dry run option to toci_quickstart  https://review.openstack.org/56706013:46
EmilienMbnemec: not sure you saw my message from yesterday but now we have designate in the CI job, we need to test it via Tempest, you'll need to patch quickstart to run at least one test13:46
mcorneaalee: yeah, I think so, I'm not familiar with the upstream ci but from what I can tell if you get the role with the overcloud-config-change in such playbook then it should trigger the tripleo-upgrade(assuming the proper vars in tripleo-upgrade are set)13:46
*** bkopilov has joined #tripleo13:47
aleemcornea, yeah I see where the playbooks are defined in tocigate_test-oooq.sh13:47
aleecool - let me give it a try13:48
*** jfrancoa has quit IRC13:48
openstackgerritMerged openstack/tripleo-quickstart-extras master: Updated undercloud tempest skip list for Pike  https://review.openstack.org/56668613:49
openstackgerritMerged openstack/tripleo-upgrade master: Run Ceph upgrade before converge.  https://review.openstack.org/56628213:49
beaglesmwhahaha: regarding https://review.openstack.org/#/c/567641/ ...13:50
*** jfrancoa has joined #tripleo13:50
*** anilvenkata has quit IRC13:51
mwhahahabeagles: yea i went looking, it doesn't seem that we created a quota puppet provider. THat being said that seems to be something we might want to do as different type of task13:51
mwhahahabeagles: unfortuantely it's a n awkward configuration because it spans all the services so it's not something we can just assign with like keystone though i think it belongs there more than in an octavia deployment task13:52
beaglesmwhahaha: yeah13:53
openstackgerritYurii Prokulevych proposed openstack/tripleo-upgrade stable/queens: Run Ceph upgrade before converge.  https://review.openstack.org/56885213:53
openstackgerritDmitry Tantsur proposed openstack/tripleo-heat-templates master: undercloud: set OS_CACERT when TLS is used  https://review.openstack.org/56885313:54
dtantsurjaosorior: something like ^^^ ?13:54
*** udesale_ has joined #tripleo13:55
jaosoriordtantsur: nice!13:55
mwhahahabeagles: i assume the quota thing is kinda blocking for octavia. That being said, shouldn't it be in it's own tenant outside of the ctrlplane stuff?13:55
dtantsurjaosorior: do we run any CI jobs with TLS enabled?13:55
*** moguimar has joined #tripleo13:55
jaosoriordtantsur: we do13:55
beaglesmwhahaha: yes and yes13:55
beaglesmwhahaha: we started adding support for a separate tenant for octavia but the timing/priorities didn't work out13:56
mwhahaha:/13:56
beaglestoo many fires - not enough firepeople13:57
* beagles thinks about that for a second13:57
mwhahahahttps://media.giphy.com/media/yr7n0u3qzO9nG/giphy.gif13:57
*** udesale has quit IRC13:58
beagles:)14:01
*** kambiz has quit IRC14:04
openstackgerritDaniel Alvarez proposed openstack/tripleo-heat-templates master: Fix missing parameters in OVN DVR environment files  https://review.openstack.org/56885614:04
Tengusee you tomorrow!14:05
*** kambiz has joined #tripleo14:05
*** moshele has quit IRC14:07
*** rajinir has joined #tripleo14:07
*** gyankum has quit IRC14:09
*** kambiz has quit IRC14:09
jaosoriorTengu: have a good one14:09
*** ooolpbot has joined #tripleo14:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION14:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097214:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177154914:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177155114:10
*** ooolpbot has quit IRC14:10
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)14:10
openstackLaunchpad bug 1771549 in tripleo "Containers multinode jobs fails on stable queens with overcloud deploy timeout" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)14:10
openstackLaunchpad bug 1771551 in tripleo "Containers multinode jobs fails on stable pike because of pacemaker" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)14:10
openstackgerritDaniel Alvarez proposed openstack/tripleo-heat-templates stable/queens: Fix missing parameters in OVN DVR environment files  https://review.openstack.org/56885814:11
*** ykarel is now known as ykarel|away14:12
*** tiswanso_ has joined #tripleo14:13
*** kambiz has joined #tripleo14:13
openstackgerritBrent Eagles proposed openstack/tripleo-heat-templates master: Add acl to paths that are shared among related neutron processes  https://review.openstack.org/56765514:13
*** lblanchard1 has joined #tripleo14:14
EmilienMmyoung|ruck, weshay , mwhahaha : you probably know but introspection looks broken https://logs.rdoproject.org/20/567320/7/openstack-check/gate-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-master/Zea21e3ed07234ca09366f6b619c041c1/undercloud/home/jenkins/overcloud_prep_images.log.txt.gz#_2018-05-16_13_30_2714:15
*** dtrainor has joined #tripleo14:15
EmilienMhttps://bugs.launchpad.net/tripleo/+bug/1770972 I guess14:15
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)14:15
EmilienMand https://review.openstack.org/#/c/568838/ should help, ok14:16
derekhEmilienM: sshnaidm|rover has a potential patch being tested at the moment14:16
*** ykarel|away has quit IRC14:16
derekhyup14:16
*** tiswanso has quit IRC14:16
*** lblanchard has quit IRC14:16
weshayEmilienM, ya.. derekh and sshnaidm|rover have been on that14:16
EmilienMcool14:17
EmilienMmwhahaha: the gate is low, I wonder if we can land https://review.openstack.org/#/c/568347/ and https://review.openstack.org/#/c/568680/ *now* so this thing is done14:18
EmilienMmwhahaha: FWIW, it passed FS03514:18
EmilienMbut I'm ok to wait. I just think it's better to have it asap14:18
mwhahahayea that's fine14:18
EmilienMmcornea: will need your review on https://review.openstack.org/#/c/568680/ please14:18
mcorneaEmilienM: looks good, can I set workflow? I see the other one has it14:21
mwhahahamcornea: sure14:21
*** tiswanso_ has quit IRC14:22
*** tiswanso has joined #tripleo14:23
EmilienMmcornea: thx14:25
openstackgerritMichele Baldessari proposed openstack/puppet-tripleo master: Move unfencing to meta_params  https://review.openstack.org/56876914:26
openstackgerritRonelle Landy proposed openstack-infra/tripleo-ci master: Add dry run option to toci_quickstart  https://review.openstack.org/56706014:26
openstackgerritwes hayutin proposed openstack/tripleo-quickstart-extras master: Revert "Add test_autoscaling tests to skip list"  https://review.openstack.org/56886014:26
*** lvdombrkr89 has joined #tripleo14:29
honzajtomasek: how is the network configuration work going?  do you have any wip code we could look at?14:30
jtomasekshardy, slagle: let me read back, I'll respond shortly14:31
jtomasekhonza: I am going to send out patches I currently have, so we align them with your networks listing, then we can identify and split work, sounds ok?14:31
beaglesmwhahaha: is non-containerized undercloud install supposed to work or is it in "don't" territory now?14:31
mwhahahabeagles: still supposed to work14:31
beaglesmwhahaha: okay14:32
*** lvdombrkr has quit IRC14:32
mwhahahabeagles: we still test it in ci as well. we haven't completely cut over14:32
beaglesmwhahaha: gotcha14:32
honzajtomasek: sounds good --- i just want to get started on it sooner rather than later because things always take longer than we think :)14:32
myoung|ruckEmilienM: aye...will talk about it now in CIX...but introspection is borked atm14:33
jtomasekhonza: yeah14:33
shardyjtomasek: ack thanks, not urgent but I wanted to sync up re the capabilities map for openshift14:33
jtomasekshardy: so currently we have openshift group in t-h-t capabilities map. I think that capabilities-map should define capabilities of the deployment plan, so if the plan is designed for openshift deployment, it's capabiltiies-map should include only environments related to openshift deployment14:36
jtomasekshardy: problem to solve would be where to draw the line and split t-h-t14:36
*** dxiri has joined #tripleo14:36
shardyjtomasek: yeah, I wasn't sure if capabilities map is enough, or if we need some additional filtering?14:36
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: [WIP] Add CLI argument parser and YAML file parser  https://review.openstack.org/56793614:37
shardyjtomasek: For now I'm assuming all the environments will stay in t-h-t, but that we need to ensure only configuration related to openshift is displayed in an openshift plan14:38
jtomasekshardy: I think capabiltiies-map is enough. We should probably consider removing 'Other' group which is added in tripleo-common action and it lists all environments not included in capabilities-map. I think we should make things more srict -> what is not in capabilties-map is not usable (by UI)14:38
shardythen when we have that we can add the various configurations folks may want to use14:38
shardyjtomasek: Ok, is the "Other" group still useful for openstack deployments?14:39
shardymaybe we should make it configurable via a flag somewhere?14:39
shardye.g in the plan_environment perhaps?14:39
jtomasekshardy: it is basically a fallback for environments which have not been added to capa-map but should14:39
jtomasekshardy:  we could add key to plan-environment.yaml...yeah14:39
jtomasekto specify capabilities-map file name14:39
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: [WIP] Implement --output-file to write the bash script  https://review.openstack.org/56828514:40
shardyjtomasek: Ah yeah that would be nice, so we can maintain separate files14:40
jtomaseksi if we decide to keep everything in single t-h-t repository, we would include multiple capabilities-maps14:40
shardyand eventually even separate repos etc14:40
EmilienMbogdando: your patch to fix upgrade seems to work for me :)14:40
shardyjtomasek: yeah I was thinking keep the single repo but use a file structure so that they can easily be split in future if needed14:41
jtomasekshardy: sounds good to me14:41
openstackgerritAde Lee proposed openstack/tripleo-quickstart-extras master: Add playbook for overcloud-config-change  https://review.openstack.org/56886514:42
*** ykarel|away has joined #tripleo14:43
*** holser__ has quit IRC14:46
*** gbarros has joined #tripleo14:47
*** holser__ has joined #tripleo14:47
bogdandoEmilienM:  \o/14:48
openstackgerritLukas Bezdicka proposed openstack/tripleo-common stable/queens: Persist package update ansible logs  https://review.openstack.org/56585314:48
openstackgerritRonelle Landy proposed openstack-infra/tripleo-ci master: Add dry run option to toci_quickstart  https://review.openstack.org/56706014:49
*** ykarel|away is now known as ykarel14:49
jtomasekslagle: ok, I don't have a strong opinion, that's why I didn't -1, only benefit which comes to my mind is that in case user accidentally removes the deployment_status object, it would get automagically recovered, next time it tries to fetch the status. On the other hand it would make the workflow more complicated and user should not remove that object, as removing it basically equals to something like manually removing something from application14:51
jtomasek database:)14:51
openstackgerritAde Lee proposed openstack-infra/tripleo-ci master: Add password change job  https://review.openstack.org/56789714:52
*** gvrangan has joined #tripleo14:54
openstackgerritBogdan Dobrelya proposed openstack/python-tripleoclient master: Persist generated undercloud parameters t-h-t  https://review.openstack.org/56576414:54
d0ugalrbrady,apetrich,thrash,toure,jtomasek: Workflow Squad meeting in 4 mins. https://etherpad.openstack.org/p/tripleo-workflows-squad-status14:56
openstackgerritAlex Schultz proposed openstack/tripleo-common stable/pike: Add yum update to base  https://review.openstack.org/56829214:57
openstackgerritAlex Schultz proposed openstack/tripleo-common stable/pike: Run yum clean to reduce size of docker image layer  https://review.openstack.org/56851014:57
*** cylopez has quit IRC14:58
*** links has quit IRC14:58
EmilienMjaosorior: I'm upgrading undercloud from queens to rocky (containerized) and it fails on starting certmonger15:01
derekhsshnaidm|rover: I'm starting to think maybe the ramdisk size has reached some limit, fixing the yum clean might help it but maybe only because it reduces the size of the ramdisk a bit, I'll let you know if I find anything out15:02
sshnaidm|roverderekh, ack15:02
EmilienMjaosorior: http://ix.io/1axU15:05
*** jfrancoa has quit IRC15:05
EmilienMjaosorior: wrong link nevermind15:05
*** tiswanso_ has joined #tripleo15:06
EmilienMjaosorior: http://paste.openstack.org/show/721106/15:07
mwhahahaEmilienM: reboot15:07
mwhahahaEmilienM: that's a 7.4 upgraded to 7.5 w/o a reboot15:07
EmilienMwatttt15:07
mwhahahaalso, read your email15:08
EmilienMpff emails15:08
* mwhahaha has explained this about 4 times already15:08
EmilienMI needed a 5th :P15:08
*** jfrancoa has joined #tripleo15:08
mwhahahaEmilienM: Ok. REBOOT15:08
mwhahahadone15:08
openstackgerritRonelle Landy proposed openstack-infra/tripleo-ci master: Add dry run option to toci_quickstart  https://review.openstack.org/56706015:08
*** tiswanso has quit IRC15:09
*** ooolpbot has joined #tripleo15:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION15:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097215:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177154915:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177155115:10
*** ooolpbot has quit IRC15:10
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)15:10
openstackLaunchpad bug 1771549 in tripleo "Containers multinode jobs fails on stable queens with overcloud deploy timeout" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)15:10
openstackLaunchpad bug 1771551 in tripleo "Containers multinode jobs fails on stable pike because of pacemaker" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)15:10
silehtjaosorior, I got it: https://review.openstack.org/#/c/568699/1/network/endpoints/endpoint_map.yaml@a10215:11
silehtjaosorior, the port change is missing15:11
silehtit should be 1397715:11
mwhahahaEmilienM: though it does show a deficiency somewhere (probably quickstart?) where we don't yum update & reboot on the undercloud host before installing15:12
EmilienMmwhahaha: I didn't use quickstart15:12
slaglejtomasek: ok. and as for plan_management.py vs deployment.py, i wasn't sure either, so i just picked plan_management15:12
mwhahahaEmilienM: ok well you should yum update & reboot before installing :D15:12
EmilienMmwhahaha: no15:13
mwhahahaEmilienM: but i think others are hitting this in quickstart15:13
EmilienMmwhahaha: yum update is run by a THT service15:13
mwhahahaEmilienM: is this in the upgrade?15:13
EmilienMafter services are properly stopped15:13
EmilienMmwhahaha: yes15:13
mwhahahaso that's the difference, that's the upgrade15:13
mwhahahaso for 7.4 to 7.5 a reboot has to happen15:13
mwhahahathanks rhel15:13
EmilienMwouat15:14
mwhahahathere is a bz open for this15:14
mwhahahathe problem is certmonger and dbush15:14
*** gvrangan has quit IRC15:14
mwhahahahttps://bugzilla.redhat.com/show_bug.cgi?id=156912215:14
openstackbugzilla.redhat.com bug 1569122 in instack-undercloud "Undercloud installation fails with "Execution of '/bin/getcert list' returned 1: Error org.freedesktop.DBus.Error.TimedOut"" [High,New] - Assigned to jslagle15:14
slaglewat15:16
*** radeks_ has quit IRC15:16
jtomasekslagle: I'll leave that decision on d0ugal or anyone else from workflows squad:)15:16
EmilienMso we could exclude dbus upgrade15:16
EmilienMbecause really I don't see how we can reboot in the middle of the containerized undercloud upgrade unless we re-do (again) the whole workflow15:17
d0ugalflorianf: Hey, can you take a look at these reviews? https://review.openstack.org/#/c/562296/ and https://review.openstack.org/#/c/562358/ (you already got the third one in the series)15:17
d0ugaljtomasek: What decision was that?15:18
florianfd0ugal: yup15:18
jtomasekd0ugal: https://review.openstack.org/#/c/564315/315:19
jtomasekd0ugal: whether get_plan_deployment_status workflow should live in plan_management.yaml or deployment.yaml workbooks15:20
EmilienMmwhahaha: otherwise we should have to change the doc, like: "yum update python-tripleoclient dbus; reboot" and run "openstack undercloud upgrade" when rebooted...15:20
mwhahahaEmilienM: it's likely this is only a 7.4 -> 7.5 issue15:20
mwhahahabut i'm not sure, it's a rhel upgrade problem15:21
d0ugaljtomasek: commenting.15:21
EmilienMmwhahaha: so in my case, dbus wasn't upgraded (I already deployed queens on 7.5)15:22
EmilienMmwhahaha: and certmonger failed to start15:22
*** masco has quit IRC15:22
EmilienMI'm trying again now15:22
*** radeks_ has joined #tripleo15:25
*** dparkes has quit IRC15:26
openstackgerritMerged openstack/tripleo-heat-templates master: Add ability to control Glance's enabled_import_methods  https://review.openstack.org/56766715:26
openstackgerritMerged openstack/tripleo-ui master: Add deployment status tracking infrastructure  https://review.openstack.org/55902115:26
openstackgerritMerged openstack/tripleo-ui master: Enable config-download deployment tracking  https://review.openstack.org/55902215:26
openstackgerritMerged openstack/tripleo-upgrade stable/queens: Run Ceph upgrade before converge.  https://review.openstack.org/56885215:26
openstackgerritMerged openstack/tripleo-upgrade master: Cleanup on oc_roles var  https://review.openstack.org/56868015:26
*** aufi has quit IRC15:27
openstackgerritMarios Andreou proposed openstack/python-tripleoclient stable/queens: Add .deployment.v1.deploy_on_servers to ffwd-upgrade prepare  https://review.openstack.org/56860415:28
EmilienMmwhahaha and others: https://review.openstack.org/568347 hasn't landed yet (tripleo-common) while https://review.openstack.org/568680 just landed15:28
*** skramaja has quit IRC15:28
EmilienMit means we'll have some errors in check for patches that are send now until https://review.openstack.org/568347 lands and is built15:28
mwhahahaEmilienM: ... depends-on would have been nice i guess15:29
EmilienMmwhahaha: it had depends on15:29
mwhahahaEmilienM: also why is tripleo-upgrade not in the tripleo queue15:29
mwhahahaoh so it's backwards15:29
EmilienMmwhahaha: wes has a patch15:29
mwhahahaso we'll just have errors and there's not much we can do about it15:29
EmilienMit'll merge today15:29
EmilienMright and it's only this time15:29
EmilienMtripleo-upgrade is having real jobs and will be in gate with others15:29
EmilienMsee https://review.openstack.org/#/c/568733/15:30
*** cshastri has joined #tripleo15:30
mwhahahaEmilienM: mcornea had a good point on that, can we update it to only run on specific patches?15:30
EmilienMweshay: yeah15:30
EmilienM^ can we update it please? i just had the same thought15:31
EmilienMalso we don't need tripleo-ci-centos-7-undercloud-containers and tripleo-ci-centos-7-containers-multinode15:31
*** tiswanso_ has quit IRC15:32
*** quiquell is now known as quiquell|off15:32
weshayEmilienM, ya.. I agree we may not need multinode-containers kicking from this repo15:32
*** tiswanso has joined #tripleo15:32
EmilienMweshay: neither tripleo-ci-centos-7-undercloud-containers15:32
weshayright right15:33
EmilienMsave trees!15:33
weshayhowever I also thought we have this min criteria for all projects15:33
weshayso I went conservative there15:33
weshaybut if you guys agree.. I'll update to be just update/upgrade related15:33
weshayI'll make the update15:33
*** masco has joined #tripleo15:34
*** olap has quit IRC15:34
*** ramishra has quit IRC15:34
openstackgerritRonelle Landy proposed openstack-infra/tripleo-ci master: Add releases script pytest tests to tox.ini  https://review.openstack.org/56764915:35
rascamwhahaha, EmilienM, I've opened the original bug because I was hitting it on the ospphase0 OSP11 CI job, but *only* there, what do you mean with all versions? I'm testing the same exact approach on all the OSP releases and getting this just in 1115:36
jtomasekd0ugal: https://bugs.launchpad.net/tripleo/+bug/177161015:36
openstackLaunchpad bug 1771610 in tripleo "deployment_status.yaml swift object does not exist when plan is created" [High,Triaged]15:36
*** eck` is now known as eck`gone15:36
mwhahaharasca: it's not specific to osp1115:37
mwhahaharasca: there's something outside of the OSP bits that causing it and it only started showing up with 7.515:37
rascamwhahaha, ok, but since I'm using the same exact procedure on the same exact env to test OSP10-11-12, why I'm hitting this just and systematically on 11?15:38
mwhahaharasca: that's a question for folks who know about certmonger :D15:38
rascamwhahaha, I'll add these info on the bug15:39
*** gyankum has joined #tripleo15:39
mwhahahathe underlying certmonger not starting is not an osp specific thing. i don't know how your env deploys 10/11/12, so maybe 10 is getting 7.4 and 11 gets 7.5?15:39
rascamwhahaha, nope, these envs are the *same*, I mean, with no difference15:41
rascamwhahaha, same way of provisioning the undercloud15:41
openstackgerritwes hayutin proposed openstack/tripleo-upgrade master: add container minimal check and gate  https://review.openstack.org/56873315:41
* mwhahaha shrugs15:42
mwhahahai'd have to look at it further. there's a lot of things that happen under the covers with various tooling that hide stuff15:42
mwhahahaso i tend not to believe anything :D15:42
openstackgerritwes hayutin proposed openstack/tripleo-upgrade master: DNM, test  https://review.openstack.org/56873215:43
weshayEmilienM, k.. updated 568732,315:45
weshayand only the update jobs is kicking..15:45
*** agurenko has quit IRC15:45
EmilienMcool15:46
*** etingof is now known as etingof|brb15:46
*** paramite_ has quit IRC15:46
trozethey guys when this is running: 2018-05-16 10:24:22,500 p=17826 u=mistral |  TASK [Start containers for step 4] *********************************************15:49
trozet2018-05-16 10:25:08,655 p=17826 u=mistral |  ok: [overcloud-controller-0] => {"censored": "the output has been hidden due to the fact that 'no_log: true' was specified for this result", "changed": false}15:49
trozethow do i go see what is actually being done?15:49
bandinirasca: the bottom line is "if you update dbus, you need to reboot the box"15:50
mwhahahatrozet: pretty sure it still shows up in the journal15:50
trozetmwhahaha: what is the no_log true thing, how do i enable that?15:50
*** leanderthal has quit IRC15:51
mwhahahatrozet: it's on the ansible play itself, and i don't think you want to because it may break other things. (excessive logging breaks mistral)15:51
trozetmwhahaha: yeah with the long blob into sql problem :)15:51
*** yprokule has quit IRC15:51
d0ugalDo we have a Mistral bug for this btw?15:51
mwhahahatrozet: slagle was working on capturing the output somewhere15:51
trozetmwhahaha: so where do you mean it shows up in journal? im tailing /var/log/messages on the overcloud-controller-0 node15:52
mwhahahad0ugal: i remember rbrady__ talking about it yesterday so maybe?15:52
rascabandini, mwhahaha, so the only conclusion is that just for osp-11 after configuring repo with rhos-release we get a dbus upgrade15:52
silehtmwhahaha, about https://bugs.launchpad.net/tripleo/+bug/1771435, the root cause is here: https://review.openstack.org/#/c/568699/1/network/endpoints/endpoint_map.yaml@a0102 in the offensing patch the port is 8977 instead of 1397715:52
openstackLaunchpad bug 1771435 in tripleo "scenario001/002 failing on autoscaling with urllib3.exceptions.SSLError: [SSL: UNKNOWN_PROTOCOL] unknown protocol (_ssl.c:579)" [Critical,Fix released] - Assigned to Alex Schultz (alex-schultz)15:52
*** masco has quit IRC15:52
mwhahahatrozet: i thought it was there, but if we'r e not capturing it yet15:53
openstackgerritMehdi Abaakouk (sileht) proposed openstack/tripleo-heat-templates master: Revert "Revert "Change default endpoint map entries to use TLS""  https://review.openstack.org/56888715:53
trozetd0ugal: are you referring to the issue I was just talking about with sql/mistral?15:53
rascabandini, mwhahaha, this can explain why I'm hitting this particular behavior, and I can also verify it, let me check15:53
*** janki has quit IRC15:53
mwhahahasileht: ok so bad port, we'll i know jaosorior reverted the revert so you may want to check that patch15:53
silehtoh I was about to do the same15:53
*** bogdando has quit IRC15:53
mwhahahasileht: https://review.openstack.org/#/c/568736/15:53
bandinirasca: yes (see my comment#3 the update is right there)15:54
mwhahahasileht: doesn't look like he updated it though, feel free to comment/patch it. I'm sure he'll be ok with it15:54
d0ugaltrozet: yup!15:54
*** ykarel is now known as ykarel|away15:54
trozetmwhahaha: the only thing i see in journald is May 16 10:51:37 overcloud-controller-0.opnfvlf.org Keepalived_vrrp[18618]: /usr/bin/systemctl status haproxy.service exited with status 115:54
trozetMay 16 10:51:37 overcloud-controller-0.opnfvlf.org dockerd-current[3438]: /usr/bin/systemctl status haproxy.service exited with status 115:54
trozetd0ugal: well one instance of that was fixed by not storing the ansible output into sql...let me find the patch15:54
mwhahahatrozet: so we build the ansible bits in THT, so you could try switching the no_log to false in THT and rerunning the deploy15:54
openstackgerritMehdi Abaakouk (sileht) proposed openstack/tripleo-heat-templates master: Revert "Revert "Change default endpoint map entries to use TLS""  https://review.openstack.org/56873615:54
mwhahahait'll be in deploy-steps.j2 or something15:55
d0ugaltrozet: Right, I think I spotted that. Probably worth fixing (or at least tracking) in Mistral too.15:55
mwhahahajust grep for the task name in tht to find it15:55
trozetd0ugal: https://review.openstack.org/#/c/565900/15:55
*** tcw has quit IRC15:55
d0ugaltrozet: Thanks. I'll open a Mistral specific bug.15:55
silehtmwhahaha, jaosorior that's done15:55
shardytrozet: you can also try running the deploy steps manually via ansible playbook, and if needed change the tasks or add debug etc15:56
trozetd0ugal: I also just bumped the blob size for sql, so that it can accept larger pieces of data15:56
shardytrozet: https://hardysteven.blogspot.co.uk/2018/02/debugging-tripleo-revisited-heat.html shows how to do that via config download15:56
*** fragatina has quit IRC15:57
trozetshardy: thanks i havent seen this link before15:58
trozetshardy, mwhahaha: my problem is this thing just seems to hang at step 4, it doesnt fail15:58
trozetmwhahaha: so i guess i can take your suggestion and try to enable the log to see what is happening15:58
mwhahahatrozet: or kill it and run it by hand15:58
trozetmwhahaha: but just to be clear..this isnt a good solution for debugging this for a user right15:59
openstackgerritSagi Shnaidman proposed openstack/tripleo-quickstart-extras master: DNM: test ansible 2.5.1  https://review.openstack.org/56347115:59
thrashd0ugal: suggest perhaps trimming the output so that it contains the last X chars15:59
trozetmwhahaha: the expectation is this should not hang, and if it fails, produce an obvious error15:59
mwhahahatrozet: we're not the regular users, but yea15:59
openstackgerritRedHat RDO CI proposed openstack/tripleo-quickstart-extras master: GATE CHECK for quickstart-extras  https://review.openstack.org/56044516:00
*** rbowen_ has joined #tripleo16:00
*** udesale_ has quit IRC16:02
slaglemwhahaha: trozet : switching no_log:true on that task might not help you fwiw16:03
slagleansible does not stream output back to the console16:03
slagleit collects it all, then shows it16:03
mwhahahayea that's what i figured16:03
mwhahaharerunning it by hand will be better for debugging16:04
trozetmwhahaha, slagle: yeah so if it hangs on a task i get nothing :/16:04
slaglein the case where the task fails, we do in fact show the error, as there are specifc follow up tasks to show the output if they previously failed16:04
mwhahahahanging tasks is not a new thing16:04
slagletrozet: that's a function of what the task is doing16:04
*** marios has quit IRC16:04
mwhahahaanything that can hang needs an external timeout16:04
mwhahahain puppet had that, not sure if you have that with ansible bits now16:04
slagleeventually it ought to time out, and maybe you'll get some output16:05
mwhahahaso if you find where it's hanging, we need to ensure it has a timeout mechanism somewhere16:05
trozetwell in my output I have16:05
trozet2018-05-16 10:24:22,500 p=17826 u=mistral |  TASK [Start containers for step 4] *********************************************16:05
trozet2018-05-16 10:25:08,655 p=17826 u=mistral |  ok: [overcloud-controller-0] => {"censored": "the output has been hidden due to the fact that 'no_log: true' was specified for this result", "changed": false}16:05
*** itlinux has joined #tripleo16:05
trozetso im guessing its hanging on the compute node...ihavent looked at that let me go check16:05
*** rmascena has joined #tripleo16:05
*** rmascena is now known as raildo_16:05
*** masco has joined #tripleo16:06
*** saneax is now known as saneax-_-|AFK16:06
*** rbowen has quit IRC16:06
openstackgerritwes hayutin proposed openstack/tripleo-quickstart-extras master: Mount /dev for chrooted environment  https://review.openstack.org/56883816:07
trozetmwhahaha, slagle: ah ha! looks like it is ceph-osd-run.sh running over and over16:07
mwhahahashort list for hanging process offenders: pacemaker, ceph16:07
itlinuxhello all. good morning first of all.16:07
trozetmwhahaha: https://paste.fedoraproject.org/paste/ey0dtVluXxsHE8Ien47EQA/raw16:07
itlinuxI have AD enabled and at times I get this error An unexpected error prevented the server from fulfilling your request. (HTTP 500)16:08
itlinuxthis happens only on one domain since I have two and the second is fine16:08
*** raildo has quit IRC16:08
mwhahahatrozet: yea that's a question for gfidente or fultonj16:08
* mwhahaha doesn't touch ceph unless he has to16:08
trozetmwhahaha: haha damnt16:09
Tenguwise thought :]16:09
*** ooolpbot has joined #tripleo16:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION16:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097216:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177154916:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177155116:10
*** ooolpbot has quit IRC16:10
trozetmwhahaha: do you know if tripleo CI covers ceph?16:10
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)16:10
openstackLaunchpad bug 1771549 in tripleo "Containers multinode jobs fails on stable queens with overcloud deploy timeout" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)16:10
openstackLaunchpad bug 1771551 in tripleo "Containers multinode jobs fails on stable pike because of pacemaker" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)16:10
sshnaidm|roverweshay, I was watching  https://review.openstack.org/#/c/568838/ running :(16:10
mwhahahatrozet: yes16:10
mwhahahatrozet: scenario001/00416:10
trozetmwhahaha: ok maybe I can compare that16:10
trozetgfidente: are you around to help me debug this issue?16:10
*** eck`gone is now known as eck`16:10
sshnaidm|roverweshay, please don't rebase it, let's see if introspection passes16:10
*** ramishra has joined #tripleo16:11
*** fragatina has joined #tripleo16:12
weshaysshnaidm|rover, ah sorry16:20
weshayI saw it hit that undercloud issue16:20
weshayhttp://logs.openstack.org/38/568838/3/check/tripleo-ci-centos-7-undercloud-containers/7cd8799/logs/undercloud/home/zuul/undercloud_install.log.txt.gz#_2018-05-16_13_36_2416:20
*** lvdombrkr89 has quit IRC16:22
EmilienMbeagles: https://review.openstack.org/#/c/561377/ is passing now16:22
EmilienMwith https://review.openstack.org/#/c/566737/16:22
beagles\o/16:22
EmilienMmwhahaha, beagles : please review these 2 patches ^ thanks16:23
sshnaidm|roverweshay, i care only about rdo ci ovb jobs in that patch16:23
mwhahahaEmilienM: https://i.imgur.com/DKUR9Tk.png16:23
*** paramite_ has joined #tripleo16:23
weshaysshnaidm|rover, roger that16:24
*** dprince has quit IRC16:24
beagleslol16:24
sshnaidm|rovermwhahaha, weshay where do we install pacemaker from? for pike for example..16:25
sshnaidm|roverlast pike promotion was tonight, but it didn't help16:25
openstackgerritJohn Trowbridge proposed openstack-infra/tripleo-ci master: Add CLI argument parser and YAML file parser  https://review.openstack.org/56793616:26
*** psahoo has quit IRC16:26
*** jfrancoa has quit IRC16:26
*** florianf has quit IRC16:26
weshaysshnaidm|rover, https://buildlogs.centos.org/centos/7/cloud/x86_64/openstack-pike/16:27
sshnaidm|roverweshay, and which version do we need?16:28
mwhahahasshnaidm|rover: it's in the image build16:28
*** jfrancoa has joined #tripleo16:28
weshaysshnaidm|rover, so iirc the build had 1.1.1816:28
weshaythis has 1.1.1616:28
sshnaidm|roverweshay, so buildlogs repo needs to be updated?16:29
mwhahahasshnaidm|rover: we haven't landed the yum update in the base container yet16:29
mwhahahait's still in gate16:29
sshnaidm|rovermwhahaha, it doesn't matter for pacemaker afaik16:29
mwhahahayes it should16:29
mwhahahaoh wait yea16:29
mwhahahathat should be installed in that container16:29
mwhahahaso that's the build from that container16:29
openstackgerritJohn Trowbridge proposed openstack-infra/tripleo-ci master: Add CLI argument parser and YAML file parser  https://review.openstack.org/56793616:30
*** mdnadeem_ has joined #tripleo16:30
mwhahahaunless we are installing pacemaker in the base container16:30
*** moshele has joined #tripleo16:30
*** mdnadeem has quit IRC16:31
*** holser__ has quit IRC16:31
weshaysshnaidm|rover, so in the container build.. if the centos base repo is used.. we should get 1.1.1816:31
weshayso apparently it's not?16:32
sshnaidm|roverweshay, yep, it's not..16:32
sshnaidm|roverweshay, but it worked for queens16:32
weshayhrm.. can we look at that?16:32
*** fragatina has quit IRC16:32
weshayhttps://logs.rdoproject.org/openstack-periodic-24hr/periodic-tripleo-centos-7-pike-containers-build/946e18f16:33
chandankumarsshnaidm|rover: is the telemetry issue got fixed/16:34
chandankumar?16:34
sshnaidm|roverchandankumar, I think so, weshay has a reverting patch for tempest16:34
weshaywell.. mwhahaha does, and it's my understanding we'd rather see it fail in the gate and fix any issue immediately if there is one16:36
weshaythen to take it offline16:36
*** kopecmartin has quit IRC16:36
*** itlinux has quit IRC16:36
mwhahahait wasn't a telemetry fix anyway16:36
mwhahahawe don't disable that test16:36
weshayfor future reference sshnaidm|rover chandankumar  ^16:37
*** ccamacho has quit IRC16:37
chandankumarsshnaidm|rover: did you get a chance to look at tempest container log patch16:37
mwhahahathat's like disabling ping test when ping test is the only thing running16:37
*** panda is now known as panda|off16:37
sshnaidm|roverok, I will remember this specifc test never to put in ignore list16:38
chandankumarwhat about adding a tag so that people cannot put it in skip list16:38
chandankumar?16:38
weshaysshnaidm|rover, chandankumar I'll add some doc text to the skip files.. to indicate what is and what is not appropriate at this moment16:38
sshnaidm|rovernever16:38
openstackgerritJohn Trowbridge proposed openstack-infra/tripleo-ci master: Add CLI argument parser and YAML file parser  https://review.openstack.org/56793616:39
openstackgerritDmitry Tantsur proposed openstack/tripleo-heat-templates master: undercloud: set OS_CACERT when TLS is used  https://review.openstack.org/56885316:40
* chandankumar headed home16:40
weshaysshnaidm|rover, I wonder if kolla overwrites the avail yum repos16:40
weshay2018-05-16 04:29:05.193 |  TASK [rdo-kolla-build : Fetch repo file] ***************************************16:40
weshayhttps://logs.rdoproject.org/openstack-periodic-24hr/periodic-tripleo-centos-7-pike-containers-build/946e18f/console.txt.gz16:41
*** ffiore has quit IRC16:41
*** tcw has joined #tripleo16:42
*** gvrangan has joined #tripleo16:42
*** dprince has joined #tripleo16:45
*** gkadam has quit IRC16:46
*** eck` is now known as eck`gone16:46
*** paramite_ has quit IRC16:46
sshnaidm|roverweshay, if it does, so only for pike16:47
weshaysshnaidm|rover, ya.. so.. fak.. I think I have to run this locally16:48
*** rbowen_ is now known as rbowen16:49
*** rbowen has quit IRC16:49
*** rbowen has joined #tripleo16:49
*** shardy has quit IRC16:50
*** eck`gone is now known as eck`16:51
*** rbowen has quit IRC16:51
*** rbowen has joined #tripleo16:51
sshnaidm|roverweshay, well, we have 1.18 http://logs.openstack.org/85/564285/9/check/tripleo-ci-centos-7-containers-multinode/1bbbd26/logs/subnode-2/var/log/extra/rpm-list.txt.gz16:52
weshaysshnaidm|rover, ya.. so I suspect kolla for some reason... I'll setup a reproducer16:53
*** links has joined #tripleo16:53
sshnaidm|roverweshay, so on image it's 1.1.18, how can I check what was in container?16:53
weshaysshnaidm|rover, ya.. /me looks16:53
weshayI think we *may* have it..  not 100% sure16:54
*** ramishra has quit IRC16:54
weshayre: a log of the version16:54
*** salmankhan has quit IRC16:54
*** radeks_ has quit IRC16:54
openstackgerritJames Slagle proposed openstack/tripleo-heat-templates master: Revert "Switch public endpoints to use FQDNs by default"  https://review.openstack.org/56889916:54
weshaysshnaidm|rover, this looks right to me https://logs.rdoproject.org/openstack-periodic-24hr/periodic-tripleo-centos-7-pike-containers-build/946e18f/kolla/logs/rabbitmq.log16:55
weshaysshnaidm|rover, where did you see 1.1.16?16:55
sshnaidm|roverweshay, I didn't say I saw 1616:56
weshaysshnaidm|rover, k k16:56
weshaysshnaidm|rover, so what lead you down this path?16:56
weshaywhat failure16:56
sshnaidm|roverweshay, weshay but in repo link you pasted above it's only 1616:56
weshayright16:57
weshaydeps should be updated.. or not duplicated16:57
weshaywth16:57
weshaysshnaidm|rover, is pike now working in upstream?16:57
* weshay checks16:57
sshnaidm|roverweshay, the failure seems exactly the same we had in queens and master and which was solved by pacemaker16:57
sshnaidm|roverweshay, no16:57
openstackgerritJames Slagle proposed openstack/tripleo-heat-templates master: Revert "Switch public endpoints to use FQDNs by default"  https://review.openstack.org/56889916:57
*** fragatina has joined #tripleo16:58
weshaysshnaidm|rover, so you've confirmed after the promotion, an upstream job with latest promoted bits fails the same way16:58
*** fragatina has quit IRC16:58
*** raildo_ is now known as raildo16:58
*** dtantsur is now known as dtantsur|afk16:58
weshaysshnaidm|rover, seems that way :) https://review.openstack.org/#/c/564285/16:59
sshnaidm|roverweshay, see https://bugs.launchpad.net/tripleo/+bug/177155116:59
openstackLaunchpad bug 1771551 in tripleo "Containers multinode jobs fails on stable pike because of pacemaker" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)16:59
*** mdnadeem_ has quit IRC16:59
sshnaidm|roverweshay, Michele checked in container and it's 16 there17:00
*** ykarel|away has quit IRC17:00
*** fragatina has joined #tripleo17:01
*** derekh has quit IRC17:01
*** fragatina has quit IRC17:02
*** fragatina has joined #tripleo17:02
weshaysshnaidm|rover, afaict.. at least in the latest hubbot job17:03
weshay2018-05-16 09:45:33 | - imagename: docker.io/tripleopike/centos-binary-rabbitmq:d52ad67500aacdb4c2a1321363bfe87de4e6b518_88c9954e17:03
weshaywhich is n-1 on promotions17:03
weshayhttps://hub.docker.com/r/tripleopike/centos-binary-rabbitmq/tags/17:03
weshayshould be pulling 1ba7734082acaef6e95d489e4c32cea52aa92c4c_de76e10817:03
* weshay looks at your bug17:03
*** shreshtha has quit IRC17:03
weshaysshnaidm|rover, your bug is also referencing a job that uses the old n-1 promoted container17:04
weshayhttp://logs.openstack.org/98/564698/2/check/tripleo-ci-centos-7-containers-multinode/22c050e/logs/undercloud/home/zuul/overcloud_prep_containers.log.txt.gz17:04
weshaysshnaidm|rover, c what I mean17:06
weshaysshnaidm|rover, can you put up a dnm patch on tht pike17:06
weshayand see if it persists?17:06
weshayor myoung|ruck17:06
myoung|rucksure, just caught up...had made a sammich17:07
sshnaidm|roverweshay, hmm.. then how does gate check run on old hash??17:08
myoung|ruckhttps://buildlogs.centos.org/centos/7/cloud/x86_64/openstack-pike/ has pacemaker 1.1.1617:08
weshaymyoung|ruck, yes we know, however centos base has 1.1.1817:09
*** gvrangan has quit IRC17:09
weshaysshnaidm|rover, are you sure it kicked after the promtion?17:09
myoung|ruckyes...so shouldn't we have the deps repo updated to match upstream?17:09
myoung|ruckbase17:09
weshayimho the duplicate should be removed17:09
myoung|ruckkolla is prob finding it from https://trunk.rdoproject.org/centos7-pike/delorean-deps.repo17:10
sshnaidm|roverweshay, last run 14:16 today17:10
*** ooolpbot has joined #tripleo17:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION17:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097217:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177154917:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177155117:10
*** ooolpbot has quit IRC17:10
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)17:10
openstackLaunchpad bug 1771549 in tripleo "Containers multinode jobs fails on stable queens with overcloud deploy timeout" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)17:10
openstackLaunchpad bug 1771551 in tripleo "Containers multinode jobs fails on stable pike because of pacemaker" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)17:10
weshaymyoung|ruck, read through the comments again17:10
weshayin irc17:10
*** jcoufal_ has joined #tripleo17:10
sshnaidm|roverweshay, ok, I have pike patches to merge, let's check on them: https://review.openstack.org/#/c/568151/ https://review.openstack.org/#/c/568292/17:11
*** gvrangan has joined #tripleo17:11
*** jpena is now known as jpena|off17:11
*** beagles is now known as beagles|afk17:11
EmilienMsshnaidm|rover, dmsimard|off: what's the progress in ara integration?17:11
*** trown is now known as trown|lunch17:11
EmilienMI thought it would be easy and we just need to configure a callback plugin17:11
sshnaidm|roverEmilienM, sorry, no progress yet :(17:12
*** itlinux has joined #tripleo17:12
sshnaidm|roverEmilienM, when installing ara seems like it breaks ansible run, no idea why17:12
weshaysshnaidm|rover, seems to be working in pike now http://zuul.openstack.org/stream.html?uuid=b4799642a64e49fa9e339d335a9f4f72&logfile=console.log17:12
openstackgerritAlex Schultz proposed openstack/python-tripleoclient master: Make standalone role name configurable  https://review.openstack.org/56837817:13
sshnaidm|roverEmilienM, but I'll get back to this asap conditions allow..17:13
weshaysshnaidm|rover, we can probably close the bug17:13
*** jcoufal has quit IRC17:13
sshnaidm|roverweshay, let's wait for results17:13
sshnaidm|roverweshay, but if it's so,  1 problem less *phew17:14
openstackgerritAlex Schultz proposed openstack/python-tripleoclient master: Update HostnameMap generation  https://review.openstack.org/56795117:14
*** psachin has quit IRC17:14
weshaysshnaidm|rover, :)17:14
weshayI got 99 problems, but pike aint one?17:14
sshnaidm|roverweshay, so only queens overcloud timeout remains17:14
EmilienMsshnaidm|rover: we can probably ask dmsimard|off to give a hand17:14
weshaysshnaidm|rover, across the board?17:14
sshnaidm|roverEmilienM, I need to improve my debug skills, need to see why ansible fails to start - where can I see that? in mistral logs..?17:15
trozetmwhahaha: I figured out the problem: https://github.com/ceph/ceph-ansible/issues/259817:15
sshnaidm|roverweshay, from urgent ones17:15
EmilienMsshnaidm|rover: right, in /var/lib/mistral you should see ansible logs17:15
sshnaidm|roverweshay, because it blocks queens17:15
mwhahahatrozet: HEH17:16
sshnaidm|roverEmilienM, I don't have this folder when it fails17:16
*** links has quit IRC17:16
EmilienMsshnaidm|rover: on the overcloud17:16
EmilienMerr17:16
trozetmwhahaha: so in Apex we create a loop device, but i was creating an ext4 partition on it...so thats why no osd17:16
EmilienMon the undercloud sorry17:16
EmilienMsshnaidm|rover: it depends which ansible run you're talking about17:16
trozetmwhahaha: so testing it now without creating a partition, and will also try to submit a fix for this in ceph-ansible to have better checking17:16
mwhahahatrozet: you and your fake devices17:17
sshnaidm|roverEmilienM, well, last time I tried - overcloud failed and this folder didn't exist, so I made a conclusion that ansible just didn't start at all..17:17
trozetmwhahaha: haha i used to just use a directory in the old puppet-ceph, but in ceph-ansible thats not allowed, so now I create the persistent loop device17:17
*** gbarros has quit IRC17:17
openstackgerritSagi Shnaidman proposed openstack-infra/tripleo-ci master: WIP: use ara for ansible deploy  https://review.openstack.org/56507917:19
openstackgerritSagi Shnaidman proposed openstack/tripleo-common master: WIP: ara with ansible deploy  https://review.openstack.org/56507717:19
sshnaidm|roverEmilienM, I'll check it again..17:19
weshaysshnaidm|rover, just to make sure I understand what you are saying17:19
weshaytripleo-ci-centos-7-containers-multinodeFAILURE in 2h 58m 24s17:19
weshayin https://review.openstack.org/#/c/567224/17:19
weshaythat job is the issue?17:20
weshayfor queens17:20
sshnaidm|roverweshay, yes, https://bugs.launchpad.net/tripleo/+bug/177154917:20
openstackLaunchpad bug 1771549 in tripleo "Containers multinode jobs fails on stable queens with overcloud deploy timeout" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)17:20
weshaysshnaidm|rover, ok.. by urgent.. you meant alert17:21
weshayk17:21
weshaythanks17:21
*** etingof|brb is now known as etingof17:21
weshaysshnaidm|rover, ah fak17:22
openstackgerritMerged openstack/tripleo-common master: (cleanup) remove usage of role_name  https://review.openstack.org/56834717:22
weshaythat is timing out since | 2018-05-15 02:39 |17:22
weshaymyoung|ruck, ^17:22
weshaymaybe before that17:23
weshayso queens is blocked17:23
sshnaidm|rovermyoung|ruck, weshay, btw, master containers build fails17:23
weshayfak17:23
openstackgerritMichael Chapman proposed openstack/tripleo-heat-templates master: Add OPNFV scenario environment  https://review.openstack.org/48690517:23
sshnaidm|roverfrom yesterday17:23
mwhahahaso i noticed the kolla build job has been failing, have we looked into that yet?17:24
sshnaidm|rovermyoung|ruck, do we have a bug about master containers build fails?17:24
aleemcornea, a little progress it seems -- any idea why this is hapeneing though?  http://logs.openstack.org/97/567897/5/experimental/tripleo-ci-centos-7-scenario000-multinode-oooq-container-password-changes/4463bc5/logs/quickstart_install.txt.gz17:24
michchaphey guys, I'm trying to add a container job for ODL, but it doesn't seem to run the container image prepare script so the DockerOpendaylightApiImage var never gets set in heat, is there something I'm likely missing?17:24
aleemcornea, Fatal: [undercloud]: FAILED! => {"changed": false, "failed": true, "msg": "Source /home/zuul/overcloud_deploy.sh not found"}17:24
mcorneaalee: yes! something familiar :) give me a sec17:25
myoung|ruckwill check...and open if not17:25
*** amoralej is now known as amoralej|off17:25
*** tesseract has quit IRC17:27
*** beagles|afk is now known as beagesl17:27
*** beagesl is now known as beagles17:27
mcorneaalee: add this at the top of your play: https://github.com/openstack/tripleo-upgrade/blob/master/tasks/fast-forward-upgrade/create-prepare-scripts.yaml#L1-L417:27
weshaysshnaidm|rover, hrm... https://review.rdoproject.org/grafana/dashboard/db/tripleo-ci?orgId=1&var-pipeline=All&var-branch=queens&var-cloud=All&var-type=All&var-jobtype=All17:27
myoung|ruckweshay, sshnaidm|rover, #908 is running now, checking logs for preview17:28
aleemcornea, cool17:28
myoung|ruckprevious17:28
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: docker: cleanup update tasks  https://review.openstack.org/56871517:28
sshnaidm|roverweshay, yeah?17:29
sshnaidm|rovermyoung|ruck, weshay, seems like something with registry, 500, 504 errors17:29
* myoung|ruck is looking here: https://review.rdoproject.org/jenkins/job/periodic-tripleo-centos-7-master-containers-build/907/consoleFull atm17:29
sshnaidm|rovermyoung|ruck, weshay gotta run, I'll look tomorrow..17:29
myoung|ruck^^ last fail17:29
*** sshnaidm|rover is now known as sshnaidm|off17:29
jaosoriorsileht: hey! thanks for looking into it!17:31
myoung|ruckweshay, sshnaidm|off, details will land there --> https://bugs.launchpad.net/tripleo/+bug/177163417:31
openstackLaunchpad bug 1771634 in tripleo "periodic: master container build is failing" [Critical,Triaged]17:31
*** pkovar has quit IRC17:32
openstackgerritMichele Baldessari proposed openstack/puppet-tripleo master: Move unfencing to meta_params  https://review.openstack.org/56876917:32
openstackgerritMatthew Thode proposed openstack/diskimage-builder master: uncap networkx  https://review.openstack.org/56891017:33
*** gbarros has joined #tripleo17:34
*** hjensas has joined #tripleo17:34
openstackgerritMichael Chapman proposed openstack/tripleo-quickstart master: Updates OpenDaylight feature set 31  https://review.openstack.org/50087217:38
*** cshastri has quit IRC17:38
*** jfrancoa has quit IRC17:41
*** rh-jelabarre has quit IRC17:41
*** jfrancoa has joined #tripleo17:42
openstackgerritMichele Baldessari proposed openstack/puppet-tripleo master: WIP Use the non-fqdn name when creating stonith levels  https://review.openstack.org/56891317:43
openstackgerritMerged openstack/tripleo-heat-templates master: FFU Set NetworkDeploymentActions CREATE,UPDATE for ffwd-upgrade prepare  https://review.openstack.org/56727017:44
weshaywhat is octavia-housekeeping?17:44
mwhahahaweshay: it's an octavia service17:45
mwhahahaweshay: why17:45
* mwhahaha throws things at beagles 17:45
weshayit's failing to build17:45
mwhahahalink to logs?17:45
beagleswhat?17:45
weshayhttps://logs.rdoproject.org/openstack-periodic/periodic-tripleo-centos-7-master-containers-build/6e68284/console.txt.gz17:45
weshaynot easy to parse17:45
mwhahahaRFE less crappy logging :D17:46
weshayya.. maybe it's more than just that one.. but I was not familiar w/ it17:46
mwhahahausually when it fails to build it's a lack of package17:46
beagleswow that is hard to sort out17:46
openstackgerritAde Lee proposed openstack/tripleo-upgrade master: Add config_change role  https://review.openstack.org/56730017:47
mwhahahaweshay: do we not capture the kolla logs seperately? those are easier to parse17:47
weshaylooks like infra https://logs.rdoproject.org/openstack-periodic/periodic-tripleo-centos-7-master-containers-build/6e68284/kolla/logs/octavia-housekeeping.log17:47
mwhahahaRROR:kolla.common.utils:neutron-l3-agent Failed with status: error17:47
weshayERROR:kolla.common.utils.octavia-housekeeping:received unexpected HTTP status: 500 Internal Server Error17:47
*** gvrangan has quit IRC17:47
mwhahahaINFO:kolla.common.utils.neutron-l3-agent:Trying to push the image17:48
mwhahahaERROR:kolla.common.utils.neutron-l3-agent:received unexpected HTTP status: 504 Gateway Time-out17:48
weshayya17:48
mwhahahalooks like issues pushing local containers17:48
mwhahahanot code/packaging related17:48
weshayya17:48
weshaysorry17:48
weshaymwhahaha, you shouldn't throw things at people man17:48
weshayyou can poke someone's eye out17:48
* mwhahaha gently tosses things at weshay 17:49
weshaythat's nice17:49
* beagles wears protective gear17:49
mwhahahaand by things i mean a cactus17:49
beagleslol17:49
weshaymwhahaha, you promised17:49
mwhahahalunch next week?17:49
mwhahahai'll bring a cactus17:49
weshaywhile all these suckers are at summit17:49
mwhahahai'm sure the mrs would love a trip to ikea17:49
weshaysure17:49
weshayhell ya17:50
weshayapproved.. sweedish meatballs and cactus.. hrm17:50
mwhahaha:D17:50
weshayrasca, ^17:50
*** gfidente is now known as gfidente|afk17:53
openstackgerritAlex Schultz proposed openstack/tripleo-heat-templates master: Add basics for standalone node  https://review.openstack.org/56641917:53
openstackgerritMerged openstack/tripleo-quickstart-extras master: Run containers update only for required packages  https://review.openstack.org/56755017:55
openstackgerritMerged openstack-infra/tripleo-ci master: Revert "temp workaround to bring ci gates back online"  https://review.openstack.org/56834117:55
silehtjaosorior, your welcome17:56
myoung|ruckmwhahaha, weshay, https://bugs.launchpad.net/tripleo/+bug/1771634 is up to date with current status, I see you already found the 504/500 so I'll not respam here.  #508 is building now...that's the next data point17:58
openstackLaunchpad bug 1771634 in tripleo "periodic: master container build is failing" [Critical,Triaged] - Assigned to Matt Young (halcyondude)17:58
weshaymyoung|ruck, it's been running red for how may jobs?17:58
myoung|ruckmwhahaha: weshay: indeed the logs are kind of hard to read...I've been pulling them local, swapping \n for newlines.  the individual kollar container build logs are easier to digest.  links in LP17:59
myoung|ruckweshay: I'm going thru that now, https://bugs.launchpad.net/tripleo/+bug/1771634/comments/117:59
openstackLaunchpad bug 1771634 in tripleo "periodic: master container build is failing" [Critical,Triaged] - Assigned to Matt Young (halcyondude)17:59
weshayk17:59
myoung|ruckweshay: 901 was last success18:00
myoung|ruckyesterday18:00
slagleI think we killed CI again18:04
mwhahahanoooooo18:04
slagleEmilienM: since we merged https://review.openstack.org/#/c/568343/, everything is broken until we get an updated Mistral container containing the new tripleo-common18:05
slaglesince the inventory script will run from within the mistral container18:05
mwhahahai thought we reverted the update thing18:05
mwhahahaso we should be getting updates18:05
mwhahahanow18:05
slaglehow often does that happen?18:05
mwhahahaat run time18:05
mwhahahait was off but just landed18:05
* mwhahaha goes and finds the review18:06
mwhahahahttps://review.openstack.org/#/c/568341/18:06
slagleok18:06
mwhahahathat just landed18:06
mwhahahawith everythign else18:06
slaglethanks18:06
mwhahahawe knew there would be some out of sync stuff but any jobs going forward as of 11mins ago should be ok18:06
mwhahaha:D18:06
mwhahahajuggling flaming chainsaws18:06
slaglei should have just rechecked instead of trying to investigate :-P18:07
bandinilol18:07
weshaymwhahaha, that is a great description18:07
mwhahahahttps://www.youtube.com/watch?v=G8OTcY0iegI18:07
* weshay hopes for carnage18:08
*** moshele has quit IRC18:08
mwhahahatoo much talking, not enough juggling18:08
mwhahaha3:3018:08
*** olap has joined #tripleo18:09
weshayjust one?18:09
weshayah18:09
weshayI think EmilienM went to school w/ that guy18:09
*** ooolpbot has joined #tripleo18:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION18:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097218:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177154918:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177163418:10
*** ooolpbot has quit IRC18:10
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)18:10
openstackLaunchpad bug 1771549 in tripleo "Containers multinode jobs fails on stable queens with overcloud deploy timeout" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)18:10
openstackLaunchpad bug 1771634 in tripleo "periodic: master container build is failing" [Critical,Triaged] - Assigned to Matt Young (halcyondude)18:10
weshaythis guy has it going on https://www.youtube.com/watch?v=OoJW-_OeFtw18:10
mwhahahatiny chainsaws18:10
mwhahahaacceptable use of flask18:11
weshaymyoung|ruck, running a recreate of queens multinode-containers, will send a tmate when it's ready18:11
*** jfrancoa has quit IRC18:12
*** akrivoka has quit IRC18:12
*** rwsu has quit IRC18:13
myoung|ruckweshay: ack cool.  just went thru 905, 906 (previous 2 fails) - so far it's random failures, all when attempting to push.  905 failed a new/different way than 906/907.  https://bugs.launchpad.net/tripleo/+bug/1771634/comments/318:14
openstackLaunchpad bug 1771634 in tripleo "periodic: master container build is failing" [Critical,Triaged] - Assigned to Matt Young (halcyondude)18:14
myoung|ruckweshay, mwhahaha, dmsimard|off: do we know what kind of HW we're running RDO container registry on?18:16
mwhahahano idea18:17
dmsimard|offmyoung|ruck: it's a virtual machine on RDO Cloud backed by a Ceph volume18:17
myoung|ruckthis smells like we're swamping the recv'r18:17
dmsimard|offmyoung|ruck: not sure what you want to know18:17
mwhahahamyoung|ruck: is it the rdo container registry or the local docker instance int he build18:17
dmsimard|offThe ceph storage on RDO cloud is notoriously slow18:17
myoung|ruckdmsimard|off: it's looking like we're getting 500/504 and/or timeouts when pushing containers from the promotion jobs18:17
dmsimard|offhas anything changed recently ?18:18
myoung|ruckmwhahaha: still unravelling the kolla build layers...i had assumed we're pushing the new image to rdo registry, now i'm self-doubting.  rather than debug in quiet land trying to run fast/transparent ;)18:20
mwhahahamyoung|ruck: yea that's fine, i think it gets pushed the local docker instance first, i would also need to look at that18:21
* myoung|ruck gets into console.registry nd watches 90818:21
dmsimard|offmyoung|ruck: Let me try something to see if it helps.18:22
myoung|ruckdmsimard|off: mwhahaha might have a very good point...i was assuming that "INFO:kolla.common.utils.octavia-housekeeping:Trying to push the image" meant --> RDO18:23
mwhahahaMay 15 23:12:31 upstream-centos-7-rdo-cloud-tripleo-174252 dockerd-current[1584]: time="2018-05-15T23:12:31.837209370Z" level=warning msg="failed to upload schema2 manifest: received unexpected HTTP status: 504 Gateway Time-out - falling back to schema1"18:23
mwhahahawhatever that means18:23
dmsimard|offdo we know if that's on the local or remote registry ?18:24
dmsimard|offlike is it the container build job ?18:24
*** gyankum has quit IRC18:24
mwhahahayea it is the build job but i don't know when containers are pushed18:24
mwhahahalike are they all built locally then pushed at teh end18:24
mwhahahaor are they pushed as they are built18:24
mwhahahahttps://logs.rdoproject.org/openstack-periodic/periodic-tripleo-centos-7-master-containers-build/551e606/undercloud/var/log/journal.txt.gz#_May_15_23_25_2518:24
mwhahahathough a 504 would be odd to hit locally18:25
mwhahahasounds more like a remote error18:25
myoung|ruckit appears that kolla spins up 16 threads (per conf file in our case)18:26
myoung|ruckit builds, and pushes as they are built18:27
myoung|rucke.g. https://console.registry.rdoproject.org/registry#/images/tripleomaster/centos-binary-aodh-base:tripleo-ci-testing landed 17 mins ago from currenly running buile 90818:27
myoung|ruckbuild18:27
myoung|ruck^^ https://review.rdoproject.org/jenkins/job/periodic-tripleo-centos-7-master-containers-build/908/console18:28
myoung|ruckhave gone thru the a few previous failures, so far it's random which container get's whacked18:29
myoung|ruck(aside) I am persistantly annoyed at how ansible hides output until everything is done.  combined with not being on the actual node it makes diagnoses == log diving...  17:42:51 TASK [rdo-kolla-build : Build and push images] --> {spinny-wheel-thingy} :)18:30
* myoung|ruck switches the "whine selector" to the "SHADDUP" position18:31
myoung|ruckdmsimard|off: is it possible to get a dump from server side of registry, do we have the server logs anywhere findable/parsable?18:32
*** rbowen_ has joined #tripleo18:33
*** rbowen has quit IRC18:33
*** rbowen_ has quit IRC18:33
*** rbowen has joined #tripleo18:33
*** radeks has joined #tripleo18:34
eric-youngI've got a fairly simple patch if someone wants to review it... https://review.openstack.org/#/c/563914/18:35
weshaymwhahaha, EmilienM chem, matbu updated the tripleo-upgrade gate https://review.openstack.org/#/c/568733/18:35
openstackgerritRonelle Landy proposed openstack-infra/tripleo-ci master: DNM Testing releases with zuul change  https://review.openstack.org/56892818:35
*** salmankhan has joined #tripleo18:36
openstackgerritRonelle Landy proposed openstack-infra/tripleo-ci master: DNM Testing releases with zuul change  https://review.openstack.org/56892818:38
*** salmankhan has quit IRC18:41
weshaydmsimard|off, myoung|ruck heh.. of course it works this time18:42
weshay:)18:42
myoung|ruckmwhahaha, dmsimard|off: so bug has details for all the fails since 901, mixture of 504, 500, and read timeouts18:42
*** atoth has quit IRC18:42
myoung|ruckweshay: fakme.  i mean...that's great!  908 ======== BUILD CONTAINERS IMAGES COMPLETED18:43
myoung|ruckdmsimard|off: did you tweak anything?  (e.g. [14:22:20] <dmsimard|off> myoung|ruck: Let me try something to see if it helps. )18:43
* myoung|ruck attempts to actually eat that (now 2 hour old) sandwich lolz18:43
dmsimard|offmyoung|ruck: I have not yet18:43
*** trown|lunch is now known as trown18:44
myoung|ruckok so we got lucky then...would expect we'll keep hitting this...just by the #'s18:44
dmsimard|offmyoung|ruck: what I'm doing right now is properly deleting the old namespaces (like master, because we're using tripleomaster now)18:44
dmsimard|offlike I was explaining to weshay recently18:44
dmsimard|offsome of the kolla images have upwards of 30 layers and when the docker client does a push, it needs to query the registry to know if it needs to push each layer or not -- the registry does a lookup in the filesystem and returns a 404 or a 200 depending if the layer is there or not so the client knows whether to push or not18:45
dmsimard|offwhen you're pushing 125+ images with so many layers, it's very expensive on a storage volume that is already slow to begin with18:45
dmsimard|offso the general idea is to keep as little images/tags on the registry as we can, hence the regular pruning18:46
myoung|ruckdmsimard|off: makes total sense18:49
myoung|ruckdmsimard|off: "docker image prune" ?18:51
*** etingof is now known as etingof|afk18:51
openstackgerritRonelle Landy proposed openstack/python-tripleoclient master: DNM Testing releases script with zuul change  https://review.openstack.org/56893118:53
dmsimard|offmyoung|ruck: no18:53
openstackgerritMichele Baldessari proposed openstack/puppet-pacemaker master: WIP do not create stonith constraint location when 1-node cluster  https://review.openstack.org/56893218:54
myoung|ruckmwhahaha, weshay, dmsimard|off confirmed we're seeing this in queens as well: https://logs.rdoproject.org/openstack-periodic/periodic-tripleo-centos-7-queens-containers-build/0b2dc67/kolla/logs/neutron-l3-agent.log18:55
myoung|rucksame mo18:55
weshayya.. I suspect it has nothing to do w/ the release18:56
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: Add missing UndercloudUpgrade to environment  https://review.openstack.org/56893318:58
myoung|ruckweshay: aye18:59
* myoung|ruck will brb18:59
*** eck` is now known as eck`gone18:59
*** eck`gone is now known as eck`19:00
*** cameron_chuang has joined #tripleo19:00
trozetmwhahaha: so the ceph issue i fixed...now still hit the infinite loop in step 4...the nova secret container seems to open a virsh interactive shell, thats why it hangs19:00
*** moshele has joined #tripleo19:00
trozetmwhahaha, EmilienM: https://paste.fedoraproject.org/paste/cGmL09MYyBA4T0fRDAMhWQ/raw19:01
*** olap has quit IRC19:02
*** fragatin_ has joined #tripleo19:05
*** abishop_ has joined #tripleo19:05
*** abishop has quit IRC19:06
*** fragatin_ has quit IRC19:07
*** fragatin_ has joined #tripleo19:07
*** fragatina has quit IRC19:07
*** fragatin_ has quit IRC19:09
openstackgerritJames Slagle proposed openstack/tripleo-common master: Set deployment_status from config_download_deploy  https://review.openstack.org/56695319:10
openstackgerritJames Slagle proposed openstack/tripleo-common master: Add workflow for plan deployment status  https://review.openstack.org/56431519:10
openstackgerritJames Slagle proposed openstack/tripleo-common master: Ansible json error callback plugin  https://review.openstack.org/56693819:10
openstackgerritJames Slagle proposed openstack/tripleo-common master: Workflow and action for deployment failures  https://review.openstack.org/56731819:10
*** ooolpbot has joined #tripleo19:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION19:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097219:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177154919:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177163419:10
*** ooolpbot has quit IRC19:10
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)19:10
openstackLaunchpad bug 1771549 in tripleo "Containers multinode jobs fails on stable queens with overcloud deploy timeout" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)19:10
openstackLaunchpad bug 1771634 in tripleo "periodic: container build jobs are failing when pushing to rdo registry (500, 504, read timeout)" [Critical,Triaged] - Assigned to Matt Young (halcyondude)19:10
*** fragatina has joined #tripleo19:13
*** mcornea has quit IRC19:13
*** gbarros has quit IRC19:14
*** mcornea has joined #tripleo19:14
openstackgerritAlex Schultz proposed openstack/tripleo-heat-templates master: Add basics for standalone node  https://review.openstack.org/56641919:16
*** tosky has quit IRC19:20
dmsimard|offmyoung|ruck: the tags for master/queens/pike have been deleted, pruning the images now.19:21
* myoung|ruck is taking bets on reclaimed space size and peers at mwhahaha and weshay19:21
mwhahaha5 megs?19:22
dmsimard|offmyoung|ruck: we rotate about 500GB worth of images every two weeks or so19:22
mwhahaha640k is all we should ever need19:22
myoung|ruck640! my god man.  64k baby.  64k19:22
cameron_chuangHi all, When I deploy pike TripleO it failed with msg  ""Error: Evaluation Error: Error while evaluating a Function Call, Could not find data item oslo_messaging_rpc_password in any Hiera data file and no default supplied at /etc/puppet/modules/tripleo/manifests/profile/base/ceilometer.pp:78:30 on node overcloud-novacompute-0.localdomain", where I need to fix config ?19:23
myoung|ruckmwhahaha: https://st.depositphotos.com/1021561/3891/i/950/depositphotos_38919889-stock-photo-three-keys-keyboard-binary-layout.jpg19:23
dmsimard|offmyoung|ruck: ~4TB of traffic in about a month http://paste.openstack.org/show/721125/19:24
*** tosky has joined #tripleo19:24
dmsimard|offmyoung|ruck: but really what's hurting us is the slow I/O for which I have no solution19:24
dmsimard|offhttps://review.rdoproject.org/grafana/?orgId=1&var-datasource=default&var-server=registry.rdoproject.org.rdocloud&var-inter=$__auto_interval&from=now%2FM&to=now19:24
* myoung|ruck puts serious hat on for a sec19:24
mwhahahawhere does one acquire a serious hat19:24
mwhahahadid serious cat sell it to you19:25
myoung|ruckdmsimard|off: are there tweakable params to increase timeouts for lengthy i/o operations on the docker registry side?  so even with slow i/o we don't time out and 500/504/faceplant but just take longer to complete the push?19:25
mwhahahai'm assuming the registry software is not that tunable19:26
dmsimard|offmyoung|ruck: we're talking about timeouts but we're not even sure what's the real issue19:26
dmsimard|offmyoung|ruck: let's do housekeeping first -- delete the cruft and etc, then see if things improve19:26
myoung|ruckthis seems to happen (at least from the builds analyzed so far) to be 1-2 containers per job.  Or I'm curious if it's worse...but since a bunch of container layers were pushed by previous job (with same hash/tag being used...as was the case on the string of master jobs) it's a 'try try again" going on...where we don't push layers already pushed (previously)19:26
dmsimard|offwe also need to update openshift at some point19:26
myoung|ruckdmsimard|off: ack, I don't have cycles today to dive in, but is the docker registry log anywhere we can get at it?19:31
weshaydang19:31
weshayDO NOT PISS OFF THE EVILIEN19:31
dmsimard|offmyoung|ruck: folks from the infrastructure core team can access it on a need basis19:32
myoung|ruck... ok.  curious what --max-concurrent-uploads is set to and if it would help19:32
trozetmyoung|ruck: hey, i think we have another problem with host/container package mismatch19:34
myoung|ruckand what's in the registry logs behind the 500's19:34
trozetmyoung|ruck: i suspect there is also an issue with libvirt packages, for the containerized libvirt19:34
gouthamrhi mwhahaha, i reported a bug on tripleo that i plan on fixing, post filing, i see a message saying that i need to be added to the LP group to manipulate the LP fields..19:35
mwhahahagouthamr: i can also triage it. You should be able to assign it to yourself at least19:35
myoung|rucktrozet: could you please update https://bugs.launchpad.net/tripleo/+bug/1771602 with the details of what you're seeing, or if it's already in LP link it to that RFE tracker?19:35
openstackLaunchpad bug 1771602 in tripleo "RFE: detect and warn when package versions in bare metal vs. container don't match" [Medium,Triaged]19:35
mwhahahagouthamr: if you aren't planning on doing a bunch of tripleo bug work, i don't think it's necessary to get yourself added to LP groups. Also we're working on moving off of LP19:36
gouthamrmwhahaha: thanks, here it is: https://bugs.launchpad.net/tripleo/+bug/1771656 added my conclusions on the report and assigned it to myself19:36
openstackLaunchpad bug 1771656 in tripleo "[manila] Dell/EMC backends require value for share_backend_name " [Undecided,New] - Assigned to Goutham Pacha Ravi (gouthamr)19:36
trozetmyoung|ruck: yeah.  I need to confirm my suspicion first, but will do19:36
mwhahahagouthamr: done19:36
myoung|rucktrozet: thanks!19:37
trozetmyoung|ruck: if i kill the nova libvirt container, start libvirtd on the host, and do the command its all good19:39
gouthamrmwhahaha: nice, thank you.. i am a newbie here, and will be working with the storage projects (mainly manila) with abishop_19:39
trozetmyoung|ruck: i think this is a blocker...im not sure how easy it is going to be to get these versions to match..19:40
trozetmyoung|ruck: pacemaker you can workaround..but libvirt is tied to qemu/kernel19:40
openstackgerritGoutham Pacha Ravi proposed openstack/puppet-tripleo master: Remove share_backend_name from Dell-EMC manila backends  https://review.openstack.org/56894519:44
myoung|rucktrozet: is it an actual inside/outside container libvirt package version mismatc?19:46
myoung|ruckmismatch*19:46
trozetmyoung|ruck: yeah19:47
* myoung|ruck nods19:47
openstackgerritwes hayutin proposed openstack/tripleo-quickstart-extras master: execute the build / install of zuul changes in undercloud-upgrade  https://review.openstack.org/56894619:47
trozetmyoung|ruck: well the weird part is in the container if i do rpm -qa | grep libvirt theres nothing listed..19:47
trozetmyoung|ruck: but if id o virsh --version it shows me the version is 3.219:47
trozetmyoung|ruck: and version on host is 3.919:47
*** wolverineav has quit IRC19:52
*** slaweq has quit IRC19:52
openstackgerritRonelle Landy proposed openstack/python-tripleoclient master: DNM Testing releases script with zuul change  https://review.openstack.org/56893119:52
*** slaweq has joined #tripleo19:52
*** holser__ has joined #tripleo19:53
*** dparkes has joined #tripleo19:54
*** jcoufal has joined #tripleo19:56
*** slaweq_ has joined #tripleo19:57
*** slaweq has quit IRC19:57
openstackgerritJames Slagle proposed openstack/python-tripleoclient master: Actually print the error during deployment fail  https://review.openstack.org/56870719:59
openstackgerritJames Slagle proposed openstack/python-tripleoclient master: overcloud plan deployment status  https://review.openstack.org/56434119:59
openstackgerritJames Slagle proposed openstack/python-tripleoclient master: overcloud plan deployment failures  https://review.openstack.org/56867319:59
*** jcoufal_ has quit IRC19:59
openstackgerritDan Sneddon proposed openstack/tripleo-heat-templates master: Add ability to pre-assign IPs by role on ctlplane  https://review.openstack.org/56850520:00
*** zshi has quit IRC20:09
*** ooolpbot has joined #tripleo20:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION20:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097220:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177154920:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177163420:10
*** ooolpbot has quit IRC20:10
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)20:10
openstackLaunchpad bug 1771549 in tripleo "Containers multinode jobs fails on stable queens with overcloud deploy timeout" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)20:10
openstackLaunchpad bug 1771634 in tripleo "periodic: container build jobs are failing when pushing to rdo registry (500, 504, read timeout)" [Critical,Triaged] - Assigned to Matt Young (halcyondude)20:10
*** pchavva has quit IRC20:10
myoung|rucktrozet: thanks for the update/details of libvirt pain in https://bugs.launchpad.net/tripleo/+bug/177160220:11
openstackLaunchpad bug 1771602 in tripleo "RFE: detect and warn when package versions in bare metal vs. container don't match" [Medium,Triaged]20:11
*** liverpooler has quit IRC20:11
*** dougbtv_ has joined #tripleo20:11
openstackgerritJohn Trowbridge proposed openstack-infra/tripleo-ci master: Add CLI argument parser and YAML file parser  https://review.openstack.org/56793620:15
*** moshele has quit IRC20:17
*** dougbtv_ has quit IRC20:17
openstackgerritNir Magnezi proposed openstack/tripleo-common master: Make lb-mgmt-subnet a class B subnet  https://review.openstack.org/56808920:19
openstackgerritAlex Schultz proposed openstack/tripleo-heat-templates master: Add basics for standalone node  https://review.openstack.org/56641920:19
openstackgerritNir Magnezi proposed openstack/tripleo-heat-templates master: Make lb-mgmt-subnet a class B subnet  https://review.openstack.org/56813820:21
*** asbishop has joined #tripleo20:24
openstackgerritJames Slagle proposed openstack/tripleo-common master: Ignore errors when checking result of previous deployments  https://review.openstack.org/56895520:24
mwhahahaweshay, myoung|ruck: is scenario000 broken20:25
*** gbarros has joined #tripleo20:26
* myoung|ruck looks with his third eye...and the other 2 as well20:26
*** abishop_ has quit IRC20:27
mwhahahaEmilienM: http://logs.openstack.org/51/567951/3/check/tripleo-ci-centos-7-undercloud-containers/c88f50b/logs/undercloud/home/zuul/undercloud_install.log.txt.gz#_2018-05-16_18_00_35 that's a special error20:27
EmilienMlooking20:28
*** dprince has quit IRC20:28
mwhahahaI assume it's https://review.openstack.org/#/c/567951/3/tripleoclient/v1/tripleo_deploy.py@26420:30
*** tiswanso has quit IRC20:31
*** raildo has quit IRC20:31
EmilienMmwhahaha: something with  _set_roles_file, one sec20:32
mwhahahamaybe a bad rebase from the tht_render thing20:33
weshaymwhahaha, last two failed, however today it's at 95%20:33
myoung|ruckmwhahaha: time flew, i have a hard stop -2 min ago, but back in 80 min.  a quick look at http://cistatus.tripleo.org/#tripleo-ci-centos-7-scenario000-multinode-oooq-container-updates shows a few check jobs have hit an issue...before i dash (in -3min now) are you looking at a specific patch?20:33
mwhahahamyoung|ruck: just noticed two check failures on patches that didn't touch overcloud20:33
mwhahahait can wait until later20:33
weshay"msg": "Overcloud minor update execution step failed..."}20:33
myoung|ruckmwhahaha: k will look when back20:34
mwhahahayea when i poked at it the ansible said no hosts20:34
weshayI swear I'll never had a update/upgrade job voting20:34
EmilienMmwhahaha: I wonder if we need to return roles_data = None if there is no roles_file20:35
EmilienMwell you return "return self.roles_data" anyway so it's not it20:35
weshaymwhahaha, it properly fails on | 568736,220:36
weshaythe other failure is on rlandy's patch but it shouldn't fail there20:37
weshayyou are looking at https://review.openstack.org/56894620:37
mwhahahahttps://review.openstack.org/#/c/568378/20:37
mwhahahahttps://review.openstack.org/#/c/567951/20:38
mwhahahathose two20:38
mwhahahaneither touches overcloud bits20:38
weshayya.. I supsect https://review.openstack.org/#/c/568680/20:40
* weshay runs20:40
EmilienMso20:41
*** radeks has quit IRC20:41
EmilienMhttps://review.openstack.org/#/c/568347/ needs to be in the mistral container20:41
EmilienMwhich will happen when we have a promotion20:41
mwhahahano should be fine now because we're updating containers20:42
mwhahahawe reverted that bits20:42
mwhahahaor are we not checking that in the 000 job20:42
EmilienMwe don't update undercloud containers when undercloud is containerized20:43
EmilienMwhich is the case of fs00120:43
mwhahahais fs001 used by scenario000?20:43
EmilienMno no20:43
* mwhahaha left his magic decoder ring at home20:43
EmilienM:D20:43
*** moshele has joined #tripleo20:44
*** ansmith has quit IRC20:44
*** gbarros has quit IRC20:45
openstackgerritMarius Cornea proposed openstack/tripleo-upgrade master: DNM: Stop openstack services before undercloud upgrade  https://review.openstack.org/56866720:45
*** moshele has quit IRC20:46
*** rbowen has quit IRC20:47
stevebakermorning20:48
EmilienMyo20:48
*** holser__ has quit IRC20:51
*** wolverineav has joined #tripleo20:54
*** salmankhan has joined #tripleo20:56
*** gfidente|afk has quit IRC20:57
*** slaweq_ has quit IRC20:59
*** slaweq has joined #tripleo20:59
*** lblanchard1 has quit IRC21:00
*** salmankhan has quit IRC21:01
openstackgerritAlex Schultz proposed openstack/python-tripleoclient master: Update HostnameMap generation  https://review.openstack.org/56795121:01
weshaymwhahaha, so.. what to do w/ scen00021:02
mwhahahafigure out how it broke21:02
weshayk21:02
weshaythe only repo that was not gated was tripleo-upgrade21:02
weshaybut obviously we still could have missed something21:02
mwhahahaweshay: oh and switch it to non-voting21:04
*** trown is now known as trown|outtypewww21:05
*** mcornea has quit IRC21:05
slaglehmm, guess my revert https://review.openstack.org/#/c/559926 won't pass without some other change also reverted21:05
* mwhahaha digs up the non-voting patch for scenario00021:06
*** jcoufal_ has joined #tripleo21:07
*** dxiri_ has joined #tripleo21:07
openstackgerritAlex Schultz proposed openstack-infra/tripleo-ci master: Revert "Revert "Disable voting on tripleo-ci-centos-7-scenario000-multinode-oooq-container-updates""  https://review.openstack.org/56896221:08
*** dxiri has quit IRC21:09
*** ooolpbot has joined #tripleo21:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION21:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097221:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177154921:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177163421:10
*** ooolpbot has quit IRC21:10
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)21:10
*** jcoufal has quit IRC21:10
openstackLaunchpad bug 1771549 in tripleo "Containers multinode jobs fails on stable queens with overcloud deploy timeout" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)21:10
openstackLaunchpad bug 1771634 in tripleo "periodic: container build jobs are failing when pushing to rdo registry (500, 504, read timeout)" [Critical,Triaged] - Assigned to Matt Young (halcyondude)21:10
weshaymwhahaha, lovely21:12
*** tiswanso has joined #tripleo21:12
weshaymwhahaha, while ur in the revert kinda mood21:13
weshayanything come to mind re: multinode-containers for queens.. timing out... looks like the deployment never starts21:13
*** itlinux has quit IRC21:13
weshayI have a local recreate.. networking looks ok21:13
*** Goneri has quit IRC21:14
*** olap has joined #tripleo21:16
*** tiswanso has quit IRC21:16
*** olap has quit IRC21:20
*** asbishop has quit IRC21:21
mwhahahameh we moved it21:28
* mwhahaha goes and digs it back up21:28
openstackgerritAlex Schultz proposed openstack-infra/tripleo-ci master: Revert "Revert "Disable voting on tripleo-ci-centos-7-scenario000-multinode-oooq-container-updates""  https://review.openstack.org/56896221:29
mwhahahathere we go21:30
* mwhahaha wanders off for a bit21:30
*** slaweq has quit IRC21:33
*** slaweq has joined #tripleo21:34
openstackgerritAlex Schultz proposed openstack-infra/tripleo-ci master: Revert "Revert "Disable voting on tripleo-ci-centos-7-scenario000-multinode-oooq-container-updates""  https://review.openstack.org/56896221:34
mwhahahaI created a bug https://bugs.launchpad.net/tripleo/+bug/177168621:34
openstackLaunchpad bug 1771686 in tripleo "tripleo-ci-centos-7-scenario000-multinode-oooq-container-updates failing on update because of no hosts" [Critical,Triaged]21:34
openstackgerritAde Lee proposed openstack/tripleo-upgrade master: Add config_change role  https://review.openstack.org/56730021:36
*** agopi has joined #tripleo21:38
*** ansmith has joined #tripleo21:38
*** d0ugal_ has joined #tripleo21:40
*** d0ugal has quit IRC21:41
openstackgerritJames Slagle proposed openstack/tripleo-common master: Revert "TLS by default for the overcloud"  https://review.openstack.org/56896421:45
openstackgerritJames Slagle proposed openstack/tripleo-heat-templates master: Revert "Switch public endpoints to use FQDNs by default"  https://review.openstack.org/56889921:45
*** itlinux has joined #tripleo21:51
*** leitan has joined #tripleo21:55
*** leitan has quit IRC22:01
*** leitan has joined #tripleo22:01
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Support networkx 2.0  https://review.openstack.org/50652422:03
*** wolverineav has quit IRC22:05
*** ooolpbot has joined #tripleo22:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION22:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097222:10
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)22:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177154922:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177163422:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177168622:10
*** ooolpbot has quit IRC22:10
openstackLaunchpad bug 1771549 in tripleo "Containers multinode jobs fails on stable queens with overcloud deploy timeout" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)22:10
openstackLaunchpad bug 1771634 in tripleo "periodic: container build jobs are failing when pushing to rdo registry (500, 504, read timeout)" [Critical,Triaged] - Assigned to Matt Young (halcyondude)22:10
openstackLaunchpad bug 1771686 in tripleo "tripleo-ci-centos-7-scenario000-multinode-oooq-container-updates failing on update because of no hosts" [Critical,Triaged]22:10
*** ssbarnea_ has quit IRC22:14
*** pliu has quit IRC22:14
*** pabelanger has quit IRC22:15
*** aputtur has quit IRC22:15
*** haleyb has quit IRC22:15
*** hewbrocca_afk has quit IRC22:15
*** mburned has quit IRC22:16
*** rasca has quit IRC22:16
*** rnoriega has quit IRC22:16
*** markmc has quit IRC22:16
*** jjoyce has quit IRC22:16
*** akrzos has quit IRC22:16
*** lhinds has quit IRC22:16
*** faceman has quit IRC22:16
*** jschlueter has quit IRC22:17
*** myoung|ruck has quit IRC22:17
*** weshay has quit IRC22:17
*** slaweq has quit IRC22:21
*** aputtur has joined #tripleo22:22
*** hewbrocca_afk has joined #tripleo22:22
*** mburned has joined #tripleo22:22
*** weshay has joined #tripleo22:22
*** dxiri_ has quit IRC22:25
*** rcernin has joined #tripleo22:25
*** dxiri has joined #tripleo22:25
*** weshay has quit IRC22:26
*** hewbrocca_afk has quit IRC22:26
*** mburned has quit IRC22:27
*** aputtur has quit IRC22:27
*** rlandy is now known as rlandy|bbl22:28
*** rajinir has quit IRC22:28
*** andreaf has quit IRC22:29
*** andreaf has joined #tripleo22:29
*** itlinux has quit IRC22:33
*** jschlueter has joined #tripleo22:34
*** weshay has joined #tripleo22:34
*** pabelanger has joined #tripleo22:34
*** mburned has joined #tripleo22:35
*** faceman has joined #tripleo22:35
*** myoung has joined #tripleo22:35
*** akrzos has joined #tripleo22:35
*** pliu has joined #tripleo22:35
*** lhinds has joined #tripleo22:35
*** rasca has joined #tripleo22:35
*** aputtur has joined #tripleo22:36
*** jjoyce has joined #tripleo22:36
*** agopi has quit IRC22:37
*** rnoriega has joined #tripleo22:37
*** haleyb has joined #tripleo22:37
*** agopi has joined #tripleo22:38
*** hewbrocca_afk has joined #tripleo22:39
*** d0ugal__ has joined #tripleo22:40
*** markmc has joined #tripleo22:41
*** d0ugal_ has quit IRC22:41
*** dougbtv_ has joined #tripleo22:47
openstackgerritMerged openstack/tripleo-validations master: Add validation for checking roles against flavors  https://review.openstack.org/56229622:52
openstackgerritMerged openstack/python-tripleoclient master: Fix hiera data override file writing  https://review.openstack.org/56881822:59
openstackgerritMerged openstack/tripleo-validations stable/pike: Remove unused tox_install.sh  https://review.openstack.org/56727222:59
openstackgerritMerged openstack/tripleo-validations stable/pike: Validate that there should not be XFS volumes with ftype=0  https://review.openstack.org/56469822:59
openstackgerritMerged openstack/tripleo-quickstart-extras master: Populate /etc/yum/vars/contentdir  https://review.openstack.org/56870122:59
*** dougbtv_ has quit IRC23:00
openstackgerritMerged openstack-infra/tripleo-ci master: Add python script to dynamically compose releases  https://review.openstack.org/56752123:05
*** ooolpbot has joined #tripleo23:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION23:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177097223:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177154923:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177163423:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177168623:10
openstackLaunchpad bug 1770972 in tripleo "CI: Images introspection fails in OVB jobs" [Critical,Triaged] - Assigned to Derek Higgins (derekh)23:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177169223:10
*** ooolpbot has quit IRC23:10
openstackLaunchpad bug 1771549 in tripleo "Containers multinode jobs fails on stable queens with overcloud deploy timeout" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)23:10
openstackLaunchpad bug 1771634 in tripleo "periodic: container build jobs are failing when pushing to rdo registry (500, 504, read timeout)" [Critical,Triaged] - Assigned to Matt Young (halcyondude)23:10
openstackLaunchpad bug 1771686 in tripleo "tripleo-ci-centos-7-scenario000-multinode-oooq-container-updates failing on update because of no hosts" [Critical,Triaged]23:10
openstackLaunchpad bug 1771692 in tripleo "hubbot check jobs are timing out on OC deploy" [Critical,Triaged]23:10
*** slaweq has joined #tripleo23:10
*** pmannidi has joined #tripleo23:14
*** slaweq has quit IRC23:15
*** lhinds has quit IRC23:16
*** pliu has quit IRC23:16
*** rnoriega has quit IRC23:16
*** mburned has quit IRC23:16
*** pabelanger has quit IRC23:17
*** markmc has quit IRC23:17
*** weshay has quit IRC23:17
*** hewbrocca_afk has quit IRC23:17
*** aputtur has quit IRC23:17
*** rasca has quit IRC23:17
*** akrzos has quit IRC23:17
*** myoung has quit IRC23:17
*** jschlueter has quit IRC23:18
*** haleyb has quit IRC23:18
*** faceman has quit IRC23:18
*** jjoyce has quit IRC23:18
*** leitan has quit IRC23:19
*** nyechiel_ has quit IRC23:20
*** aputtur has joined #tripleo23:20
*** pabelanger has joined #tripleo23:20
*** rnoriega has joined #tripleo23:21
*** weshay has joined #tripleo23:21
*** rasca has joined #tripleo23:21
*** haleyb has joined #tripleo23:21
*** akrzos has joined #tripleo23:21
*** pliu has joined #tripleo23:21
*** myoung has joined #tripleo23:23
*** mburned has joined #tripleo23:23
*** faceman has joined #tripleo23:23
*** jschlueter has joined #tripleo23:23
*** lhinds has joined #tripleo23:23
*** markmc has joined #tripleo23:24
*** jjoyce has joined #tripleo23:24
*** hewbrocca_afk has joined #tripleo23:24
*** gvrangan has joined #tripleo23:34
*** tosky has quit IRC23:34
*** jcoufal has joined #tripleo23:42
*** moshele has joined #tripleo23:44
*** jcoufal_ has quit IRC23:46
*** jcoufal has quit IRC23:47
mwhahahaweshay: looks like tripleo-ci-centos-7-scenario000-multinode-oooq-container-updates is ok, there was just a gap of broken stuff23:48
* mwhahaha abandondes the revert23:48
mwhahahafrom ~16:02 to 19:30 it was failing23:48
mwhahahait's been green since about 19:16 or so23:48
*** olap has joined #tripleo23:51
*** dxiri has quit IRC23:52
*** dxiri has joined #tripleo23:53
*** olap has quit IRC23:56
*** myoung is now known as myoung|ruck23:56
*** dxiri has quit IRC23:57

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!