Thursday, 2018-07-19

*** med_ has quit IRC00:07
jtcressyCan I force-delete a stack? I cant get this thing to delete no matter how many times I try.00:07
jtcressythere are no instances in "openstack server list" but it still fails saying that It cant delete an instance. Why does it keep trying to delete an instance that doesn't exist??? wtf!00:08
*** toure is now known as toure|gone00:09
*** ooolpbot has joined #tripleo00:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION00:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332500:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178216500:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178243800:10
*** ooolpbot has quit IRC00:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)00:10
openstackLaunchpad bug 1782165 in tripleo "ntp servers blocked. Can't deploy undercloud with oooq due to wrong NTP configuration" [Critical,Incomplete] - Assigned to wes hayutin (weshayutin)00:10
openstackLaunchpad bug 1782438 in tripleo "tripleo ui endpoints misconfigured in containerized undercloud" [Critical,New]00:10
openstackgerritSagi Shnaidman proposed openstack/tripleo-common master: Support for ARA report for ansible playbooks in deploy  https://review.openstack.org/56507700:10
openstackgerritSagi Shnaidman proposed openstack/python-tripleoclient master: Support ARA report tracking from command line  https://review.openstack.org/58379900:12
openstackgerritSagi Shnaidman proposed openstack/tripleo-common master: Support for ARA report for ansible playbooks in deploy  https://review.openstack.org/56507700:12
*** thrash is now known as thrash|g0ne00:13
openstackgerritSagi Shnaidman proposed openstack/tripleo-quickstart-extras master: Collect overcloud statistics with ARA  https://review.openstack.org/57846200:15
jtcressyOn every resource in my stack after attempting deletion: "Stack DELETE cancelled"00:16
*** moshele has joined #tripleo00:16
jtcressywhy? wtf?00:16
*** honza has joined #tripleo00:20
*** honza is now known as Guest7015400:20
*** Guest70154 is now known as honza_00:21
*** itlinux_ has joined #tripleo00:22
*** itlinux has quit IRC00:23
*** jtcressy has quit IRC00:25
*** jtcressy has joined #tripleo00:26
jtcressymwhahaha: is there any way for me to forcefully delete a failed heat stack? it keeps instantly failing every time I try to delete it00:26
jtcressy"openstack stack failures list overcloud" shows me a list of resources that ALL say "DELETE aborted (user triggered cancel)". I triggered no such thing. I keep running "openstack stack delete overcloud -y" repeatedly to no avail.00:27
*** lblanchard has quit IRC00:28
jillrjtcressy: he's on pto.  there is not a stack force delete though,00:28
jillrone way I've seen that sort of thing, if any resources were deleted manually the stack can get into a state where it can't cascade through all the substacks and resources correctly, and you end up with an undeleteable stack.00:29
jtcressyI did not delete any of these resources manually. I've only been running the stack delete command and nothing else00:30
jillryou can attempt database surgery to identify what's in a bad state and nudge things along00:30
jillrk, that's just one way I've seen it, as a example.00:30
jtcressyeach time i check what the failures are it seems to be different resources. I cant pin it down.00:30
jillrit's probably fastest and easier to redeploy the undercloud if this is a test/poc cloud,00:30
jillror a great opp to learn heat troubleshooting, depending on how you want to look at it?  :)00:31
jtcressyI cant even begin to understand why it would cancel all of these deletions. I cant find a single resource that doesn't say "cancelled"00:31
jillrheat logs might help, or you can run --debug with your openstack client cmd00:32
jtcressydoes this make any sense? https://hastebin.com/raw/orovuwehaj00:34
jillrcouple bugs that could be related: https://bugzilla.redhat.com/show_bug.cgi?id=1568578, https://bugzilla.redhat.com/show_bug.cgi?id=157138400:37
openstackbugzilla.redhat.com bug 1568578 in rhosp-director "Deleting the OC stack occasionally fails" [Unspecified,Closed: duplicate] - Assigned to rhos-maint00:37
openstackbugzilla.redhat.com bug 1571384 in openstack-ironic "libvirt errors are causing virtualbmc power operations to fail, resulting in failed deployments when using virtualbmc" [High,Closed: duplicate] - Assigned to ietingof00:37
jillris vbmc working for your OC nodes?00:37
jtcressyvbmc?00:37
jillrare you deploying with vms?00:38
jtcressyno i'm on bare metal00:38
jtcressyall R710's with iDRAC 600:38
jillrok, s/vbmc/ipmi then00:38
jtcressyheat isn't trying to delete the instances or anything... it keeps getting stuck on the resources I show in that hastebin above.00:39
jtcressyI dont think it's a nova issue.00:40
jillrthe nested stacks are intertwined with each other.  you're going to need to trace out what resource it's trying to act on when it fails, and what caused that action to fail.00:42
jillrdebug logs should be helpful, so you can see what api call is being made when it happens00:42
jtcressyI hit a dead end on this resource: "| ServiceChain  | eeb871e8-c3c6-4d7a-90bd-4e56da023d32 | OS::Heat::ResourceChain | DELETE_FAILED   | 2018-07-18T23:37:37Z |"00:45
jtcressy"openstack stack resource list eeb871e8-c3c6-4d7a-90bd-4e56da023d32" gives me no output.00:45
*** artom has quit IRC00:49
jtcressyOk.... so i guess running "openstack stack delete overcloud -y" repeatedly eventually WILL delete the stack. I just had to attempt it over 150 times over the course of an hour or so.00:49
*** itlinux_ has quit IRC00:49
jtcressyopenstack stack list now comes up empty00:49
jtcressymaybe I can write a brute-forcing script that will repeat this until the stack is deleted. it will be handy next time this happens.00:50
*** noslzzp has quit IRC00:58
*** mburned has quit IRC01:04
*** haleyb has quit IRC01:05
*** jtcressy has quit IRC01:06
*** noslzzp has joined #tripleo01:10
*** ooolpbot has joined #tripleo01:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION01:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332501:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)01:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178216501:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178243801:10
*** ooolpbot has quit IRC01:10
openstackLaunchpad bug 1782165 in tripleo "ntp servers blocked. Can't deploy undercloud with oooq due to wrong NTP configuration" [Critical,Incomplete] - Assigned to wes hayutin (weshayutin)01:10
openstackLaunchpad bug 1782438 in tripleo "tripleo ui endpoints misconfigured in containerized undercloud" [Critical,New]01:10
*** haleyb has joined #tripleo01:13
*** mburned has joined #tripleo01:13
*** itlinux has joined #tripleo01:17
*** mrsoul` has joined #tripleo01:21
*** mrsoul_ has joined #tripleo01:21
*** Petersingh has joined #tripleo01:21
*** mrsoul has quit IRC01:23
*** mschuppert has quit IRC01:24
*** med_ has joined #tripleo01:35
*** med_ has quit IRC01:35
*** med_ has joined #tripleo01:35
*** Petersingh is now known as Petersingh|afk01:39
*** agopi has joined #tripleo01:42
*** rbrady has quit IRC01:45
*** yamahata has quit IRC01:46
EmilienMcan someone look at https://review.openstack.org/#/c/569153/ please01:51
openstackgerritEmilien Macchi proposed openstack/tripleo-quickstart master: Remove --use-heat usage, as it's deprecated  https://review.openstack.org/58153401:51
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: Check container health as part of the deploy  https://review.openstack.org/56915301:52
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates master: Limit deploy health checks to paunch managed ones  https://review.openstack.org/58152901:52
*** mcornea has quit IRC01:53
*** ramishra has joined #tripleo01:54
*** lblanchard has joined #tripleo01:54
*** Petersingh|afk is now known as Petersingh01:54
openstackgerritMerged openstack/puppet-tripleo stable/queens: remove scenario005 from experimental  https://review.openstack.org/58368501:57
*** ramishra has quit IRC02:01
*** jaganathan has joined #tripleo02:06
*** mdnadeem has joined #tripleo02:10
*** ooolpbot has joined #tripleo02:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION02:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332502:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)02:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178216502:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178243802:10
*** ooolpbot has quit IRC02:10
openstackLaunchpad bug 1782165 in tripleo "ntp servers blocked. Can't deploy undercloud with oooq due to wrong NTP configuration" [Critical,Incomplete] - Assigned to wes hayutin (weshayutin)02:10
openstackLaunchpad bug 1782438 in tripleo "tripleo ui endpoints misconfigured in containerized undercloud" [Critical,New]02:10
*** agopi has quit IRC02:17
*** itlinux has quit IRC02:19
*** shreshtha has quit IRC02:27
*** ramishra has joined #tripleo02:29
openstackgerritTuan Do Anh proposed openstack/tripleo-common master: Fix typo of function naming conventions in parameters.py  https://review.openstack.org/58117802:35
*** moshele has quit IRC02:36
*** psachin` has joined #tripleo02:37
*** lblanchard has quit IRC02:43
*** ramishra has quit IRC02:46
*** jaganathan has quit IRC02:48
openstackgerritMerged openstack-infra/tripleo-ci master: Read featureset variable as string value  https://review.openstack.org/58302202:59
openstackgerritRafael Folco proposed openstack-infra/tripleo-ci master: Set common vars at vars/common.yaml  https://review.openstack.org/58288503:08
openstackgerritRafael Folco proposed openstack-infra/tripleo-ci master: Take featureset out of TOCI_JOBTYPE  https://review.openstack.org/58238403:08
openstackgerritRafael Folco proposed openstack-infra/tripleo-ci master: Take environment_type out of TOCI_JOBTYPE  https://review.openstack.org/58238503:08
openstackgerritRafael Folco proposed openstack-infra/tripleo-ci master: Take nodes out of TOCI_JOBTYPE  https://review.openstack.org/58238603:08
openstackgerritRafael Folco proposed openstack-infra/tripleo-ci master: Take periodic and dryrun out of TOCI_JOBTYPE  https://review.openstack.org/58238703:08
*** ooolpbot has joined #tripleo03:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION03:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332503:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)03:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178216503:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178243803:10
*** ooolpbot has quit IRC03:10
openstackLaunchpad bug 1782165 in tripleo "ntp servers blocked. Can't deploy undercloud with oooq due to wrong NTP configuration" [Critical,Incomplete] - Assigned to wes hayutin (weshayutin)03:10
openstackLaunchpad bug 1782438 in tripleo "tripleo ui endpoints misconfigured in containerized undercloud" [Critical,New]03:10
*** Petersingh is now known as Petersingh|afk03:18
*** psahoo has joined #tripleo03:56
*** Petersingh|afk is now known as Petersingh03:58
*** udesale has joined #tripleo04:02
*** links has joined #tripleo04:08
*** Haresh has joined #tripleo04:09
*** ooolpbot has joined #tripleo04:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION04:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332504:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)04:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178216504:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178243804:10
*** ooolpbot has quit IRC04:10
openstackLaunchpad bug 1782165 in tripleo "ntp servers blocked. Can't deploy undercloud with oooq due to wrong NTP configuration" [Critical,Incomplete] - Assigned to wes hayutin (weshayutin)04:10
openstackLaunchpad bug 1782438 in tripleo "tripleo ui endpoints misconfigured in containerized undercloud" [Critical,New]04:10
*** holser_ has joined #tripleo04:11
*** med_ has quit IRC04:11
*** pdeore has joined #tripleo04:17
*** pdeore has quit IRC04:17
*** rh-jelabarre has quit IRC04:19
*** ccamacho has quit IRC04:21
*** khyr0n has joined #tripleo04:22
*** eck` is now known as eck`gone04:23
*** pcaruana has joined #tripleo04:23
*** ramishra has joined #tripleo04:28
*** karthiks has quit IRC04:28
*** shreshtha has joined #tripleo04:31
*** pcaruana has quit IRC04:34
*** Haresh has quit IRC04:39
*** ykarel has joined #tripleo04:49
*** mdnadeem has quit IRC04:55
*** mdnadeem has joined #tripleo04:56
*** skramaja has joined #tripleo04:57
*** pdeore has joined #tripleo04:59
*** mpjetta has quit IRC05:03
*** holser_ has quit IRC05:04
*** mdnadeem has quit IRC05:06
*** ooolpbot has joined #tripleo05:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION05:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332505:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)05:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178216505:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178243805:10
*** ooolpbot has quit IRC05:10
openstackLaunchpad bug 1782165 in tripleo "ntp servers blocked. Can't deploy undercloud with oooq due to wrong NTP configuration" [Critical,Incomplete] - Assigned to wes hayutin (weshayutin)05:10
openstackLaunchpad bug 1782438 in tripleo "tripleo ui endpoints misconfigured in containerized undercloud" [Critical,New]05:10
*** Petersingh is now known as Petersingh|bomga05:15
*** mpjetta has joined #tripleo05:21
*** rrubins has quit IRC05:22
*** moshele has joined #tripleo05:23
*** ccamacho has joined #tripleo05:25
*** quiquell|off is now known as quiquell05:26
*** moshele has quit IRC05:27
*** flwang has quit IRC05:29
*** mnasiadka_ has joined #tripleo05:29
*** mnasiadka has quit IRC05:29
*** NobodyCam has quit IRC05:29
*** mnasiadka_ is now known as mnasiadka05:29
*** NobodyCam has joined #tripleo05:29
*** mgkwill_ has joined #tripleo05:30
*** gregwork_ has joined #tripleo05:30
*** mwhahaha has quit IRC05:30
*** portdirect has quit IRC05:30
*** mgkwill has quit IRC05:30
*** mwhahaha has joined #tripleo05:30
*** mgkwill_ is now known as mgkwill05:30
*** portdirect has joined #tripleo05:30
*** colonwq has quit IRC05:31
*** morazi has quit IRC05:31
*** hamzy has quit IRC05:31
*** gregwork has quit IRC05:31
*** gregwork_ is now known as gregwork05:31
*** portdirect is now known as Guest7295205:32
*** hamzy has joined #tripleo05:36
*** mdnadeem has joined #tripleo05:37
openstackgerritQuique Llorente proposed openstack/python-tripleoclient master: [WIP] Learn --with-ara  https://review.openstack.org/58386105:37
*** rrubins has joined #tripleo05:38
openstackgerritQuique Llorente proposed openstack/tripleo-quickstart master: [DNM] Use --with-ara for featureset010  https://review.openstack.org/58355705:40
*** colonwq has joined #tripleo05:45
*** morazi has joined #tripleo05:45
*** flwang has joined #tripleo05:47
*** threestrands has quit IRC05:49
*** cshastri has joined #tripleo05:50
*** dparkes has joined #tripleo05:55
*** janki has joined #tripleo05:56
*** mdnadeem has quit IRC05:57
*** ratailor has joined #tripleo05:57
*** noslzzp has quit IRC05:59
*** mdnadeem has joined #tripleo05:59
*** hamdyk has joined #tripleo06:08
*** threestrands has joined #tripleo06:09
*** threestrands has quit IRC06:09
*** threestrands has joined #tripleo06:09
*** ooolpbot has joined #tripleo06:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION06:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332506:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178216506:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178243806:10
*** ooolpbot has quit IRC06:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)06:10
openstackLaunchpad bug 1782165 in tripleo "ntp servers blocked. Can't deploy undercloud with oooq due to wrong NTP configuration" [Critical,Incomplete] - Assigned to wes hayutin (weshayutin)06:10
openstackLaunchpad bug 1782438 in tripleo "tripleo ui endpoints misconfigured in containerized undercloud" [Critical,New]06:10
*** paramite has joined #tripleo06:11
*** jtcressy has joined #tripleo06:14
*** agopi has joined #tripleo06:17
bandiniSimple backport if anyone is around https://review.openstack.org/58310706:20
*** udesale_ has joined #tripleo06:23
*** udesale has quit IRC06:25
*** gkadam has joined #tripleo06:26
*** paramite has quit IRC06:27
*** paramite has joined #tripleo06:28
*** agurenko has joined #tripleo06:30
openstackgerritMichele Baldessari proposed openstack/tripleo-heat-templates master: Enable logging to stdout/stderr in memcached  https://review.openstack.org/58334406:31
*** jtcressy has quit IRC06:32
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: Take periodic and dryrun out of TOCI_JOBTYPE  https://review.openstack.org/58238706:32
openstackgerritMerged openstack/tripleo-heat-templates master: Check container health as part of the deploy  https://review.openstack.org/56915306:33
*** ffiore has joined #tripleo06:35
*** jfrancoa has joined #tripleo06:36
*** threestrands has quit IRC06:36
*** pcaruana has joined #tripleo06:37
*** gkadam has quit IRC06:38
*** ramishra has quit IRC06:40
*** ramishra has joined #tripleo06:41
*** yprokule has joined #tripleo06:41
*** paramite has quit IRC06:43
*** moshele has joined #tripleo06:44
*** assassin has joined #tripleo06:48
*** Petersingh|bomga is now known as Petersingh06:48
openstackgerritFlavio Percoco proposed openstack/tripleo-heat-templates master: WIP use openshift-ansible container instead of RPMs  https://review.openstack.org/58386806:50
*** udesale__ has joined #tripleo06:51
*** aufi has joined #tripleo06:52
*** mrunge_ has joined #tripleo06:53
*** udesale_ has quit IRC06:54
*** holser_ has joined #tripleo06:54
*** aufi_ has joined #tripleo06:54
*** mrunge has quit IRC06:55
*** paramite has joined #tripleo06:57
*** khyr0n has quit IRC06:58
*** aufi has quit IRC06:58
*** brault has joined #tripleo06:58
*** mrsoul_ is now known as mschuppert06:59
*** cylopez has joined #tripleo06:59
*** cylopez has left #tripleo06:59
*** nyechiel has joined #tripleo07:00
*** moshele has quit IRC07:01
openstackgerritwaleed mousa proposed openstack/puppet-tripleo master: Adding support for VF LAG in SR-IOV  https://review.openstack.org/55841107:01
*** peereb has joined #tripleo07:02
*** bogdando has joined #tripleo07:02
*** openstackgerrit has quit IRC07:04
*** sileht has quit IRC07:04
*** sileht has joined #tripleo07:07
*** dbecker has joined #tripleo07:09
*** amoralej|off is now known as amoralej07:09
*** ramishra has quit IRC07:09
*** ooolpbot has joined #tripleo07:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION07:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332507:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178216507:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178243807:10
*** ooolpbot has quit IRC07:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)07:10
openstackLaunchpad bug 1782165 in tripleo "ntp servers blocked. Can't deploy undercloud with oooq due to wrong NTP configuration" [Critical,Incomplete] - Assigned to wes hayutin (weshayutin)07:10
openstackLaunchpad bug 1782438 in tripleo "tripleo ui endpoints misconfigured in containerized undercloud" [Critical,New]07:10
*** paramite has quit IRC07:10
*** ramishra has joined #tripleo07:11
*** shardy has joined #tripleo07:13
*** shardy has quit IRC07:13
*** shardy has joined #tripleo07:13
*** openstackgerrit has joined #tripleo07:13
openstackgerritMarios Andreou proposed openstack-infra/tripleo-ci master: tripleo.sh --repo-setup update ceph to luminous and remove older  https://review.openstack.org/58354707:13
*** paramite has joined #tripleo07:13
*** gfidente has joined #tripleo07:21
*** gfidente has quit IRC07:21
*** gfidente has joined #tripleo07:21
openstackgerritMarios Andreou proposed openstack-infra/tripleo-ci master: tripleo.sh --repo-setup update ceph to luminous and remove older  https://review.openstack.org/58354707:21
openstackgerritSteven Hardy proposed openstack/tripleo-heat-templates master: Fix HostnameMap lookup - replace str_replace with yaql  https://review.openstack.org/58247507:24
*** yprokule_ has joined #tripleo07:24
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: Move toci_quickstart variables to yaml  https://review.openstack.org/58246607:26
*** yprokule has quit IRC07:27
*** yprokule_ is now known as yprokule07:27
*** ykarel is now known as ykarel|lunch07:27
openstackgerritQuique Llorente proposed openstack/tripleo-heat-templates master: [DNM] To test sprint16 toci refactoring  https://review.openstack.org/58387407:28
*** janki has quit IRC07:29
*** rcernin has quit IRC07:29
*** lvdombrkr has joined #tripleo07:30
*** avivgt|lunch has joined #tripleo07:31
*** noslzzp has joined #tripleo07:33
*** tosky has joined #tripleo07:37
*** florianf has joined #tripleo07:42
openstackgerritJuan Badia Payno proposed openstack/tripleo-heat-templates master: [WIP] - mistra_engine container added /usr/share volume  https://review.openstack.org/58387707:44
*** janki has joined #tripleo07:44
openstackgerritTuan Do Anh proposed openstack/tripleo-common master: fix tox python3 overrides  https://review.openstack.org/57982707:46
openstackgerritMarios Andreou proposed openstack-infra/tripleo-ci master: tripleo.sh --repo-setup update ceph to luminous and remove older  https://review.openstack.org/58354707:50
*** Petersingh is now known as Petersingh|lunch07:50
*** yprokule_ has joined #tripleo07:52
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates master: Use global ansible.cfg for nodes-uuid playbook  https://review.openstack.org/58355207:53
*** kopecmartin has joined #tripleo07:53
*** yprokule has quit IRC07:55
*** yprokule_ is now known as yprokule07:55
openstackgerritJuan Badia Payno proposed openstack/tripleo-heat-templates master: [WIP] - mistra_engine container added /usr/share volume  https://review.openstack.org/58387708:03
*** dtantsur|afk is now known as dtantsur08:07
*** ooolpbot has joined #tripleo08:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION08:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332508:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178216508:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178243808:10
*** ooolpbot has quit IRC08:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)08:10
openstackLaunchpad bug 1782165 in tripleo "ntp servers blocked. Can't deploy undercloud with oooq due to wrong NTP configuration" [Critical,Incomplete] - Assigned to wes hayutin (weshayutin)08:10
openstackLaunchpad bug 1782438 in tripleo "tripleo ui endpoints misconfigured in containerized undercloud" [Critical,New]08:10
*** gkadam has joined #tripleo08:11
*** gkadam is now known as gkadam-brb08:12
openstackgerritChandan Kumar proposed openstack/tripleo-quickstart-extras master: Enable support for running refstack tests in TQE  https://review.openstack.org/57071908:15
*** pmannidi has quit IRC08:18
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates master: Use global ansible.cfg for nodes-uuid playbook  https://review.openstack.org/58355208:20
*** derekh has joined #tripleo08:23
*** dtantsur is now known as dtantsur|bbl08:24
*** holser_ has quit IRC08:25
*** ykarel|lunch is now known as ykarel08:30
shardyd0ugal: Hey, can you give any tips on how to trace mistral logs from an action error back to which workflow/task was running the action?08:30
shardyd0ugal: http://logs.openstack.org/53/574753/21/check/tripleo-ci-centos-7-scenario000-multinode-oooq-container-updates/516a10d/logs/undercloud/var/log/containers/mistral/executor.log.txt.gz#_2018-07-18_11_04_12_33608:30
openstackgerritCédric Jeanneret proposed openstack/puppet-tripleo master: Corrected vrrp script for haproxy status  https://review.openstack.org/58388608:30
shardyhere I can see an action failed because it 404'd getting plan-environment.yaml from swift08:30
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: Move toci_quickstart variables to yaml  https://review.openstack.org/58246608:30
shardyd0ugal: but it's not clear which workflow/task caused this (so I can figure out why the update workflow lost the plan-environment we copy/create during the initial deploy)08:31
shardyit'd be cool if there was a tree/forest view option which we could dump into the CI logs, so you can see the exact state of the workflow graph08:31
flaper87bogdando: https://review.openstack.org/#/c/583238/ <- you sure it's this patch fault?08:32
flaper87the CI switched to containerized undercloud and now scenario009 is broken08:33
flaper87:(08:33
flaper87want to merge these patches asap08:33
flaper87shardy: mandre https://review.openstack.org/#/c/583238/ pls08:33
bogdandoflaper87: I don't know how to debug that zuul breakage (08:33
bogdandosyntax*08:33
bogdandoflaper87: check experimental fails on that patch08:34
flaper87doh, I just noticed08:34
flaper87T_T08:34
d0ugalshardy: Looking. Maybe we can add that script I wrote to CI, even if it doesn't land into Mistral yet08:34
shardyflaper87: sure but bogdando is right, there's some zuul syntax error badness in the subsequent patches in that series08:35
* shardy tries to understand why08:35
flaper87shardy: uyeah, I just noticed what he meant08:35
flaper87T_T08:35
flaper87sorry for the noise08:35
* flaper87 checks08:36
shardyflaper87: cool, I don't really see what's wrong, the list additions seem reasonable08:36
shardy"Job tripleo-ci-centos-7-scenario005-multinode-oooq not defined"08:36
shardyhmm08:36
flaper87I don't think it's this patche's fault08:36
flaper87lemme re run experimental08:36
flaper87maybe some job definition coming from somewhere else08:37
shardyyeah08:37
openstackgerritFlavio Percoco proposed openstack/tripleo-heat-templates master: Run scenario009 for more services  https://review.openstack.org/58323808:37
flaper87rebased it and re-run check experimental08:37
*** Petersingh|lunch is now known as Petersingh08:37
flaper87lets see08:37
openstackgerritCarlos Goncalves proposed openstack/tripleo-heat-templates master: Add scenario010 for testing Octavia  https://review.openstack.org/51833108:37
openstackgerritLuigi Toscano proposed openstack/tripleo-heat-templates master: WIP Deploy Sahara with unversioned endpoints  https://review.openstack.org/58389008:38
mandreflaper87: yeah likely not the patch's fault08:38
d0ugalshardy: The action exectuion ID is a few lines above the error in the logs action_ex_id08:38
d0ugalshardy: http://logs.openstack.org/53/574753/21/check/tripleo-ci-centos-7-scenario000-multinode-oooq-container-updates/516a10d/logs/undercloud/var/log/containers/mistral/executor.log.txt.gz#_2018-07-18_11_04_12_30508:38
mandreit's just that experimental has a tripleo-ci-centos-7-scenario005-multinode-oooq job that is probably never defined08:39
d0ugalshardy: grepping the engine log for that ID finds the workflow trace08:39
d0ugalshardy: http://logs.openstack.org/53/574753/21/check/tripleo-ci-centos-7-scenario000-multinode-oooq-container-updates/516a10d/logs/undercloud/var/log/containers/mistral/engine.log.txt.gz#_2018-07-18_11_04_12_36908:39
shardyd0ugal: ah, thanks, I was looking for the ID in [req-79f774cb-9c70-4499-ba1b-2f92c3b19f71 fa0efb24f6194b28aa741bb374178745 0b39e2fc1663472f9b045854a4770f8308:40
d0ugalshardy: yeah, those IDs are confusing. I don't understand them. I need to figure that out.08:40
shardyd0ugal: yeah be interested to know if you find out, I assumed one of the second IDs was the action execution08:41
*** tesseract has joined #tripleo08:41
shardyOk so tripleo.plan_management.v1.update_deployment_plan calls tripleo.parameters.generate_passwords which blows up because plan-environment is missing08:42
shardyd0ugal: do you happen to know how we maintain the plan-environment.yaml over updates to the plan?08:42
mandreflaper87, shardy, bogdando: tripleo-ci-centos-7-scenario005-multinode-oooq definition was removed in https://review.openstack.org/#/c/581376/08:42
shardyI was expecting it to persist because it's got e.g all the passwords etc in it08:42
shardynot get re-generated every update08:43
flaper87mandre: just rebased my patch08:43
flaper87it should work now08:43
flaper87let's see08:43
bogdandoflaper87: well spotted! thanks)08:43
shardymandre: aha, good catch!08:43
d0ugalshardy: I don't fully remember, but I don't think it is re-generated. I think that workflow will generate the passwords if they are missing08:44
d0ugalI can't think of a reason why it wouldn't exist at that point however.08:44
shardyd0ugal: Ok I'll need to modify the update workflow then - how do we avoid changing the passwords, are those stored somewhere else as well as the plan-environment?08:44
mandreflaper87: still, we need to remove scenario005 from experimental, the job doesn't exist anymore08:45
mandrei'll submit a patch08:45
shardyd0ugal: I assume part of the update cleans the old plan contents, but I incorrectly assumed we saved the plan-environment somewhere along the line08:45
flaper87mandre: wait08:45
flaper87let me do it08:45
flaper87just to avoid a bunch of rebases08:45
d0ugalshardy: Good question. You are testing my memory :)08:46
flaper87also, why wasn't it removed from the experimental queue when the job was removed?08:46
shardyhehe08:46
flaper87there probably is a patch for that already08:46
* shardy passes d0ugal a coffee ;)08:46
flaper87mandre: ^08:46
* shardy looks at the code to figure it out08:46
flaper87mandre: https://review.openstack.org/#/c/583680/08:46
*** agurenko has quit IRC08:46
flaper87there's a patch for it already08:46
*** pradk has joined #tripleo08:47
*** holser_ has joined #tripleo08:48
d0ugalshardy: I think they are only stored in user-environment and generated if missing?08:49
mandreweshay must wonder what happened to his patch ;)08:49
d0ugalshardy: but at one point we did store a copy of what we generted in another location - but I can't find that now.08:49
d0ugalshardy: that was back when we stored everything in Mistral envs08:49
openstackgerritJose Luis Franco proposed openstack-infra/tripleo-ci master: Collect /tmp/ansible-mistral-action into CI job logs.  https://review.openstack.org/58389608:49
openstackgerritFlavio Percoco proposed openstack/tripleo-heat-templates master: Run scenario009 for more services  https://review.openstack.org/58323808:49
*** kopecmartin has quit IRC08:49
d0ugalshardy: https://github.com/openstack/tripleo-common/blob/master/tripleo_common/actions/parameters.py#L285-L31508:50
shardyd0ugal: hmm, but there's a huge "passwords" block in the plan-environment08:50
shardyhttps://github.com/openstack/tripleo-common/blob/master/tripleo_common/utils/passwords.py#L5008:50
*** kopecmartin has joined #tripleo08:50
*** agurenko has joined #tripleo08:50
shardyd0ugal: yeah, so it looks like we delete it, then grab any existing passwords from the heat environment08:50
shardysigh.  That kinda breaks my selction of a plan-sample :(08:51
d0ugalshardy: yeah, that was done for the upgrade case (upgrading from no initial plan)08:51
d0ugalI don't know if we still need that, since everyone should have a plan now?08:51
shardyd0ugal: I think we're relying on that, because by the time you update the plan, the old plan-environment is gone08:51
shardybut, luckily, we still have the data in heat08:51
d0ugaloh08:52
shardyI'll have to test to confirm that though08:52
d0ugalshardy: I don't see where the old environment is removed?08:52
d0ugalI see it being loaded and updated, but I might be missing something08:52
openstackgerritMartin André proposed openstack/tripleo-quickstart-extras master: Restrict undercloud resolvers to IPv4 addresses  https://review.openstack.org/58330208:52
openstackgerritDamien Ciabrini proposed openstack/puppet-tripleo master: Prevent triggering firewall actions while configuring HA services  https://review.openstack.org/58364808:54
*** udesale_ has joined #tripleo08:54
shardyTODO(d0ugal): We need to put a more robust strategy in place here to handle updating plans.08:54
shardyhttps://github.com/openstack/python-tripleoclient/blob/master/tripleoclient/v1/overcloud_deploy.py#L37308:54
shardy;)08:54
shardyIIRC we purge the entire container in the client08:54
* shardy looks for where08:54
openstackgerritMarios Andreou proposed openstack-infra/tripleo-ci master: tripleo.sh --repo-setup update ceph to luminous and remove older  https://review.openstack.org/58354708:55
d0ugalshardy: ah, I forgot that tripleoclient dove behind the API. gah08:55
shardyhttps://github.com/openstack/python-tripleoclient/blob/master/tripleoclient/workflows/plan_management.py#L20008:55
openstackgerritwaleed mousa proposed openstack/puppet-tripleo master: Adding support for VF LAG in SR-IOV for Mellanox interfaces  https://review.openstack.org/55841108:56
shardyd0ugal: yeah, I guess for now we could special-case it to save the plan-environment and capabilities-map, but it'd probably be better to have a workflow that does this08:56
d0ugalyup08:56
d0ugalshardy: we should really try and sort out the tripleoclient mess in Stein :)08:56
*** udesale__ has quit IRC08:57
*** agurenko has quit IRC08:57
shardyd0ugal: yeah agreed, but it's going to be risky and a lot of work08:57
d0ugalIndeed08:57
d0ugalbut it just keeps getting worse and causing extra work.08:57
shardymaybe at the PTG we can figure out the steps and attempt it incrementally08:57
shardyyeah08:57
flaper87bogdando: you can now remove your -1 https://review.openstack.org/#/c/583238/ :D08:57
*** salmankhan has joined #tripleo08:57
shardyOk mystery solved - thanks for your help working through it! :)08:57
bogdandooops, I thought it's gone after rebase, flaper8708:58
openstackgerritBogdan Dobrelya proposed openstack/tripleo-heat-templates master: Bind mount mistral state for external deployments  https://review.openstack.org/58313608:58
openstackgerritBogdan Dobrelya proposed openstack/tripleo-heat-templates master: W/a kubespray vault install failure  https://review.openstack.org/58313208:58
openstackgerritMarios Andreou proposed openstack-infra/tripleo-ci master: tripleo.sh --repo-setup update ceph to luminous and remove older  https://review.openstack.org/58354708:59
d0ugalshardy: np08:59
*** agurenko has joined #tripleo09:00
mandrebogdando: do you have an idea why scenario009 ran with containerized undercloud for https://review.openstack.org/#/c/583302/ ? I was kinda under the impression we haven't made the switch yet09:02
*** cmyster has quit IRC09:02
bogdandomandre: defaults switched09:02
bogdandoin the client, openstack undercloud install now deploys containerized, --use-heat=False would deploy instack09:03
bogdandohttps://review.openstack.org/#/c/581534/09:03
bogdandomandre: ^^09:03
gfidentesshnaidm|rover if I was marios09:03
gfidenteI'd have killed you on the comment09:03
gfidenteabout erase vs remove09:03
bogdandoand the switch was in https://review.openstack.org/#/c/57621809:04
*** nyechiel has quit IRC09:04
mandrebogdando: ahh, thanks that explains it09:04
sshnaidm|rovergfidente, it's nit, not reason for -109:04
shardybogdando: has much testing been done of upgrading from an instack installed undercloud to the containerized one?09:05
shardyI ask because I tried it yesterday and it didn't work, but that was possibly due to other issues in my environment09:05
bogdandoshardy: that's using instack still09:05
*** Petersingh is now known as Petersingh|afk09:05
gfidentesshnaidm|rover ah sorry, what is the reason for the -1 ?09:06
shardybogdando: I mean we switched the defaults - if someone upgrades then runs "openstack undercloud install" or "openstack undercloud upgrade", the heat based container stuff will run09:06
bogdandoshardy: oh, do you mean UC upgrade, not OC?09:06
shardywhat happens then?09:06
shardybogdando: yeah09:06
sshnaidm|rovergfidente, If I was gfidente, I would talk to people before killing them09:06
bogdandoit's been tested in upstream instack-to-cont-upgrade job for a few months09:06
gfidentesshnaidm|rover oh dear09:07
gfidenteI wasn't thinking about killing you for real09:07
shardybogdando: Ok good to hear, hopefully my issues were a one-off then09:07
gfidenteand if you have something to say to -1, better say it in gerrit so others know09:07
bogdandoshardy: since May 4 , https://trello.com/c/nFbky9Uk/5-upgrade-support-from-instack-undercloud09:07
sshnaidm|rovergfidente, please read comments more carefully09:07
gfidentebut anyway, I hope it's obvious I was joking09:07
gfidentesorry if it wasn't09:07
bogdandohttps://review.openstack.org/#/c/553633/ introced that job09:08
bogdandointroduced09:08
gfidentesshnaidm|rover ok about comments09:08
*** panda|off is now known as panda09:08
gfidenteI see you wrote this "How is that related to tripleo.sh?"09:08
gfidentein a change which is making a change to tripleo.sh09:08
gfidenteI might be overlooking something09:08
gfidentewhat was the real meaning of your comment?09:08
bogdandojfrancoa: hi, so it failed now http://logs.openstack.org/15/583515/2/check/tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades/0c803dc/logs/undercloud/home/zuul/undercloud_install.log.txt.gz#_2018-07-18_17_35_00 while was passing the previous patchset09:09
bogdandohow come?.. :(09:09
*** ooolpbot has joined #tripleo09:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION09:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332509:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178216509:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178243809:10
*** ooolpbot has quit IRC09:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)09:10
openstackLaunchpad bug 1782165 in tripleo "ntp servers blocked. Can't deploy undercloud with oooq due to wrong NTP configuration" [Critical,Incomplete] - Assigned to wes hayutin (weshayutin)09:10
openstackLaunchpad bug 1782438 in tripleo "tripleo ui endpoints misconfigured in containerized undercloud" [Critical,New]09:10
jfrancoabogdando: I only changed the flag you commented in the patch, instead of adding --use-heat, set containerized_undercloud to true. "openstack undercloud install" should be now the same as "openstack undercloud install --use-heat" right?09:11
bogdandocomparing to http://logs.openstack.org/47/465047/13/check/tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades/0507766/ , jfrancoa , let's find out09:11
jfrancoabogdando: that's the only difference09:11
bogdandoI mean the instack job09:11
bogdandoin the original patch09:11
*** holser_ has quit IRC09:11
*** Petersingh|afk is now known as Petersingh09:12
bogdandojfrancoa: never mind, I messed it up09:12
bogdandohte links09:12
bogdandoso the passed instack and overcloud upgrades job was http://logs.openstack.org/47/465047/13/check/tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades/0507766/logs/undercloud/home/zuul/undercloud_install.log.txt.gz09:14
bogdandoand failed for the minor refactor on 14 (09:14
*** derekh has quit IRC09:14
*** kopecmartin has quit IRC09:15
bogdandooh, it passed http://logs.openstack.org/47/465047/14/check/tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades/3f333a1/ . Ugh, so confusing :D so that's the containerized UC patch failed on the 2nd run09:15
bogdandojfrancoa: yes, you're right wrt that --use-heat change09:15
mariosgfidente: sshnaidm|rover peace :) brothers be cool09:16
mariosgfidente: sshnaidm|rover its all good09:17
mariosgfidente: sshnaidm|rover we are discussing it in oooq too09:17
jfrancoabogdando: yes, I added that modification. but currently by default we install containerized undercloud, right? so --use-heat and not using it should be the same. Or am I missing something?09:17
*** udesale__ has joined #tripleo09:17
*** derekh has joined #tripleo09:17
jfrancoabogdando: ok, I got it. https://review.openstack.org/#/c/583515/2/config/general_config/featureset051.yml@10709:18
bogdandojfrancoa: comparig http://logs.openstack.org/15/583515/1/check/tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades/698ee97/logs/undercloud/home/zuul/install-undercloud.log.txt.gz to http://logs.openstack.org/15/583515/2/check/tripleo-ci-centos-7-scenario000-multinode-oooq-container-upgrades/67b9d01/logs/undercloud/home/zuul/install-undercloud.log.txt.gz I can see the deploy command changed, which is not expected09:19
jfrancoabogdando: it should be now "{{ undercloud_templates_path }}/ci/common/net-config-simple-bridge.yaml"09:19
jfrancoabogdando: I am going to change it09:19
bogdandoI can see there is also wrong hiera file09:19
chkumar|rucksileht: Please have a look at this failure https://logs.rdoproject.org/openstack-periodic/git.openstack.org/openstack-infra/tripleo-ci/master/legacy-periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset016-master/12091db/logs/tempest.html.gz09:19
*** kopecmartin has joined #tripleo09:19
openstackgerritRabi Mishra proposed openstack/puppet-tripleo master: Check for neutron_plugin_ml2_ansible service when including plugin  https://review.openstack.org/58390009:19
bogdandothe classic one is for instack and should not be used with cont UC09:19
bogdandoseems like a little mess in quickstart left after the defaults switched09:20
*** udesale_ has quit IRC09:20
bogdandoEmilienM: ^^09:20
silehtchkumar|ruck, sure09:20
openstackgerritJose Luis Franco proposed openstack/tripleo-quickstart master: Enable containerized undercloud in scenario000-upgrades.  https://review.openstack.org/58351509:21
jfrancoabogdando: let's see if it passes now ^^^09:21
bogdandojfrancoa: right, well spotted. I need another change bundled with the main patch to alter all overcloud_templates_path to undercloud_09:21
bogdandoin featuresets09:21
jfrancoabogdando: yes, right. we need to change all those references too09:22
*** athomas has quit IRC09:23
sshnaidm|rovergfidente, instead of killing somebody, maybe you can help moire with that bug - do you know where does it happen? jobs, logs..?09:24
gfidentesshnaidm|rover sorry I waste my days killing people09:26
gfidentesshnaidm|rover are you serious?09:26
sshnaidm|rovergfidente, about bug? yes09:30
ccamachohey folks!!!09:30
ccamachohttps://www.youtube.com/watch?v=ZbZSe6N_BXs09:30
ccamachoHappy!09:30
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: Set common vars at vars/common.yaml  https://review.openstack.org/58288509:30
ccamacho<3 love for all09:30
ccamachogfidente sshnaidm|rover marios :*09:31
ccamachojfrancoa not for you09:31
ccamachogive me some review love man!09:31
gfidentesshnaidm|rover so to be honest I remeber something close https://review.openstack.org/#/c/576597/09:31
gfidentesshnaidm|rover but I am definitely not oooq expert so I am not sure what can be interferring with that stuff09:31
openstackgerritQuique Llorente proposed openstack/tripleo-heat-templates master: [DNM] Test review  https://review.openstack.org/58317909:32
sshnaidm|rovergfidente, that's oooq patch, according to patch the problem is in infra code09:33
sshnaidm|rovergfidente, *according to bug09:33
gfidentesshnaidm|rover yeah I remember infra pre-installed ceph repos09:33
silehtchkumar|ruck, you can safely recheck, it's a test is not expecting the setup to be so slow09:34
* marios hugs ccamacho and a tree09:34
gfidentesshnaidm|rover this is why tripleo.sh and oooq were removing them I think09:34
chkumar|rucksileht: I will wait for next run then, thanks :-)09:34
silehtccamacho, I proposed a patch to our tempest plugin, the be remove this race when the setup is very slow09:34
mariosccamacho: which review?09:35
ccamachomarios xD09:35
ccamachomaybe this one?09:35
ccamachohttps://review.openstack.org/#/c/581054/09:35
ccamachosimple simple09:35
*** afazekas|pto is now known as afazekas09:39
*** suuuper has joined #tripleo09:41
openstackgerritCédric Jeanneret proposed openstack/tripleo-specs master: Validation Framework specifications.  https://review.openstack.org/58347509:42
Tenguccamacho: hello! any way to get your feedback for this spec? -^^  :)09:43
Tenguas you're apparently working on validations :)09:44
ccamachoTengu yeah I have it right now opened, have a lot of feedback :)09:44
Tenguccamacho: cool! :)09:44
gfidentesshnaidm|rover marios so regarding the bug, I think one of oooq or tripleo.sh is meant to remove the pre-existing centos-release packages09:45
gfidentesshnaidm|rover marios and in that context the tripleo.sh change looks sane to me\09:45
*** shardy has quit IRC09:46
sri_shardy, quick question, in my overcloud deployment instead of using ovs_bonds I've configre linux_bonds with vlans, http://paste.openstack.org/show/726266/, is linux_bond's works out of the box in os-net-config ? is there anything we need to be aware of09:47
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: Set common vars at vars/common.yaml  https://review.openstack.org/58288509:47
*** kopecmartin has quit IRC09:47
*** kopecmartin has joined #tripleo09:48
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: Take featureset out of TOCI_JOBTYPE  https://review.openstack.org/58238409:48
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: Take environment_type out of TOCI_JOBTYPE  https://review.openstack.org/58238509:48
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: Take nodes out of TOCI_JOBTYPE  https://review.openstack.org/58238609:48
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: Take periodic and dryrun out of TOCI_JOBTYPE  https://review.openstack.org/58238709:49
openstackgerritQuique Llorente proposed openstack-infra/tripleo-ci master: Move toci_quickstart variables to yaml  https://review.openstack.org/58246609:49
mariosccamacho: ack09:50
rnoriegagfidente, hi! don't want to interrupt, just a quick question. What is the tag used for ceph containers in tripleo ci? latest? latest-luminous?09:51
rnoriegagfidente, there is also another one called: build-master-XXX-centos-709:51
rnoriegait's a bit confusing... :-\09:51
gfidenternoriega hey, we pin to known working versions09:55
gfidentelocation depends on the tripleo version, are you asking 'master' ?09:55
*** chem has joined #tripleo09:56
openstackgerritSorin Sbarnea proposed openstack/tripleo-quickstart-extras master: WIP: do-not-review bundle v2  https://review.openstack.org/58357410:00
*** honza_ is now known as honza10:01
openstackgerritChandan Kumar proposed openstack/tripleo-quickstart-extras master: use undercloud registry for ceph_namespace in overcloud image prepare  https://review.openstack.org/58160710:01
openstackgerritMerged openstack/puppet-tripleo master: rate limit iptables logging  https://review.openstack.org/58174810:02
gfidentechkumar|ruck regarding https://review.openstack.org/#/c/581607 , is the image actually copied into the undercloud registry?10:03
*** kopecmartin has quit IRC10:04
*** Petersingh is now known as Petersingh|away10:05
*** peereb has quit IRC10:06
*** Petersingh|away has quit IRC10:06
*** kopecmartin has joined #tripleo10:07
*** ooolpbot has joined #tripleo10:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION10:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332510:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178216510:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178243810:10
*** ooolpbot has quit IRC10:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)10:10
openstackLaunchpad bug 1782165 in tripleo "ntp servers blocked. Can't deploy undercloud with oooq due to wrong NTP configuration" [Critical,Incomplete] - Assigned to wes hayutin (weshayutin)10:10
openstackLaunchpad bug 1782438 in tripleo "tripleo ui endpoints misconfigured in containerized undercloud" [Critical,New]10:10
Tenguflorianf and or ccamacho: maybe I can talk about the "validation framework" during one of your team meeting? So that we can catch interest of the majority, and get good feedback and make a great thing together? :)10:14
florianfTengu: Sounds good. I haven't commented on the spec yet though. Can't do it right now, but later today.10:16
Tenguflorianf: fine for me. When are your meetings?10:17
*** dhill_ has quit IRC10:17
Tengu(and where ;)10:17
*** v1a4 has joined #tripleo10:17
florianfTengu: next one is on monday. but I'm not gonna be present because pto10:19
*** leanderthal has joined #tripleo10:19
Tenguflorianf: ok. Maybe it would be good to wait the next one if you're back, so that we can get some first comment/reviews?10:19
florianfTengu: I'm away next week, so maybe the Monday after that could be good. Plus, this will give others some time to comment. I'll find out if there is anything else scheduled.10:21
Tenguflorianf: great! it's on IRC I get? which channel?10:22
florianfTengu: nope, bluejeans10:22
Tenguof, ok. Care to add me to the event (calendar, whatever)?10:22
rnoriegagfidente, sorry, I was afk. Yes, asking about master...10:23
ccamachowe have our upstream meeting tomorrow Tengu florianf10:23
ccamachomaybe might be a good place10:23
florianfTengu: sure, I'll talk to our tc who has all the calendar powers ;-)10:23
Tenguccamacho: as a first catch, depending on the hour, yep10:23
Tenguflorianf: perfect :).10:24
Tenguccamacho: have you any details? I won't take long anyway, just pointing to the spec so that ppl can at least know about it, read it, and comment out :)10:24
Tenguhmm. *do you have any details - better.10:24
gfidenternoriega here https://github.com/openstack/tripleo-common/blob/master/tripleo_common/image/kolla_builder.py#L3610:27
gfidenteend up in https://github.com/openstack/tripleo-common/blob/master/container-images/overcloud_containers.yaml#L10610:27
*** nyechiel has joined #tripleo10:27
gfidenteci probably uses its own images.yaml though as it customizes prepare10:27
openstackgerritYurii Prokulevych proposed openstack/tripleo-upgrade master: Minor updates of pre-provisioned envrionments.  https://review.openstack.org/58391310:29
Tenguso. I'll be back in a while, taking a dip in the lake. need some fresh water :).10:30
rnoriegagfidente, I see, thanks!10:30
gfidenternoriega I guess older tags are not visible in docker.io10:30
gfidentefrom the web interface10:30
*** iranzo has joined #tripleo10:30
*** iranzo has joined #tripleo10:30
ccamachoMeeting to share & talk about Upgrade in upstream and in CI Invite folks from other DFG to share, raise issues : https://etherpad.openstack.org/p/tripleo-upgrade-squad-meeting10:30
gfidentewe should test and bump the version if newer works10:30
jfrancoaccamacho: here you have the upgrades upstream meeting etherpad https://etherpad.openstack.org/p/tripleo-upgrade-squad-meeting10:30
ccamachotomorrow 3:30 CET10:30
ccamachojfrancoa thanks!10:30
openstackgerritYurii Prokulevych proposed openstack/tripleo-upgrade master: Minor updates of pre-provisioned envrionments.  https://review.openstack.org/58391310:31
Tenguccamacho: cool, I should be available. Adding the topic.10:31
chkumar|ruckgfidente: I think during undercloud install they might be pulled from upstream to undercloud registery so thought to use it10:31
chkumar|ruckgfidente:  https://review.openstack.org/#/c/549216/ was added earlier it from undercloud registery but removed10:32
rnoriegagfidente, if there is a mapping between openstack version - ceph version. Why not using latest-$ceph_version ??10:32
rnoriegagfidente, at development cycle, of course.10:33
gfidenternoriega basically because neither ceph-container nor ceph-ansible upstream releases are tested (yet?) with tripleo10:35
gfidenternoriega and they broke more frequently than we wanted10:35
gfidenternoriega so while in theory I agree, mapping to latest is a good idea10:35
gfidenternoriega in practice that broke the entire tripleo ci because of issues that nobody in tripleo could work on10:35
gfidenternoriega hence we decided to pin to known working versions and advance them only after they are tested working10:35
rnoriegagfidente, I see, alright.10:36
gfidenternoriega we have DNM submissions to test newer versions of both10:36
rnoriegagfidente, just wanted to understand the pipeline. This is for OPNFV Apex (tripleo) where we use the tag: build-master-luminous-centos-710:36
gfidentehttps://review.openstack.org/#/c/501987/ and https://review.openstack.org/#/c/562213/10:36
rnoriegagfidente, and people are asking about why not using new container images... and not 8 months old ones...10:37
gfidenternoriega do you need to override our pin for particular reasons?10:37
gfidenternoriega yeah we could bump up the tags if the tests pass10:37
*** moshele has joined #tripleo10:37
gfidenternoriega we can try now10:37
rnoriegagfidente, usually, the OPNFV community dictates which versions are meant to be included in a release...10:37
gfidentewhan version of ceph?10:38
rnoriegagfidente, like, Openstack Queens + OpenDaylight Oxygen + Ceph Luminous... etc10:38
openstackgerritSorin Sbarnea proposed openstack/tripleo-quickstart-extras master: make reproducer bash syntax more portable  https://review.openstack.org/58101210:38
rnoriegagfidente, I'm not sure, I have to ask, but I think it's luminous...10:38
gfidentewell, we obviously point to luminous, the question is what version of luminous10:38
gfidentebut all tags include luminous10:38
rnoriegagfidente, mmmm... specific version, that I don't know.10:39
gfidenteright, so I think you should stick with the tag we tested10:39
rnoriegagfidente, ok!10:39
rnoriegagfidente, thanks for the tip! :-)10:39
gfidentebut we can at the same time bump up v3.0.3 to v3.0.610:40
gfidenteif it passes tripleo/ci10:40
gfidenteI'll add you to the submission which tests this10:40
*** ramishra has quit IRC10:40
gfidentenote that 3.0.3 is the version of the *container image*10:40
openstackgerritGabriele Cerami proposed openstack/tripleo-quickstart-extras master: manage-stack: add env variables in info gathering  https://review.openstack.org/58391610:40
gfidentenot the version of ceph itself10:40
*** kopecmartin has quit IRC10:41
rnoriegagfidente, yes please, include me. Thanks!10:41
openstackgerritYurii Prokulevych proposed openstack/tripleo-upgrade master: Adjust templating for upgrade scripts.  https://review.openstack.org/58391710:41
gfidenternoriega and v3.0.3 is not 8 months old, but it's dated apr 17th10:42
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates master: DO-NOT-MERGE Test new ceph-container builds  https://review.openstack.org/56221310:42
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates master: DO-NOT-MERGE Test new ceph-container builds  https://review.openstack.org/56221310:42
rnoriegagfidente, I meant the build-master-luminous-centos: https://hub.docker.com/r/ceph/daemon/builds/10:42
gfidenternoriega oh that one is probably an old tag we don't use anyway10:43
gfidentenote I added you to https://review.openstack.org/#/c/562213/ which is testing 3.0.610:43
gfidente22 days old10:43
rnoriegagfidente, isn't there a "generic" tag that points to the version you pick? like current-rdo that points to a hash-tag10:44
rnoriegagfidente, even if it's just pinned10:44
gfidenternoriega you mean why we don't pin last-known-working version in ceph-container repo vs trpleo repo?10:44
rnoriegagfidente, yep10:45
gfidenternoriega right10:45
gfidenternoriega there isn't because ceph last known working is not tested with tripleo10:45
gfidenternoriega so it is in tripleo that we maintain what is known to work with tripleo10:45
gfidentesame reason why we don't point to -latest10:46
rnoriegagfidente, well, make sense.10:46
gfidenternoriega but this is indeed interesting topic10:46
gfidenteand applies to ceph-ansible as well10:46
gfidentewe discussed a while ago with weshay if/how we could gate ceph-ansible and ceph-container changes with a tripleo job10:46
rnoriegagfidente, consuming external components from tripleo perspective increases complexity10:46
ccamachotENGO DONE :)10:46
rnoriegagfidente, not complaining :-)10:46
ccamachoTengu done ** wrong caps10:46
gfidenternoriega I think we can offer infrastructure where to run the tripleo job with "pending" ceph-ansible and ceph-container code10:47
gfidenternoriega but the mechanics of setting up in github triggers for zuul on pull requests are not resolved yet10:47
gfidenternoriega so right now ceph continues to test both ceph-ansible and ceph-container with their test suite10:48
rnoriegagfidente, maybe OPNFV could be a good place to test it without breaking the whole tripleo CI. We'd have to discuss it with trozet10:48
gfidenternoriega and tripleo/ci picks them up for testing what is considered stable10:48
gfidente*when considered10:48
rnoriegagfidente, ok10:48
gfidenteexcept they not always are :D10:48
rnoriegahahha10:48
gfidenternoriega sure yes10:48
*** links has quit IRC10:49
gfidenternoriega to be honest10:50
gfidentecurrent approach revealed to be much more stable for tripleo devs10:50
gfidentewe rarely had outages due to breakages in ceph components10:50
gfidentewhen we had any, it was likely misconfig or stuff we could fix in tripleo, but not breakages in a ceph component outside our direct control10:50
gfidentewhich in the past (around pike) was more the case instead10:51
rnoriegagfidente, I see10:51
rnoriegagfidente, reading your blogpost about ceph-container, ceph-ansible and openstack :-)10:52
*** jfrancoa is now known as jfrancoa|lunch10:52
gfidenternoriega yeah that is the surface10:52
gfidenteof the whole thing10:52
rnoriegagfidente, going for lunch now, thanks for the insights! :-)10:53
gfidenternoriega cool and nice if we can share knowledge about all this stuff10:53
openstackgerritJuan Badia Payno proposed openstack/tripleo-heat-templates master: mistral_engine container added /usr/share volume  https://review.openstack.org/58387710:53
*** amoralej is now known as amoralej|lunch10:54
*** ukalifon has joined #tripleo10:56
*** salmankhan has quit IRC10:57
*** salmankhan has joined #tripleo10:58
*** dhill_ has joined #tripleo10:58
*** agurenko has quit IRC11:02
*** agurenko has joined #tripleo11:05
*** ooolpbot has joined #tripleo11:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION11:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332511:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178216511:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178243811:10
*** ooolpbot has quit IRC11:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)11:10
openstackLaunchpad bug 1782165 in tripleo "ntp servers blocked. Can't deploy undercloud with oooq due to wrong NTP configuration" [Critical,Incomplete] - Assigned to wes hayutin (weshayutin)11:10
openstackLaunchpad bug 1782438 in tripleo "tripleo ui endpoints misconfigured in containerized undercloud" [Critical,New]11:10
openstackgerritLuigi Toscano proposed openstack/tripleo-heat-templates master: WIP Deploy Sahara with unversioned endpoints  https://review.openstack.org/58389011:10
*** links has joined #tripleo11:12
*** morazi has quit IRC11:17
*** quiquell is now known as quiquell|lunch11:21
*** udesale__ has quit IRC11:28
*** agopi is now known as agopi|brb11:32
*** abishop has joined #tripleo11:33
*** kopecmartin has joined #tripleo11:34
*** pchavva has joined #tripleo11:35
*** rh-jelabarre has joined #tripleo11:37
openstackgerritSorin Sbarnea proposed openstack/tripleo-quickstart master: include domain name into clouds.yaml  https://review.openstack.org/58254611:38
openstackgerritSagi Shnaidman proposed openstack/tripleo-common master: Support for ARA report for ansible playbooks in deploy  https://review.openstack.org/56507711:39
*** moguimar has joined #tripleo11:42
*** med_ has joined #tripleo11:47
*** med_ has quit IRC11:47
*** med_ has joined #tripleo11:47
openstackgerritChandan Kumar proposed openstack/tripleo-quickstart-extras master: Enable support for running refstack tests in TQE  https://review.openstack.org/57071911:48
Tenguccamacho: great, thanks :).11:49
*** shardy has joined #tripleo11:50
*** lblanchard has joined #tripleo11:51
Tenguccamacho: thanks a lot for your feedback - will do some corrections for the nits, and will try to reflect your thoughts for the rest :).11:54
*** pdeore has quit IRC11:54
openstackgerritClark Chen proposed openstack/tripleo-ha-utils master: Filter "Starting" and "Stopping" keywords when deleting resources Sometimes the some resource can be in Starting status and resourceid will return "Starting", which will fail to uninstall  https://review.openstack.org/58393911:55
*** mcornea has joined #tripleo11:55
*** moguimar has quit IRC11:57
ccamachoTengu! Awesome but they are most suggestions :) we can speak about it in the upstream call :)11:57
ccamachothank you for the spec proposal11:57
Tenguccamacho: ok, we can see that tomorrow then :).11:57
Tenguwill just push the nits part, because, well, nits.11:58
*** athomas has joined #tripleo11:58
openstackgerritCédric Jeanneret proposed openstack/tripleo-specs master: Validation Framework specifications.  https://review.openstack.org/58347511:58
Tenguso no need to review, no real change -^11:58
openstackgerritFlavio Percoco proposed openstack/tripleo-heat-templates master: Move to openshift-ansible 3.10  https://review.openstack.org/58249512:00
*** shreshtha has quit IRC12:00
chkumar|ruckmandre: https://review.openstack.org/#/c/583940/12:01
chkumar|ruckregarding tempest user in a container12:01
*** dtantsur|bbl is now known as dtantsur12:02
*** agopi|brb has quit IRC12:02
*** trown|outtypewww is now known as trown12:03
*** thrash|g0ne is now known as thrash12:04
*** amoralej|lunch is now known as amoralej12:04
trownchkumar|ruck: I was able to reproduce that issue with scenario009 ... havent got to the bottom of it yet12:04
*** jfrancoa|lunch is now known as jfrancoa12:04
chkumar|ruckarxcruz: it is related to this bug https://bugzilla.redhat.com/show_bug.cgi?id=160317612:05
openstackbugzilla.redhat.com bug 1603176 in rhosp-director "[OSP14][Containerized Undercloud] tempest_init_logs docker container exited with exited code!=0 "chown: invalid user: 'tempest:tempest'"" [Low,New] - Assigned to rhos-maint12:05
chkumar|ruckarxcruz: it is not related to scenario002 issue12:06
chkumar|rucktrown: great12:06
arxcruzchkumar|ruck: but we use this docker image right?12:07
openstackgerritChandan Kumar proposed openstack/tripleo-quickstart master: Add fs055 to run refstack tests  https://review.openstack.org/57088412:07
openstackgerritChandan Kumar proposed openstack-infra/tripleo-ci master: Add FS055 job as experimental to run refstack tests  https://review.openstack.org/57089212:07
chkumar|ruckarxcruz: yes, but adding temepst user will be used only when --user tempest flag is passed with docker run12:08
chkumar|ruckarxcruz: it does not affect the tempest container12:09
trownmandre: have you seen any issue where mistral is failing to access files in /usr/share/ansible/openshift-ansible? scenario009 job is failing on it, and even testing with flaper87 patch that bumps openshift-ansible version I hit the same thing12:09
arxcruzchkumar|ruck: please add these comments on the patch and i'll change my vote12:09
openstackgerritGabriele Cerami proposed openstack/tripleo-quickstart-extras master: ovb-manage: save generated idnum to yaml file  https://review.openstack.org/58394412:09
*** leanderthal has quit IRC12:10
*** ooolpbot has joined #tripleo12:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION12:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332512:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178216512:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178243812:10
*** ooolpbot has quit IRC12:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)12:10
openstackLaunchpad bug 1782165 in tripleo "ntp servers blocked. Can't deploy undercloud with oooq due to wrong NTP configuration" [Critical,Incomplete] - Assigned to wes hayutin (weshayutin)12:10
openstackLaunchpad bug 1782438 in tripleo "tripleo ui endpoints misconfigured in containerized undercloud" [Critical,New]12:10
flaper87trown: that's fixed with bogdando patch12:10
flaper87trown: https://review.openstack.org/#/c/583136/12:10
flaper87trown: basically, the new containerized mistral doesn't have the osa playbooks installed in it12:11
openstackgerritEmilien Macchi proposed openstack/tripleo-quickstart-extras master: Add pauch to the includepkgs  https://review.openstack.org/57922312:11
flaper87we have to bindmount them from the base os12:11
trownflaper87: ah makes sense12:11
EmilienMhellow12:12
flaper87EmilienM: hellow tow youw toow12:12
openstackgerritJohn Trowbridge proposed openstack/tripleo-heat-templates master: Add secondary DNS server to disable-unbound environment  https://review.openstack.org/58216412:12
*** pdeore has joined #tripleo12:12
EmilienMflaper87: :)12:12
flaper87trown: feel free to base your patch on top of https://review.openstack.org/#/c/583136/ instead of mine12:13
flaper87trown: we can merge yours before mine lands12:13
trownflaper87: I am using it to test 3.10 though12:13
flaper87trown: oh, nvm then12:13
flaper87:)12:13
trownflaper87: my patch is not actually needed by CI, so kind of lower priority... It may not even be needed by most people locally12:14
trownmy ISP just doesnt like 1.1.1.112:14
chkumar|ruckflaper87: ah the same patch is passed by mandre last night to me, need to test that12:14
*** morazi has joined #tripleo12:19
flaper87trown: oh, mmh, silly isp12:20
openstackgerritMarios Andreou proposed openstack/tripleo-quickstart-extras master: WIP - Adds new bootstrap-subnodes role instead of tripleo.sh  https://review.openstack.org/58102612:20
*** yrabl has joined #tripleo12:21
*** holser_ has joined #tripleo12:22
openstackgerritJames Slagle proposed openstack/tripleo-common stable/queens: Use hostnames in inventory  https://review.openstack.org/58328512:22
openstackgerritJames Slagle proposed openstack/tripleo-common stable/queens: Fix dynamic inventory  https://review.openstack.org/58328612:22
openstackgerritJames Slagle proposed openstack/tripleo-common stable/queens: Include 'tripleo_role_name' in the inventory  https://review.openstack.org/58328712:22
*** marrusl has quit IRC12:22
openstackgerritJames Slagle proposed openstack/tripleo-common stable/queens: Remove role_data from inventory  https://review.openstack.org/58394912:22
*** holser_ has quit IRC12:24
openstackgerritJames Slagle proposed openstack/tripleo-common master: git integration for GetOvercloudConfig action  https://review.openstack.org/57963412:24
openstackgerritJames Slagle proposed openstack/tripleo-common master: Use /var/lib/mistral/<plan-name> as config-download dir  https://review.openstack.org/57963512:24
openstackgerritJames Slagle proposed openstack/tripleo-common master: Update failures listing to use latest ansible-errors.json location  https://review.openstack.org/58329312:24
*** raildo has joined #tripleo12:24
*** holser_ has joined #tripleo12:25
*** yrabl is now known as liverpooler12:25
*** cmyster has joined #tripleo12:25
*** cmyster has joined #tripleo12:25
*** tzumainn has joined #tripleo12:25
chkumar|ruckmandre: regarding tempest tht changes, i think it is ok https://github.com/openstack/tripleo-heat-templates/blob/master/docker/services/tempest.yaml#L53 ?12:27
slagleEmilienM: d0ugal : could I have another look at this series: https://review.openstack.org/#/q/topic:bug/1779093+(status:open+OR+status:merged)12:28
openstackgerritBogdan Dobrelya proposed openstack/tripleo-quickstart master: Use undercloud templates path for UC deployments  https://review.openstack.org/58395012:30
*** ratailor has quit IRC12:33
bogdandojfrancoa: https://review.openstack.org/#/c/583950/12:33
EmilienMslagle: ack12:33
bogdandoplease use that topic for your fs51 fixes12:34
openstackgerritCédric Jeanneret proposed openstack/puppet-tripleo master: Corrected vrrp script for haproxy status  https://review.openstack.org/58388612:35
*** moshele has quit IRC12:35
*** ssbarnea1 has quit IRC12:36
bogdandotrozet, janki, dsneddon, bfournie: hi, PTAL rebased https://review.openstack.org/#/c/575122/ backport12:38
openstackgerritCédric Jeanneret proposed openstack/puppet-tripleo master: Corrected vrrp script for haproxy status  https://review.openstack.org/58388612:39
*** ssbarnea has joined #tripleo12:39
*** v1a4 has quit IRC12:40
*** rlandy has joined #tripleo12:40
*** Guest72952 is now known as portdirect12:41
*** edmondsw has joined #tripleo12:41
*** noslzzp has quit IRC12:45
*** noslzzp has joined #tripleo12:45
*** quiquell|lunch is now known as quiquell12:45
*** ramishra has joined #tripleo12:55
*** eck`gone is now known as eck`12:55
*** iranzo is now known as iranzo|AFK12:56
weshaylog server is down12:56
weshayjobs will fail12:56
*** weshay changes topic to "Welcome to Rocky. CI status: http://logs.openstack.org is down RED RED RED | https://docs.openstack.org/tripleo-docs/latest"12:57
weshayEmilienM, ^12:57
dpeacockflorianf: EmilienM: https://review.openstack.org/#/c/577397/ is ready for you again please :-)12:57
EmilienMweshay: no... did you report to infra?12:57
weshay-openstackstatus/#openstack-infra- NOTICE: logs.openstack.org is offline, causing POST_FAILURE results from Zuul. Cause and resolution timeframe currently unknown.12:57
dpeacockI really wanna get this merged since M3 is looming and I'd like this to make it.12:57
weshaythey are on it12:57
EmilienMah it's from infra12:57
EmilienMok, thanks for the headsup12:57
EmilienMdpeacock: ack12:58
*** psahoo has quit IRC12:58
-openstackstatus- NOTICE: logs.openstack.org is offline, causing POST_FAILURE results from Zuul. Cause and resolution timeframe currently unknown.12:59
*** ChanServ changes topic to "logs.openstack.org is offline, causing POST_FAILURE results from Zuul. Cause and resolution timeframe currently unknown."12:59
*** moshele has joined #tripleo13:00
honzaEmilienM: weshay: could i get your help with https://bugs.launchpad.net/tripleo/+bug/1782438 ? tripleo-ui is broken on oooq master13:02
openstackLaunchpad bug 1782438 in tripleo "tripleo ui endpoints misconfigured in containerized undercloud" [Critical,New]13:03
*** iranzo|AFK is now known as iranzo13:03
EmilienMdpeacock: no -1 but a comment13:03
honzaEmilienM: weshay: i'm happy to do the work, i just don't really know where to look13:03
EmilienMdpeacock: I'm happy to iterate later though, just let me know what you prefer;13:03
EmilienMhonza: looking now, one sec13:03
dpeacockEmilienM: thnaks13:03
arxcruzEmilienM: https://review.openstack.org/#/c/583659/ fix https://bugs.launchpad.net/tripleo/+bug/1773325 :)13:03
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)13:03
dpeacock*thanks13:03
EmilienMarxcruz: +A, thx for the fix13:04
EmilienMhonza: it's really weird, I've been testing the UI for a while now and everything works fine13:04
EmilienMbut I don't use quickstart.13:04
EmilienMit shouldn't matter, tbh13:04
honzaEmilienM: there's the problem13:05
dpeacockEmilienM: lol scope creep - ok this might delay things - let me look into it - I was hoping to get *something* through soon - I'll come back to you13:05
dpeacock:-)13:05
EmilienMoh actually yes it matters13:05
honzaIs minimal.yml being exercised by ci anywhere?13:05
EmilienMdpeacock: I'm happy to +2 it now, if you say we can do it later13:05
weshayhonza, /me looks13:05
EmilienMdpeacock: I've +2-ed, but I want someone from validation to make a final review for the code structure. florianf / gchamoul at least13:06
mandrechkumar|ruck: how are you running the tempest container?13:07
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates master: Use global ansible.cfg for nodes-uuid playbook  https://review.openstack.org/58355213:07
mandrechkumar|ruck: the container you want to run with the 'tempest' user13:07
chkumar|ruckmandre: https://github.com/openstack/tripleo-quickstart-extras/blob/master/roles/validate-tempest/templates/run-tempest.sh.j2#L4713:08
weshaychkumar|ruck, sshnaidm|rover can you guys help honza ?13:08
chkumar|ruckmandre: on doing enable_tempest to true in undercloud.conf it pulls the container on undercloud, and create /var/log/container tempest13:08
weshayhttps://bugs.launchpad.net/tripleo/+bug/178243813:08
openstackLaunchpad bug 1782438 in tripleo "tripleo ui endpoints misconfigured in containerized undercloud" [High,Triaged]13:08
mandrechkumar|ruck: this is where you need to add the '--user tempest' to the docker command13:08
dpeacockEmilienM: Thank you Sir; I'll make a followup patch with your suggestion right after13:09
EmilienMdpeacock: cool cool13:09
chkumar|ruckmandre: yes, but waiting for kolla patch to merge13:09
openstackgerritCarlos Camacho proposed openstack/tripleo-common master: WIP - Group sudoers by aliases  https://review.openstack.org/58395613:09
EmilienMdpeacock: credits to Mr Alex for the code https://review.openstack.org/#/c/569153/25/common/deploy-steps-tasks.yaml13:09
*** ooolpbot has joined #tripleo13:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION13:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332513:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178216513:10
*** ooolpbot has quit IRC13:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)13:10
openstackLaunchpad bug 1782165 in tripleo "ntp servers blocked. Can't deploy undercloud with oooq due to wrong NTP configuration" [Critical,Incomplete] - Assigned to wes hayutin (weshayutin)13:10
dpeacockEmilienM: Excellent!13:10
mandrechkumar|ruck: you don't have to wait for kolla patch to merge - in the kolla patch you set the default user for the image, but you can force the user in the run-tempest.sh script if you want to test it13:10
chkumar|ruckmandre: sure13:10
chkumar|ruckmandre: I will prepare a patch for the same13:10
*** moshele has quit IRC13:10
mandreIIUC this is what arxcruz was asking for13:11
mandrechkumar|ruck: what is the issue with scenario002?13:11
chkumar|ruckmandre: container check script does not check for package update in tempest container13:12
*** pradk has quit IRC13:12
chkumar|ruckmandre: waiting for logs.openstack.org to up, I will explain the issue13:12
mandreok so it's unrelated to the user change, right?13:12
*** jtomasek has joined #tripleo13:13
chkumar|ruckmandre: yup13:13
chkumar|ruckmandre: that is totally a different story13:13
arxcruzmandre: according chkumar|ruck is unrelated, and i'm trusting on his word13:13
*** dprince has joined #tripleo13:14
*** agopi has joined #tripleo13:15
*** udesale has joined #tripleo13:18
*** toure|gone is now known as toure13:19
mandrechkumar|ruck: ah the issue is caused by the tempest container not having the latest package?13:20
*** artom has joined #tripleo13:20
chkumar|ruckmandre: yup13:20
chkumar|ruckmandre: so on this one https://review.openstack.org/573220 i have added a depends on patch from python-temepstconf13:21
chkumar|rucktempest container should be updated with python-temepstconf quickstart dlrn build package but it is not happening13:22
chkumar|rucki am not sure where is the gotcha13:22
openstackgerritTim Rozet proposed openstack/puppet-tripleo stable/queens: Remove table 17 from OVS OF pipeline sync  https://review.openstack.org/58300913:23
openstackgerritTim Rozet proposed openstack/puppet-tripleo stable/queens: Updates OpenDaylight HA Proxy backend check  https://review.openstack.org/58179013:24
mandrechkumar|ruck: do you store the tempest image in the local registry on the undercloud? or are you pulling it via the 'docker run' command in run-tempest.sh13:27
chkumar|ruckmandre: it is getting pulled in the local registery13:28
chkumar|ruckmandre: https://github.com/openstack/tripleo-quickstart-extras/blob/master/roles/validate-tempest/defaults/main.yml#L2613:28
chkumar|ruckwhich is referneced to local registery13:28
*** mjturek has joined #tripleo13:29
chkumar|ruckmandre: might be I am doing something wrong, there13:30
yolanda_hi, good afternoon. I have a question.. i'm trying to deploy tripleo, queens, but without external network.13:32
yolanda_and i keep getting an error, it keeps asking me about ExternalNetName parameter13:32
yolanda_what shall i provide there? i don't have external routable network in my setup13:32
slaglequiquell: sshnaidm|rover : do you know you're both working similar approaches with https://review.openstack.org/#/c/583536/ and https://review.openstack.org/#/c/565077/13:34
sshnaidm|roverslagle, yes :)13:34
slagleok13:35
*** iranzo is now known as iranzo|AFK13:36
openstackgerritChandan Kumar proposed openstack/tripleo-quickstart-extras master: User tempest user with tempest container  https://review.openstack.org/58396113:36
openstackgerritChandan Kumar proposed openstack/tripleo-quickstart-extras master: Use tempest user with tempest container  https://review.openstack.org/58396113:37
*** iranzo|AFK is now known as iranzo13:38
*** moguimar has joined #tripleo13:39
*** bfournie has quit IRC13:42
*** ChanServ changes topic to "Welcome to Rocky. CI status: http://logs.openstack.org is down RED RED RED | https://docs.openstack.org/tripleo-docs/latest"13:43
-openstackstatus- NOTICE: logs.openstack.org is back on-line. Changes with "POST_FAILURE" job results should be rechecked.13:43
trownflaper87: I am seeing the webconsole pod failing to schedule with 3.10 ... is that a familiar issue?13:46
*** suuuper has quit IRC13:46
*** suuuper has joined #tripleo13:46
*** ccamacho has quit IRC13:49
*** ccamacho1 has joined #tripleo13:49
*** nyechiel has quit IRC13:51
openstackgerritMichele Baldessari proposed openstack/tripleo-heat-templates master: Enable deep_compare of pcmk resources by default  https://review.openstack.org/58141613:52
*** raildo has quit IRC13:53
openstackgerritMichele Baldessari proposed openstack/puppet-tripleo master: Make sure that stonith state is enforced before attempting a scaleup  https://review.openstack.org/58252113:54
*** raildo has joined #tripleo13:54
*** iranzo is now known as iranzo|AFK13:54
openstackgerritMichele Baldessari proposed openstack/puppet-tripleo master: Make sure that stonith state is enforced before attempting a scaleup  https://review.openstack.org/58252113:55
flaper87trown: when that one fails is prob because the nodes are not ready13:55
trownflaper87: hmm so a bit racy?13:55
flaper87get in the master node, run oc get pods and see if the pods there are not scheduled13:55
flaper87trown: no, more likely a miss labeled node (or at least that was my problem)13:55
flaper87trown: are you seeing this in your local env?13:56
trownflaper87: ya13:56
openstackgerritSorin Sbarnea proposed openstack/tripleo-quickstart-extras master: avoid Inappropriate ioctl for device with pause module  https://review.openstack.org/58396513:56
flaper87trown: I can help looking into it13:56
trownflaper87: ah ya all the pods are in Pending13:56
openstackgerritSagi Shnaidman proposed openstack/python-tripleoclient master: Support ARA report tracking from command line  https://review.openstack.org/58379913:57
*** iranzo|AFK is now known as iranzo13:57
flaper87trown: so, likely what I said. I can help looking into this if you give me access. Make sure you have nodes labeled as infra13:58
flaper87trown: oc get node $NODE13:58
*** moguimar has quit IRC13:59
*** gfidente has quit IRC14:00
*** gfidente has joined #tripleo14:01
*** gfidente has quit IRC14:01
*** gfidente has joined #tripleo14:01
trownflaper87: I am looking at ROLES there? only compute14:02
*** morazi has quit IRC14:08
*** ooolpbot has joined #tripleo14:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION14:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332514:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178216514:10
*** ooolpbot has quit IRC14:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)14:10
openstackLaunchpad bug 1782165 in tripleo "ntp servers blocked. Can't deploy undercloud with oooq due to wrong NTP configuration" [Critical,Incomplete] - Assigned to wes hayutin (weshayutin)14:10
*** bkopilov has quit IRC14:10
*** cshastri has quit IRC14:12
*** morazi has joined #tripleo14:13
trownflaper87: is this openshift_node_group_name supposed to end up as "node-config-infra" http://paste.openstack.org/show/726276/14:14
*** janki has quit IRC14:15
*** paramite has quit IRC14:16
*** janki has joined #tripleo14:16
openstackgerritDerek Higgins proposed openstack/tripleo-quickstart master: [WIP] Add featureset 054 - overcloud baremetal+ansible-ml2  https://review.openstack.org/57960114:17
openstackgerritDerek Higgins proposed openstack/tripleo-heat-templates master: [WIP] Add scenario 012 - overlcoud baremetal+ansible-ml2  https://review.openstack.org/57960314:18
*** bfournie has joined #tripleo14:19
*** mdnadeem has quit IRC14:25
*** pdeore has quit IRC14:26
*** jfrancoa has quit IRC14:28
*** med_ has quit IRC14:29
honzaEmilienM: weshay: our ovb configs in oooq-extras aren't set up for containerized undercloud; should they be?14:30
honzachkumar|ruck: sshnaidm|rover any ideas on https://bugs.launchpad.net/tripleo/+bug/178243814:30
openstackLaunchpad bug 1782438 in tripleo "tripleo ui endpoints misconfigured in containerized undercloud" [High,Triaged]14:30
*** hamdyk has quit IRC14:31
*** quiquell is now known as quiquell|off14:32
sshnaidm|roverhonza, well, need to look more, not familiar with that part..14:33
sshnaidm|roverhonza, can you point me exactly where quickstart populates this ip?14:33
*** pradk has joined #tripleo14:33
*** rbrady has joined #tripleo14:34
*** rbrady has quit IRC14:34
*** rbrady has joined #tripleo14:34
*** paramite has joined #tripleo14:36
*** mdnadeem has joined #tripleo14:37
Tenguflorianf: do you have a couple of minutes for a quick question on a validation?14:39
honzasshnaidm|rover: all i know is in the bug; the --public-virtual-ip value passed to 'tripleo deploy' is 192.168.24.2 which can't be accessed from outside the virthost.  Setting --public-virtual-ip to the virthost's public ip makes undercloud deploy fail (network issue)14:40
honzasshnaidm|rover: i don't have the precise error message anymore, unfortunately14:41
*** dparkes has quit IRC14:41
*** arxcruz has quit IRC14:41
flaper87trown: sorry, had to go afk for a bit14:41
flaper87back now14:41
trownflaper87: no worries14:41
honzasshnaidm|rover: i'm re-running the script now14:42
sshnaidm|roverhonza, can you paste in bug your reproducing steps?14:42
honzasshnaidm|rover: done14:43
*** arxcruz has joined #tripleo14:44
*** bogdando has quit IRC14:44
*** leanderthal has joined #tripleo14:46
*** jfrancoa has joined #tripleo14:46
*** yprokule has quit IRC14:47
*** morazi has quit IRC14:48
*** morazi has joined #tripleo14:54
openstackgerritMarios Andreou proposed openstack/tripleo-quickstart-extras master: Adds reproducer check+exit+warning dependencies - virtualenv+others  https://review.openstack.org/57808114:55
*** janki has quit IRC14:55
*** ratailor has joined #tripleo14:56
owalshEmilienM, slagle, dprince: if you get a chance would really appreciate reviews on https://review.openstack.org/57785514:56
openstackgerritFlavio Percoco proposed openstack/tripleo-heat-templates master: Move to openshift-ansible 3.10  https://review.openstack.org/58249514:58
openstackgerritFlavio Percoco proposed openstack/tripleo-heat-templates master: WIP use openshift-ansible container instead of RPMs  https://review.openstack.org/58386814:58
openstackgerritJohn Trowbridge proposed openstack/tripleo-heat-templates master: Add secondary DNS server to disable-unbound environment  https://review.openstack.org/58216414:58
*** leanderthal has quit IRC15:02
*** dtantsur is now known as dtantsur|afk15:02
*** ykarel is now known as ykarel|away15:04
slagleowalsh: i'll take a look15:04
owalshslagle: thanks15:04
*** weshay changes topic to "Welcome to Rocky. CI status: GREEN | https://docs.openstack.org/tripleo-docs/latest"15:06
*** jfrancoa has quit IRC15:08
EmilienMowalsh: ack15:08
*** jfrancoa has joined #tripleo15:08
openstackgerritSorin Sbarnea proposed openstack/tripleo-quickstart-extras master: Run bashate via pre-commit  https://review.openstack.org/58398415:09
EmilienMowalsh: ouch, it'll take time15:09
EmilienMowalsh: I wasn't a fan of having these scripts in THT...15:10
weshayEmilienM, just confirming.. this is what you want to see w/ the new healthchecks in the undercloud15:10
weshayhttp://logs.openstack.org/55/573255/4/gate/tripleo-ci-centos-7-containers-multinode/91fb9e6/logs/undercloud/home/zuul/undercloud_install.log.txt.gz#_2018-07-19_14_42_2115:10
EmilienMowalsh: but I don't have anything better in my mind now15:10
openstackgerritBob Fournier proposed openstack/tripleo-common stable/queens: ensure unique ironic node ID with UCS driver  https://review.openstack.org/58398515:10
*** ooolpbot has joined #tripleo15:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION15:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332515:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178216515:10
*** ooolpbot has quit IRC15:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)15:10
openstackLaunchpad bug 1782165 in tripleo "ntp servers blocked. Can't deploy undercloud with oooq due to wrong NTP configuration" [Critical,Incomplete] - Assigned to wes hayutin (weshayutin)15:10
owalshEmilienM: yea, there were always a quick hack that turned out to be far too useful :-)15:10
EmilienMweshay: yes, this is Alex's patch that is in effect :D15:10
*** ccamacho1 has quit IRC15:10
weshayk15:10
EmilienMweshay: we need to get someone from nova (owalsh?) to look at this one15:10
EmilienMbut the container isn't healthy in this job15:11
*** leanderthal has joined #tripleo15:11
EmilienMweshay: is it in all jobs or randomly?15:11
* EmilienM in a meeting, can't look now15:11
* owalsh in a meeting too15:11
*** pcaruana has quit IRC15:12
weshayarxcruz++15:12
openstackgerritRyan Brady proposed openstack/tripleo-common master: Makes sorting environments with capabilities-map optional  https://review.openstack.org/58223315:16
openstackgerritChandan Kumar proposed openstack/tripleo-quickstart master: Add fs055 to run refstack tests  https://review.openstack.org/57088415:16
*** mrunge_ is now known as mrunge15:16
florianfTengu: yup. still there?15:20
Tenguflorianf: yep :)15:20
*** iranzo is now known as iranzo|AFK15:20
Tenguflorianf: I was wondering if you have and thought about weshay comment on this change: https://review.openstack.org/#/c/582917/15:20
slagleEmilienM: owalsh : i'm not a fan of the script in tht either, but it seems reasonable for now I suppose. as we move towards ansible roles and we can move the deploy tasks out of tht, maybe the script(s) could get moved into the role15:20
EmilienM+1 with slagle15:21
EmilienMowalsh: how does that sound to you?15:21
Tenguflorianf: was about to post a question on that topic on openstack-dev anyway. but as you're on validations, maybe you have a first word :).15:21
*** gkadam-brb is now known as gkadam15:21
dpeacockflorianf: hey - any chance you are ready to +2 https://review.openstack.org/#/c/577397/ please?15:23
*** iranzo|AFK has quit IRC15:24
*** sshnaidm|rover is now known as sshnaidm|afk15:24
florianfdpeacock: yes almost. It looks fine, but I want to give it one real test run at least. I'm finishing one thing in my dev environment and then I'm ready.15:25
dpeacockflorianf: of course - thank you - let me know if you need anything - I'm writing up a doc which I haven't submitted yet.  It requires the inventory.yaml generated by tripleo-ansible-inventory which is archived in the homedir.15:26
dpeacockflorianf: it's in the undercloud-install-<timestamp>.tar.bz2 file.15:28
openstackgerritMarios Andreou proposed openstack/tripleo-quickstart-extras master: Adds reproducer check+exit+warning dependencies - virtualenv+others  https://review.openstack.org/57808115:28
EmilienMowalsh: +2 with comment. I'll let slagle approve maybe15:30
florianfdpeacock: the doc requires the inventory file?15:30
EmilienMweshay: is it in all jobs or randomly?15:31
dpeacockflorianf: the validation playbook does; it needs the ctlplane ip15:31
florianfdpeacock: right.15:32
dpeacockAfter thinking about how to get it, Emilien and I settled on grabbing it from the inventory which was provided during the deployment.15:32
owalshEmilienM: thanks, ack that's that plan15:32
EmilienMowalsh: cool.15:32
weshayEmilienM, was in a gate failure15:33
EmilienMweshay: damn.15:33
dpeacockthere may be a better way we can think of, and without trying to get too far ahead, I'm going to talk to the dev list people to see if anyone objects to making the inventory file persist on undercloud instead of being archived after deployment (which is the current state)15:33
dpeacockflorianf: ^15:33
EmilienMweshay: let's prepare a logstash query to see how frequent we have this one, give me a sec15:33
weshayk15:33
EmilienMweshay: build_name: *tripleo-ci* AND build_status: FAILURE AND message: "Up 2 minutes (unhealthy)" and message: "centos-binary-nova-api:current-tripleo-updated"15:36
*** lvdombrkr has quit IRC15:36
florianfdpeacock: if the validation is run through mistral, it already has access to the inventory information15:36
EmilienMweshay: 7 hits today15:36
EmilienMnot much but still not good15:36
EmilienMowalsh: can you help please? (maybe after your meeting if you still have time)15:37
EmilienMweshay: do we have a bug already?15:37
owalshEmilienM: yes, will take a look after15:37
EmilienMweshay: https://prnt.sc/k8hrjr15:38
EmilienMowalsh: thx15:38
weshayEmilienM, no.. I'll make one15:38
EmilienMk15:38
EmilienMowalsh: first look, it seems like the containers looks healthy after a while: http://logs.openstack.org/55/573255/4/gate/tripleo-ci-centos-7-containers-multinode/91fb9e6/logs/undercloud/var/log/extra/docker/containers/nova_api/docker_info.log.txt.gz15:40
*** aufi_ has quit IRC15:40
EmilienMso maybe it's just nova-api being too long15:40
*** holser__ has joined #tripleo15:40
EmilienMand we need to increase the timeout in alex's patch: https://review.openstack.org/#/c/569153/25/common/deploy-steps-tasks.yaml15:40
owalshEmilienM: ah, was about to say ^^^ maybe need to bump this in CI15:41
EmilienM:(15:41
EmilienMretry driven deployment15:41
*** ykarel|away has quit IRC15:41
EmilienMweshay: please add the query in the bug report, for the record.15:43
weshaysure good idea15:44
*** holser_ has quit IRC15:44
*** udesale has quit IRC15:46
*** shreshtha has joined #tripleo15:46
*** ramishra has quit IRC15:47
*** mdnadeem has quit IRC15:48
openstackgerritJames Slagle proposed openstack/tripleo-docs master: Update docs for /var/lib/mistral/<plan name>  https://review.openstack.org/58400415:48
*** jfrancoa has quit IRC15:48
weshayhttps://bugs.launchpad.net/tripleo/+bug/178259815:49
openstackLaunchpad bug 1782598 in tripleo "container health check fails in step 5 on centos-binary-nova-api" [Critical,Triaged]15:49
florianfdpeacock: hmm. the inventory doesn't contain any info on the undercloud's ctlplane ip in my environment15:50
dpeacockflorianf: Did you deploy a containerized undercloud?15:51
dpeacockflorianf: on my system:-15:52
dpeacock[vagrant@undercloud ~]$ grep ctlplane_ip undercloud-ansible-roQcHV/inventory.yaml15:52
dpeacock      ctlplane_ip: 192.168.24.115:52
*** skramaja has quit IRC15:52
*** gfidente has quit IRC15:53
openstackgerritRonelle Landy proposed openstack/tripleo-quickstart-extras master: Include minimal Browbeat playbook in baremetal playbook  https://review.openstack.org/58148815:54
*** leanderthal has quit IRC15:55
EmilienMowalsh: I assigned https://bugs.launchpad.net/tripleo/+bug/1782598 to you15:57
openstackLaunchpad bug 1782598 in tripleo "container health check fails in step 5 on centos-binary-nova-api" [Critical,Triaged] - Assigned to Oliver Walsh (owalsh)15:57
*** gfidente has joined #tripleo15:57
*** gfidente has quit IRC15:57
*** gfidente has joined #tripleo15:57
*** gfidente^2nd has joined #tripleo15:57
owalshEmilienM: ack, seems apache just took a couple of minutes to spawn the 4 nova_api procs15:58
*** kopecmartin has quit IRC16:01
florianfdpeacock: ok, that's probably it then.16:01
dpeacockflorianf: yeah this is only for containerized undercloud checks16:01
florianfdpeacock: I know. But I wonder: should there be some checks if the variable exists? because if it doesn't the validation will break (not just fail).16:02
*** ukalifon has quit IRC16:04
*** gfidente^2nd has quit IRC16:05
dpeacockSure that sounds reasonable.16:05
florianfdpeacock: I added a comment to the patch16:06
dpeacockflorianf: much appreciated - thank you Sir :-)16:07
florianfdpeacock: thank *you*! :)16:07
*** avivgt|lunch has quit IRC16:08
*** ooolpbot has joined #tripleo16:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION16:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332516:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178259816:10
*** ooolpbot has quit IRC16:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)16:10
openstackLaunchpad bug 1782598 in tripleo "container health check fails in step 5 on centos-binary-nova-api" [Critical,Triaged] - Assigned to Oliver Walsh (owalsh)16:10
*** shardy has quit IRC16:11
*** sshnaidm|afk is now known as sshnaidm|rover16:11
chkumar|ruckEmilienM: weshay sshnaidm|rover https://review.openstack.org/#/c/581607/ please have a look at this one16:12
*** athomas has quit IRC16:12
owalshlarsks: hey, around? got a question re healthchecks16:13
larsksowalsh: I'm around, although it's been a while since I looked at health checks...16:14
owalshlarsks: wondering if we can alter the start period16:14
sshnaidm|roverchkumar|ruck, you need the patch to pass CI before review16:14
sshnaidm|roverchkumar|ruck, it has legit job failures16:15
weshaythanks chkumar|ruck16:15
larsksowalsh: by "can", do you mean "is it possible" or "is it advisable"?16:15
owalshlarsks: I think it's necessary... wondering if it's possible16:15
owalshseems to be an option here - https://docs.docker.com/engine/reference/builder/#healthcheck16:16
sshnaidm|roverweshay, recheck won't help16:16
weshayok16:16
chkumar|rucksshnaidm|rover: I need to find out how to add it to undercloud16:16
dpeacockflorianf: actually having checked I think this is implicitly taken care of - please see the three scenarios I just put in this paste and let me know if this addresses your concerns: http://paste.openstack.org/show/726283/16:16
larsksowalsh: I believe the health checks are put in place by the 'paunch' tool, so the real question is if paunch exposes an option for that.  Let me see if I can answer that...16:17
owalshlarsks: ah, no https://github.com/openstack/paunch/blob/master/paunch/builder/compose1.py#L18516:17
sshnaidm|roverchkumar|ruck, ok, so this patch is WIP so far, it's not ready for review16:17
larsksowalsh: well, there you go :). Looks like a pretty simple patch, though.16:18
owalshlarsks: ack thanks16:18
owalshEmilienM: think we need to drop the healthcheck patch... needs to consider the interval/retires options https://github.com/openstack/paunch/blob/master/paunch/builder/compose1.py#L18516:19
*** karthiks has joined #tripleo16:19
*** janki has joined #tripleo16:20
*** agopi is now known as agopi|food16:21
sshnaidm|roverslagle, can you please take a look at https://review.openstack.org/#/c/565077/ ? I don't understand what to do on mistral workflows side16:21
owalshEmilienM: default is 3 retires and 30s interval before it marks containers unhealthy16:22
owalshso 60s timeouts defintely ain't right16:22
*** amoralej is now known as amoralej|off16:23
*** jtcressy has joined #tripleo16:24
*** agopi|food is now known as agopi16:25
owalshlarsks: *sigh* looks like that's a new docker option16:27
larsksThe inevitable march of progress...16:27
*** ratailor has quit IRC16:28
*** moshele has joined #tripleo16:29
*** noslzzp has quit IRC16:31
florianfdpeacock: the output will look different when the validation is run through mistral16:34
florianfdpeacock: because it uses a custom plugin to format the output: https://github.com/openstack/tripleo-validations/blob/master/validations/callback_plugins/validation_output.py16:34
*** gfidente is now known as gfidente|afk16:34
florianfdpeacock: which doesn't pick up debug tasks etc.16:35
*** dtrainor has quit IRC16:35
florianfdpeacock: (you can check directly if you run the validation from the repository root because it contains an ansible.cfg file that sets the validation_output plugin)16:35
*** ffiore has quit IRC16:36
openstackgerritwes hayutin proposed openstack/tripleo-quickstart master: ensure pip deps are at the latest version  https://review.openstack.org/58373616:36
dpeacockflorianf: ok - my apologies - I haven't tried through mistral yet - is there an example command to run it all?16:36
*** dtrainor has joined #tripleo16:38
florianfdpeacock: no need to apologize at all! that's not exactly obvious... The easiest way to check the output as it would look when run through mistral is to cd into the validations repo root (where an ansible.cfg is located) and run the validation from there16:38
*** dbecker has quit IRC16:39
florianfdpeacock: I added another comment as a reminder...16:42
*** suuuper has quit IRC16:42
jtcressySo I once got the tripleo validations to run properly before... but most of the time the pre-deployment validation fail because "Warning! The validation did not run on any host." Is this a common issue?16:43
*** agopi is now known as agopi|food16:44
*** pchavva has quit IRC16:45
dpeacockflorianf: sorry - one thing I am missing is pretty crucial - what is the actual command to run this?16:46
jtcressyflorianf: just noticed you were talking about validations in the above messages, but I didnt catch the first half of the convo.16:46
jtcressydpeacock: I just used this from the docs a few minutes ago to spawn validations in mistral: openstack workflow execution create tripleo.validations.v1.run_groups '{"group_names": ["pre-deployment"]}'16:47
dpeacockjtcressy: thank you :-)16:49
jtcressyhowever, unrelated to that issue, I get "Warning! The validation did not run on any host." on every validation it tries to run. These were working once, but I don't know what's different between then and now.16:49
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates master: Add sample designate environment for ha  https://review.openstack.org/58402616:51
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates master: Use absolute paths in enable-designate environments  https://review.openstack.org/58402716:51
dpeacockflorianf: for dev purposes - how do I actually run the validation from the repo root?16:55
*** tesseract has quit IRC16:56
dpeacockliterally just ansible-playbook ..?  Or is there something else special?16:56
dpeacockflorianf: ok nevermind - I answered my own question - thanks16:57
dpeacockok right - characterized the problem now - moving on :-)16:58
florianfdpeacock: sorry, back.16:59
dpeacockSorry for the stream of consciousness folks16:59
dpeacockflorianf: it's all good - I have everything I need to get going now :-)16:59
*** thrash is now known as thrash|biab16:59
florianfdpeacock: ok, cool! :-)17:00
florianfjtcressy: the "did not run on any hosts" error usually appears if you run a validation that's supposed to run on the overcloud without having an overcloud deployed17:01
jtcressythese are the "pre-deployment" validations though.17:01
openstackgerritwes hayutin proposed openstack/tripleo-quickstart master: ensure pip deps are at the latest version  https://review.openstack.org/58373617:01
jtcressye.g. the advanced format validation fails with this error and it does not need a deployed overcloud to run.17:02
florianfjtcressy: yeah, that shouldn't happen.17:02
*** owalsh is now known as owalsh_biab17:02
jtcressyI looked through a few logs in /var/log/mistral but cant find anything relevant yet17:02
florianfjtcressy: what command did you use to run a single validation?17:02
jtcressyI used this to run all of the pre-deployment validations: "openstack workflow execution create tripleo.validations.v1.run_groups '{"group_names": ["pre-deployment"]}'"17:03
*** links has quit IRC17:03
sri_dprince, dsneddon: hi quick quastion I've confiugred bonds in my overcloud with LACP, but from my switch side  not learning mac address for the bonds17:03
jtcressyflorianf: I can also try to start them individually from the UI but they still fail with the same error.17:03
*** pradk has quit IRC17:04
*** brault has quit IRC17:04
*** derekh has quit IRC17:04
florianfjtcressy: can you post the output of `python /usr/bin/tripleo-ansible-inventory --list` somewhere?17:04
*** brault has joined #tripleo17:04
florianf*paste17:04
*** pradk has joined #tripleo17:05
jtcressyflorianf: https://hastebin.com/raw/daquwakigo17:06
jtcressyi have no stack deployed, but I do have a plan defined.17:06
florianfjtcressy: do you use a stack/plan with a different name than overcloud?17:06
jtcressyno17:06
*** bdodd has quit IRC17:06
*** trown is now known as trown|lunch17:08
florianfjtcressy: what happens if you add `--plan overcloud` to the command?17:08
*** brault has quit IRC17:09
jtcressyflorianf: see the second half of the hastebin.17:09
*** gkadam has quit IRC17:09
*** ooolpbot has joined #tripleo17:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION17:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332517:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178226717:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178259817:10
*** ooolpbot has quit IRC17:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)17:10
openstackLaunchpad bug 1782267 in tripleo "Stderr: u'Set Chassis Power Control to Up/On failed: Command not supported in present state\n': ProcessExecutionError: Unexpected error while running command." [Critical,Triaged]17:10
openstackLaunchpad bug 1782598 in tripleo "container health check fails in step 5 on centos-binary-nova-api" [Critical,Triaged] - Assigned to Oliver Walsh (owalsh)17:10
*** moshele has quit IRC17:10
florianfjtcressy: what if you run this: openstack action execution run tripleo.plan.list17:11
*** holser__ has quit IRC17:11
jtcressyflorianf: output: `{"result": ["overcloud"]}`17:11
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates master: Use global ansible.cfg for nodes-uuid playbook  https://review.openstack.org/58355217:12
*** psachin` has quit IRC17:13
*** edmondsw has quit IRC17:13
*** artom has quit IRC17:13
florianfjtcressy: ok, this is strange. does this output anything: `echo $TRIPLEO_PLAN_NAME`17:14
*** weshay changes topic to "Welcome to Rocky. CI status: GREEN, OVB RED due to nodepool nodefailure | https://docs.openstack.org/tripleo-docs/latest"17:14
jtcressyNo, it does not.17:14
jtcressyi did `source stackrc` a while ago btw in case that envvar is supposed to be populated from there.17:15
florianfjtcressy: no, it's not. but if it was set to something else than overcloud (for whatever reason -- I just wanted to rule it out) it would have explain the result17:15
florianf*explained17:16
dsneddonsri_, You have the switch side configured for LACP as well?17:16
jtcressyflorianf: is it ok for this var to be empty or undefined? or does it _need_ to be "overcloud"?17:16
sri_yes, I mena my network admin told me17:16
florianfjtcressy: yes it is17:16
sri_and also when i craete a linux bonds "BONDING_MASTER" this not part of the bond config17:17
florianfjtcressy: the inventory will fall back to overcloud if it isn't set.17:17
sri_dsneddon, ^^ is that a deal breaker ?17:18
*** edmondsw has joined #tripleo17:18
dsneddonsri_, I'm not sure I understand what you mean when you say and also when i craete a linux bonds "BONDING_MASTER" this not part of the bond config"17:18
jtcressyflorianf: iirc the validations were running properly on pike, but i'm not sure. it could've also been queens. I've gone through a few re-creations of my undercloud node over the past few weeks.17:18
sri_dsneddon, instead of ovs_bond I've created a linux_bond17:18
florianfjtcressy: I've tested it with a fairly recent master17:19
EmilienMowalsh_biab: back. Ok so IIUC, we need to consider interval/retries options in paunch, but they are in a too-recent version of Docker so we can't use it now, is that correct?17:19
florianfjtcressy: but not completely up to date17:20
sri_dsneddon, in a linux bond that parameter needed or is it optionl : https://access.redhat.com/documentation/en-us/red_hat_enterprise_linux/6/html/deployment_guide/sec-configuring_a_vlan_over_a_bond17:20
dsneddonsri_, That's not a problem, but you will have to use a different format for the bonding_options. You can reuse the BondInterfaceOvsOptions parameter, and set "bonding_options: {get_param: BondInterfaceOvsOptions}" in the NIC config.17:20
jtcressyflorianf: I'm definitely not working from master. I'm sticking to specific releases.17:20
owalsh_biabEmilienM: no, we have intervals/retries but no start_period (which would be useful in this case)17:20
*** artom has joined #tripleo17:20
jtcressyflorianf: and I am currently on queens fyi17:21
owalsh_biabEmilienM: but looking at https://docs.docker.com/engine/reference/builder/#healthcheck ...17:21
dsneddonsri_, so instead of, for instance "bond_mode=balance-tcp lacp=active", you would use "mode=4" for Linux bond options17:21
florianfjtcressy: I have a queens env. let me double check real quick17:21
sri_dsneddon, my nic config: http://paste.openstack.org/show/726295/17:21
owalsh_biabEmilienM: think we will have to just check after the final step and give it enough time to run at least 3 healthchecks (or query the retry/interval to get how long we need to wait)17:22
dsneddonsri_, That looks correct17:22
EmilienMowalsh_biab: that's not really "step driven deployment" :-/17:23
EmilienMI think the goal was to stop the deployment if a container would fail to run17:23
EmilienMto provide quick feedback to the deployer17:23
owalsh_biabyea, but healthchecks need to fail $retry times17:24
sri_dsneddon, my network admin saying something must be wrong from my side,  I just wanted clarify things from my side17:24
dsneddonsri_, There are some inspection commands that will tell you if you are getting LLDP from the switch. This would allow you to confirm that the ports are plugged into the correct ports on the switch.17:24
dsneddonsri_, You can use "openstack baremetal introspection interface list" and "openstack baremetal introspection interface show" commands17:25
dsneddonsri_, Those commands are run on the undercloud with stackrc authentication file.17:25
EmilienMowalsh_biab: why nova-api takes so much time also, and not other containers?17:25
sri_dsneddon, cool, let me try17:25
*** khyr0n has joined #tripleo17:26
slagleEmilienM: bandini : can you review this https://review.openstack.org/#/c/583017, since you reviewed the child17:27
openstackgerritRafael Folco proposed openstack-infra/tripleo-ci master: Take featureset out of TOCI_JOBTYPE  https://review.openstack.org/58238417:27
openstackgerritRafael Folco proposed openstack-infra/tripleo-ci master: Take environment_type out of TOCI_JOBTYPE  https://review.openstack.org/58238517:27
openstackgerritRafael Folco proposed openstack-infra/tripleo-ci master: Take nodes out of TOCI_JOBTYPE  https://review.openstack.org/58238617:27
openstackgerritRafael Folco proposed openstack-infra/tripleo-ci master: Take periodic and dryrun out of TOCI_JOBTYPE  https://review.openstack.org/58238717:27
slaglemerging anything in t-p-e is broken without the first one17:27
owalsh_biabEmilienM: see https://bugs.launchpad.net/tripleo/+bug/1782598, request timestamps are very unstable. I'd guess load is high at that time17:27
openstackLaunchpad bug 1782598 in tripleo "container health check fails in step 5 on centos-binary-nova-api" [Critical,Triaged] - Assigned to Oliver Walsh (owalsh)17:27
EmilienMslagle: ack, looking now.17:28
EmilienMowalsh_biab: so do you suggest to disable healthcheck for now and see if it was the only container? and in the meantime figure out how to solve this problem17:29
EmilienMowalsh_biab: or total revert of alex's patch?17:29
owalsh_biabEmilienM: total revert would be safest I think17:30
* owalsh_biab afk for a bit17:31
EmilienMmhh17:31
* EmilienM not really happy17:31
jtcressyIs there a minimum storage requirement on overcloud nodes? I just noticed that some of my nodes which have 270GB disk space are deployed via nova/ironic just fine, however nodes which have 135GB disk space outright fail with some sort of block storage error that does a horrible job at describing the true problem.17:32
cmysterjtcressy: as in regular nodes or some specifics needs? IIRC 135GB is way more than the minimal17:35
jtcressyjust regular nodes. Currently all my nodes with 135GB storage are set to compute. I have a single 270GB node set to compute that actually succeeds in deployment. (control and ceph-storage nodes all have 270GB as well)17:36
jtcressythe error also occurs in nova/ironic17:36
*** agopi|food is now known as agopi17:36
cmysterjtcressy: could you run df -h on the nodes that failed?17:37
jtcressyheat requests instance -> nova fails to build instance and never even tells ironic to deploy the server. It just outright fails. It feels like it's failing some sort of check but i don't know what.17:37
jtcressycmyster: cant run df -h if it never powers on the failed nodes17:37
jtcressyalso the disks should be presumed clean on nodes in the "available" provision state17:38
cmysterjtcressy: oh, you said failed to deploy, I assumed it at least passed some stages17:38
jtcressycmyster: nope. nova outright refuses to build the instance almost as quickly as heat requested it.17:39
cmystercould be a thing for ironic team to have a look at...17:39
*** mdnadeem has joined #tripleo17:39
jtcressyi'm on queens release btw. no in-dev stuff going on. ;)17:39
cmysterso queens, trying to deploy with tripleo?17:39
openstackgerritJames Slagle proposed openstack/tripleo-common stable/queens: Include 'tripleo_role_name' in the inventory  https://review.openstack.org/58328717:39
jtcressycmyster: yes. tripleo queens.17:40
jtcressycmyster: here's the brief output in the tripleo UI about the failure: resources.NovaCompute: Went to status ERROR due to "Message: Build of instance edc6a7a4-170c-4cf4-8d00-8f4f03587b77 aborted: Failure prepping block device., Code: 500"17:41
*** yamahata has joined #tripleo17:41
*** pchavva has joined #tripleo17:41
sri_dsneddon, if you have min can you please take look at this http://paste.openstack.org/show/726299/17:42
cmysterjtcressy: introspection passed ?17:42
jtcressyYup17:42
*** chem has quit IRC17:43
jtcressymy instackenv.json actually has old values for the hard drive sizes (270GB) and the introspection updated it to the actual storage size of 135GB. (I moved disks around a few days ago to consolidate what I have into dedicated ceph nodes)17:43
cmysterI re,e,ber seeing that issue17:44
cmysterbut where17:44
cmysterimpi?17:44
cmysterhmmm17:44
openstackgerritGabriele Cerami proposed openstack/tripleo-quickstart-extras master: ovb-manage: Allow the use of localhost as undercloud part of the stack  https://review.openstack.org/58404017:45
dsneddonsri_, It's possible that eth2-5 are attached to a switch that is not running LLDP (Link-Layer Discovery Protocol) on those ports.17:45
sri_dsneddon, http://paste.openstack.org/show/726300/17:47
dsneddonsri_, Are eth0/1 attached to a different switch than eth2-5? It definitely looks like LLDP is not running on the 2-5 switchports.17:50
sri_dsneddon, let me find out17:51
pabelangerhave a baremetal question for DIB, anybody in tripleo deal with that before? I'm trying to understand why we need to extract  kernel and initial ramdisk into separate images: http://git.openstack.org/cgit/openstack/diskimage-builder/tree/diskimage_builder/elements/baremetal17:51
*** gfidente|afk has quit IRC17:56
slaglepretty sure it was because that was how Ironic originally required it. it didn't always support whole disk images17:57
*** sshnaidm|rover is now known as sshnaidm|off17:57
pabelangerslagle: in tripleo, are you still doing kernel and ramdisk images or whole disk images17:58
jangutteralso, I don't think dib can generate gpt whole-disk image quite yet, so for EFI boot, I _think_ it's still required. Happy to be proven wrong.17:58
pabelangerhow does it work in OVB jobs today?17:59
pabelangerjangutter: if needed, I think we can find somebody to add it, ianw comes to mind17:59
jangutterpabelanger: I think there's already a task running for it.18:00
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates master: Add sample designate environment for ha  https://review.openstack.org/58402618:00
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates master: Use absolute paths in enable-designate environments  https://review.openstack.org/58402718:00
sri_dsneddon, I've asked my network-admin/boss about switchs he told me "don't get into it, just do what i said"18:00
jangutterpabelanger: https://bugzilla.redhat.com/show_bug.cgi?id=148855718:00
openstackbugzilla.redhat.com bug 1488557 in diskimage-builder "[RFE] diskimage-builder support whole disk images with UEFI whole disk image support for overcloud nodes" [Unspecified,On_dev] - Assigned to yroblamo18:00
sri_dsneddon, very sorry for wasting your time18:00
pabelangerrlandy: weshay: panda: maybe you can answer OVB question above about kernel and ramdisk images18:00
sri_dsneddon, and thank you for your time :)18:01
jangutternear as I can figure, EFI boot _likes_ having the kernel and ramdisk broken out.18:01
bnemecpabelanger: I don't know for sure what ci is doing these days, but the default image build is still using split kernel and ramdisk so ci _should_ be doing that too.18:01
pabelangerjangutter: http://git.openstack.org/cgit/openstack-infra/project-config/tree/nodepool/nb03.openstack.org.yaml#n72 is how we are doing efi images in nodepool, but really don't know much how it works, ianw drove that18:02
*** janki has quit IRC18:03
pabelangerbnemec: okay, that helps. If we added baremetal into nodepool, which I'm just planning now, we still want to use the 3 images?18:03
*** edmondsw has quit IRC18:03
jangutterpabelanger: heh, the "vm" element in my setup expressly set mbr(msdos) partition layout, while EFI kinda requires GPT.18:03
bnemecpabelanger: I would say yes.18:03
pabelangerbnemec: perfect, thanks!18:04
bnemecEven if we changed the default, older releases are still using the split images.18:04
pabelanger++18:04
florianfjtcressy: I think I found something re failing validations: https://review.openstack.org/#/c/565201/318:06
dsneddonsri_, I checked with the engineer who wrote the code for the LLDP data collection, and he said that it also would show that output for eth2-5 if no cable was plugged in to the port. I think that's unlikely, though, since os-net-config would have detected that and thrown an error during deployment.18:06
florianfjtcressy: It's been backported to queens18:06
florianfjtcressy: I can investigate further tomorrwo18:06
florianf*tomorrow18:07
jtcressyflorianf: sounds good.18:07
florianfjtcressy: thanks for the hint18:07
slaglepabelanger: we use the split image with kernel, ramdisk, and a partition image18:08
pabelangerslagle: ack, where can I look to see how that is built today for CI?18:08
slaglequickstart18:09
*** ooolpbot has joined #tripleo18:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION18:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332518:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178226718:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178259818:10
*** ooolpbot has quit IRC18:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)18:10
openstackLaunchpad bug 1782267 in tripleo "Stderr: u'Set Chassis Power Control to Up/On failed: Command not supported in present state\n': ProcessExecutionError: Unexpected error while running command." [Critical,Triaged]18:10
openstackLaunchpad bug 1782598 in tripleo "container health check fails in step 5 on centos-binary-nova-api" [Critical,Triaged] - Assigned to Oliver Walsh (owalsh)18:10
sri_dsneddon, yes cable is conneted in all ports, and os-net-config also runs without any errors, are you saying is it still LLDP issue ?18:10
sri_dsneddon, my deplyment failed trying to ping one of the vlans18:12
*** florianf has quit IRC18:12
*** trown|lunch is now known as trown18:13
sri_dsneddon, this part [u'1000BASE-T fdx'] need show up all Interface right ?18:15
*** salmankhan has quit IRC18:16
*** moshele has joined #tripleo18:16
jtcressySo i'm getting this error regardless of the node's hard drive size now: `Build of instance 5d3fe517-24ac-429a-af98-4289a0a00353 aborted: Failure prepping block device.`18:17
*** thrash|biab is now known as thrash18:17
jtcressythis is ONLY happening on compute nodes. ceph-storage and control are deployed just fine.18:17
openstackgerritAlan Bishop proposed openstack/puppet-tripleo stable/queens: [Ocata,Pike,Queens-Only] Fix Cinder's Netapp backend  https://review.openstack.org/58373418:18
*** moshele has quit IRC18:19
*** panda is now known as panda|off18:20
jtcressyFAILED node: https://hastebin.com/qinofavugo.py18:20
jtcressySuccessful (currently bullding) node: https://hastebin.com/giputuyane.rb18:21
radezEmilienM: that issue we were looking at the other day ended up being a puppet-tripleo issue: https://review.openstack.org/#/c/583900/18:21
radezif you get a min to look at it a review would be welcome :)18:21
EmilienMradez: it's a way to fix this, indeed18:21
EmilienMI'm happy to merge this.18:21
radezEmilienM++ thx!18:22
openstackgerritJohn Trowbridge proposed openstack/tripleo-heat-templates master: Add secondary DNS server to disable-unbound environment  https://review.openstack.org/58216418:22
openstackgerritJohn Trowbridge proposed openstack/tripleo-heat-templates master: Move to openshift-ansible 3.10  https://review.openstack.org/58249518:23
openstackgerritJohn Trowbridge proposed openstack/tripleo-heat-templates master: WIP use openshift-ansible container instead of RPMs  https://review.openstack.org/58386818:23
jtcressyThis is one of the bug reports mentioning my problem but it says it's been fixed and backported to queens? How come I still experience this problem if it was "fixed"?18:24
jtcressyhttps://bugs.launchpad.net/tripleo/+bug/174967118:24
openstackLaunchpad bug 1749671 in tripleo "Overcloud installation fails with "Failure prepping block device." error" [Critical,Fix released] - Assigned to Harald Jensås (harald-jensas)18:24
pabelangerokay, looking more at tripleo-quickstart, is ironic-python-agent seems to be the images for ironic?18:24
*** artom_ has joined #tripleo18:25
jtcressymy current node list: https://hastebin.com/qasikekunu.rb18:27
dsneddonsri_, No, I don't think there is neccessarily an issue with the lack of LLDP data. I was just pointing out that since the switch isn't running LLDP, we won't get any useful troubleshooting data out of the "openstack baremetal introspection interface list|show" commands for those ports.18:27
jtcressythe singular novacompute node still errors out with the "Failure prepping block device" error.18:28
jtcressyAnyone here specialize in nova/ironic?18:28
*** artom has quit IRC18:29
*** med_ has joined #tripleo18:29
*** med_ has quit IRC18:29
*** med_ has joined #tripleo18:29
*** marrusl has joined #tripleo18:29
dsneddonsri_, But without LLDP, the switch is completely a black box.18:29
dsneddonsri_, A few other things to look at. You can run "cat /proc/net/bonding/bond" on the overcloud nodes, which will give you the status of the bond and LACP.18:31
dsneddonsri_, Oops, I meant "cat /proc/net/bonding/bond1"18:31
*** edmondsw has joined #tripleo18:31
*** agurenko has quit IRC18:31
*** mdnadeem has quit IRC18:35
dsneddonsri_, Other thing that could be causing issues: incorrect VLAN trunking configuration on the bond on the switch. This can't be detected by os-net-config, and so the bond gets set up but no traffic flows across the VLANs.18:36
dsneddonsri_, Another thing could be incorrect cabling, so the ports on the switch are actually connected to different servers, rather than all bond slaves being attached to the same server.18:37
dsneddonsri_, Finally, another problem could be native (untagged) vs. trunked (tagged) VLANs. These VLANs should be trunked so they will be tagged on both ends.18:38
dsneddonsri_, You also want to make sure that the VLAN IDs are set correctly. StorageNetworkVlanID and InternalApiNetworkVlanID need to be set correctly in your network-environment.yaml (or set correctly in network_data.yaml if you are using a very recent TripleO version).18:39
*** akhilaki has joined #tripleo18:40
openstackgerritTom Barron proposed openstack/tripleo-heat-templates master: Update manila environment file names  https://review.openstack.org/58370518:46
*** shreshtha has quit IRC18:48
*** bdodd has joined #tripleo18:48
openstackgerritRonelle Landy proposed openstack/tripleo-quickstart-extras master: DNM - Adding patch for reproducer test  https://review.openstack.org/58406518:51
*** moshele has joined #tripleo18:59
*** moshele has quit IRC18:59
*** rpioso|afk is now known as rpioso19:02
*** sri__ has joined #tripleo19:02
*** itlinux has joined #tripleo19:06
*** ooolpbot has joined #tripleo19:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION19:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332519:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178226719:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178259819:10
*** ooolpbot has quit IRC19:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)19:10
openstackLaunchpad bug 1782267 in tripleo "Stderr: u'Set Chassis Power Control to Up/On failed: Command not supported in present state\n': ProcessExecutionError: Unexpected error while running command." [Critical,Triaged]19:10
openstackLaunchpad bug 1782598 in tripleo "container health check fails in step 5 on centos-binary-nova-api" [Critical,Triaged] - Assigned to Oliver Walsh (owalsh)19:10
sri__dsneddon, understood, i will look into all possible scenarios you mentioned, again thank you very much for your help19:11
*** tosky has quit IRC19:16
*** brault has joined #tripleo19:21
*** med_ has quit IRC19:23
*** brault has quit IRC19:25
*** dbecker has joined #tripleo19:25
jtcressySo I've tracked down my block device configuration problem from earlier to a problem between Heat and Nova...19:34
jtcressyIt seems Heat thinks a node exists by a particular UUID and tries to tell nova to deploy using said UUID, but that UUID does not exist as a node in ironic!!!! WTF?19:35
jtcressywhere does heat pull in a list of baremetal nodes from? does it cache a list in its own database? because it is most certainly stale data if it thinks a node still exists after it is LONG gone.19:36
jtcressyin `/var/log/nova/nova-compute` grepping for "ERROR" I find a LOT of 404 errors when nova tries to fetch info for a bare metal node by the aforementioned UUID. I dont know where it's getting this from, but I need to know how to get rid of it so it selects new baremetal nodes and stops using stale UUIDs19:37
jtcressymy deployments are going nowhere with this, as I cannot deploy any compute nodes because of this bizarre problem.19:38
jtcressy2018-07-19 13:12:53.604 10611 ERROR nova.virt.ironic.driver [req-cd5bd1c3-6cc8-4f27-a59a-832cb4649c0d 101ee8edb5b749e9ac95f5ee15333f4d 29ed7702e21e4480a317eb8b03bab387 - default default] [instance: dff12360-ae5e-49f3-bb52-76d3929a05a8] Error preparing deploy for instance dff12360-ae5e-49f3-bb52-76d3929a05a8 on baremetal node a6caba27-c4c0-4e6b-9b92-2fe65fd87410.: NotFound: Node a6caba27-c4c0-4e6b-9b92-2fe65fd87410 could not be found. (HTTP19:41
jtcressy 404)19:41
jtcressyDoes anyone know what might be wrong?19:46
*** noslzzp has joined #tripleo19:49
*** liverpooler has quit IRC19:58
openstackgerritJames Slagle proposed openstack/tripleo-common master: Add override_ansible_cfg  https://review.openstack.org/58408719:59
*** myoung is now known as myoung|biab20:00
*** holser_ has joined #tripleo20:05
openstackgerritwes hayutin proposed openstack/tripleo-quickstart-extras master: update default logging to match upstream  https://review.openstack.org/58408820:06
openstackgerritJames Slagle proposed openstack/tripleo-common master: Add override_ansible_cfg  https://review.openstack.org/58408720:06
*** ooolpbot has joined #tripleo20:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION20:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332520:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178226720:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178259820:10
*** ooolpbot has quit IRC20:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)20:10
openstackLaunchpad bug 1782267 in tripleo "Stderr: u'Set Chassis Power Control to Up/On failed: Command not supported in present state\n': ProcessExecutionError: Unexpected error while running command." [Critical,Triaged]20:10
openstackLaunchpad bug 1782598 in tripleo "container health check fails in step 5 on centos-binary-nova-api" [Critical,Triaged] - Assigned to Oliver Walsh (owalsh)20:10
*** holser_ has quit IRC20:15
*** holser_ has joined #tripleo20:15
*** dbecker has quit IRC20:15
*** sri__ has quit IRC20:22
openstackgerritwes hayutin proposed openstack/tripleo-quickstart master: devmode.sh has been upgraded  https://review.openstack.org/58409720:25
*** dprince has quit IRC20:25
*** pchavva has quit IRC20:28
*** artom_ is now known as artom20:29
*** akhilaki_ has joined #tripleo20:31
*** akhilaki has quit IRC20:33
*** akhilaki has joined #tripleo20:37
*** akhilaki_ has quit IRC20:39
trozetweshay: its a miracle scenario008 passed on queens: https://review.openstack.org/#/c/581790/20:40
trozetEmilienM: ^ so i think we are good now on https://review.openstack.org/#/c/581791/ when you have a minute20:41
EmilienMtrozet: good20:41
*** jroll has joined #tripleo20:42
weshayEmilienM++20:43
jrollEmilienM: I like your last email :)20:43
EmilienMjroll: because it has "edge" in the subject? i know it's how I get people to read my garbage :P20:43
jrollha20:43
jrollbecause it's similar to what I'm working on recently :P20:44
EmilienMjroll: nice, tell me more20:44
jrollEmilienM: not much to say, central DC control plane with remote compute nodes20:45
EmilienMjroll: come help me \o/20:45
jrollEmilienM: well, we aren't tripleo users right now, but this is compelling :)20:45
EmilienMjroll: oh, what do you use?20:46
jrollEmilienM: some homegrown chef stuff at the moment, but this is a new project. have been evaluating OSA for now, mostly because we use a lot of ansible elsewhere20:47
EmilienMjroll: come use tripleo20:48
EmilienMwe are ansible based :D20:48
jrollheh20:48
jrollwait, you are? TIL20:49
EmilienMjroll: mostly. We use a bit of Puppet still for config managment, but most of the orchestration is now done by Ansible.20:49
jrollneat.20:49
EmilienMjroll: https://docs.openstack.org/tripleo-docs/latest/install/advanced_deployment/ansible_config_download.html20:50
jrollthanks, will read up20:50
EmilienMjroll: and we're now using a bunch of ansible roles to deploy our services.20:50
jrollEmilienM: neat, will check it out20:52
EmilienMjroll: i'll save you time, come use tripleo ;-)20:52
* jroll sends EmilienM's boss a letter to promote him to sales20:52
*** lblanchard has quit IRC20:53
jrollEmilienM: I'm hoping to eventually have: 1 API endpoint which connects to X cells, which each control Y sites with compute nodes20:53
jrollor something like that20:53
EmilienMahah no I'm not sales20:53
*** akhilaki has quit IRC20:56
*** abishop has quit IRC20:56
*** raildo has quit IRC20:58
*** raildo has joined #tripleo20:58
*** lifeless has quit IRC21:01
*** akhilaki has joined #tripleo21:02
*** agopi has quit IRC21:05
*** bugzy_ has quit IRC21:05
*** lifeless has joined #tripleo21:06
*** raildo has quit IRC21:06
*** trown is now known as trown|outtypewww21:07
openstackgerritRonelle Landy proposed openstack/tripleo-quickstart-extras master: Include minimal Browbeat playbook in baremetal playbook  https://review.openstack.org/58148821:08
*** ooolpbot has joined #tripleo21:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION21:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332521:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178226721:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178259821:10
*** ooolpbot has quit IRC21:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)21:10
openstackLaunchpad bug 1782267 in tripleo "Stderr: u'Set Chassis Power Control to Up/On failed: Command not supported in present state\n': ProcessExecutionError: Unexpected error while running command." [Critical,Triaged]21:10
openstackLaunchpad bug 1782598 in tripleo "container health check fails in step 5 on centos-binary-nova-api" [Critical,Triaged] - Assigned to Oliver Walsh (owalsh)21:10
*** slaweq has quit IRC21:10
*** holser_ has quit IRC21:12
*** pradk has quit IRC21:12
EmilienMowalsh_biab: are you going to propose the revert? I'm more in favor of disabling the healthcheck for nova api21:12
EmilienMI'll check that later /me afk21:12
*** bugzy has joined #tripleo21:14
owalsh_biabEmilienM: don't see why nova_api would be the only container affected.... none of the health checks failed, I expect they just timed out21:15
*** myoung|biab is now known as myoung21:16
*** jtcressy has quit IRC21:16
openstackgerritOliver Walsh proposed openstack/tripleo-heat-templates master: Give healthchecks time to stablize before failing the deployment  https://review.openstack.org/58411921:26
*** slaweq has joined #tripleo21:29
openstackgerritOliver Walsh proposed openstack/tripleo-heat-templates master: Give healthchecks time to stabilize before failing the deployment  https://review.openstack.org/58411921:29
*** bfournie has quit IRC21:29
*** owalsh_biab is now known as owalsh21:31
*** akhilaki_ has joined #tripleo21:37
openstackgerritJames Slagle proposed openstack/python-tripleoclient master: Add --override-ansible-cfg  https://review.openstack.org/58412121:38
*** gbarros has joined #tripleo21:41
*** paramite has quit IRC21:42
*** jtcressy has joined #tripleo21:44
jtcressyAnyone have a guide on heat database surgery? Things are *very* broken on my undercloud and I think it's heat's fault.21:45
jtcressyIt keeps trying to deploy a node that doesn't exist instead of picking from my list of current nodes.21:45
jtcressyI've rebooted the undercloud, deleted my plan, refreshed everything and it still has this problem.21:46
openstackgerritJames Slagle proposed openstack/tripleo-docs master: Document --override-ansible-cfg  https://review.openstack.org/58412521:49
*** dtrainor has quit IRC21:51
*** hamzy has quit IRC21:53
*** hamzy has joined #tripleo21:53
openstackgerritJames Slagle proposed openstack/tripleo-common master: Add override_ansible_cfg  https://review.openstack.org/58408721:54
*** gbarros has quit IRC21:56
*** dtrainor has joined #tripleo21:56
*** gbarros has joined #tripleo21:56
*** rcernin has joined #tripleo21:58
*** brault has joined #tripleo21:59
*** jtomasek has quit IRC22:00
*** brault has quit IRC22:03
*** mcornea has quit IRC22:04
openstackgerritMerged openstack/tripleo-heat-templates master: Improve nova statedir ownership logic  https://review.openstack.org/57785522:08
openstackgerritMerged openstack/tripleo-puppet-elements master: Update test-requirements.txt  https://review.openstack.org/58301722:08
*** ooolpbot has joined #tripleo22:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION22:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332522:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178226722:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178259822:10
*** ooolpbot has quit IRC22:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)22:10
openstackLaunchpad bug 1782267 in tripleo "Stderr: u'Set Chassis Power Control to Up/On failed: Command not supported in present state\n': ProcessExecutionError: Unexpected error while running command." [Critical,Triaged]22:10
openstackLaunchpad bug 1782598 in tripleo "container health check fails in step 5 on centos-binary-nova-api" [Critical,In progress] - Assigned to Oliver Walsh (owalsh)22:10
*** slaweq has quit IRC22:17
*** gbarros has quit IRC22:20
*** gbarros has joined #tripleo22:20
*** ipsecguy has quit IRC22:22
*** itlinux has quit IRC22:22
*** jtcressy has quit IRC22:23
*** jtcressy has joined #tripleo22:26
*** jtcressy has quit IRC22:27
*** edmondsw has quit IRC22:30
*** mjturek has quit IRC22:31
*** edmondsw has joined #tripleo22:31
*** jtcressy has joined #tripleo22:33
*** ipsecguy has joined #tripleo22:34
*** edmondsw has quit IRC22:35
*** gbarros has quit IRC22:41
*** itlinux has joined #tripleo22:42
*** rcernin_ has joined #tripleo22:47
openstackgerritBen Nemec proposed openstack/python-tripleoclient master: Move ironic http boot reno to the correct section  https://review.openstack.org/58415422:48
openstackgerritMerged openstack/puppet-tripleo master: Check for neutron_plugin_ml2_ansible service when including plugin  https://review.openstack.org/58390022:48
openstackgerritMerged openstack/tripleo-heat-templates master: remove scenario005 from experimental  https://review.openstack.org/58368022:48
openstackgerritMerged openstack/tripleo-heat-templates master: Run scenario009 for more services  https://review.openstack.org/58323822:48
*** rcernin has quit IRC22:49
*** lblanchard has joined #tripleo22:58
*** rlandy is now known as rlandy|bbl22:59
*** noslzzp has quit IRC22:59
*** tzumainn has quit IRC23:02
*** ooolpbot has joined #tripleo23:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION23:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/177332523:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178226723:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/178259823:10
*** ooolpbot has quit IRC23:10
openstackLaunchpad bug 1773325 in tripleo "tempest.api.object_storage.test_object_services is failing on scenario002" [Critical,In progress] - Assigned to Arx Cruz (arxcruz)23:10
openstackLaunchpad bug 1782267 in tripleo "Stderr: u'Set Chassis Power Control to Up/On failed: Command not supported in present state\n': ProcessExecutionError: Unexpected error while running command." [Critical,Triaged]23:10
openstackLaunchpad bug 1782598 in tripleo "container health check fails in step 5 on centos-binary-nova-api" [Critical,In progress] - Assigned to Oliver Walsh (owalsh)23:10
*** slaweq has joined #tripleo23:11
*** dhill_ has quit IRC23:14
*** slaweq has quit IRC23:16
*** bfournie has joined #tripleo23:18
*** gbarros has joined #tripleo23:19
*** gbarros has quit IRC23:23
*** gbarros has joined #tripleo23:24
openstackgerritArx Cruz proposed openstack/tripleo-quickstart master: Let's tempestconf tool handle swift related conf  https://review.openstack.org/57322023:31
pabelangerpanda|off: rlandy|bbl: weshay: So, here is a very basic example of how we can get a bmc node from nodepool: https://review.rdoproject.org/r/14768/ looking at tripleo-ci, that seems to be the only thing we do so the image before booting it.23:33
pabelangernext step would be looking at working bmc-template node and seeing what networking is setup, SSH account, etc23:34
pabelangerSSH key can be generated at runtime, like we do with devstack multinode jobs23:34
*** gbarros has quit IRC23:34
pabelangernetworking, more tricky as there are provider networks23:34
pabelangerhowever, one idea would be to generate overlay networks, but not sure how that would look with ironic bits23:35
pabelangerpanda|off: rlandy|bbl: weshay: I think I'd like to learn more about the ipxe-boot image next, looking at http://git.openstack.org/cgit/openstack-infra/tripleo-ci/tree/scripts/prepare-ovb-cloud.sh looks straight forward to create the image23:41
*** khyr0n has quit IRC23:45
*** pmannidi has joined #tripleo23:49
*** rpioso is now known as rpioso|afk23:51
jtcressyTIL If I remove nodes from my undercloud, they will linger in the `compute_nodes` table in the `nova` database and will cause heat/nova/ironic to fail deploying new nodes. I had to delete 43 lines of stale node data from that table. I'm beginning another deploy now and hopefully this will let me get past the problem i've been experiencing for the past two days. More details about this upon request!23:52
*** akhilaki__ has joined #tripleo23:59

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!