Friday, 2017-03-24

*** flepied has quit IRC00:05
*** dparkes has quit IRC00:08
*** ooolpbot has joined #tripleo00:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION00:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468100:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477000:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495500:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517400:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]00:10
*** ooolpbot has quit IRC00:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]00:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)00:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]00:10
*** pkovar has joined #tripleo00:14
dmsimardrlandy|afk: I'm back, did you manage to find anything ?00:16
openstackgerritMerged openstack/diskimage-builder master: Typo fix: curent => current  https://review.openstack.org/44896600:16
*** Goneri has quit IRC00:17
*** Goneri has joined #tripleo00:19
*** tbonds has quit IRC00:19
*** flepied has joined #tripleo00:25
openstackgerritwes hayutin proposed openstack/tripleo-quickstart master: WIP, make script work in CI and interactive  https://review.openstack.org/44937000:30
*** limao has joined #tripleo00:30
*** rlandy|afk is now known as rlandy00:36
*** tbonds has joined #tripleo00:55
openstackgerritMerged openstack/diskimage-builder master: functests: skip qcow2 generically but add specific test  https://review.openstack.org/44883701:07
*** ooolpbot has joined #tripleo01:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION01:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468101:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477001:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495501:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517401:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]01:10
*** ooolpbot has quit IRC01:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]01:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)01:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]01:10
*** japestinho has quit IRC01:46
*** japestinho has joined #tripleo01:47
*** rlandy has quit IRC01:51
*** apevec has quit IRC01:56
*** ooolpbot has joined #tripleo02:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION02:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468102:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477002:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495502:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517402:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]02:10
*** ooolpbot has quit IRC02:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]02:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)02:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]02:10
*** nyechiel has joined #tripleo02:11
*** dmacpher-afk is now known as dmacpher02:20
*** pkovar has quit IRC02:27
*** fzdarsky_ has joined #tripleo02:29
*** fzdarsky|afk has quit IRC02:32
*** gkadam has joined #tripleo02:35
*** limao has quit IRC02:36
*** limao_ has joined #tripleo02:36
*** tbarron has quit IRC02:38
*** yamahata has quit IRC02:43
*** tbarron has joined #tripleo02:48
*** nyechiel has quit IRC02:58
*** ooolpbot has joined #tripleo03:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION03:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468103:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477003:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495503:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517403:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]03:10
*** ooolpbot has quit IRC03:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]03:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)03:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]03:10
*** ramishra has joined #tripleo03:12
openstackgerritMerged openstack/puppet-pacemaker master: Update test-requirements.txt  https://review.openstack.org/44895303:25
*** psahoo has joined #tripleo03:29
*** gbarros has quit IRC03:47
*** yamahata has joined #tripleo03:47
*** limao_ has quit IRC04:09
*** ooolpbot has joined #tripleo04:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION04:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468104:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477004:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495504:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517404:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]04:10
*** ooolpbot has quit IRC04:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]04:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)04:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]04:10
*** limao has joined #tripleo04:10
*** limao has quit IRC04:14
*** links has joined #tripleo04:20
*** ratailor has joined #tripleo04:40
*** limao has joined #tripleo04:43
openstackgerritMerged openstack/diskimage-builder master: Use correct Ubuntu distro url on non-x86 arches  https://review.openstack.org/44884804:43
*** skramaja has joined #tripleo04:44
*** janki has joined #tripleo04:53
*** radeks has joined #tripleo04:55
*** udesale has joined #tripleo04:57
*** fragatin_ has joined #tripleo05:01
*** fragati__ has joined #tripleo05:03
*** fragatina has quit IRC05:05
*** fragatin_ has quit IRC05:06
*** fragati__ has quit IRC05:07
*** ooolpbot has joined #tripleo05:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION05:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468105:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477005:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495505:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517405:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]05:10
*** ooolpbot has quit IRC05:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]05:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)05:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]05:10
*** udesale has quit IRC05:11
*** udesale__ has joined #tripleo05:11
*** pgadiya has joined #tripleo05:18
*** udesale__ has quit IRC05:20
*** udesale has joined #tripleo05:20
*** udesale has quit IRC05:21
*** fragatina has joined #tripleo05:21
*** udesale has joined #tripleo05:22
*** fragatina has quit IRC05:25
*** prateek has joined #tripleo05:31
*** fragatina has joined #tripleo05:33
*** fragatina has quit IRC05:33
*** fragatina has joined #tripleo05:34
*** masco has joined #tripleo05:34
*** masco has quit IRC05:51
*** mdnadeem has joined #tripleo05:53
*** yprokule has joined #tripleo05:53
*** iranzo has joined #tripleo06:00
*** ooolpbot has joined #tripleo06:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION06:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468106:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477006:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495506:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517406:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]06:10
*** ooolpbot has quit IRC06:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]06:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)06:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]06:10
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Create PReP boot partition for PPC  https://review.openstack.org/44773906:10
*** aufi has joined #tripleo06:18
*** lmiccini has joined #tripleo06:41
*** dparkes has joined #tripleo06:47
*** karimb has joined #tripleo06:51
*** karimb has quit IRC06:51
bandinimorning07:06
*** ooolpbot has joined #tripleo07:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION07:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468107:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]07:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477007:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495507:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517407:10
*** ooolpbot has quit IRC07:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]07:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)07:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]07:10
*** ealcaniz has joined #tripleo07:13
cschwedeGood Morning! This fix needs only a +A https://review.openstack.org/#/c/448208/ - would be great so I can propose a backport to stable/ocata soon07:14
bandinicschwede: done07:15
cschwedebandini: \o/ thx!07:15
openstackgerritMichele Baldessari proposed openstack/puppet-tripleo master: Qpid dispatch router puppet profile  https://review.openstack.org/42571007:15
bandinichem: https://review.openstack.org/#/c/446893/ is this one okay for you?07:19
*** pmannidi has quit IRC07:21
openstackgerritLuke Hinds proposed openstack/tripleo-heat-templates master: SSHD Service extensions  https://review.openstack.org/44462207:22
*** pmannidi has joined #tripleo07:28
openstackgerritLuke Hinds proposed openstack/tripleo-heat-templates master: SSHD Service extensions  https://review.openstack.org/44462207:30
*** tesseract has joined #tripleo07:31
*** pmannidi has quit IRC07:32
chembandini: looking07:39
*** jprovazn has joined #tripleo07:40
bandinichem: good morning, sir!07:43
chembandini: god morning07:44
matbubandini: chem bonjour07:44
bandinibonjour matbu!07:44
chemmatbu: buongiorno07:45
*** jaosorior has joined #tripleo07:55
*** cylopez has joined #tripleo07:55
*** ccamacho has joined #tripleo07:58
*** zzzeek has quit IRC08:00
openstackgerritAdriano Petrich proposed openstack/tripleo-common master: add caching the GetParametersAction  https://review.openstack.org/44422008:00
*** zzzeek has joined #tripleo08:01
*** percevalbot has quit IRC08:02
*** bogdando has joined #tripleo08:03
*** jlinkes has joined #tripleo08:05
d0ugalHow is CI today?08:07
*** yamahata has quit IRC08:07
openstackgerritmathieu bultel proposed openstack/tripleo-quickstart master: Download overcloud_release rpm for mixed upgrade  https://review.openstack.org/44934908:08
*** shardy_afk is now known as shardy08:09
*** florianf has joined #tripleo08:09
*** ooolpbot has joined #tripleo08:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION08:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468108:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477008:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495508:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517408:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]08:10
*** ooolpbot has quit IRC08:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]08:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)08:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]08:10
jaosoriord0ugal: I guess that's the answer ^^08:10
d0ugaljaosorior: yikes08:11
*** percevalbot has joined #tripleo08:11
* d0ugal attempts to understand them08:11
openstackgerritmathieu bultel proposed openstack/tripleo-quickstart master: Download overcloud_release rpm for mixed upgrade  https://review.openstack.org/44934908:16
ccamachojaosorior d0ugal :( CI is having a big Influenza shoot..08:16
d0ugalccamacho: no kidding, I am finding it harder to help as I am less familiar with quickstart too08:17
d0ugalI need to get up to speed.08:17
jaosoriorccamacho: yeah dude, it's been a rough week08:17
openstackgerritMichele Baldessari proposed openstack/tripleo-heat-templates master: WIP DO NOT MERGE Move rabbitmq behind haproxy  https://review.openstack.org/39011408:18
d0ugalIt would be awesome if somebody done a "debugging quickstart CI deep dive" - or maybe something like this exists?08:18
bandinid0ugal: +108:18
jaosoriornot that I know of08:18
jaosoriorI'm acquainted with the classic bash-based CI08:18
jaosoriorbut with this quickstart move I'm pretty lost08:18
remix_tjhi guys, i've a failing gate saying this: http://logs.openstack.org/87/446887/1/gate/gate-tripleo-ci-centos-7-nonha-multinode-oooq/cf58322/logs/undercloud/home/jenkins/overcloud_validate.log.txt.gz08:19
d0ugaljaosorior: yeah, same, I was never that good at understanding/helping but now I am terrible lol08:19
bandiniremix_tj: looking08:19
ccamacho:( ya with oooq i.e. if you use the --config it wont append your yaml.. that option is not working.. :P And I dont know why I have to launch 2 times the deploy command to have it working :P08:19
bogdandoo/08:19
ccamacho+1 to that deep dive session08:19
bogdandoplease merge https://review.openstack.org/#/c/444308/08:19
shardy+1 also, that is a good idea08:19
d0ugal /cc trown|outtypewww ^ :)08:19
bogdandoand https://review.openstack.org/#/c/445883/ please08:20
jaosoriord0ugal: wouldn't the overcloudrc be generated by a mistral workflow nowadays?08:20
remix_tjbandini: ansible play says that's a single failure http://logs.openstack.org/87/446887/1/gate/gate-tripleo-ci-centos-7-nonha-multinode-oooq/cf58322/console.html#_2017-03-24_07_52_56_11184708:20
d0ugaljaosorior: yeah, it is08:20
bogdandoone more https://review.openstack.org/#/c/448294/ . thank you!08:20
*** janki has quit IRC08:20
jaosoriord0ugal: remix_tj is seeing a successful overcloud deploy, and when validation is tried out, it fails cause it doesn't find the overcloudrc :/08:21
jaosoriorremix_tj: what's the patch?08:21
remix_tjhttps://review.openstack.org/44688708:21
*** mhenkel has quit IRC08:21
ccamachobogdando, check the last command for #/c/44829408:21
ccamachosorry ^ 44588308:21
bandiniremix_tj: hohum that's a weird one08:22
remix_tjit's since the 17th i'm continuing to recheck this patch and all its backports, CI was quite weird this week08:22
bandinimaybe d0ugal can help as to why overcloudrc is missing08:22
bogdandoccamacho: missed that, thanks!08:22
jaosoriord0ugal: what's the name of the workflow? we should find it having been executed in the mistral logs, right? http://logs.openstack.org/87/446887/1/gate/gate-tripleo-ci-centos-7-nonha-multinode-oooq/cf58322/logs/undercloud/var/log/mistral/08:22
* d0ugal looks at the lods08:22
d0ugallogs*08:23
d0ugaljaosorior: it isn't a workflow, just an action call08:23
*** amoralej|off is now known as amoralej08:23
bogdandoand these two require some review from oooq folks please https://review.openstack.org/#/c/447409/ https://review.openstack.org/#/c/447000/08:23
d0ugaland it is called...08:23
*** mhenkel has joined #tripleo08:23
d0ugaljaosorior: https://github.com/openstack/tripleo-common/blob/master/setup.cfg#L7108:23
d0ugaljaosorior: I don't see the action mentioned in the logs at all08:24
jaosoriorfunky08:24
jaosoriormandre, jistr: Hey guys, could you check these out https://review.openstack.org/#/q/status:open+project:openstack/tripleo-heat-templates+branch:master+topic:keystone-fernet-docker ?08:25
d0ugalhttp://logs.openstack.org/87/446887/1/gate/gate-tripleo-ci-centos-7-nonha-multinode-oooq/cf58322/logs/undercloud/var/log/mistral/executor.log.txt.gz#_2017-03-24_07_21_51_06108:26
d0ugalThat is the last log entry matching "tripleo_common"08:26
d0ugalso nothing in Mistral is started after the deploy.08:26
d0ugalWhere can I see the CLI output?08:27
*** leanderthal|afk is now known as leanderthal08:27
*** leanderthal is now known as leanderthal|afk08:28
openstackgerritBogdan Dobrelya proposed openstack/tripleo-heat-templates master: Rework container volumes as hostpath mounts  https://review.openstack.org/44851008:29
jaosoriord0ugal: http://logs.openstack.org/87/446887/1/gate/gate-tripleo-ci-centos-7-nonha-multinode-oooq/cf58322/logs/undercloud/home/jenkins/overcloud_deploy.log.txt.gz08:29
d0ugaljaosorior: weird, that doesn't get to the end of the deploy command.08:31
d0ugalWhere is https://github.com/openstack/python-tripleoclient/blob/master/tripleoclient/v1/overcloud_deploy.py#L1170-L117108:31
jaosoriorbut it passed O_o08:31
d0ugaloh man, I have a theory08:32
* d0ugal digs08:32
bogdandoneed some help with solving this puzzle http://logs.openstack.org/43/448543/3/check/gate-tripleo-ci-centos-7-nonha-multinode-oooq/9ece507/console.html (RC of failure?)08:33
bogdandoso overcloud-validate had failed, where are logs?..08:34
*** social has quit IRC08:34
jaosoriord0ugal: in the same direcotry as the deploy logs I showed you08:35
d0ugaljaosorior: was that for bogdando ?08:35
*** stendulker has joined #tripleo08:35
jaosoriord0ugal: http://logs.openstack.org/87/446887/1/gate/gate-tripleo-ci-centos-7-nonha-multinode-oooq/cf58322/logs/undercloud/home/jenkins/overcloud_validate.log.txt.gz08:35
bandinibogdando: http://logs.openstack.org/43/448543/3/check/gate-tripleo-ci-centos-7-nonha-multinode-oooq/9ece507/logs/undercloud/home/jenkins/overcloud_deploy.log.txt.gz08:35
jaosoriord0ugal: oh. In weechat you both handles are green. Got confused hahaha08:35
d0ugal:)08:36
bandinibogdando: the reason it went to do the validate even though the deploy failed is bug 167495508:36
openstackbug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] https://launchpad.net/bugs/1674955 - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)08:36
bogdandobandini: how those are related? so I could find it next time?08:36
bogdandoI mean, how could I follow the build log to find it08:36
jaosoriorbogdando: when is that run from>?08:36
jaosoriorbogdando: the ssl_depth issue should have been fixed already08:36
bandinibogdando: well I am not super expert after the oooq move, but in general those commands are run from the undercloud, so the undercloud/home/jenkins folder is the one with the most interesting logs most of the time08:37
bogdandojaosorior: https://review.openstack.org/#/c/448543/3 I'm only learning for finding failures in CI builds, not the RCA yet )08:37
bogdandoso I need to now how to locate that failed check log08:37
jaosoriorRCA?08:37
bogdandou, root cause08:37
bogdandoum*08:38
jaosoriorI need coffee08:38
bogdandoand a=analysis :)08:38
bogdandobandini: oh, thank you!08:38
bandinibogdando: so now that we know the deployment failed (as opposed to the validation), I usually hop on the nodes that failed and check /var/log/messages08:39
bandinibogdando: in your case the culprit is http://logs.openstack.org/43/448543/3/check/gate-tripleo-ci-centos-7-nonha-multinode-oooq/9ece507/logs/subnode-1/var/log/messages.txt.gz#_Mar_22_13_34_0808:39
bogdandoI need to raise kibana unarmed firghting skill08:40
bandiniahahahah08:40
bogdandobtw, in fuel I used to search for errors with some grep and perl magic08:40
*** milan has joined #tripleo08:40
bogdandoit served well for years08:41
bogdandoneed to find something similar here as well08:41
bogdandohttps://github.com/bogdando/fuel-log-parse :)08:41
bandiniyeah something along those lines would be useful for ooo as well08:42
bogdandoit looks ugly but works08:42
d0ugalI am 100% confused. Something causes the deploy command to exit early, but with a status code 0.08:42
bandiniwe used to have a postci.txt that contained the proper error most of the time, I think we are working on reinstantiating it in oooq08:42
bandiniiirc I saw some bug/review flying by about it08:43
jaosoriord0ugal: maybe this https://github.com/openstack/python-tripleoclient/blob/master/tripleoclient/v1/overcloud_deploy.py#L1147 ?08:43
jaosoriorbandini: postci still exists, it's just in a different directory08:43
d0ugaljaosorior: I checked that, it should be false - it isn't passed at least.08:43
jaosoriordafuq08:43
d0ugaljaosorior: but we should maybe add a print for that branch, just to be sure08:43
bandinijaosorior: oh!?08:43
*** jpena|off is now known as jpena08:44
jaosoriorbandini: http://logs.openstack.org/43/448543/3/check/gate-tripleo-ci-centos-7-nonha-multinode-oooq/9ece507/logs/undercloud/var/log/postci.txt.gz08:44
jaosoriorit's in /var/log/ in the undercloud08:44
bandinioh blimey08:44
bandinijaosorior: thanks, that will save me some time!08:44
jaosoriorI was also very confused and started ranting about it the other day08:44
jaosoriorso I was pointed to it08:45
bandinieheh well good to know!08:46
jaosorioranyway, I think it would still be better to move that from there, to the place where it was before08:46
jaosoriorit's a bit more intuitive08:46
openstackgerritDougal Matthews proposed openstack/python-tripleoclient master: Default --update-plan-only to False  https://review.openstack.org/44949108:46
bogdandobandini: is it possible to get all build artifacts in the tarbal?08:46
bogdandoto grep logs locally, or yes, kibana stuff is good (but slow I'm afraid)08:47
jaosoriord0ugal: not sure if the default=False is really necessary08:47
jaosoriord0ugal: but the print sure helps08:47
bandinibogdando: dunno tbh. you mean just the env files?08:47
bogdandothis should be asked rather to infra folks... but may be some one ate that dog already! :)08:47
d0ugaljaosorior: agreed, but I figured explicit is better etc. :)08:47
bandinibogdando: I guess it can all be reconstructed by looking at the logs08:47
bandinibut yeah having it all in a single place would be nice08:48
bogdandobandini: yeah, my concern is to grab all the failed job has produced and inspect locally with perl magic08:48
bandiniyeah08:49
bogdandothen to add yet another file to my repo :)08:49
bandinieheh08:49
bogdandoso may be there is some url to fetch all...08:49
* bogdando gonna ask infra folks08:50
bogdandowget recursive to the rescue08:51
openstackgerrityolanda.robla proposed openstack/diskimage-builder master: Apply setfiles on all mountpoints  https://review.openstack.org/44707608:52
shardybogdando: sec, there's a script that does that in tripleo-ci08:56
bogdandoshardy: right in time, thank you! :)08:56
shardyhttps://github.com/openstack-infra/tripleo-ci/blob/master/scripts/getthelogs08:57
*** athomas has joined #tripleo08:57
*** jpena is now known as jpena|off08:57
shardybogdando: that may help ^^ ?08:57
bogdandoyes, thanks08:57
*** zoli|gone is now known as zoli08:58
openstackgerritmathieu bultel proposed openstack/tripleo-quickstart master: Download overcloud_release rpm for mixed upgrade  https://review.openstack.org/44934908:58
*** milan has quit IRC09:00
*** mcornea has joined #tripleo09:03
*** jpena|off is now known as jpena09:04
openstackgerritOpenStack Proposal Bot proposed openstack/tripleo-ui master: Imported Translations from Zanata  https://review.openstack.org/44950709:06
bogdandoshardy: hm, not sure is that works "while in the tmp directory run "getthelogs" with no params09:09
bogdandoto download any log files you hadn't previously downloaded"09:09
shardybogdando: Hmm, OK I've not used it in a while, I was just aware it existed09:10
*** ooolpbot has joined #tripleo09:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION09:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468109:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477009:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495509:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517409:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]09:10
*** ooolpbot has quit IRC09:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]09:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)09:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]09:10
shardyI don't see where but perhaps it's been broken by the move to quickstart and/or multinode jobs09:10
openstackgerrityolanda.robla proposed openstack/diskimage-builder master: WIP: Add lvm management to diskimage-builder  https://review.openstack.org/44440309:11
*** jlinkes has quit IRC09:12
*** suuuper has joined #tripleo09:12
bogdandofolks, is elastic-check is still in use?09:13
bogdandoI've read about that, it looks cool09:13
bogdandobut f.e. I can't find the aforementioned bug in the https://git.openstack.org/cgit/openstack-infra/elastic-recheck/tree/queries list09:13
bogdandohttps://www.elastic.co/blog/openstack-elastic-recheck-powered-elk-stack09:13
shardyhttp://status.openstack.org/elastic-recheck/09:13
shardybogdando: I think so, but we've not really been making use of it for TripleO09:13
bogdandoI see. But the thing is just great, nothing to say more09:14
*** udesale has quit IRC09:15
shardybogdando: yeah, I think it could be useful, we've discussed it in the past, but nobody has had time to really push on getting regular queries added for bugs09:15
bogdandohehe. This could be some bot using LP tags :) a nice exercise09:16
bogdandoso someone has only to merge them09:16
bogdandopathces09:16
shardybogdando: yeah, I think the problem is/was that it takes human analysis of logs to figure out the query and propose it09:16
shardywhat would be really cool is to have a special comment format, so when raising a bug you could do e.g "ER_QUERY=foo"09:17
shardyor something09:17
bogdandoyeah, you're right. In google for example, they consider things went really bad if SREs has to read logs ever.09:17
shardyso maybe a bot could help there09:17
bogdandoit eats time09:18
shardybogdando: yeah, our CI is not really at google levels of automation at this point ;)09:18
bogdandowe will make it so!09:18
bogdandoand even better09:18
shardyIt's something to aim for, certainly ;)09:18
shardyOne issue we have in TripleO is that we're not in the gate of every other project, so we deal with a large number of regressions, which makes automated analysis/recovery harder09:19
*** limao has quit IRC09:21
bogdandoright. Although having a dummy bot that greps for errors and submit patches to the elastic-recheck could help to offload ppl here perhaps. At least we will have frequencies for each type of error09:24
bogdandocuz each failure will be linked to some bug, IIUC how it works09:24
bogdandoso it will be semi automatic. A person puts a tag to a known bug and here it is - each related error is linked to that bug09:25
bogdandobut it's easy to say, not to do...09:26
jaosoriormandre: by the way, the bind-mount patch depends on the one setting fernet as the default09:26
jaosoriorso if you want you could try those two together09:26
bandinid0ugal, remix_tj: fwiw I just spotted another place where we fail the validate due to no overcloudrc but deploy seemed to have succeeded http://logs.openstack.org/19/447319/2/check/gate-tripleo-ci-centos-7-nonha-multinode-oooq/5ea42f4/logs/undercloud/home/jenkins/overcloud_validate.log.txt.gz09:26
d0ugalbandini: thanks09:27
d0ugalapetrich: ^ another one.09:27
bandinid0ugal: I'll start by opening a bug to track this09:27
mandrejaosorior: yep, I'm testing the series09:27
remix_tjbandini: yep, i wait for other news09:28
d0ugalbandini: good idea. I am still looking into it. The weirdest bug I have seen in a while :)09:28
bandinid0ugal: gotta lova fridays ;)09:28
d0ugalhaha09:28
apetrichbandini, best bugs best days09:28
bandinilol09:28
*** lucas-afk is now known as lucasagomes09:29
remix_tji suggest you to stop touching on fridays after 3pm local time. Here touching after that time will lead to an interesting weekend09:29
*** flepied has quit IRC09:29
*** dbecker has joined #tripleo09:30
remix_tj(anyway i can't still start my oooq environment)09:31
bandinid0ugal, apetrich: https://bugs.launchpad.net/tripleo/+bug/1675709 to track it09:31
openstackLaunchpad bug 1675709 in tripleo "deploy succeeded but no overcloudrc was generated" [High,Triaged]09:31
d0ugalbandini: I was hoping the title would be "Weirdest bug in a while" :P09:32
d0ugalthanks!09:32
bandiniahahah09:33
apetrichd0ugal, ok linking my bug to that09:33
*** derekh has joined #tripleo09:33
*** jrist has quit IRC09:34
openstackgerritYurii Prokulevych proposed openstack/tripleo-heat-templates master: Run cluster check on nodes configured in wsrep_cluster_address.  https://review.openstack.org/44915409:35
openstackgerritThomas Herve proposed openstack/tripleo-heat-templates master: Remove yaql call when building logging_groups  https://review.openstack.org/44760509:36
openstackgerritKarthik S proposed openstack/puppet-tripleo master: vhostuser socket dir shall be created for vhostuserclient mode  https://review.openstack.org/44953009:38
*** janki has joined #tripleo09:38
*** deadnull has joined #tripleo09:43
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo master: Ensure directory exists for certificates for httpd  https://review.openstack.org/44953609:44
*** akrivoka has joined #tripleo09:45
*** fzdarsky_ is now known as fzdarsky09:46
*** snecklifter has joined #tripleo09:48
*** gkadam has quit IRC09:51
*** skramaja_ has joined #tripleo09:51
*** ckyriakidou has joined #tripleo09:53
*** dixiaoli has joined #tripleo09:53
*** skramaja has quit IRC09:54
openstackgerrityolanda.robla proposed openstack/diskimage-builder master: WIP: Add lvm management to diskimage-builder  https://review.openstack.org/44440309:54
openstackgerrityolanda.robla proposed openstack/diskimage-builder master: Use stevedore for module config of block device  https://review.openstack.org/44709009:54
openstackgerrityolanda.robla proposed openstack/diskimage-builder master: Refactor: block-device filesystem creation, mount and fstab  https://review.openstack.org/44458609:54
openstackgerritLuke Hinds proposed openstack/tripleo-specs master: blueprint for TCP Wrapper Service  https://review.openstack.org/44121109:57
*** flepied has joined #tripleo09:59
openstackgerritKarthik S proposed openstack/puppet-tripleo master: vhostuser socket dir shall be created for vhostuserclient mode  https://review.openstack.org/44953010:00
openstackgerritMerged openstack/puppet-tripleo stable/ocata: Fixes issues with raising mysql file limit  https://review.openstack.org/44751510:01
openstackgerritLuke Hinds proposed openstack/tripleo-specs master: blueprint for TCP Wrapper Service  https://review.openstack.org/44121110:02
*** karthiks is now known as karthiks_afk10:02
openstackgerrityolanda.robla proposed openstack/diskimage-builder master: Refactor: block-device filesystem creation, mount and fstab  https://review.openstack.org/44458610:02
*** pcaruana has joined #tripleo10:05
openstackgerritLuke Hinds proposed openstack/tripleo-heat-templates master: Extends audit serivce  https://review.openstack.org/44480410:07
d0ugalDo we have a CI run that passed recently?10:07
d0ugaloh, cistatus. oops10:08
*** Vijayendra has quit IRC10:09
d0ugalbandini, apetrich: I think this is the problem: http://logs.openstack.org/19/447319/2/check/gate-tripleo-ci-centos-7-nonha-multinode-oooq/5ea42f4/logs/undercloud/var/log/mistral/engine.log.txt.gz#_2017-03-24_07_57_11_45510:09
d0ugalI don't know what it means10:09
*** salmankhan has joined #tripleo10:09
d0ugalbut I think it is related to these timeouts10:09
d0ugalI just checked a couple of jobs that passed, they don't have these.10:09
*** skramaja_ is now known as skramaja10:10
d0ugalso my guess is that tripleoclient calls the overcloudrc action and it just gets stuck.10:10
*** ooolpbot has joined #tripleo10:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION10:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468110:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477010:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495510:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517410:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]10:10
*** ooolpbot has quit IRC10:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]10:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)10:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]10:10
d0ugal.... I don't know why it doesn't error tho'10:10
bandinid0ugal: ha, interesting10:13
apetrichd0ugal, I'm looking at the other fail example that I have and I see those timeouts also10:14
shardyjtomasek: Hi!  Sorry, another tripleo-ui related question :)10:15
openstackgerrityolanda.robla proposed openstack/diskimage-builder master: WIP: Add lvm management to diskimage-builder  https://review.openstack.org/44440310:15
shardyjtomasek: in tripleoclient, we have a validation which ensures NtpServer is set when ControllerCount > 110:15
shardyI don't see that in tripleo-common, does the UI do that same validation?10:15
d0ugalbandini, apetrich: I actually seen them in my environment the other day - but thought it was a one off10:16
d0ugalbandini, apetrich: re-creating it at the moment, so maybe I'll hit it again.10:16
bandiniack10:18
openstackgerritAlfredo Moralejo proposed openstack-infra/tripleo-ci master: [DNM] Adding yum debug in oooq playbook  https://review.openstack.org/44954810:20
matbucan someone add workflow on https://review.openstack.org/#/c/448274/10:25
matbuplz10:25
*** social has joined #tripleo10:26
*** zoli is now known as zoli|lunch10:27
openstackgerritBogdan Dobrelya proposed openstack-infra/tripleo-ci master: Adapt getthelogs UX for more use cases  https://review.openstack.org/44955210:27
bogdandobandini, shardy: https://review.openstack.org/44955210:28
jtomasekshardy: not really afaik10:29
openstackgerritSteven Hardy proposed openstack/tripleo-image-elements master: 51-hosts fails if given lots of changes  https://review.openstack.org/44919810:30
bogdandomandre: ^ ^10:30
jtomasekshardy: how is that validation run in tripleoclient?10:31
shardyhttps://github.com/openstack/python-tripleoclient/blob/master/tripleoclient/v1/overcloud_deploy.py#L22710:32
shardyjtomasek: it just looks at the merged environment before calling heat10:32
shardyjtomasek: I'd like to remove that, but I think we need to add something similar to either t-h-t or tripleo-common first10:33
jtomasekshardy: I could see it as a normal pre-deployment validation in tripleo-validations.10:33
shardyjtomasek: can't those happen before the plan is created?10:33
jaosoriormandre: yay, thanks for checking it out10:34
shardyjtomasek: but yeah, we need to find a better place for this, I want to remove it from tripleoclient10:34
jtomasekshardy: good question, tripleo-validations probably have access just to plan in swift10:34
shardyjtomasek: Ok, I think I'll leave it here for now, with a FIXME so we can work out removing it later10:35
jaosoriorbandini: sshnaidm|off posted a patch to get back postci.log :D10:35
jaosoriorbandini: https://review.openstack.org/#/c/448135/10:35
shardyjtomasek: what I would like is a plan update workflow, which includes any validation required on the data10:35
jtomasekshardy: ack, I think having it as pre-deployment validation in tripleo-validations should be fine. Filing it as bug would be helpful I think10:35
shardyjtomasek: which could include specific parameters like this, but also e.g dependencies described in the capabilities map10:35
openstackgerritJuan Antonio Osorio Robles proposed openstack-infra/tripleo-ci master: DO NOT MERGE: Testing novajoin authtoken  https://review.openstack.org/44634810:36
shardyjtomasek: Ok, FWIW I don't really see the value in having ansible do this, I'd probably prefer something in tripleo-common10:36
openstackgerritJuan Antonio Osorio Robles proposed openstack-infra/tripleo-ci master: DO NOT MERGE: Testing ensure dir for httpd certs  https://review.openstack.org/44634810:36
shardybut I'll raise a bug and we can discuss it, thanks!10:36
jtomasekshardy: so that plan update workflow would get a list of validations as input?10:37
jtomasekshardy: oh, I see, the validation would be just part of the plan update workflow...10:38
shardyjtomasek: No, it'd get a list of environment files, then a validation would automatically run that fails the plan update if the data is bad10:38
shardyjtomasek: yeah10:38
shardyjtomasek: vs the action we currently have that just enables environments10:38
*** tosky has joined #tripleo10:38
jtomasekshardy: in case of the parameters you mentioned, we would want the update to happen anyway because user just selects environments but he is still able to update the parameters later10:40
openstackgerritJuan Antonio Osorio Robles proposed openstack/tripleo-heat-templates master: Change the directory for httpd certs/keys to be service-specific  https://review.openstack.org/44955810:40
shardyjtomasek: I think it'd still be the same plan_update workflow, but perhaps we'd have a validate=False option10:41
jtomasekshardy: that is why I see  it fit as separate pre-deployment validation. Although I am aware it is a bit complicated for tripleoclient workflow as it does everything at once and10:41
*** nyechiel has joined #tripleo10:41
shardyjtomasek: I think we need to split it into two workflows, one updates the plan (including all operations related to environment files, and that includes parameters)10:41
shardythe other just deploys the plan10:41
shardyright now we have those two steps mixed up, despite making them separate in both Ux's10:42
openstackgerritJuan Antonio Osorio Robles proposed openstack-infra/tripleo-ci master: DO NOT MERGE: Testing change dir for httpd certs/keys  https://review.openstack.org/44956010:42
jtomasekshardy: yes, that's what UI does anyway10:42
openstackgerritMarios Andreou proposed openstack/tripleo-heat-templates master: WIP - Add upgrade_batch_tasks to neutron-l3-agent  https://review.openstack.org/44549410:42
openstackgerritJiri Stransky proposed openstack/tripleo-quickstart-extras master: Upgrade to containerized overcloud  https://review.openstack.org/44857610:43
jtomasekshardy: only reason why I mention tripleo-validations is that we already have this concept and introducing different kind of validations makes things more confusing10:43
shardyjtomasek: sure, well FWIW I think all tripleo validations should be described in terms of mistral workflows, even if those then run ansible10:44
jtomasekshardy: it would be possible to run tripleo-validation workflow as a subworkflow of plan update workflow btw.10:44
openstackgerritSagi Shnaidman proposed openstack/tripleo-quickstart-extras master: Clone projects if they aren't cloned by ZUUL  https://review.openstack.org/44956210:44
shardyjtomasek: I don't want to be forced to run mistral->ansible->swift when I can just do mistral->swift10:45
jtomasekshardy: +110:45
jtomasekshardy: agree10:45
*** gaurangt has joined #tripleo10:45
shardyjtomasek: Ok, thanks, I guess we have some work to do here, I'll raise a bug to track it10:46
openstackgerritSagi Shnaidman proposed openstack/tripleo-quickstart-extras master: Clone projects if they aren't cloned by ZUUL  https://review.openstack.org/44956210:46
jtomasekshardy: thanks for bringing this up10:46
jtomasekshardy: btw. is there progress on flattening parameters in tripleo-common? I am just curious. It is all fine if it is deferred due to low priority:)10:47
shardyjtomasek: I think skramaja was planning to work on it, but I'm not aware of any patches yet10:48
florianfshardy, jtomasek: I think we should be careful not to spread validation logic over multiple projects.10:48
shardyjtomasek: I suspect we can do it with yaql in a mistral workflow10:48
jtomasekshardy: ok10:49
shardyflorianf: I agree, but I think tripleo-validations started as kind of a "bolt on" addition to tripleo10:49
shardyI'd like to see the validations properly integrated into our architecture10:49
shardyand we can't do that if every kind of validation must be done only via ansible10:50
*** athomas has quit IRC10:50
d0ugalshardy: can you share the bug with me when you open it?10:50
* skramaja reading through10:50
shardyIMHO mistral makes a much more flexible integration point, and it fits well with our current architecture10:50
shardyeven if a bunch of validations still use ansible10:50
*** jprovazn has quit IRC10:51
d0ugalMakes sense. Ansible would be overkill for some simple validations10:51
shardyyeah, all I want is to check two values in the plan10:51
jtomasekflorianf:, shardy: so we have means to run tripleo-validations as part of mistral workflow. next good step could be that a tripleo-validation could run any code, not just ansible10:51
d0ugalshardy: Have you seen these? https://review.openstack.org/#/q/topic:validations-in-workflows10:51
*** athomas has joined #tripleo10:51
jtomasek(I am not entirely sure how it is implement atm)10:51
*** athomas has quit IRC10:52
florianfjtomasek: We already have a bunch of validations that extend ansible via custom ansible modules.10:52
openstackgerrityolanda.robla proposed openstack/diskimage-builder master: Apply setfiles on all mountpoints  https://review.openstack.org/44707610:52
shardyd0ugal: Ah, no, cool, yeah that's exactly what I'm talking about :)10:52
shardyflorianf: so, we already have a split implementation, mistral workflows, custom mistral actions, ansible playbooks and custom ansible modules10:53
* shardy sighs10:53
shardyoh well, we'll have to work out ways to rationalize that over time I guess10:53
florianfshardy, d0ugal: Yeah, it seems overkill. How about validations contributors who want a single place to see what's already being validated, plus a strainght forward way to contribute new ones?10:53
d0ugalshardy: they are ports of the validations in tripleoclient, I think thrash|g0ne plans to move them to tripleo-validations eventually - but maybe that isn't needed.10:53
*** [1]cdearborn has joined #tripleo10:54
florianfshardy, d0ugal: I asked thrash|g0ne to put in that comment :-)10:54
d0ugalaha10:54
openstackgerritJuan Antonio Osorio Robles proposed openstack/tripleo-heat-templates master: Bind mount directories that contain the key/certs for keystone  https://review.openstack.org/44956910:55
florianf(the TODO to port the tripleoclient validations to ansible validations.10:55
florianf)10:55
florianfSo I'm not advocating using ansible for every single validation, but keeping validation logic in a single place10:55
d0ugalbandini, apetrich: my local env is currently being flooring with timeouts... so I am expecting the CLI command to fail (but not lol) any second.10:55
shardyflorianf: sure, I think that's a good idea, but it still means we've got two layers of validations, e.g pure mistral and mistral driving ansible10:56
bandinid0ugal: ah nice that you can reproduce it!10:56
shardyflorianf: maybe that's OK and we just need to maintain the mistral workflows in the tripleo-validations repo10:56
florianfOne problem we'll already see with the legacy tripleoclient validations that are about to land in tripleo-common: They will not show up in the list of validations (in the UI for instance).10:57
apetrichd0ugal, seeing the same here. I'm not seeing floods of it but a some10:57
d0ugalbandini: dang, it actually completed properly - I have an overcloudrc but the mistral log is still displaying these timeouts every couple of seconds - so I think something is wrong.10:57
skramajashardy: jtomasek yes. i started with flatten params, but couldn't progress because of ovs2.6 issues.. i think i should be able to focus on it next week. will keep you posted. as of now, i got the parameters, and resources. but couldn't get the service name thoug as it is in output.10:57
bandinid0ugal: oh damn :/10:57
d0ugalapetrich: I am tailing all three mistral logs - when a timeout happens in one it seems to happen in them all, makes the rate seem higher :)10:57
*** athomas has joined #tripleo10:58
d0ugalshardy, florianf - we could add custom Mistral actions to tripleo-validations.10:58
apetrichd0ugal, bandini mine about to finish.10:58
d0ugalthere is no reason they all need to be in tripleo-common - but that could be confusing for different reasons.10:58
jtomasekskramaja: ok, let me know if you need anything, It would be great to make sure the result matches the implementation we currently have in UI10:59
shardyflorianf: ack, yeah that sounds like something we can and should solve, e.g by enabling pure mistral and mistral+ansible validations to be discovered via a mistral workflow10:59
florianfshardy: Sure, ansible shouldn't be the hammer to hit every nail. But I like it as an easy way to contribute new validations for outsiders.10:59
shardyskramaja: ack, thanks - I don't think it's super urgent, but let me know if you need any help :)10:59
skramajasure shardy jtomasek11:00
florianfshardy: And that discovery workflow should live in -common? or -validations?11:00
openstackgerritJuan Antonio Osorio Robles proposed openstack-infra/tripleo-ci master: DO NOT MERGE: Testing TLS-everywhere with keystone container  https://review.openstack.org/44957011:01
shardyflorianf: right now it'll be in tripleo-common, but it sounds like a discussion about moving the mistral pieces into the tripleo-validations repo is worthwhile11:01
florianfshardy: ack.11:01
*** jlinkes has joined #tripleo11:02
*** dixiaoli has quit IRC11:03
florianfshardy: Not sure if I'm putting too much emphasis on potential contributors here. But I always thought it's nice that the tripleo-validations can be run independently from a simple validations repo checkout. I have absolutely no idea, of course, how many user-contributed validations we can expect in the future. But the ability to easily develop new ones seems like an asset to me, which we might not want to make more complicated11:05
florianfthan necessary.11:05
*** nyechiel has quit IRC11:06
*** stendulker has quit IRC11:06
*** thrash|g0ne is now known as thrash11:07
*** ooolpbot has joined #tripleo11:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION11:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468111:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477011:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495511:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517411:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]11:10
*** ooolpbot has quit IRC11:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]11:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)11:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]11:10
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-heat-templates master: WIP: O->N Upgrade, make sure all nova placement parameter properly set.  https://review.openstack.org/44957211:13
chemmarios: he, do you think that make sense https://review.openstack.org/449572 ?11:15
*** tvignaud has quit IRC11:16
chemmarios:or am i missing something ?11:16
d0ugalapetrich: how did yours turn out?11:18
*** dtantsur|afk is now known as dtantsur11:18
*** udesale has joined #tripleo11:19
chemmcornea: the quick patch https://review.openstack.org/449572 (owalsh) ...11:19
dtantsurfolks, could you please approve 2 easy backports with CI passed and 1x +2? https://review.openstack.org/#/c/448028/ and https://review.openstack.org/#/c/448029/11:19
marioschem: hey, looking11:19
* dtantsur needs these landed before his PTO next week..11:19
chemmarios: sorry for the lack of explanation.  It causes this error 2017-03-24 10:54:48.707 12174 ERROR nova.compute.manager MissingRequiredOptions: Auth plugin requires parameters which were not given: auth_url on the compute node11:20
chemmarios: and we hope that it's the root of another bz for nova upgrade :)11:20
marioschem: yeah is ok, so i am the one that added that crudini stuff there it is meant to go to the non controllers right i mean wherever the manual upgrade will run11:21
marioschem: and i think we originally had a restart11:21
mcorneachem: thanks, will test it11:21
marioschem: will comment on the review11:21
chemmarios: oki, thanks.11:21
*** udesale has quit IRC11:21
marioschem: btw had a call with jakub and he had some more info/ideas/fix for package (neutron-*) but for now i added mask/unmask at https://review.openstack.org/#/c/445494/5/puppet/services/neutron-ovs-agent.yaml not sure if it will help but a possibility11:22
marioschem: thanks for the idea (you mentioned disable i think mask does what we need)11:22
chemmarios: this is not run during the manual step, it's done during the delivery of the script11:22
marioschem: yeah it is run during delivery but it is only done for those nodes that will be manually upgraded11:23
thrashshardy: d0ugal florianf jtomasek so, reading back (and not sure I got everything), I see the validations doing multiple things... The stuff that's in tripleo-validations are doing "hardware" checks it seems. Things that ansible would be good at, since it has the facts there.11:23
chemmarios: oki, we are on the same page then :)11:23
thrashThe validations being done in the client and soon in a mistral workflow are more surrounding the deployment.11:24
chemmarios: so many variation of systemctl, didn't know the mask command11:24
marioschem: yeah neither did i i found it today :)11:27
jaosoriorsshnaidm|off: seems some ha jobs are failing cause they couldn't get a test-env environment :/11:27
EmilienMhello11:28
*** tvignaud has joined #tripleo11:29
shardythrash: Hey, yeah that's the way I see it too, and I don't think the ansible approach is needed for the validations only concerned with the plan contents vs the actual nodes11:29
jaosoriorEmilienM: hey dude, for the TLS-everywhere work that involves getting it to work with containers I opened another blueprint: https://blueprints.launchpad.net/tripleo/+spec/tls-via-certmonger-containers if you have time to check it out.11:30
EmilienMjaosorior: ack11:32
owalshchem: so all of the crudini command should have been within the if statement?11:34
jaosoriord0ugal: damn dude, saw the missing overcloudrc error again: http://logs.openstack.org/48/446348/4/check/gate-tripleo-ci-centos-7-nonha-multinode-oooq/d4ea9f3/11:35
chemowalsh: well like this we won't have the auth_url error anymore, hope that it will fix the migration too11:35
owalshchem: how about the ROLE= line?11:35
d0ugaljaosorior: ugh, there is something seriously wrong here11:37
d0ugaljaosorior: I've put all I've learned on the bug: https://bugs.launchpad.net/tripleo/+bug/167570911:37
openstackLaunchpad bug 1675709 in tripleo "deploy succeeded but no overcloudrc was generated" [High,Triaged]11:37
chemowalsh: it's unrelated, it's kind of an hack to pass down the ROLE name current to the written script11:37
chemowalsh: but you're right it belong outside of the if11:37
chemowalsh: thanks11:37
*** tvignaud has quit IRC11:37
d0ugaljaosorior: yours is slightly different which is interesting.11:37
*** pkovar has joined #tripleo11:38
jaosoriorwhat the hell man11:38
jaosoriorfunky11:38
owalshchem: ack, yea, thats what I meant, assumed it should be outside11:38
jtomasekthrash: I agree ansible is not necessary there, although what we probably should do is keep validations in single place and make sure they use same api and are provided by same api11:38
jaosoriord0ugal: well, if the print statement you added doesn't appear, then it might be another issue.11:38
*** pgadiya has quit IRC11:38
jtomasekthrash: so clients are able to list the validations as well as access validation results11:39
openstackgerritTom Barron proposed openstack/tripleo-heat-templates stable/ocata: Configure horizon to use keystone v2  https://review.openstack.org/44902711:40
apetrichd0ugal, btw mine passed the timeouts happened but early on. not close to the finish11:40
jaosoriord0ugal: DUDE, your commit failed with the same issue11:40
d0ugaljaosorior: I don't think that print statement will appear tbh11:40
jaosoriord0ugal: and the print didn't appear there.11:40
d0ugaljaosorior: woah11:40
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-heat-templates master: N->O Upgrade, make sure all nova placement parameter properly set.  https://review.openstack.org/44957211:40
d0ugaljaosorior: yeah, I didn't think it would be that. but good to eliminate it11:40
apetrichd0ugal, I lie. they are there close to the finish11:40
*** bfournie has quit IRC11:41
openstackgerritBogdan Dobrelya proposed openstack-infra/tripleo-ci master: Adapt getthelogs UX for more use cases  https://review.openstack.org/44955211:43
d0ugalapetrich, jaosorior: I am running out of ideas :(11:43
d0ugalit is almost like somebody snuck a sys.exit(0) in there somewhere lol11:43
jaosoriord0ugal: would it be that there's an Exception going on somewhere that cliff is not setting up as fatal?11:44
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-heat-templates master: N->O Upgrade, make sure all nova placement parameter properly set.  https://review.openstack.org/44957211:44
florianfshardy, thrash, jtomasek: Maybe the question isn't so much if we use ansible or something else, but: What is our interface to list and execute validations? atm ansible is a common interface for all, which unifies listing/execution and fact-gathering. Of course the drawback is that it's overkill for simple stuff. But can we really keep simple stuff simple if we want a unified API for all validations?11:44
d0ugaljaosorior: Good question. I'll test for that.11:44
thrashflorianf: it's extreme overkill for quite a bit. And I'm not sure that we should be calling both of these "validations" tbh11:45
openstackgerritBogdan Dobrelya proposed openstack-infra/tripleo-ci master: Adapt getthelogs UX for more use cases  https://review.openstack.org/44955211:45
*** abishop has joined #tripleo11:46
openstackgerritJuan Antonio Osorio Robles proposed openstack/tripleo-quickstart-extras master: DO NOT MERGE: Setting debug for overcloud deploy  https://review.openstack.org/44958011:46
jaosoriord0ugal: ^^11:46
thrashas shardy said... There's really no point to go mistral -> ansible -> swift when we can go mistral -> swift.11:46
thrashflorianf: perhaps the proper way to do this would be to move the validations workbook and associated actions into tripleo-validations.11:47
openstackgerritJuan Antonio Osorio Robles proposed openstack/tripleo-heat-templates master: DO NOT MERGE: Trying to replicate missing overcloudrc  https://review.openstack.org/44958211:47
jaosoriord0ugal: ^^11:47
thrashand that all validations, whether they are in ansible or directly in mistral, should have a workflow.11:47
*** abishop has quit IRC11:48
florianfthrash: Right, so you mean we should distinguish between simple plan checks and validations?11:48
thrashflorianf: similar to how I did it.11:48
openstackgerritDougal Matthews proposed openstack/python-tripleoclient master: TESTING print any exceptions  https://review.openstack.org/44958311:48
d0ugaljaosorior: nice. this too ^11:48
thrashflorianf: that's one way of looking at it, yes.11:48
*** abishop has joined #tripleo11:48
jaosoriord0ugal: dude! good idea!11:48
d0ugaljaosorior: and now we wait a bit :)11:49
*** tbonds has quit IRC11:49
d0ugaljaosorior: meanwhile I am trying to break my deploy locally with no luck... the one time you want tripleo to fail it doesn't :-D11:49
jaosoriorhaha dammit!11:49
*** abishop has quit IRC11:49
thrashflorianf: it's likely that we can convert some of those ansible modules into mistral actions (and they can probably have dual citizenship)11:49
*** abishop has joined #tripleo11:50
*** abishop has quit IRC11:51
*** tvignaud has joined #tripleo11:51
*** abishop has joined #tripleo11:51
*** zoli|lunch is now known as zoli11:51
thrashflorianf: shardy but first step right now should be getting them out of tripleoclient...11:52
*** abishop has quit IRC11:52
thrashflorianf: shardy and my preference for things coming out of tripleoclient should go directly to tripleo-common.11:52
EmilienMjaosorior: * Separate the certificate requests from the puppet files of the services11:52
*** abishop has joined #tripleo11:52
florianfthrash: oh. like a library of validation logic that can be a mistral action or be run on some host via ansible.11:52
EmilienMjaosorior: what does it mean exactly?11:52
thrashflorianf: yeah...11:54
thrashIf you want to run it from ansible, fine. Here's the core logic.11:54
openstackgerritDougal Matthews proposed openstack/python-tripleoclient master: TESTING is it any different without the websocket events?  https://review.openstack.org/44958511:54
thrashflorianf: then the ansible modules become thin wrappers just to make them modules.11:54
jaosoriorEmilienM: https://review.openstack.org/#/c/444891/11:54
jaosoriorEmilienM: so instead, the requests are done here https://github.com/openstack/puppet-tripleo/blob/master/manifests/profile/base/certmonger_user.pp11:55
EmilienMjaosorior: ah, nice11:55
jaosoriorEmilienM: that's still done on the baremetal node though.11:55
jtomasekthrash, florianf: I think so far only problem is with listing validations as that's tied to ansible (AFAIK) if we were able to change this, then all we need to is keep the validation workflow execution output match certain format - so we can get validation results11:55
*** rbrady-afk is now known as rbrady11:55
jaosoriorEmilienM: hopefully at some point we can containerize all the stuff that's needed. So there would be one container that has the credentials to the CA and does the requests11:56
*** jpena is now known as jpena|lunch11:56
jaosoriorEmilienM: so in that sense, this is moving in the right direction as well11:56
thrashjtomasek: I think that should be a relatively easy problem to solve.11:56
thrashjtomasek: you call a mistral workflow to list them, right?11:56
*** jayg|g0n3 is now known as jayg11:57
jtomasekthrash: we're calling mistral actions: VALIDATIONS_LIST: 'tripleo.validations.list_validations',11:57
jtomasek  VALIDATIONS_RUN: 'tripleo.validations.v1.run_validation',11:57
jtomasek  VALIDATIONS_RUN_GROUPS: 'tripleo.validations.v1.run_groups'11:57
thrashjtomasek: just out of curiousity, why are y'all calling actions directly, and not workflows?11:58
*** tvignaud has quit IRC11:58
jtomasekthrash: calling action is a request->response thing which is much faster/simpler for GUI to consume -  it is basically ordinary API call11:58
*** salmankhan has quit IRC11:59
jtomasekthrash: operations which don't need to be wrapped in workflow are faster to call as actions11:59
thrashjtomasek: but calling workflows at some point is ok, right?12:00
thrash:D12:00
jtomasekthrash: yes, it is12:00
thrashjtomasek: ok... Just making sure. :D12:00
jtomasekthrash: :)12:00
openstackgerritChristian Schwede proposed openstack/tripleo-quickstart-extras master: Use subjectAltName in self-generated SSL certs  https://review.openstack.org/44958812:00
*** abishop has quit IRC12:00
thrashjtomasek: anyway... I think we can have validation discovery from mistral too...12:01
*** udesale has joined #tripleo12:01
jaosoriorcschwede: nice!12:01
*** salmankhan has joined #tripleo12:01
thrashjtomasek: we'd just namespace the validation workflows.12:02
jtomasekthrash: yes12:02
*** abishop has joined #tripleo12:02
*** adarazs is now known as adarazs_lunch12:02
jtomasekthrash: so each validation would basically be separate workflow?12:02
thrashjtomasek: then you can make a direct call to list the workflows, right?12:02
jtomasekyes12:02
*** links has quit IRC12:02
thrashjtomasek: ideally, yes.12:02
florianfthrash, jtomasek: Currently most (if not all) tripleo-validations are things that potentially run on multiple hosts. So we need ansible for this. So the question is probably more: Do these "simple" deployment checks qualify as validations and do we in fact want them available/executable in clients like the UI.12:03
thrashjtomasek: so you'd ask mistral for 'give me all workflows named "tripleo.validations.v1.impl.*"'12:03
thrashflorianf: we do want them available/executable from the clients.12:03
thrashflorianf: shardy unless we can somehow move these into a heat template.12:03
thrashwhiich I'm not sure that would even work.12:04
thrashflorianf: but I deifinitely see the demarcation there...12:04
thrashflorianf: if need to run across multiple hosts, absolutely that makes sense to be in ansible,12:04
jtomasekthrash: well primarily those are going to be triggered from other workflows as subworkflows12:04
shardyflorianf: I think it's two different types of validation, one is check if the environment is broken, one is did the user do something wrong and the plan data is broken (or e.g there aren't enough nodes, or there's a profile mismatch, or whatever)12:04
*** links has joined #tripleo12:05
shardythrash: some things are already validated via constraints in the heat templates, but really we want to do this earlier12:05
thrashshardy: right... We aren't checking the nodes themselves for errors.12:05
thrashshardy: ack12:05
shardythrash: perhaps if the plan create workflow did a stack preview we could do that12:05
shardythrash: but yeah, and there's a bunch of other stuff like checking ironic node tagging against flavors etc12:06
shardywhich we can't do in heat, so I like the approach you've already taken with the mistral based validations12:06
shardymaybe we can solve this debate by just namespacing them "plan_validation" ;)12:06
thrashshardy: ack. I really think we just need a rename. :) Stop calling both of the these things "validations"12:07
jtomasekin every case, all the validations are workflows which most usually run as subworkflow for other workflow12:07
openstackgerritMarius Cornea proposed openstack/tripleo-heat-templates master: WIP: Stop openstack-nova-compute during nova-ironic upgrade  https://review.openstack.org/44959612:07
jtomasekthat is why I think it would be nice if they shared the same format12:07
jaosoriord0ugal: mistral creates the overcloudrc files. But does it write them in the filesystem as well? or is that tripleoclient?12:07
d0ugaljaosorior: that is still in tripleoclient (mistral doesn't know where or what the users home dir is)12:08
jaosoriorok12:08
d0ugaljaosorior: https://github.com/openstack/python-tripleoclient/blob/master/tripleoclient/utils.py#L6612:08
*** salmankhan has quit IRC12:08
thrashjtomasek: what is that format? Are you talking about the stdout/stderr thing?12:08
openstackgerritMarios Andreou proposed openstack/tripleo-heat-templates master: WIP - Add upgrade_batch_tasks to neutron-l3-agent  https://review.openstack.org/44549412:08
d0ugaljaosorior: which is called from..... https://github.com/openstack/python-tripleoclient/blob/36b6b09fb307399458a9bfadef497cbcae35f3c4/tripleoclient/v1/overcloud_deploy.py#L116012:09
openstackgerritMarios Andreou proposed openstack/tripleo-heat-templates master: WIP - Add upgrade_batch_tasks to neutron-l3-agent  https://review.openstack.org/44549412:09
jtomasekthrash: yes mostly, + the api to be able to list them. But if we agree that those plan_validations are something that is not supposed to be listed and run from client, but solely as part of certain workflow, then fine12:10
jaosoriord0ugal: thanks dude; makes sense12:10
thrashjtomasek: that doesn't work for the checks that I'm doing... We have errors and warnings. Errors are things that must be fixed. Warnings are just that. Warnings that something *could* go wrong should you proceed.12:10
*** ooolpbot has joined #tripleo12:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION12:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468112:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477012:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495512:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517412:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]12:10
*** ooolpbot has quit IRC12:10
jaosoriord0ugal: somehow I really think there's an exceptino somewhere that's being ignored.12:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]12:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)12:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]12:10
jtomasekthrash: ack12:10
*** vpickard_ is now known as vpickard12:11
thrashjtomasek: yeah... it probably should just be a part of the deploy workflow anyway, since that's where they were in the client.12:11
openstackgerritMarius Cornea proposed openstack/tripleo-heat-templates master: WIP: Stop openstack-nova-compute during nova-ironic upgrade  https://review.openstack.org/44959612:11
thrashalthough some could probably be done when uploading the plan.12:11
jaosoriorEmilienM: could I get your opinion on this patch? https://review.openstack.org/#/c/449536/12:11
jtomasekthrash: so in case that validation fails, whole workflow fails and that validation populates output of the workflow, so client can display error12:11
shardythrash: yeah I think many of these can be done when creating/updating the plan12:11
thrashjtomasek: yes12:12
*** tvignaud has joined #tripleo12:12
shardyso we'd have a plan_update workflow which can optionally run plan_validations, which can return errors and warnings12:12
shardythe thing we're missing is the top-level workflow to wire it all together I think12:12
thrashshardy: exactly. And if each check is a workflow, that should be easy to integrate into the plan update/create workflows.12:12
d0ugaljaosorior: yeah, hopefully - that would be the "best" outcome12:12
shardythrash: cool, yeah exactly :)12:13
EmilienMjaosorior: sure12:14
jtomasekshardy: by top level workflow you mean something that wraps plan_update workflow (e.g. deploy workflow?)12:14
*** lucasagomes is now known as lucas-hungry12:14
*** nyechiel has joined #tripleo12:14
EmilienMjaosorior: can you have multiple certificates request with the same cert_dir?12:15
shardyjtomasek: I guess we can enhance update_deployment_plan to do it12:15
shardyjtomasek: the problem is that doesn't accept all data associated with the plan atm, e.g the list of environments etc, so we end up having to create the plan, then do stuff, then do validation12:15
EmilienMjaosorior: because if yes, you'll have duplicated resource in your catalog, unless you use ensure_resource from puppetlabs-stdlib12:15
jaosoriorEmilienM: oh shit.12:15
jaosoriorI'll use ensure_resource then12:16
jaosoriorEmilienM: thanks man!12:16
shardyjtomasek: I'm just saying we can perhaps integrate the steps a little better in future12:16
jtomasekok12:16
florianfthrash, jtomasek, shardy: but if those plan_validation make the create/update workflows fail, do we really want/need them to be separately exectuable from a client?12:16
*** bfournie has joined #tripleo12:16
jtomasekflorianf: yes, we don't need that - that is why it is different from 'ansible validations we have' (UUIC)12:17
shardyflorianf: probably not, but it's still useful to define each validation as a discreate workflow12:17
shardyflorianf: maybe we just have a  way to say they're internal vs something you click on in the UI12:17
*** ratailor has quit IRC12:17
*** dougbtv_ has joined #tripleo12:18
*** tvignaud has quit IRC12:18
* shardy votes for plan_validation namespace :)12:18
* florianf agrees with shardy 12:18
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo master: Ensure directory exists for certificates for httpd  https://review.openstack.org/44953612:19
jaosoriorEmilienM: what about this? ^^12:19
EmilienMjaosorior: better12:20
*** katkapilatova has joined #tripleo12:20
jaosoriorEmilienM: nice catch. Thanks for the feedback.12:21
EmilienMjaosorior: anytime12:21
openstackgerritPradeep Kilambi proposed openstack/instack-undercloud master: Run ceilometer-upgrade for gnocchi conditionally  https://review.openstack.org/44876212:21
EmilienMjaosorior: it would be nice to have a tls everywhere testing in the multinode jobs, would it be possible?12:22
*** nyechiel has quit IRC12:23
*** katkapilatova has left #tripleo12:24
jaosoriorEmilienM: I gotta investigate.12:24
*** katkapilatova has joined #tripleo12:24
EmilienMjaosorior: even if we have to deploy a service on the undercloud12:24
jaosoriorEmilienM: so, deploying the service in the undercloud is not really an issue.12:24
jaosoriorEmilienM: there are two deterrents currently.12:24
*** jprovazn has joined #tripleo12:24
*** d0ugal has quit IRC12:24
jaosoriorEmilienM: 1. We need to deploy the CA somewhere, so it would require another node that gets set up before the undercloud (this one we can probably address and is not a big deal)12:25
*** dprince has joined #tripleo12:25
jaosorior2. multinode jobs use DeployedServer resources from heat, and currently it's not possible to set metadata for these; so the kerberos principals would have issues there.12:25
EmilienMjaosorior: I se12:26
EmilienMsee*12:26
*** liverpooler has quit IRC12:26
jaosoriorEmilienM: I tried adding that https://review.openstack.org/#/c/422868/ but apparently we can't asure that a DeployedServer will come from OpenStack12:26
*** dprince has quit IRC12:26
jaosoriorso that's the main blocker at the moment.12:26
*** liverpooler has joined #tripleo12:26
*** dprince has joined #tripleo12:26
EmilienMjaosorior: ok thanks... we'll have to figure out something later probably12:27
jaosoriorEmilienM: I really need to dig more into DeployedServer and come up with a solution for this before we can do this.12:27
jaosoriorEmilienM: so meanwhile, we currently are pretty tied to OVB12:27
thrashflorianf: jtomasek the ansible validations are selectable because they don't necessarily apply to every deployment, right?12:29
*** janki has quit IRC12:30
*** tvignaud has joined #tripleo12:31
*** shardy is now known as shardy_lunch12:31
snecklifterjprovazn: hello, what are the chances of getting https://bugs.launchpad.net/tripleo/+bug/1644784 as a backport to Newton?12:31
openstackLaunchpad bug 1644784 in tripleo "Support deploying of Manila / CephFS with managed Ceph" [Medium,Fix released] - Assigned to Jan Provaznik (jan-provaznik)12:31
jtomasekthrash: they should, they are mean to persistently warn user that something with his setup is wrong and should not deploy before those issues are fixed12:32
florianfthrash: They all belong to a group, so in principle the should be run in every deployment.12:32
florianfthrash: You can run them separately so you can restart a failed one after you made changes for instance.12:33
*** morazi has joined #tripleo12:33
*** pkovar has quit IRC12:35
*** pkovar has joined #tripleo12:35
*** flepied has quit IRC12:36
openstackgerritMerged openstack/tripleo-docs master: Switch trunk/cbs/buildlogs to use https  https://review.openstack.org/44804412:38
*** trown|outtypewww is now known as trown12:39
jprovaznsnecklifter: hello, I think that chances are minimal, it was a new feature consisting of about 7 patches accross multiple repos12:39
snecklifterjprovazn: ack, thanks for responding12:39
jprovaznsnecklifter: np12:40
snecklifterjprovazn: oh and thanks for implementing it in Ocata too!12:41
jprovaznsnecklifter: it was gfidente who added ceph mds support :)12:41
*** udesale has quit IRC12:42
jprovaznsnecklifter: I helped with some manila specific things12:42
snecklifterjprovazn: a true team effort \o/12:42
jprovaznsnecklifter: anyway, you are welcome :)12:42
*** rbowen has quit IRC12:42
*** dougbtv_ has quit IRC12:42
*** rbowen has joined #tripleo12:44
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-heat-templates master: WIP: N->O upgrade, blanks ipv6 rules before activating it.  https://review.openstack.org/44961312:45
*** dougbtv_ has joined #tripleo12:45
*** dougbtv_ is now known as dougbtv|laptop12:46
chemmatbu: bandini could you cross check -> https://review.openstack.org/#/c/449613/12:47
chemmatbu: bandini it's about the firewall ipv612:48
*** psahoo has quit IRC12:48
chemmatbu: bandini I've just coded that without running it once, so beware :)12:48
openstackgerritJiri Stransky proposed openstack/tripleo-quickstart-extras master: Upgrade to containerized overcloud  https://review.openstack.org/44857612:49
*** eck` is now known as eck`gone12:49
chemdsneddon: hi, we have an issue with upgrade with this https://github.com/openstack/tripleo-heat-templates/commit/a3f03eb307797ac5eef1251b9252e642db326e0712:50
chemdsneddon: basically what do you think would be the best course of action to migrate those parameter from previous to new version ?12:51
*** thrash is now known as thrash|brb12:51
*** rlandy has joined #tripleo12:52
*** flepied has joined #tripleo12:54
*** tzumainn has joined #tripleo12:56
*** ramishra has quit IRC12:56
matbuchem: /me looks12:56
rlandydmsimard: hi - 1450 didn't help - we got an undercloud install failure on the third run12:58
dmsimardrlandy: what did it fail on ?12:58
*** jpena|lunch is now known as jpena13:03
*** tvignaud has quit IRC13:03
*** flepied has quit IRC13:03
*** adarazs_lunch is now known as adarazs13:04
*** milan has joined #tripleo13:04
openstackgerritBogdan Dobrelya proposed openstack-infra/tripleo-ci master: Adapt getthelogs UX for more use cases  https://review.openstack.org/44955213:09
*** ooolpbot has joined #tripleo13:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION13:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468113:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477013:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495513:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517413:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]13:10
*** ooolpbot has quit IRC13:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]13:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)13:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]13:10
*** lblanchard has joined #tripleo13:13
*** thrash|brb is now known as thrash13:13
*** jcoufal has joined #tripleo13:15
*** tvignaud has joined #tripleo13:16
*** shardy_lunch is now known as shardy13:17
*** flepied has joined #tripleo13:17
*** zoli is now known as zoli|afk-FBCI13:19
openstackgerritCarlos Camacho proposed openstack/tripleo-ui master: Add favicon icons  https://review.openstack.org/42011113:20
openstackgerritCarlos Camacho proposed openstack/tripleo-ui master: Add favicon icons  https://review.openstack.org/42011113:21
*** jrist has joined #tripleo13:22
openstackgerritMerged openstack/tripleo-quickstart-extras master: Switch trunk/cbs/buildlogs to use https  https://review.openstack.org/44803713:25
openstackgerritOliver Walsh proposed openstack/tripleo-common master: Add MigrationSshKey to generated passwords  https://review.openstack.org/44923913:26
*** lucas-hungry is now known as lucasagomes13:27
EmilienMowalsh: ^ can you add a release note please?13:30
owalshEmilienM: ack13:31
*** zoli|afk-FBCI is now known as zoli13:32
*** mburned is now known as mburned_out13:33
*** mburned_out is now known as mburned13:33
*** lucasagomes is now known as lucas-brb13:35
*** prateek has quit IRC13:36
bandinichem: ack will look shortly13:39
chemmatbu: bandini thanks13:40
*** jkilpatr has quit IRC13:44
*** jcoufal_ has joined #tripleo13:44
*** d0ugal has joined #tripleo13:45
*** jcoufal has quit IRC13:46
*** skramaja has quit IRC13:48
d0ugaljaosorior: lol, so my debugging patches both passed >_<13:48
jaosorioragh!!13:49
jaosoriord0ugal: do you think it's due to the patches that get several messages at once?13:49
d0ugaljaosorior: no, because none of that CLI work has landed yet.13:49
d0ugaljaosorior: well, other than the initial code that isn't used - so I don't think so, but maybe worth looking into13:50
*** gbarros has joined #tripleo13:50
*** salmankhan has joined #tripleo13:51
jaosoriorI see13:51
*** links has quit IRC13:53
openstackgerritCarlos Camacho proposed openstack/tripleo-ui master: Add favicon icons  https://review.openstack.org/42011113:53
*** sshnaidm|off has quit IRC13:54
openstackgerritGael Chamoulaud proposed openstack/tripleo-quickstart-extras master: Add blank newline at the end of file  https://review.openstack.org/44902813:56
openstackgerritSteven Hardy proposed openstack/python-tripleoclient master: Don't track added_files in deploy environment processing  https://review.openstack.org/44604513:57
openstackgerritSteven Hardy proposed openstack/python-tripleoclient master: Move clients into class constructor  https://review.openstack.org/44963313:57
bandinichem: I have added a comment14:00
remix_tjhttp://logs.openstack.org/16/447416/1/gate/gate-tripleo-ci-centos-7-multinode-upgrades/9848094/console.html#_2017-03-23_22_23_51_587577 how do i troubleshoot this?14:00
dmsimardrlandy: I didn't hear back earlier, what did the third undercloud install fail on ?14:01
*** lucas-brb is now known as lucasagomes14:01
openstackgerritLuke Hinds proposed openstack/tripleo-heat-templates master: Adds service for managing securetty  https://review.openstack.org/44915314:02
rlandydmsimard: sorry - instance got cleaned up - got another job with logs going now - will see14:02
jristakrivoka: I think your import/export stuff is getting close to merging14:03
chembandini: thanks14:03
bandiniremix_tj: I usually go on the nodes themselves (like http://logs.openstack.org/16/447416/1/gate/gate-tripleo-ci-centos-7-multinode-upgrades/9848094/logs/subnode-2/var/log/messages in that case)14:03
jristwho else do we need to poke?14:03
akrivokajrist: it's been "close" for ages14:03
akrivokajrist: any tripleo core14:03
jristk14:03
jristd0ugal: ^14:03
jrist:)14:03
d0ugaljrist: link?14:04
akrivokad0ugal: https://review.openstack.org/#/c/414169/14:04
akrivokathat's the one we really need to land first, all the other dependent ones already have +2s14:04
d0ugalakrivoka: thanks. I feel like I've +2ed this in the past...14:04
jristd0ugal: (ノ◕ヮ◕)ノ*:・゚✧14:04
d0ugalhuh, I've not even reviewed it14:04
d0ugalI'm sure I've looked at it before... :/14:05
* jrist cries for argentina14:05
dmsimardjrist: wow that is a magical emoji14:05
jristdmsimard: yesssss14:05
jristdmsimard: do you want a kawaii emoji?14:05
dmsimardkawaiiiiiiiii14:05
remix_tjbandini: 15.184.64.1 is not pingable. Recheck is enough?14:06
*** mdnadeem has quit IRC14:06
jrist\(^○^)人(^○^)/14:06
jristhi 5!14:06
bandiniremix_tj: I *think* so, yes14:06
bandinii am also getting a bunch of odd failures on multinode jobs atm14:06
slagledid you run that emoji by the foundation first?14:06
slaglenot sure it fits with the existing branding rules14:06
dmsimardburn14:07
akrivokalol14:07
jristlol14:07
jristslagle: welcome back!14:07
dmsimardok </friday>14:07
jristfine, different high five ┏(^0^)┛┗(^0^) ┓14:07
amoralejrlandy, are you using patched versions to get debug info if it fails?, that'd be nice14:08
jaosoriorEmilienM: can you check this one out https://review.openstack.org/#/c/447953/ ?14:08
EmilienMsure14:08
jtomasekakrivoka: how many of your patches is missing to merge14:09
jtomasek?14:09
EmilienMjaosorior: have you seen dtantsur's comment?14:09
jaosoriorEmilienM: no14:09
openstackgerritLuke Hinds proposed openstack/puppet-tripleo master: Adds service for managing securetty  https://review.openstack.org/44914814:09
*** ealcaniz has quit IRC14:09
EmilienMjaosorior: I can approve this one and you follow up with the domain params14:09
*** ooolpbot has joined #tripleo14:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION14:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468114:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477014:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495514:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517414:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]14:10
*** ooolpbot has quit IRC14:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]14:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)14:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]14:10
akrivokajtomasek: https://review.openstack.org/#/c/414169/  https://review.openstack.org/#/c/422789/  https://review.openstack.org/#/c/425858/  https://review.openstack.org/#/c/437676/14:10
rlandyamoralej: I am not getting extra debug info - but we collect a decent log14:10
akrivokajtomasek: but the bottleneck is only the first one, all the other ones have +2s (and depend on the first one)14:10
jtomasekakrivoka: thanks14:10
jaosoriorEmilienM: we'll do that for every service's auth14:11
jaosoriorEmilienM: I'm starting to think we should just have a hiera parameter that contains the domain14:11
jaosoriorEmilienM: and do some overriding of the keystone resource14:11
amoralejrlandy, it'd be fantastic if we could get debug info by setting additional environment variable, i'm trying it in https://review.openstack.org/#/c/44954814:11
EmilienMjaosorior: yes14:11
d0ugalakrivoka: Does this patch handle upgrades?14:12
EmilienMjaosorior: approving your patch now and I'll let you update the rest if neded14:12
rlandyamoralej: sure - on next run14:12
*** psahoo has joined #tripleo14:12
d0ugalakrivoka: i.e. I have a plan in Mistral, I upgrade to Pike, will it move the plan to Swift?14:12
*** psahoo is now known as psahoo|away14:12
amoralejthanks14:12
jaosoriorEmilienM: thanks14:12
jaosoriorEmilienM: oh... wait, can you check the depends-on for that patch?14:12
d0ugalakrivoka: oh wait, I see a comment about this already. /me continues reading14:12
EmilienMjaosorior: approved14:13
jaosoriorthanks dude14:13
*** noslzzp has quit IRC14:17
*** noslzzp has joined #tripleo14:17
*** jbadiapa has quit IRC14:18
dprincejistr: I rebuilt Ironic-conductor's container and tried it w/ the new heat templates. It seems to be restarted due to sudoers issue...14:19
bogdandobandini, shardy: folks who fight CI for errors (EmilienM, mwhahaha ?), I improved the getthelog tool PTAL https://review.openstack.org/#/c/449552/. Also I gave examples for my custom parser script https://github.com/bogdando/fuel-log-parse#examples-for-tripleo-ci-openstack-infra-logs. The given example finds that  'ssl_depth' error we'd discussed above easy.14:19
openstackgerritDougal Matthews proposed openstack/python-tripleoclient master: DO NOT MERGE: difference without websockets?  https://review.openstack.org/44958514:20
openstackgerritDougal Matthews proposed openstack/python-tripleoclient master: DO NOT MERGE: print any exceptions  https://review.openstack.org/44958314:20
dprincejistr: just wondering if you ever say any of this with the new 'host_prep_task' stuff for  this service...14:20
bogdandoofc you need no all these if you're a kibana guru :)14:20
jistrdprince: i'm deploying a full e2e upgrade from BM to containerized right now, will check when it's upgraded14:20
EmilienMbogdando: ok i'll look shortly14:20
dprincejistr: could be a regression somewhere else. Just wondering what might have changed14:21
jistrdprince: do you have the exact error message?14:21
bogdandoEmilienM: I used to use that tool to find all nasty HA bugs and race conditions in fuel. Now it seems applicable to ooo as well14:21
jistrdprince: i know kolla has quite often custom sudoers to allow their entrypoint scripts to sudo for very specific things but nothing else14:21
*** jmelvin has joined #tripleo14:21
*** amoralej is now known as amoralej|lunch14:21
bogdandoif you think it is usable (please try it) I'll make an announce perhaps14:22
openstackgerritJuan Antonio Osorio Robles proposed openstack/instack-undercloud master: Set default domain for all keystone users  https://review.openstack.org/44964414:22
jaosoriorEmilienM: what about this? ^^14:22
bandinibogdando: nice. I will take a look14:23
tbarronEmilienM: jaosorior I got better results on https://review.openstack.org/#/c/448614 and think we can go with that approach.14:23
*** bnemec is now known as beekneemech14:23
jaosoriortbarron: nice! thanks for the update14:24
bogdandoI believe in the scope of a single job investigation, it outperforms ELK/Kibana by speed and UX perhaps14:24
tbarronjaosorior: ty!14:24
dprincejistr: weird. It looks like sudo -E kolla_set_configs is failing for Ironic14:25
dprincejistr: wonder what changed14:25
bogdandobut for the full world few, we really ought to adopt some bots for elastic-recheck!14:25
dprincejistr: you removed the 'user: root' in your patch I think....14:25
*** chlong has joined #tripleo14:25
*** eck`gone is now known as eck`14:25
dprincejistr: this was required for it to functionally work I think. It is probably a bug on the container but I think we'll need it set for it to functionally work in the meantime14:26
bogdandooops, I mean world view*14:26
*** pkovar has quit IRC14:26
jistrdprince: i removed the full container that has done `mkdir`s, which had user: root, but we don't need that container anymore, as the dirs are managed by host_prep_tasks now. https://github.com/openstack/tripleo-heat-templates/commit/1a4ece16cea40075fe7332ed048b9c289b3ff42414:28
jistrdprince: as far as the normal ironic container goes, i don't think there was user: root to begin with14:28
openstackgerritAlex Schultz proposed openstack/tripleo-specs master: Create patch abandonment policy  https://review.openstack.org/44933214:28
dprincejistr: yeah, weird. I see that now14:29
*** yprokule has quit IRC14:30
jistrhttps://github.com/openstack/kolla/commit/5752c7eb0b1f9c5978dd4e9271ded346cea231e014:31
jistrhttps://github.com/openstack/kolla/commit/05c0d6998bdda155c925d563b75ac353303f93ff14:31
*** pkovar has joined #tripleo14:31
jistrdprince: ^ these do sth with sudoers, could be related maybe... i think the ironic images we have in dockerhub are quite old now, so this may have gone unnoticed for a while14:31
mwhahahabandini: you may need to -A+A to unstick https://review.openstack.org/#/c/445479/14:31
d0ugalakrivoka: reviewed14:32
dprincejistr: exactly, I just rebuilt them. I was aiming to push new ones today fwiw....14:32
dprincejistr: it seems to be kolla_set_configs that fails though. Very weird14:33
*** jbadiapa has joined #tripleo14:33
bandinimwhahaha: oh just did. will that be enough?14:33
dprincejistr: the https://github.com/openstack/kolla/commit/5752c7eb0b1f9c5978dd4e9271ded346cea231e0 is almost certainly related to what I'm seeing14:34
mwhahahabandini: it should, sometimes when it doesn't start gatting it's because the notification of +A gets lost14:34
bandiniooh14:34
bandinidid not know that14:34
dprincejistr: I'll file a bug and try to fix it14:35
dprincejistr: sorry I pinged about your commit. I saw 'user: root' got removed there and it confused me :)14:35
jistrdprince: np :) and thanks for looking into that issue14:35
dmsimardrlandy: any info yet ?14:36
mwhahahabandini: oh looks l ike it's been stuck in the gat for 12+ hours on status.openstack.org14:36
mwhahahabandini: so we just need to be patient14:37
*** morazi has quit IRC14:37
rlandydmsimard: this time, passed14:37
rlandyI am still running with 1450 mtu14:37
rlandybut afaict - not much difference14:37
bandinimwhahaha: I hope it is the golden pass, it's been 10days of rechecks :)14:37
dmsimardrlandy: ok, let us know if you see a failure :(14:37
akrivokad0ugal: thanks!14:37
jistrdprince: interesting. the commits only touch files in /etc/sudoers.d, not /etc/sudoers itself (where the kolla_set_configs lives). I wonder if perhaps one of the files in /etc/sudoers.d/ is malformed, preventing sudo from working in general.14:38
rlandydmsimard: sure - spinning all the time - and I enabled full log collection now - so it should be easier to share14:38
*** udesale has joined #tripleo14:38
jistrdprince: s/where the kolla_set_configs lives/where the kolla_set_configs rule lives/14:38
openstackgerritwes hayutin proposed openstack/tripleo-quickstart master: WIP, make script work in CI and interactive  https://review.openstack.org/44937014:39
dprincejistr: could be14:39
openstackgerritPradeep Kilambi proposed openstack/instack-undercloud master: Run ceilometer-upgrade for gnocchi conditionally  https://review.openstack.org/44876214:41
openstackgerritOliver Walsh proposed openstack/tripleo-heat-templates master: WIP: SSH known_hosts config  https://review.openstack.org/44966014:41
openstackgerritmathieu bultel proposed openstack/tripleo-quickstart-extras master: Allow complex upgrade deployment for N to O  https://review.openstack.org/43959814:43
pradkEmilienM, can you review this https://review.openstack.org/#/c/448762/9 .. i confirmed this is working locally (added some notes as well)14:44
pradkblocking qe :(14:44
beekneemechHeads up: the compute node hosting tripleo.org is up for reboot.  The site will be down for a little while until that node is back up.14:45
pradkmwhahaha, when you're around ^^ .. finally, works :)14:46
mwhahahapradk: ok i'll take a look14:47
pradkty sir14:47
*** fragatina has quit IRC14:47
chemowalsh: he again, ... so we still have the issue.  It boils down "vif_type=binding_failed"14:51
chemowalsh: does it ring a bell ?14:51
jaosoriordtantsur: well, I'm not sure if it's relevant to add a release note for the keystone domain bits, since it's not something that the users can really see. We do use the default keystone domain that comes out of the box anyway14:51
jaosoriorin that patch14:51
*** morazi has joined #tripleo14:51
dtantsurjaosorior, well, it enabled keystone v3 support, so it's kinda a feature14:52
dtantsuror a fix for keystone v314:52
jaosoriorit doesn't enable it yet14:52
dtantsurwell, I think it mostly does not work because of missing domain14:53
jaosoriordtantsur: this one enables it https://review.openstack.org/#/c/446752/14:53
dtantsurassuming we already switched from versioned auth_url14:53
jaosoriordtantsur: we haven't switched14:53
*** fragatina has joined #tripleo14:53
jaosoriordtantsur: so actually, that commit doesn't really do much until we actually make the switch14:53
dtantsurjaosorior, then this change does not enable it. stackrc is unrelated to authtoken configuration14:53
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-heat-templates master: WIP: N->O upgrade, blanks ipv6 rules before activating it.  https://review.openstack.org/44961314:54
dtantsurok, if we use versioned auth_url still (I thought EmilienM has fixed it), then indeed this change does not affect keystone v3 support14:54
jaosorioruhm...14:54
jaosoriorwait up14:54
jaosoriordtantsur: I think you're right14:54
dtantsurlet's wait for the CI, I guess, and see what it actually uses14:55
jaosoriorright14:55
*** jbadiapa has quit IRC14:55
*** jbadiapa has joined #tripleo14:56
EmilienMtbarron: excellent, I'll look at it14:57
tbarronEmilienM: ty!14:57
openstackgerritJiri Stransky proposed openstack/tripleo-quickstart-extras master: Upgrade to containerized overcloud  https://review.openstack.org/44857614:57
EmilienMjaosorior: dtantsur is right about release note14:57
*** jkilpatr has joined #tripleo14:58
*** prateek has joined #tripleo14:58
jaosoriorEmilienM: why? we're using the default and it's not something the deployer cares at all. Unless we make it configurable.14:58
jaosoriorEmilienM: the release notes are getting very confusing and they should be useful for deployers14:58
EmilienMjaosorior: ok14:59
jaosoriorEmilienM: at least that's the way I see it.14:59
EmilienMtbarron: so we can move forward with https://review.openstack.org/#/c/448614/ ?14:59
EmilienMtbarron: last call? :)14:59
pradkis there any nifty script to clean up undercloud install ?14:59
tbarronEmilienM: yeah, go for it14:59
EmilienMtbarron: approving14:59
tbarronEmilienM: thanks15:00
openstackgerritwes hayutin proposed openstack/tripleo-quickstart master: WIP, make script work in CI and interactive  https://review.openstack.org/44937015:00
*** prateek has quit IRC15:01
*** prateek has joined #tripleo15:02
chemowalsh: this is certainly an openvswitch issue:15:02
jaosoriord0ugal: I've been still seeing that error every once in a while. For some reason it's not constant.15:02
jaosoriord0ugal: what was the LP bug again?15:02
openstackgerritPradeep Kilambi proposed openstack/instack-undercloud master: Run ceilometer-upgrade for gnocchi conditionally  https://review.openstack.org/44876215:02
jristcan we get a +a on https://review.openstack.org/#/c/414169/ plz15:03
chem2017-03-24T15:02:55.075Z|03596|rconn|WARN|br-tun<->tcp:127.0.0.1:6633: connection failed (Connection refused)15:03
chem2017-03-24T15:02:55.075Z|03597|rconn|WARN|br-int<->tcp:127.0.0.1:6633: connection failed (Connection refused)15:03
chem2017-03-24T15:02:55.075Z|03598|rconn|WARN|br-ex<->tcp:127.0.0.1:6633: connection failed (Connection refused)15:03
chem2017-03-24T15:02:55.075Z|03599|rconn|WARN|br-infra<->tcp:127.0.0.1:6633: connection failed (Connection refused)15:03
jristhas a +2 and 3 +1s15:03
chemowalsh: ^15:03
chemmcornea: ^15:03
mcorneachem: yeah, I think it's my workaround, I shouldn't apply it on computes15:04
*** gbarros has quit IRC15:04
*** gbarros has joined #tripleo15:05
d0ugaljaosorior: https://bugs.launchpad.net/tripleo/+bug/167570915:07
openstackLaunchpad bug 1675709 in tripleo "deploy succeeded but no overcloudrc was generated" [High,Triaged]15:07
jaosoriord0ugal: I added the alert tag15:07
d0ugaljaosorior: cool, good idea.15:08
chembandini: about the ipv6 firewall stuff, do you think it's at the right step and place ?15:08
d0ugaljaosorior: I am never sure when to use that.15:08
jaosoriord0ugal: I just thought the bug was annoying enough to merit that :P15:09
bandinichem: good question. let me relook at the steps15:09
d0ugaljaosorior: Yup, that's for sure - more eyes may help too15:09
chembandini: I mean it doesn't strike you as completly idiotic :)15:09
*** morazi has quit IRC15:09
jaosoriord0ugal: just popped again here http://logs.openstack.org/58/449558/1/check/gate-tripleo-ci-centos-7-nonha-multinode-oooq/2ff609a/logs/undercloud/home/jenkins/overcloud_validate.log.txt.gz15:10
jaosorior* popped up15:10
*** ooolpbot has joined #tripleo15:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION15:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468115:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477015:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495515:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517415:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]15:10
*** ooolpbot has quit IRC15:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]15:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)15:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]15:10
bandinichem: with my small brain, just about everything not done by me strikes me as awesome and amazing ;)15:10
d0ugaljaosorior: http://logs.openstack.org/83/449583/2/check/gate-tripleo-ci-centos-7-nonha-multinode-oooq/ed030c5/logs/undercloud/home/jenkins/overcloud_deploy.log.txt.gz#_2017-03-24_15_02_12_68426215:10
d0ugaljaosorior: and go down a bit to the traceback!15:10
d0ugaljaosorior: despite the fact that I re-raise it, it is still a status_code 015:11
openstackgerritRob Crittenden proposed openstack/tripleo-docs master: Small fixups for the TLS everywhere documentation  https://review.openstack.org/44967215:11
jaosoriorduuuuude15:11
jaosoriorwow15:12
jaosoriorwhat the hell15:12
chembandini:haha15:12
d0ugaljaosorior: Maybe RuntimeError is a bad choice? https://github.com/openstack/python-tripleoclient/blob/master/tripleoclient/exceptions.py#L3515:12
*** jprovazn has quit IRC15:12
d0ugalapetrich: ^15:13
jaosoriord0ugal: why would it be skipping RuntimeErrors?15:13
d0ugaljaosorior: No idea.15:13
jaosoriorwell, lets try switching it to Exception then15:13
d0ugaljaosorior: Maybe the thing that checks the status_code is broken?15:13
d0ugalI don't know where/how that happens15:14
apetrichd0ugal, oooohhhh15:14
trowndobson: jaosorior I see the problem15:14
trownd0ugal: rather15:14
trownunping dobson15:14
trownfalse && status_code=0 || status_code=$?15:14
trownthat statement gives exit code of 015:14
d0ugaltrown: oh, lol15:15
trownwait... that is what we want there though... because we need to not fail ansible at that spot15:15
trownthe statement above does set status_code to 115:15
d0ugaltrown: oh, so the status_code=0 doesn't mean the command returned 0?15:16
openstackgerritDougal Matthews proposed openstack/python-tripleoclient master: DO NOT MERGE: Are RuntimeErrors handled badly?  https://review.openstack.org/44967615:16
trownd0ugal: well ya if status_code=0 after that it means the command returned 0 exit code15:17
*** gaurangt has quit IRC15:19
openstackgerritPradeep Kilambi proposed openstack/tripleo-heat-templates master: Add upgrade tasks for gnocchi container services  https://review.openstack.org/44562715:19
bandinichem: looks good to me, I added a comment15:21
beekneemechtripleo.org is back up15:21
jistrweshay, matbu: so i've just had a full successful e2e upgrade from non-containerized master to containerized master with this https://review.openstack.org/#/c/448576/15:22
jistrweshay, matbu: however, i'd like to ask you for some help, if you have bandwidth for it (e.g. next week or so) with actually getting CI to run it15:22
*** morazi has joined #tripleo15:22
openstackgerritNuman Siddique proposed openstack/puppet-tripleo master: Pacemaker support for OVN DB servers  https://review.openstack.org/37227415:23
jistrthere seems to be multiple ways to run CI jobs, from my understanding of reading the current toci repo15:23
jistrand i don't mean just tripleo.sh vs OOOQ. IIUC there's actually multiple different approaches to trigger jobs via OOOQ15:23
*** gkadam has joined #tripleo15:24
jistrso i would definitely appreciate some assistance in figuring this out (even if i figured it out, i may not figure out a way which is in sync with what's the intended direction of TOCI in following weeks/months)15:25
jaosoriorAlright, I'm off. Have a good weekend everyone!15:25
openstackgerritEmilien Macchi proposed openstack/tripleo-docs master: Basic structure of TripleO Deployment Guide  https://review.openstack.org/44968415:26
jistrafter we've got CI to actually execute this, we can figure out how to make it do ocata->master rather than master->master, and perhaps add converge etc.15:26
*** udesale has quit IRC15:27
jistrthough on my machine which didn't do anything else, it took 2.5 hours to run the whole thing (without converge or pingtest)15:27
jistrso it might be quite tight w/r/t job runtime15:28
jistrweshay, matbu ^15:28
shardyjistr: I think we'll have to optimze this before it'll run in CI15:28
shardyjistr: when I was testing local composable upgrades the CI runtimes were considerably longer15:28
shardyso I suspect this will be the same/similar15:28
jistrshardy: right, most likely. But i'd quite like the next step to be the CI at least attempting to execute what we have. It's quite difficult to judge how far we're from that.15:29
shardythat said we can still be getting a WIP/experimental job to quantify that :)15:29
jistre.g. multinode OOOQ has it's own playbook that cannot do upgrade so far15:29
shardyjistr: ack, yeah sounds good15:29
jistrhttps://github.com/openstack-infra/tripleo-ci/blob/master/scripts/quickstart/multinode-playbook.yml15:29
jistrso i expect there's quite some work ahead to at least get the CI to red :D15:29
jistrand then we can figure out how to make it green :))15:29
EmilienMadarazs: https://review.openstack.org/#/c/448511/15:29
shardylol15:29
shardyprobably true :)15:29
EmilienMadarazs: Sagi is not here today, can you update it please? so we can merge it today.15:29
adarazsEmilienM: okay.15:30
*** jaosorior has quit IRC15:30
jistrflaper87, mandre: ^^ fyi re upgrades CI status15:31
openstackgerritMarius Cornea proposed openstack/tripleo-heat-templates master: Stop openstack-nova-compute during nova-ironic upgrade  https://review.openstack.org/44959615:31
openstackgerritPradeep Kilambi proposed openstack/tripleo-heat-templates master: Add ceilometer ipmi agent  https://review.openstack.org/43044715:32
trozetmwhahaha: when you have a minute, can you please review: https://review.openstack.org/#/c/448827/15:32
EmilienMadarazs: on #openstack-infra, pabelanger is helping to promote some tripleo gate jobs in zuul15:32
*** athomas has quit IRC15:33
EmilienMadarazs: and he asked me the patches we want to merge ASAP. Please look if I missed some15:33
flaper87jistr: thanks15:33
EmilienMadarazs: http://eavesdrop.openstack.org/irclogs/%23openstack-infra/latest.log.html (look the end)15:34
adarazsEmilienM: looking15:34
*** dparkes has quit IRC15:34
*** jkilpatr has quit IRC15:35
openstackgerritOliver Walsh proposed openstack/tripleo-heat-templates master: WIP: SSH known_hosts config  https://review.openstack.org/44966015:36
openstackgerritEmilien Macchi proposed openstack/tripleo-docs master: Basic structure of TripleO Deployment Guide  https://review.openstack.org/44968415:36
*** aufi has quit IRC15:41
adarazsEmilienM: I think those should be enough, and of course this repo one I just -1'd :)15:41
EmilienMadarazs: yeah please update it so we can approve it and land it today15:42
adarazsdamn, that reponame is already used in there before...15:42
*** prateek has quit IRC15:42
openstackgerrityolanda.robla proposed openstack/tripleo-common master: WIP: Add creation of security hardened images  https://review.openstack.org/44852815:44
openstackgerritDougal Matthews proposed openstack/python-tripleoclient master: DO NOT MERGE: Are RuntimeErrors handled badly?  https://review.openstack.org/44967615:45
EmilienMadarazs, trown : I'm taling with Paul Belanger now about our situation and the fact patches don't merge15:45
EmilienMI'm proposing to force-merge some patches without waiting for zuul15:46
EmilienMthat would be outstanding15:46
d0ugalsounds dangerous :)15:46
EmilienMI know15:46
d0ugalbut obviously useful15:46
EmilienMbut jobs are failing a lot15:46
adarazsit's non working gates vs. probably non working gates, so I think we should take our chances :P15:46
openstackgerrityolanda.robla proposed openstack/tripleo-image-elements master: Add overcloud-secure-block-device element  https://review.openstack.org/44912215:46
EmilienMgimme a list of patches you want to merge and I'll see what I can do15:47
d0ugaladarazs: lol, I like our chances!15:47
*** social has quit IRC15:48
EmilienMtrown, adarazs: please help me to set gerrit topic to tripleo/outstanding for the patches we want to land today15:48
owalshchem: ack15:48
*** yamahata has joined #tripleo15:48
*** flepied has quit IRC15:49
adarazsEmilienM: you know if we're so much in crunch mode we could submit https://review.openstack.org/448511 as is, I was even unsure if I should -1 for these.15:49
adarazsI'll give it a +2 and +w and see what happens.15:49
dmsimardEmilienM: the patches in the buildlogs bug are probably worth looking at ( current repo issue and  https )15:50
dmsimardThey're contributing to the gate instability situation.15:50
EmilienMdmsimard: yes15:51
EmilienMcan someone review https://review.openstack.org/#/c/448041/2 please ?15:51
EmilienMat least +2, so we can approve later15:51
*** d0ugal has quit IRC15:52
*** amoralej|lunch is now known as amoralej15:52
* adarazs still can't +2 there.15:53
*** gbarros has quit IRC15:55
*** gbarros has joined #tripleo15:55
jristtrying again: can we get a +A? https://review.openstack.org/#/c/414169/ - EmilienM?15:56
EmilienMjrist: bad time now15:56
jristsorry :(15:57
EmilienMI'm trying to get CI back15:57
jristEmilienM: I understand. anything I can do to help?15:57
EmilienMjrist: and we have plenty of reviewers here, please don't ping me15:57
EmilienMjrist: don't ping me for reviews15:57
jristEmilienM: noted. sorry15:57
EmilienMthat will help me to focus15:57
shardyjrist: I'll take a look, I've reviewed that one before15:57
jristshardy: thanks yeah you already +1'd15:58
openstackgerritAttila Darazs proposed openstack/tripleo-quickstart-extras master: GATE CHECK  https://review.openstack.org/43027716:00
*** pcaruana has quit IRC16:03
*** udesale has joined #tripleo16:04
EmilienMI would gently ask folks to not recheck patches from now16:07
EmilienMpabelanger and I are working on merging critical patches in tripleo that would help to have a more stable CI16:07
EmilienMthe queue in zuul is huge16:08
jristEmilienM: noted. thanks16:08
jristhttp://img2.wikia.nocookie.net/__cb20140425031646/villains/images/f/fe/Vlcsnap-2014-04-24-23h13m48s25.png16:08
EmilienMthat's /me now :D16:09
*** udesale has quit IRC16:09
*** pabelanger has joined #tripleo16:09
*** udesale has joined #tripleo16:09
*** ooolpbot has joined #tripleo16:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION16:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468116:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477016:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495516:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517416:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]16:10
*** ooolpbot has quit IRC16:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]16:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)16:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]16:10
*** trown is now known as trown|lunch16:10
*** d0ugal has joined #tripleo16:14
*** Goneri has quit IRC16:15
openstackgerritOliver Walsh proposed openstack/tripleo-heat-templates master: WIP: SSH known_hosts config  https://review.openstack.org/44966016:16
openstackgerrityolanda.robla proposed openstack/tripleo-common master: Add creation of security hardened images  https://review.openstack.org/44852816:17
*** thrash is now known as thrash|f00dz16:18
*** sshnaidm|off has joined #tripleo16:19
*** skramaja has joined #tripleo16:19
EmilienMI had to -2 some patches to kill them from zuul and free some resources16:19
EmilienMI'll remove -2 before EOD16:20
*** d0ugal has quit IRC16:22
akrivokashardy: many thanks16:23
*** jzimnowoda has quit IRC16:26
*** jzim has quit IRC16:27
* adarazs off, have a nice weekend folks!16:29
openstackgerritLuke Hinds proposed openstack/tripleo-heat-templates master: Adds service for managing securetty  https://review.openstack.org/44915316:32
*** rlandy is now known as rlandy|brb16:32
openstackgerritLuke Hinds proposed openstack/puppet-tripleo master: Adds service for managing securetty  https://review.openstack.org/44914816:33
shadowerUI people: how do you set up your undercloud these days? Still a custom script or is it quickstart now?16:33
* shadower has a 4 moth-old setup that's beyond broken16:34
shadowerjrist, florianf ^16:34
jristhehe16:34
jristtriple OHHHHHH q16:35
jristis the recommended way16:35
jristoooq16:35
shadoweroooqay16:35
jrist:)16:35
mwhahahalots of crying and hoping it works16:35
mwhahahathat's how i deploy16:35
jristhahaha16:35
shardylol :)16:35
jristmwhahaha: in the instructions it specifically said weeping16:36
jristpreferably in the corner in a fetal position16:36
mwhahahayes it is the most comfortable way16:36
*** bogdando has quit IRC16:36
openstackgerritMarios Andreou proposed openstack/tripleo-heat-templates master: WIP - Add upgrade_batch_tasks to neutron-l3-agent  https://review.openstack.org/44549416:37
deadnullim running into a validation error on ocata deploy, maybe someone can point me in the right direction... "unexpected keyword argument 'user_domain_id'" - https://gist.github.com/rvalente/e1ecde794b393b6d3f90d97a13aac4cb16:38
florianfshadower: last time I still used instack-virt-setup because oooq didn't work for me immediately and I didn't have the time to look into it16:39
shadowerhm okay16:39
shadowerthanks16:39
openstackgerritAdam Harwell proposed openstack/diskimage-builder master: Use DIB_PYTHON_VERSION to run commands  https://review.openstack.org/44972116:39
openstackgerritAlex Schultz proposed openstack/tripleo-heat-templates master: Run update after RHEL registration  https://review.openstack.org/44972416:42
*** thrash|f00dz is now known as thrash16:44
*** fragatina has quit IRC16:46
*** jlinkes has quit IRC16:47
*** salmankhan has quit IRC16:48
chemgood week end people16:52
*** rwsu has quit IRC16:57
*** yamahata has quit IRC17:00
*** rwsu has joined #tripleo17:00
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-heat-templates master: WIP: N->O upgrade, blanks ipv6 rules before activating it.  https://review.openstack.org/44961317:02
openstackgerrityolanda.robla proposed openstack/diskimage-builder master: WIP: Add lvm management to diskimage-builder  https://review.openstack.org/44440317:03
*** udesale has quit IRC17:06
weshayjistr, nice man.. yes I think we can get you help :)17:10
*** ooolpbot has joined #tripleo17:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION17:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468117:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477017:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495517:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517417:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]17:10
*** ooolpbot has quit IRC17:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]17:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)17:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]17:10
*** rlandy|brb is now known as rlandy17:11
jistrweshay: great, thanks :)17:11
*** lmiccini has quit IRC17:12
weshayrlandy, my local deploy w/ the script worked.. full deploy + validate17:13
weshayjistr, I would have thought that need much more code17:13
*** zoli is now known as zoli|gone17:13
weshayimpressvie17:13
weshayimpressive17:13
jistrwell it's just master->master, no converge, no pingtest17:14
jistrjust the most crucial stuff17:14
*** cylopez has quit IRC17:14
jistra lot of logic was already in place for reuse17:14
jistrlike prep-containers, and the upgrade skeleton17:15
jistrmainly i'm concerned about how will we get CI to run the upgrade on multinode, which will be new17:15
morazijistr, awesome!17:15
*** mcornea has quit IRC17:15
jistrand about the run time in general17:15
openstackgerritDougal Matthews proposed openstack/python-tripleoclient master: DO NOT MERGE: Are RuntimeErrors handled badly  https://review.openstack.org/44967617:15
morazijistr, (not about the concern, but about getting it all to work)17:16
jistrhaha yea i understood17:16
jistrthanks :)17:16
*** d0ugal has joined #tripleo17:18
weshayrlandy, I'd like to update the internal ovb osp gate to also take changes to quickstart17:18
rlandyweshay: ok - sure17:21
jristshadower: any luck?17:22
rlandyweshay; we run very little from quickstart there though17:22
jristI'm genuinely curious if that is the best way17:22
*** dtantsur is now known as dtantsur|pto17:22
weshayrlandy, I'm going only trigger w/ changes to the ovb scripts17:22
shadowerjrist: just started quickstart and keeping the fingers crossed17:22
jristshadower: nice let me know how that goes :)17:23
rlandyok17:23
*** suuuper has quit IRC17:24
fultonjdprince: would you mind dropping a comment into https://review.openstack.org/#/c/387631 ?17:25
fultonjdprince: i know we had talked about it tuesday but if you unable to comment on it, then can you recommend someone else from the containers squad who might be a good person to review it? thanks.17:26
dprincefultonj: sure, I've got it up17:26
fultonjthanks17:26
dprincefultonj: will comment on the spec too17:26
dprincefultonj: the SPEC has been on my todo. Sorry for being slow17:27
fultonjdprince: no problem, i'm glad it's on your list. thanks17:27
*** flepied has joined #tripleo17:28
openstackgerritLuke Hinds proposed openstack/puppet-tripleo master: Adds service for managing securetty  https://review.openstack.org/44914817:29
*** jmelvin has quit IRC17:33
weshayrlandy, did the port quotas get lifted?17:34
*** trown|lunch is now known as trown17:34
bkerotrown, adarazs: Could I get some eyes on https://review.openstack.org/#/c/446934/ and https://review.openstack.org/#/c/447142/ ?17:36
*** yamahata has joined #tripleo17:36
weshaytrown, re: ovb I thought I'd try to reuse the full-deploy-ovb.sh script for something that devs would use.  It's a little limited due to backwards compat until everyhting is updated and merged. https://review.openstack.org/#/c/449370/3/ci-scripts/full-deploy-ovb.sh17:37
trownbkero we need some patch to tht stable/ocata to test https://review.openstack.org/447142 actually works17:38
trownbkero: so a dummy tht patch that depends on ^17:38
bkerotrown: https://review.openstack.org/#/c/447714/17:38
bkeroThere's newton17:38
bkerotrown: https://review.openstack.org/#/c/446865/17:38
bkeroThere's ocata17:38
trownbkero: sweet17:38
*** psahoo_ has joined #tripleo17:39
trownthanks17:39
*** psahoo|away has quit IRC17:39
*** derekh has quit IRC17:40
openstackgerritBen Kero proposed openstack/tripleo-quickstart-extras master: Refactor the toci-vxlan-networking script to be more correct  https://review.openstack.org/44693417:40
*** milan has quit IRC17:41
bkero^ That's just changing the commit message17:41
trownbkero: reviewing that now, sorry it has taken me so long17:43
bkeroNo worries17:43
*** lucasagomes is now known as lucas-afk17:44
*** akrivoka has quit IRC17:46
pradkEmilienM, mwhahaha, this is passing ci https://review.openstack.org/#/c/448762/ .. can we get it in please17:47
*** dparkes has joined #tripleo17:48
*** tesseract has quit IRC17:50
*** psahoo_ has quit IRC17:50
EmilienMpradk: I'll approve it once our zuul queue is low again17:51
EmilienMwe're still trying to land CI patches17:51
trowncascading failures17:53
trownthats the name of my new harsh noise band17:53
pradkEmilienM, sure, we just need this soon to unblock 11 testing .. if we can get this merged and into ocata by this weekend i'm cool17:54
EmilienMpradk: no worries, it will merge today17:55
*** jpena is now known as jpena|off17:55
trownthat seems a bit optimistic :P17:55
openstackgerritMerged openstack-infra/tripleo-ci master: Only force http for repos that allow it  https://review.openstack.org/44818717:57
openstackgerritMerged openstack/tripleo-quickstart-extras master: Exit with error code when overcloud wasn't deployed  https://review.openstack.org/44854117:58
EmilienMwoot17:58
EmilienM2 down17:58
*** jcoufal has joined #tripleo17:58
trownfinally17:59
trownthough I am not sure either of those fixes the root issue with DLRN17:59
dmsimardone at a time, there's a couple outstanding issues18:00
*** ckyriakidou has quit IRC18:00
*** skramaja has quit IRC18:00
*** jcoufal__ has joined #tripleo18:00
*** jcoufal_ has quit IRC18:01
openstackgerritBen Kero proposed openstack/tripleo-quickstart-extras master: Refactor the toci-vxlan-networking script to be more correct  https://review.openstack.org/44693418:02
bkerotrown: dealt with all your referenced issues :)18:02
trownbkero: awesome thanks18:02
pabelangerya, looks like things are catching up18:02
pabelangerat least on gate pipeline18:02
bkeroIt's worth noting that ovb already had a 'mtu' default variable called 'mtu'18:02
bkeroSo we could simply use that, or use vxlan_mtu18:03
*** jcoufal has quit IRC18:03
EmilienMbeekneemech: pabelanger told me rh1 doesn't have capacity to handle gate-tripleo-ci-centos-7-ovb-containers-oooq-nv in check pipeline anymore. Thoughts?18:03
pabelangercheck-tripleo pipeline, is another story.  You likely want to remove gate-tripleo-ci-centos-7-ovb-containers-oooq-nv from tripleo-check pipeline. tripleo-test-cloud-rh1 does have the capacity to keep up now with demand18:03
beekneemechEmilienM: I said that at the PTG and people went ahead and added the job anyway.18:04
pabelangersome quick math show me, anything about 55 items in the pipeline, will take 24 hours to drain18:04
beekneemechYeah, we've been 100% utilized all week, and were the same for the last 3 days of last week.18:04
beekneemechexperimental jobs haven't run in like 3 days18:05
EmilienMdprince: ^ any thoughts on this?18:05
pabelangerbeekneemech: right, need to scale back the jobs to help drain the pipelines18:06
dprinceEmilienM: if our aim is to deliver containers in Pike I think we pick another job perhaps18:06
*** sshnaidm|off has quit IRC18:06
*** axisys has quit IRC18:06
EmilienMdprince: multinode probably18:07
dprinceEmilienM: and also, make the containers overcloud job run faster. Is tempest still running there?18:07
EmilienMdprince: not afik18:07
dprinceEmilienM: okay, mandre must have disable it.18:07
pabelangerdo you need ovb for the container stuff?18:07
pabelangeror can that be on public clouds18:07
dprincepabelanger: ideally we would be testing it end to end18:08
dprincepabelanger: somewhere...18:08
dprincewe should cut a job18:08
dprincepabelanger: I support you in this18:08
dprinceEmilienM: unclear to me if containers is the job to cut. Perhaps we cut something else? Or make it periodic18:09
dprinceEmilienM: or make something else that isn't being worked on everyday a periodic job18:09
*** ooolpbot has joined #tripleo18:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION18:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468118:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477018:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495518:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517418:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]18:10
*** ooolpbot has quit IRC18:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]18:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)18:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]18:10
dprincebeekneemech, EmilienM if jobs are pegged we have to cut something. We should do this quickly and start an email thread about what to keep with our given resources18:10
pabelangerYa, I think you are at the point to make the tough call on what to scale back on, until you get more OVB resources18:12
bkerotrown: ok, retriggered all the test jobs. Depending on how nodepool is feeling today we'll see if the last change didn't break anything in about 1.5hr.18:13
*** gkadam has quit IRC18:15
EmilienMbeekneemech, dprince: could we move non-ha features into ha?18:16
EmilienMso we keep ovb-ha, ovb-updates and ovb-containers18:16
dprinceEmilienM: fine by me18:16
*** jkilpatr has joined #tripleo18:16
beekneemechEmilienM: Some, but a big part of the reason for three jobs was testing all three major net archs - sans net-iso, ipv4 net-iso, and ipv6 net-iso.18:17
*** eck` is now known as eck`gone18:17
EmilienMbeekneemech: the "sans net-iso" might be skipped?18:17
*** d0ugal has quit IRC18:17
dsneddonchem, How's the stuff going with the IPv6 FixedIPs? I haven't tested the external loadbalancer/IPv6 combo, but I think you are on the right track going from the *Vip to the *FixedIPs parameters.18:18
beekneemechEmilienM: It's what developers are most likely to use, so if it breaks a) it blocks a bunch of people and b) it probably gets fixed quickly.  So that might be an acceptable compromise.18:19
EmilienMbeekneemech: I would give it a try until it breaks us next time and see how we can do18:19
beekneemechThe other problem we have is that we can't switch to containers until containers is passing consistently.18:19
beekneemechAnd preferrably not using the full 2:45 every run.18:20
beekneemechTrading jobs is still going to hurt our capacity if we swap a 100 minute nonha job for a 160 minute containers job.18:21
*** dtrainor has quit IRC18:21
chemdsneddon: not sure what you're referencing to, I'm mostly a interested into a working solution to https://bugzilla.redhat.com/show_bug.cgi?id=143075718:21
openstackbugzilla.redhat.com bug 1430757 in rhosp-director "OSP10 -> OSP11 upgrade fails for deployments using external loadbalancer" [Urgent,Assigned] - Assigned to sathlang18:21
*** iranzo has quit IRC18:21
chemdsneddon: If you could have a look at what could be the best course of action that would be great18:22
chemdsneddon: I must go now :)18:22
dsneddonchem, I mist have misunderstood the IRC log. I'll comment on this BZ.18:22
dsneddonmust have18:22
dsneddonchem, What's in -e ~/openstack_deployment/environments/external-lb.yaml18:24
EmilienMbeekneemech: I agree18:24
openstackgerritJustin Kilpatrick proposed openstack/tripleo-quickstart-extras master: Fix introspection with retries edge case  https://review.openstack.org/43794618:24
dsneddonchem, And why are you importing both that and environments/external-loadbalancer-vip.yaml?18:24
*** chlong has quit IRC18:24
chemdsneddon: I'm not :) it's a bug report I didn't do it.  The patch shown inside the bz is kind of pretty clear, but if mcornea missed something just add it to the bz18:25
*** chem is now known as chem_off18:25
dsneddonchem_off, I found the environment files linked... thanks.18:26
*** jmelvin has joined #tripleo18:26
EmilienMdprince: could you help on this topic please? you're focused on containers. I would be great to transform the words ^ into a patch in tripleo-ci asap, our CI issues are super serious18:27
dprinceEmilienM: yes18:27
*** jcoufal has joined #tripleo18:29
*** jcoufal__ has quit IRC18:31
*** sshnaidm|off has joined #tripleo18:34
openstackgerritBen Kero proposed openstack/tripleo-quickstart-extras master: Refactor the toci-vxlan-networking script to be more correct  https://review.openstack.org/44693418:38
*** dtrainor has joined #tripleo18:38
EmilienMfyi all: I'm -2 all patches in gate, abandon & restore them, to clear the queue and let ci patches to land in priority18:40
*** tosky has quit IRC18:43
dprinceEmilienM: ack, do it18:44
weshaytrown, I think we have validated rlandy's ovb changes18:45
weshayshould be good to go there18:45
trownweshay: what do you mean?18:46
EmilienMpabelanger: all done, the queue is low now18:46
weshaytrown, the two changes listed in https://review.openstack.org/#/c/449370/18:46
*** dtrainor has quit IRC18:46
weshaylines 154 15518:46
pabelangerEmilienM: okay, so lets not enqueue anything more18:46
pabelangeruntil we land the current service18:47
pabelangerseries*18:47
EmilienMok18:47
pabelangerthen, we can enqueue and promote again if needed18:47
EmilienMpabelanger: what if one of the patches we need fails again?18:48
pabelangerEmilienM: enqueue / promote18:48
EmilienMpabelanger: what about askin fungi to force-merge them18:48
pabelangerlets see18:49
pabelangerif they still fail now, we have some other gate issue18:49
pabelangerbecause all fixes should be in gate18:49
dprincebeekneemech: containers runs without network iso ATM so it should be okay I think right?18:49
dprincebeekneemech: it would cover that case still...18:49
EmilienMdprince: if beekneemech is ok with that, one the first actions is to see what ovb-nonha does and migrate it to ovb-ha (except the net-iso thing, where ovb-ha will keep current config) - and retire ovb-nonha18:50
dprinceEmilienM: ack18:50
*** jmelvin has quit IRC18:50
beekneemechOh, we have another problem: http://git.openstack.org/cgit/openstack-infra/tripleo-ci/tree/README.rst#n9818:51
beekneemechha is the only job type that doesn't use ceph18:51
EmilienMwell, let's use ceph...18:51
*** jmelvin has joined #tripleo18:51
EmilienMor maybe keep updates with ceph18:52
beekneemechAlso, it's going to break our existing ssl certs.18:52
beekneemech(which can be fixed, but something to be aware of)18:52
beekneemechEmilienM: No, opposite problem.  We'll _only_ be testing with ceph if we merge ha and nonha.18:52
trozetEmilienM: whats going on with my patch?18:52
*** dtrainor has joined #tripleo18:52
beekneemechIf containers doesn't use ceph then that's probably okay though.18:52
EmilienMtrozet: please read the backlog here...18:53
EmilienMtrozet: "+EmilienM | fyi all: I'm -2 all patches in gate, abandon & restore them, to clear the queue and let ci patches to land in priority"18:53
* beekneemech badly needs lunch18:53
trozetEmilienM: oh ok18:53
*** jmelvin has quit IRC18:56
*** jmelvin has joined #tripleo18:56
fungiEmilienM: fwiw it doesn't need to be my call to bypass ci votes. any of our infra-root gerrit admins have that ability. however if we bypass testing to merge something which makes your jobs even more broken, that's not a great situation so one we tend to be very cautious about18:57
*** jcoufal_ has joined #tripleo18:57
*** hexo has quit IRC18:57
pabelanger EmilienM: fungi: Ya, that is the scenario I am trying to avoid honestly18:58
*** hexo has joined #tripleo18:58
pabelangerI think promote / enqueue is a good option currently18:58
openstackgerritwes hayutin proposed openstack/tripleo-quickstart master: WIP, make script work in CI and interactive  https://review.openstack.org/44937018:58
EmilienMfungi: I agree with you. Though the situation is outstanding for us atm18:58
*** jcoufal has quit IRC18:58
*** sshnaidm|off has quit IRC18:59
fungithe idea with pre-merge gating is that it should hopefully prevent you from merging changes which completely break your jobs, so if we bypass the gate we lose that assurance18:59
fungiand increase the chances that we'll need to force another change through to fix the previous one19:00
pabelangerYa, I've heard too many horror stories of force merges gone bad19:00
EmilienMfungi: the patches we want to land already passed check jobs (and failed randomly in gate)19:00
pabelangerEmilienM: the issue however is, none of the other patches in check have tested against that patch.19:01
pabelangerthats why promote / enqueue in gate is powerful19:01
fungior depends-on19:01
pabelangerit is a dependent pipeline, so ever patch behind it gets said patch19:01
EmilienMthis week was an "horror story" for tripleo CI; really, things can't be more bad19:02
EmilienMworst even19:02
*** pkovar has quit IRC19:02
pabelangerI only found out how bad things were today19:02
pabelangerand was able to jump in and help19:02
pabelangerright now, we are about 1h9min from the tripleo change queue for gate being empty19:03
pabelangerlets see what fails19:03
EmilienMpabelanger: http://tripleo.org/cistatus.html - a nice overview to see how red was our week19:04
*** jmelvin has quit IRC19:05
*** ooolpbot has joined #tripleo19:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION19:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468119:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477019:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495519:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517419:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]19:10
*** ooolpbot has quit IRC19:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]19:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)19:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]19:10
*** eck`gone is now known as eck`19:15
dmsimardso far so good... http://i.imgur.com/iwMsRiB.png19:16
pabelangeralmost19:19
pabelangerhttp://logs.openstack.org/41/448041/2/gate/gate-tripleo-ci-centos-7-undercloud-oooq/3f7a9c8/logs/undercloud/home/jenkins/install_packages.sh.log.txt.gz19:19
pabelangerthat just failed19:19
pabelanger[Errno 14] curl#35 - "TCP connection reset by peer"19:19
EmilienMhttps://trunk.rdoproject.org/centos7/current/puppet-mysql-3.10.1-0.20170324145036.b9a0a55.el7.centos.noarch.rpm: [Errno 14] curl#35 - "TCP connection reset by peer"19:20
EmilienMdmsimard: ^19:20
pabelangerdmsimard: how large is dlrn current?19:20
pabelangerGB wise19:20
dmsimardpabelanger: not large, iirc it's like 250MB ? The problem is it's fast moving so hard to mirror19:21
dmsimardlet me look19:21
pabelangerdmsimard: how fast would you say19:22
dmsimardon a good day it could change 100 times a day ?19:22
pabelangeris this part of the promotion stuff?19:22
dmsimard /current/ is a symlink that changes when a new package is built19:22
pabelangerk19:23
dmsimardEmilienM: what is using /current/ anyway ?19:23
EmilienMdmsimard: tripleo-ci19:23
dmsimardEmilienM: there's nothing, nowhere, that should be using /current/19:23
pabelangerdepending on the size, we could have an more aggressive mirroring19:23
EmilienMdmsimard: we use current for tripleo packages19:23
pabelangerdepends on what uses it19:23
EmilienM(tht, puppet-tripleo, etc)19:23
pabelangerEmilienM: how far back could that lag?19:24
dmsimardEmilienM: why not consistent ? or the CDN repository for promoted packages ?19:24
*** hewbrocca_afk is now known as hewbrocca19:24
openstackgerritDan Prince proposed openstack-infra/tripleo-ci master: Combine HA and non-ha features  https://review.openstack.org/44978519:24
*** jkilpatr has quit IRC19:24
dprinceEmilienM, beekneemech ^^19:24
*** hewbrocca is now known as hewbrocca_afk19:24
EmilienMwe don't want to lag on tripleo packages19:25
EmilienMdprince: sounds good to me, I'll review it, thanks19:26
dmsimardEmilienM: we have a whole process for promoting tripleo packages to a cdn repository for the purpose of the gate, I don't understand why current is used19:26
dprinceEmilienM: I will push a project-configs patch to remove nonha now too?19:26
dprinceEmilienM: or you are doing that?19:26
EmilienMdmsimard: like I said, just for tht, instack, puppet-tripleo19:26
dmsimardEmilienM: that failure came from puppet-mysql19:27
*** jmelvin has joined #tripleo19:27
EmilienMdprince: you can push for it. Some jobs in puppet ci are using ovb-nonha, we need to find an alternative19:27
dmsimardso it's a dependency from puppet-tripleo I guess19:27
dprinceEmilienM: ovb-ha will work I think there?19:27
EmilienMdmsimard: probably and it's not our intention to skip promoted repo on puppet-*19:28
EmilienMdprince: probably, yes19:28
slaglewe use current, b/c we want the absolute latest tripleo packages and puppet modules19:28
slaglesince we are so heavily dependent on the puppet modules19:28
*** jcoufal_ has quit IRC19:28
slagleotherwise, we aren't testing tripleo patches against "latest". it would be against last promote19:28
pabelangerdmsimard: this appears to be how it is setup: http://logs.openstack.org/41/448041/2/gate/gate-tripleo-ci-centos-7-undercloud-oooq/3f7a9c8/logs/undercloud/home/jenkins/repo_setup.sh.txt.gz19:29
*** jcoufal has joined #tripleo19:29
EmilienMslagle: yes that19:29
dmsimardslagle: but I thought that was the whole point of the promotion process, to test things so it doesn't break the gate19:29
dmsimardslagle: and the gate uses the promoted packages19:29
EmilienMdmsimard: except tripleo projects19:29
slagledmsimard: no, that's not the point19:29
EmilienMdmsimard: because tripleo projects are tested by tripleo jobs19:29
dmsimardEmilienM: but they're rebuilt in-flight anyway if required/depends-on, right ?19:30
slaglethe promotion process is to validate an entire repo19:30
dmsimardpabelanger: baseurl=https://trunk.rdoproject.org/centos7/current19:30
dmsimardthat's bad19:30
pabelangerwe should just mirror that to AFS19:30
dmsimardpabelanger: there's  a patch in the queue19:30
dmsimardto fix it19:30
pabelangerdmsimard: which?19:31
EmilienMyes19:31
dmsimardlet me find it19:31
EmilienMhttps://review.openstack.org/#/c/448512/19:31
pabelangerya19:31
pabelangerit is on list to promote19:31
dmsimardpabelanger: https://review.openstack.org/#/c/448512/19:31
pabelangeronce other 2 merge19:31
dmsimardsince /current/ changes every time a package is built, jobs can land on 404s or other errors since the symlink is updated19:32
EmilienMthat's one of the things we're fixing I think19:32
dmsimardyeah19:33
pabelangerdmsimard: we get around that issue in AFS, but not deleting existing packages on disk, then mirror new things. Only after 2 hours do the original packages get delete19:33
dmsimardjust saying that might explain the yum error we just got19:33
pabelangerin an effort not to break jobs like you say19:33
pabelangerbut, I am not sure how dlrn works when you combine things19:33
dmsimardpabelanger: /current/ and /consistent/ were never meant to be used as baseurls for yum repositories19:34
dmsimardpeople and systems must take the delorean hashed repository (i.e, the target of the symlink) and use that19:34
dmsimardso that it doesn't change in flight19:34
dmsimardso for example19:35
*** gbarros has quit IRC19:35
dmsimardhttps://trunk.rdoproject.org/centos7-master/current/delorean.repo <-- the base url: baseurl=https://trunk.rdoproject.org/centos7/77/e0/77e0621a8968d898877613ebabecda4f32ed353b_cd690faf19:35
dmsimardhttps://trunk.rdoproject.org/centos7/77/e0/77e0621a8968d898877613ebabecda4f32ed353b_cd690faf will never change, it is persistent19:35
dprinceEmilienM: also this https://review.openstack.org/#/c/449791/19:35
dprinceEmilienM: that one is going to be tedious to review :/19:36
pabelangerdmsimard: why do you have /current then?19:36
*** jmelvin has quit IRC19:36
*** gbarros has joined #tripleo19:36
EmilienMdprince: sounds good19:37
dmsimardpabelanger: people are supposed to use /current/delorean.repo, not craft their own .repo file that points to /current/19:37
EmilienMdprince: bookmarked, will review asap19:37
dprinceEmilienM: feel free to push on them too. I don't mind, but I will check back to fix this ASAP19:37
openstackgerritMerged openstack/tripleo-quickstart master: Switch trunk/cbs/buildlogs to use https  https://review.openstack.org/44803619:37
pabelangerdmsimard: seems like an easy fix, don't drop packages into /current :)19:37
dmsimard¯\_(ツ)_/¯19:38
EmilienMdprince: ack19:38
dmsimardThere's a .repo file in there people should be using19:38
EmilienMpabelanger: https://review.openstack.org/#/c/448512/ is in merge conflict, I need to rebase and push again19:40
*** gbarros has quit IRC19:41
openstackgerritEmilien Macchi proposed openstack/tripleo-quickstart master: Change current repo to exact delorean hash  https://review.openstack.org/44851219:41
EmilienMpabelanger: can we promote 448512 please?19:41
*** pkovar has joined #tripleo19:43
*** dprince has quit IRC19:44
openstackgerritMikhail S Medvedev proposed openstack/diskimage-builder master: Create PReP boot partition for PPC  https://review.openstack.org/44773919:44
pabelangerEmilienM: yes, let the next 2 merge first19:45
pabelangeractually19:45
pabelanger1 sec19:45
pabelangerI can enqueue19:45
*** jcoufal_ has joined #tripleo19:46
pabelangerEmilienM: enqueue to gate19:47
EmilienMdone19:47
EmilienMthanks19:47
*** jcoufal has quit IRC19:47
EmilienMpabelanger: what to do with https://review.openstack.org/#/c/448041/ ? should I unqueue it?19:48
EmilienMpabelanger: and recheck?19:48
pabelangerEmilienM: leave for now19:48
*** florianf has quit IRC19:49
*** toure is now known as toure|afk19:49
pabelangeronce it clears out, I will enqueue it again19:49
EmilienMok19:49
*** pkovar has quit IRC19:53
*** liverpooler has quit IRC19:55
*** jmelvin has joined #tripleo19:59
*** axisys has joined #tripleo20:02
*** dprince has joined #tripleo20:03
*** rbrady is now known as rbrady-afk20:09
*** ooolpbot has joined #tripleo20:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION20:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468120:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477020:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495520:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517420:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]20:10
*** ooolpbot has quit IRC20:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]20:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)20:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]20:10
*** gbarros has joined #tripleo20:12
*** jcoufal_ has quit IRC20:12
*** gbarros has quit IRC20:14
*** jayg is now known as jayg|g0n320:20
EmilienMpabelanger: http://logs.openstack.org/23/449023/2/gate/gate-tripleo-ci-centos-7-nonha-multinode-oooq/8995277/logs/undercloud/home/jenkins/overcloud_deploy.log.txt.gz#_2017-03-24_20_09_15_99776920:20
EmilienMI don't know why this one failed20:20
EmilienMit sounds like a multinode issue on this one20:21
*** amoralej is now known as amoralej|off20:25
*** jobewan has joined #tripleo20:25
EmilienMpabelanger: should I recheck https://review.openstack.org/#/c/449023/ ?20:35
*** dougbtv|laptop has quit IRC20:35
openstackgerritIan Main proposed openstack/tripleo-heat-templates master: Remove kolla_config copy from keystone service.  https://review.openstack.org/44767620:35
pabelangerEmilienM: I should be able to enqueue again20:35
pabelangeronce 448511 is done20:36
*** jmelvin has quit IRC20:36
*** jmelvin has joined #tripleo20:38
*** vpickard is now known as vpickard_20:39
*** trown is now known as trown|outtypewww20:41
*** jcoufal has joined #tripleo20:49
*** abishop has quit IRC20:50
openstackgerritIan Main proposed openstack/tripleo-heat-templates master: Remove kolla config files from ironic.  https://review.openstack.org/44562020:50
EmilienMSlower: could we make it for all services in a row? ^ it would save CI resources ...20:52
EmilienMdprince: ^20:52
EmilienMand easier to review imho20:53
dprinceEmilienM, Slower: 2-3 services at a time would be fine I think20:54
dprinceEmilienM: but I thi we can go ahead and approve what he has posted probably too20:54
EmilienMwhy not all?20:54
dprinceEmilienM: unless there is a rework20:54
dprinceEmilienM: just saying, batches might be the middle ground20:54
*** yolanda has quit IRC20:54
EmilienMwhy not all in a same patch, see how CI workd and merge it20:54
dprinceEmilienM: all in one if fine w/ me20:54
dprinceEmilienM: I post big patches :) so no need to convince me man20:54
EmilienMyeah :D20:55
*** yolanda has joined #tripleo20:56
dprinceSlower: the gist is, the CI pipeline got jammed up (periodic jobs haven't ran for days). This was due to the containers job. Anything we can do to help save CI resources is nice I think20:57
EmilienMmburned: wasn't it fixed? https://bugs.launchpad.net/tripleo/+bug/167591420:59
openstackLaunchpad bug 1675914 in tripleo "docker-registry fails to start when installing undercloud on rhel7.3 with rdo-ocata" [Undecided,New]20:59
mburnedEmilienM: i thought so21:00
mburnedEmilienM: https://review.openstack.org/#/q/I89e14cc2a27299ce4c191d2a823deb042469383121:03
mburnedEmilienM: that should have fixed it21:03
EmilienMmburned: yeah21:03
EmilienMdhill_: can you self triage when you file a bug?21:03
*** yolanda has quit IRC21:04
mburnedEmilienM: only thing i can think of is if the test was with released rdo ocata21:04
mburnedand maybe it hasn't hit an rpm there yet?21:04
*** lblanchard has quit IRC21:04
mburnedbut rdo trunk ocata should work21:05
EmilienMprobably21:05
*** dprince has quit IRC21:05
*** yolanda has joined #tripleo21:05
*** links has joined #tripleo21:05
dmsimardEmilienM: fyi I just temporarily disabled DLRN from picking up new package builds for centos7-master/current/21:06
dmsimardonce it finishes it's current queue it won't pick up any more packages21:06
dmsimardit should help until the necessary patch lands21:06
EmilienMwhy?21:06
openstackgerritOliver Walsh proposed openstack/tripleo-heat-templates master: WIP: SSH known_hosts config  https://review.openstack.org/44966021:06
EmilienMah I see ok21:06
EmilienMthanks, it will help indeed21:07
dmsimardEmilienM: yeah it means /current/ will stop moving21:07
dmsimardEmilienM: I won't interrupt what's currently building but after that it'll be stopped21:07
EmilienMdmsimard: subscribe to https://review.openstack.org/#/c/448512/ so when it merges you can rollback21:07
dmsimarddone21:07
*** ooolpbot has joined #tripleo21:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION21:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468121:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477021:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495521:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517421:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]21:10
*** ooolpbot has quit IRC21:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]21:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)21:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]21:10
EmilienMpabelanger: we need to enqueue https://review.openstack.org/#/c/448511/ and https://review.openstack.org/#/c/448512/ again please21:11
mburnedEmilienM: looks like this is the build:  http://cbs.centos.org/koji/buildinfo?buildID=1596421:12
mburnedbut it's not in the release tag yet21:12
EmilienMmburned: ah21:12
mburnedso need to get one of the RDO people to promote it21:12
mburnedonly rc1 is in the release tag21:12
EmilienMright, rdo promotion21:13
dmsimardEmilienM: crap.. https://review.openstack.org/#/c/448512/ passed but failed due to merge fail21:15
EmilienMpabelanger: thx21:15
EmilienMdmsimard: should I rebase https://review.openstack.org/#/c/448511/ ?21:16
EmilienMI think that's fine now21:16
*** jcoufal_ has joined #tripleo21:16
dmsimardEmilienM: I don't know enough about the relationship between oooq and oooq-extras21:17
*** yolanda has quit IRC21:17
*** jcoufal has quit IRC21:17
*** mburned is now known as mburned_out21:17
dmsimardI guess if we're going to "recheck" it, might as well rebase21:17
dmsimardin case it could benefit from one of the patches that managed to merge21:18
pabelanger448511 is already in the gate21:18
pabelangerso, don't both rebasing21:18
dmsimardpabelanger: ack21:18
pabelangerokay, stepping away for a bit21:19
EmilienMsame21:19
*** yolanda has joined #tripleo21:19
openstackgerritIan Main proposed openstack/tripleo-heat-templates master: Remove kolla config entries from heat services.  https://review.openstack.org/44397421:21
*** bfournie has quit IRC21:24
*** rlandy has quit IRC21:25
*** yolanda has quit IRC21:26
*** fragatina has joined #tripleo21:29
*** radeks has quit IRC21:31
*** jcoufal has joined #tripleo21:31
*** yolanda has joined #tripleo21:32
*** jcoufal_ has quit IRC21:32
*** thrash is now known as thrash|g0ne21:34
*** pradk has quit IRC21:36
*** yolanda has quit IRC21:39
*** yolanda has joined #tripleo21:39
openstackgerritAndreas Florath proposed openstack/diskimage-builder master: Use stevedore for plugin config of block device  https://review.openstack.org/44709021:42
*** eck` is now known as eck`gone21:45
*** jcoufal_ has joined #tripleo21:47
*** jcoufal has quit IRC21:47
*** morazi has quit IRC21:50
*** sshnaidm|off has joined #tripleo22:02
openstackgerritJames Slagle proposed openstack/tripleo-docs master: Additional Networking docs for deployed-server  https://review.openstack.org/44222222:04
*** jmelvin has quit IRC22:08
*** ooolpbot has joined #tripleo22:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION22:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468122:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477022:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495522:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517422:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]22:10
*** ooolpbot has quit IRC22:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]22:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)22:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]22:10
openstackgerritOliver Walsh proposed openstack/tripleo-heat-templates master: WIP: SSH known_hosts config  https://review.openstack.org/44966022:29
*** jobewan has quit IRC22:33
*** yamahata has quit IRC22:36
openstackgerritMerged openstack-infra/tripleo-ci master: Fix timeout for quickstart jobs  https://review.openstack.org/44902322:45
* mwhahaha cheers on the CI queue22:45
mwhahahaYOU CAN DOO EEEET22:46
pabelangernext 3 should merge23:00
pabelangerbut still 3 failures because of the network23:00
EmilienMyeah23:00
EmilienMpabelanger: do you have telnet on 448041 ?23:01
EmilienMI don't have ipv623:01
pabelangerEmilienM: yes, should pass23:01
EmilienMok23:01
pabelangerjust doing log collection23:01
EmilienMah ok23:01
pabelangersame with 44851123:02
EmilienMpabelanger: well the last one would be https://review.openstack.org/#/c/448512/23:02
EmilienMonce this one merges we can ping dmsimard to unblock current23:02
pabelangerEmilienM: yes, that failed because github.com timeout23:03
EmilienMlol23:03
EmilienMsomeone hates us this week :D23:03
pabelangerEmilienM: so, next week, you should make a list of thing you are depending on for github.com and we'll try to get a mirror solution23:03
EmilienMI wonder why do we clean hosts at the end of jobs23:03
EmilienMit sounds very useless23:04
pabelangerif you stop going out of network, things will fail less often I think23:04
openstackgerritMerged openstack-infra/tripleo-ci master: Switch trunk/cbs/buildlogs to use https  https://review.openstack.org/44804123:04
EmilienMpabelanger: yes, thanks. Added on my list23:04
openstackgerritMerged openstack/tripleo-heat-templates master: Clarify Kolla build overrides for tripleo  https://review.openstack.org/44430823:04
EmilienMpabelanger: do you know which repo?23:04
openstackgerritMerged openstack/tripleo-quickstart-extras master: Support includepkgs option in downloaded repos  https://review.openstack.org/44851123:05
pabelangerEmilienM: I am guessing: Getting https://github.com/rdo-packages/tripleo-heat-templates-distgit.git23:06
EmilienMpabelanger: please enqueue 44851223:06
EmilienMwe can't mirror rdo-packages I guess23:06
pabelangerEmilienM: we should be able to mirror anything off github23:06
EmilienMpabelanger: in openstack git?23:07
pabelangerEmilienM: that has been suggested before23:07
pabelangerwe have a pretty big git farm23:08
EmilienMyeah23:08
EmilienMI would be in favor of that for sure23:08
pabelangeryes, goal should be to any dependency on the network now23:09
pabelangerthis is what we did for devstack23:09
pabelangerrarely now it times out because of download failures23:10
EmilienMbkero: you around by any chance?23:10
*** ooolpbot has joined #tripleo23:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION23:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167468123:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167477023:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167495523:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/167517423:10
openstackLaunchpad bug 1674681 in tripleo "buildlogs.centos.org CDN issues" [Critical,Triaged]23:10
bkeroEmilienM: Hi23:10
*** ooolpbot has quit IRC23:10
openstackLaunchpad bug 1674770 in tripleo "Update timeout too long in CI" [Critical,Triaged]23:10
openstackLaunchpad bug 1674955 in tripleo "oooq now believes that overcloud deploy succeeded even though it failed" [Critical,Triaged] - Assigned to Sagi (Sergey) Shnaidman (sshnaidm)23:10
openstackLaunchpad bug 1675174 in tripleo "Timeout passed to overcloud deploy not effective" [Critical,Triaged]23:10
EmilienMbkero: do we teardown something at the end of oooq multinode jobs?23:10
bkeroTear down? No, I don't believe so.23:11
mwhahahayea you do23:11
bkeroCould you point it out?23:12
mwhahahaoh sec no it's misleading23:12
EmilienMlet me find it in logs23:12
mwhahaha--skip-tags teardown-all23:12
EmilienMhere23:12
EmilienMhttp://logs.openstack.org/11/448511/7/check/gate-tripleo-ci-centos-7-nonha-multinode-oooq/c0cd008/console.html#_2017-03-24_12_27_01_67128723:12
EmilienMwhat happens in 5 min?23:12
mwhahahathat's teh log collection isn't it23:12
EmilienMhow comes it takes 5 min?23:12
mwhahahait's skipping the teardown-all23:13
EmilienMcollecting logs should be fast no?23:13
mwhahahanot from remote nodes and all those files23:13
bkeroIt's collecting them and gzipping them and rsyncing them23:13
bkeroI don't know why collect-logs takes so long, that's an adarazs question.23:13
mwhahahasosreport takes like 5 minutes but that collects way more logs23:14
bkeroHere's the collectlogs run: http://logs.openstack.org/11/448511/7/check/gate-tripleo-ci-centos-7-nonha-multinode-oooq/c0cd008/logs/quickstart_collect_logs.txt.gz23:14
EmilienMoh thanks23:14
mwhahahait really should just run sosreport23:15
EmilienMhopefuly we have timestamps soon23:15
bkerocollect-logs : Gather the logs to /tmp --------------------------------- 99.49s23:15
bkerocollect-logs : Call get_host_info script ------------------------------- 86.72s23:15
EmilienMmwhahaha: yeah, therefore I know someone who did a cli in openstack tripleo client23:15
bkeroThose are the two major timesinks23:15
* mwhahaha is working on it23:15
mwhahahastupid swift23:15
EmilienMI've hear it's cool23:16
EmilienMok this is not the critical topic at this time23:16
EmilienMeven if saving time would be great23:16
bkeroHere's the script that takes 87s: http://git.openstack.org/cgit/openstack/tripleo-quickstart-extras/tree/roles/collect-logs/templates/get_host_info.sh.j223:17
*** yolanda has quit IRC23:17
EmilienMthe heat command take time I think23:17
mwhahahapcs isn't fast either23:17
EmilienMopenstack stack event list --nested-depth 2 -f json overcloud23:18
EmilienMI think this one is the winner23:18
EmilienM(but quite useful for debug though)23:18
bkeroSomething quite convenient is at the end of every ansible run, the longest runtime tasks are listed.23:19
bkero(which is where my earlier pastes came from)23:19
pabelangerhttp://logs.openstack.org/19/447419/1/gate/gate-tripleo-ci-centos-7-nonha-multinode-oooq/ec733d3/logs/delorean_logs/23/e8/23e86e4116fed979b0ccb5bca8286f41d4c3ba89_dev/build.log.txt.gz23:19
pabelangerthat looks like a broken package23:20
pabelangerwhich makes me ask, how are tripleo-heat-templates getting into the gate23:20
pabelangerif the package is broken23:20
*** links has quit IRC23:24
pabelangerEmilienM: http://logs.openstack.org/19/447419/1/gate/gate-tripleo-ci-centos-7-nonha-multinode-oooq/ec733d3/console.html#_2017-03-24_23_05_18_60613923:27
pabelangerthat is wrong23:27
pabelanger447419 is stable/newton23:28
pabelangerbut dlrn is using rpm-master23:28
pabelangerhow did check not catch that23:28
*** gbarros has joined #tripleo23:28
EmilienMI thought we fixed that23:28
EmilienMbkero: can you help on this one? ^23:29
mwhahaharebase?23:29
EmilienMbizarre23:29
bkeroLet me go look at the code.23:30
bkero"zuul_changes":23:33
bkero"openstack-infra/tripleo-ci:master:refs/changes/23/449023/2^openstack-infra/tripleo-ci:master:refs/changes/41/448041/2^openstack/tripleo-heat-templates:master:refs/changes/08/444308/1^openstack/tripleo-quickstart-extras:master:refs/changes/11/448511/7^openstack/tripleo-heat-templates:master:refs/changes/74/449174/1^openstack/tripleo-ui:master:refs/changes/07/449507/1^openstack/tripleo-heat-templates:stable/n23:33
bkeroewton:refs/changes/19/447419/1"23:33
*** toure|afk is now known as toure|gone23:33
bkeroThat's quite the patch stack23:33
bkeroEmilienM: for some reason it think the branch is 'master'23:34
pabelangerWow23:37
pabelangerthere is a massive amount of jinja2 magic in that playbook23:38
pabelangerbkero: http://logs.openstack.org/19/447419/1/gate/gate-tripleo-ci-centos-7-nonha-multinode-oooq/ec733d3/console.html#_2017-03-24_23_05_18_19186923:40
pabelangeris the issue23:40
pabelangerit is saying master branch23:40
bkerohttp://git.openstack.org/cgit/openstack/tripleo-quickstart-extras/tree/roles/build-test-packages/library/zuul_deps.py23:42
*** tbonds has joined #tripleo23:43
pabelangerbkero: ya, looks to be the issue23:43
EmilienMI'm down for this week23:43
EmilienMbkero: if you have an idea, feel free to send a patch and irc me, I'll review it over the weekend23:44
EmilienMsee you all23:44
* EmilienM out23:44
bkerofriday 4:45pm issues, heh23:44
pabelangerYa, you'll have to stop approving tripleo-heat-template patches23:46
bkeropabelanger: correct me if I'm reading that wrong, but isn't it saying openstack/tripleo-heat-templates:stable/newton:refs/changes/19/447419/1?23:46
*** jcoufal has joined #tripleo23:46
pabelangerbkero: Oh23:47
pabelangerI see the issue23:47
bkero?23:47
pabelangerif it likely the length of zuul_changes in the gate pipeline that is breaking the paring on zuul_deps23:47
pabelangercheck will allows be a single change23:48
pabelangerhttp://logs.openstack.org/81/439681/2/check/gate-tripleo-ci-centos-7-nonha-multinode-oooq/90015ed/console.html#_2017-03-24_18_49_33_01293123:48
pabelangerthat is why it passes check&23:48
pabelangerindependent pipelines only have 1 change23:48
pabelangerbut gate is dependant, which has all the changes23:49
pabelangerso, until you fix that bug23:49
pabelangerthere is no point approaching tripleo-heat-templates23:49
bkeroYou think the variable is too long and getting truncated?23:49
pabelangerthey will just slow down the gate23:49
*** jcoufal_ has quit IRC23:49
pabelangerbkero: let me run the code quickly23:49
*** jcoufal has quit IRC23:50
bkeroThe only data types here are an environment variable and an ansible (python) variable (string)23:51
pabelangerbkero: ah, i see the issue23:56
bkero??23:56
pabelangerwhen there is duplicate projects, they are selecting the first instance, and skipping all others23:56
bkerooh, ugh23:57
pabelangerya23:57
pabelangerit is backwarks23:57
pabelangerbackwards23:57
pabelangerthey need to keep the last23:57
pabelangeris that will be the latest code path23:57
*** lblanchard has joined #tripleo23:58
pabelangerin fact23:58
pabelangerwhy does a stable project need to build dlrn package from master?23:58
pabelangerso much wrong here23:58

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!