Wednesday, 2016-06-15

*** apetrich has joined #tripleo00:00
*** mbound has quit IRC00:00
*** yamahata has quit IRC00:14
*** saneax is now known as saneax_AFK00:15
*** rcernin has joined #tripleo00:17
*** Lokesh_Jain has quit IRC00:17
*** rcernin has quit IRC00:22
*** panda has quit IRC00:24
*** panda has joined #tripleo00:25
*** apetrich has quit IRC00:29
*** apetrich has joined #tripleo00:30
*** TSCHAK_ has quit IRC00:31
*** TSCHAK has joined #tripleo00:34
openstackgerritMerged openstack/puppet-tripleo: add plumgrid neutron profile  https://review.openstack.org/31725900:35
*** rcernin has joined #tripleo00:47
*** cwolferh has quit IRC00:50
*** rcernin has quit IRC00:53
*** weshay_mtg has quit IRC00:55
*** numans has joined #tripleo00:56
*** MaxPC has joined #tripleo00:56
*** saneax_AFK is now known as saneax01:00
*** mbound has joined #tripleo01:01
*** rcernin has joined #tripleo01:05
*** mbound has quit IRC01:06
*** weshay_mtg has joined #tripleo01:07
*** rcernin has quit IRC01:10
*** cwolferh has joined #tripleo01:11
*** MaxPC has quit IRC01:22
*** xinwu has quit IRC01:34
*** links has joined #tripleo01:54
*** jrist has quit IRC01:56
*** saneax is now known as saneax_AFK01:56
*** jrist has joined #tripleo02:02
*** coolsvap has joined #tripleo02:18
*** weshay_mtg has quit IRC02:28
*** r-mibu has quit IRC02:37
*** tzumainn has quit IRC02:37
*** julim has joined #tripleo02:38
*** lblanchard has quit IRC02:38
*** cmyster has quit IRC02:47
*** r-mibu has joined #tripleo02:47
*** apetrich has quit IRC02:56
*** apetrich has joined #tripleo02:56
*** links has quit IRC02:56
*** ramishra has joined #tripleo03:02
openstackgerritNuman Siddique proposed openstack/tripleo-puppet-elements: FOR TESTING ONLY... PLZ DONT MERGE  https://review.openstack.org/32883903:04
*** apetrich has quit IRC03:10
*** apetrich has joined #tripleo03:10
*** saneax_AFK is now known as saneax03:14
*** fragatina has quit IRC03:22
*** morazi has quit IRC03:35
*** ramishra has quit IRC03:38
*** ramishra has joined #tripleo03:57
*** cllewellyn_ has joined #tripleo04:02
*** ramishra has quit IRC04:03
*** xinwu has joined #tripleo04:03
*** julim has quit IRC04:04
*** links has joined #tripleo04:05
*** ramishra has joined #tripleo04:13
*** apetrich has quit IRC04:19
*** apetrich has joined #tripleo04:20
*** panda has quit IRC04:24
*** panda has joined #tripleo04:25
*** apetrich has quit IRC04:25
*** apetrich has joined #tripleo04:26
*** cllewellyn__ has joined #tripleo04:27
*** skramaja has quit IRC04:29
*** oshvartz has quit IRC04:30
*** masco has joined #tripleo04:36
*** skramaja has joined #tripleo04:45
*** ramishra has quit IRC04:50
*** ramishra has joined #tripleo04:50
*** fragatina has joined #tripleo04:52
*** fragatina has quit IRC04:53
*** fragatina has joined #tripleo04:53
*** ramishra has quit IRC04:54
openstackgerritIan Wienand proposed openstack/diskimage-builder: Clear up "already provided" message  https://review.openstack.org/29096804:58
openstackgerritIan Wienand proposed openstack/diskimage-builder: Convert element_dependencies to logging  https://review.openstack.org/32807204:58
*** ramishra has joined #tripleo05:00
*** jaosorior has joined #tripleo05:04
*** olap has quit IRC05:05
*** dixiaoli has quit IRC05:13
*** cllewellyn_ has quit IRC05:18
*** cllewellyn__ has quit IRC05:18
*** ramishra has quit IRC05:18
openstackgerritMerged openstack/tripleo-heat-templates: Updates ControlPlaneSubnetCidr to be a string  https://review.openstack.org/31623305:18
bandinimatbu: that is good news ;)05:18
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo: profile/base/nova: declare nova class properly  https://review.openstack.org/32834705:19
openstackgerritJuan Antonio Osorio Robles proposed openstack/tripleo-heat-templates: Remove usage of ::nova class in THT  https://review.openstack.org/32598305:19
openstackgerritMerged openstack/tripleo-heat-templates: Composable Neutron Plumgrid plugin  https://review.openstack.org/32730705:20
*** cllewellyn__ has joined #tripleo05:20
*** cllewellyn_ has joined #tripleo05:21
openstackgerritMerged openstack/tripleo-heat-templates: Drop extraconfig for neutron-plumgrid.yaml  https://review.openstack.org/32731805:22
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo: Add fact to get the fqdn for a host in the different networks  https://review.openstack.org/32929905:23
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo: Drop galera_bootstrapped fact  https://review.openstack.org/32897905:24
*** leanderthal|afk is now known as leanderthal05:30
*** apetrich has quit IRC05:44
*** apetrich has joined #tripleo05:44
*** ramishra has joined #tripleo05:45
*** ramishra has quit IRC05:48
*** tremble has quit IRC05:49
*** apetrich has quit IRC05:50
*** apetrich has joined #tripleo05:50
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo: Split Heat pacemaker roles into separate services  https://review.openstack.org/32770805:51
*** yamahata has joined #tripleo05:52
*** oshvartz has joined #tripleo05:55
*** mbound has joined #tripleo05:55
*** apetrich has quit IRC05:56
*** coolsvap has quit IRC05:57
*** apetrich has joined #tripleo05:58
*** ramishra has joined #tripleo05:58
*** rlandy has quit IRC05:58
*** numans has quit IRC06:00
*** mbound has quit IRC06:00
*** cllewellyn_ has quit IRC06:00
*** cllewellyn__ has quit IRC06:00
*** ramishra has quit IRC06:03
*** saneax is now known as saneax_AFK06:04
*** coolsvap has joined #tripleo06:10
*** yolanda has joined #tripleo06:13
*** yolanda_ has joined #tripleo06:13
*** yolanda_ has quit IRC06:14
*** itamarl has joined #tripleo06:15
*** olap has joined #tripleo06:16
*** ramishra has joined #tripleo06:16
*** openstackgerrit has quit IRC06:18
*** openstackgerrit has joined #tripleo06:18
*** xinwu has quit IRC06:21
*** rook has quit IRC06:22
*** anshul has joined #tripleo06:23
*** anshul is now known as Guest3405806:24
*** numans has joined #tripleo06:24
*** rcernin has joined #tripleo06:29
*** apetrich has quit IRC06:34
*** apetrich has joined #tripleo06:34
openstackgerritMartin André proposed openstack/tripleo-common: Allow running validation against different plans  https://review.openstack.org/31819406:46
openstackgerritMartin André proposed openstack/tripleo-common: Disable retry files for ansible validations  https://review.openstack.org/32903906:46
openstackgerritMartin André proposed openstack/tripleo-common: Validations actions and workbook  https://review.openstack.org/31363206:46
*** aufi has joined #tripleo06:47
*** ifarkas has joined #tripleo06:49
*** rook has joined #tripleo06:49
*** athomas has joined #tripleo06:56
*** jprovazn has joined #tripleo06:57
openstackgerritMichele Baldessari proposed openstack/tripleo-heat-templates: Add redis constraint to aodh upgrade manifest  https://review.openstack.org/32965506:59
*** cllewellyn__ has joined #tripleo07:01
*** cllewellyn_ has joined #tripleo07:01
*** tremble has joined #tripleo07:02
*** cwolferh has quit IRC07:02
*** tesseract has joined #tripleo07:03
*** fzdarsky has joined #tripleo07:03
*** rcernin has quit IRC07:04
*** rcernin has joined #tripleo07:04
*** florianf has joined #tripleo07:06
*** dtrainor has quit IRC07:09
tobias_fiberdatathe neutron-server service is timing out after reboot with the latest release07:11
tobias_fiberdatais this a known issue?07:12
tobias_fiberdatait's possible to start it afterwards though07:12
*** saneax_AFK is now known as saneax07:13
matbutobias_fiberdata: reboot of the controller ?07:14
matbutobias_fiberdata: on which release ? master ?07:14
tobias_fiberdatauhm, the tripleO server07:14
tobias_fiberdatai was not clear i believe. the latest mitaka based tripleO07:15
*** milan has quit IRC07:17
*** ebarrera has joined #tripleo07:17
matbutobias_fiberdata: k, and what do you mean by tripleo server ? undercloud or overcloud ?07:19
tobias_fiberdataundercloud07:19
tobias_fiberdatai can priv you the logoutput07:19
matbutobias_fiberdata: i experiment something for CI purpose, and i notice that the overcloud controller, when rebooting, sometimes the neutron-server is down07:20
tobias_fiberdatacould put on verbose and debug if you want more details07:20
tobias_fiberdatacould it be something similar in this case? but this is undercloud though07:20
matbutobias_fiberdata: for the UC i never seen it before, but idk if mean of us try to reboot the nodes :)07:21
matbutobias_fiberdata: yep maybe07:21
matbutobias_fiberdata: could you fill a bug  on launchpad ?07:21
tobias_fiberdatayea sure i could. I'll give myself some more details with verbose and debug07:22
matbutobias_fiberdata: k thx07:22
*** dtrainor has joined #tripleo07:22
*** shardy has joined #tripleo07:24
*** jpena|off is now known as jpena07:26
openstackgerrityolanda.robla proposed openstack/tripleo-quickstart: Allow to specify templates path on overcloud deployment  https://review.openstack.org/32955607:27
tobias_fiberdatamatbu, ah well, i'll try to do it as fast as i can though. gotta prio our openstackdeployment first of all. Seems like Dell R610 is not very nice to me.07:31
*** hjensas__ has joined #tripleo07:31
*** openstackgerrit has quit IRC07:33
*** openstackgerrit has joined #tripleo07:33
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo: Enable TLS in the internal network for keystone  https://review.openstack.org/32702907:34
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo: Enable TLS in the internal network for heat  https://review.openstack.org/32706907:34
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo: Enable TLS in the internal network for glance API and registry  https://review.openstack.org/32747307:35
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo: Enable TLS in the internal network for RabbitMQ  https://review.openstack.org/32748207:35
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo: Enable TLS in the internal network for cinder-api  https://review.openstack.org/32885907:35
*** cmyster has joined #tripleo07:36
*** pino|work_ has joined #tripleo07:37
*** dsariel has joined #tripleo07:37
*** links has quit IRC07:39
*** pino|work has quit IRC07:40
*** shardy has quit IRC07:43
*** jpich has joined #tripleo07:45
openstackgerritIan Wienand proposed openstack/diskimage-builder: Clear up "already provided" message  https://review.openstack.org/29096807:45
openstackgerritIan Wienand proposed openstack/diskimage-builder: Convert element_dependencies to logging  https://review.openstack.org/32807207:45
*** pino|work_ is now known as pino|work07:45
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo: Enable TLS in the internal network for keystone  https://review.openstack.org/32702907:46
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo: Enable TLS in the internal network for heat  https://review.openstack.org/32706907:46
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo: Enable TLS in the internal network for glance API and registry  https://review.openstack.org/32747307:46
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo: Enable TLS in the internal network for RabbitMQ  https://review.openstack.org/32748207:46
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo: Enable TLS in the internal network for cinder-api  https://review.openstack.org/32885907:46
*** cmyster has quit IRC07:48
*** dtantsur|afk is now known as dtantsur07:50
*** dtrainor has quit IRC07:52
*** jaosorior is now known as jaosorior_brb07:53
*** jtomasek_ has joined #tripleo07:53
*** links has joined #tripleo07:55
*** ccamacho has joined #tripleo07:56
*** milan has joined #tripleo08:06
openstackgerritCarlos Camacho proposed openstack/puppet-tripleo: Composable roles within services - NTP  https://review.openstack.org/31072508:06
*** Guest34058 has quit IRC08:07
*** dtrainor has joined #tripleo08:08
*** dbecker has quit IRC08:09
*** jtomasek_ has quit IRC08:10
*** shardy has joined #tripleo08:10
*** dbecker has joined #tripleo08:10
*** zoli_gone-proxy is now known as zoliXXL08:11
*** abehl has joined #tripleo08:12
*** liverpooler has joined #tripleo08:15
*** ohamada has joined #tripleo08:16
jaosorior_brbupgrades gate seems to be broken  in master :/08:17
*** dmk0202 has joined #tripleo08:22
openstackgerritAthlan-Guyot sofer proposed openstack/puppet-tripleo: WIP: integration of the new puppet pacemaker.  https://review.openstack.org/30906908:22
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-heat-templates: WIP: integration of the new puppet pacemaker.  https://review.openstack.org/30240908:24
*** panda has quit IRC08:24
*** panda has joined #tripleo08:25
*** olap has quit IRC08:27
*** cllewellyn__ has quit IRC08:27
*** cllewellyn_ has quit IRC08:27
*** apetrich has quit IRC08:28
*** olap has joined #tripleo08:28
*** stendulker has joined #tripleo08:28
*** apetrich has joined #tripleo08:30
*** abehl has quit IRC08:32
*** abehl has joined #tripleo08:33
*** paramite has joined #tripleo08:34
*** abehl has quit IRC08:34
*** abehl has joined #tripleo08:34
openstackgerritCarlos Camacho proposed openstack/puppet-tripleo: Composable roles within services - NTP  https://review.openstack.org/31072508:35
openstackgerritCarlos Camacho proposed openstack/tripleo-heat-templates: Composable roles within services - NTP  https://review.openstack.org/31042108:36
dtantsurmorning folks! a second +2 is needed on the ironic-in-overcloud spec https://review.openstack.org/320995 please08:38
*** zoliXXL is now known as zoli|brb08:39
*** jaosorior_brb has quit IRC08:40
*** jaosorior_brb has joined #tripleo08:41
*** cllewellyn__ has joined #tripleo08:41
*** cllewellyn_ has joined #tripleo08:41
*** jaosorior_brb is now known as jaosorior08:41
jaosoriorccamacho hey dude, seems to me like the upgrades gate is broken, have you noticed?08:42
ccamachoupgrades in Master?08:42
jaosorioryes08:42
ccamachojaosorior ^08:42
jaosoriorI recheck a bunch of commits in the morning08:43
jaosoriorand not a single one of them has passed upgrades08:43
*** cllewellyn__ has quit IRC08:43
ccamachommmm yesterday was fine, but were landed a lot of patches..08:43
ccamacholetme check08:43
jaosoriorresources.ControllerNodesPostDeployment: resources.ControllerPostPuppet: resources.ControllerPostPuppetRestartDeployment: Error: resources[0]: Deployment to server failed: deploy_status_code : Deployment exited with non-zero status code: 108:43
*** cllewellyn__ has joined #tripleo08:44
jaosoriorshardy: Any tips on how to debug something like that? ^^08:44
ccamachodead by timeout :$08:44
dtantsurjaosorior, once you have a second of time, could you please look again at the documentation patch https://review.openstack.org/#/c/322776/ ?08:44
*** cllewellyn__ has quit IRC08:44
*** cllewellyn__ has joined #tripleo08:44
shardyjaosorior: get the ID of the failing SoftwareDeployment, then run heat deployment-show <id>08:45
shardythe stderr should give some clues08:45
*** derekh has joined #tripleo08:45
*** cllewellyn__ has quit IRC08:45
ccamachojaosorior, can you post the patch link? or is from your local env?08:46
jaosoriorcheck any recent patch's upgrade job08:46
*** cllewellyn__ has joined #tripleo08:46
jaosoriorccamacho: for instance http://logs.openstack.org/04/329504/1/check-tripleo/gate-tripleo-ci-centos-7-upgrades/2a9a0fc/08:47
openstackgerritMerged openstack/tripleo-docs: Rework nodes registration and configuration  https://review.openstack.org/32277608:47
dtantsurshardy, hi! a kind request to review the ironic-in-overcloud spec https://review.openstack.org/255792 please. we're getting some good progress with the patches already, would be nice to have the spec landed08:47
*** cllewellyn__ has quit IRC08:47
dtantsurmeh, wrong link08:47
dtantsurshardy, the correct link: https://review.openstack.org/32099508:47
*** cllewellyn__ has joined #tripleo08:47
jaosoriorccamacho: This has the same issue http://logs.openstack.org/18/329718/1/check-tripleo/gate-tripleo-ci-centos-7-upgrades/cb4b9c1/08:48
openstackgerritCarlos Camacho proposed openstack/tripleo-heat-templates: Composable roles within services - NTP  https://review.openstack.org/31042108:48
*** cllewellyn__ has quit IRC08:48
jaosoriordamn, so if ugprades doesn't fail with the overcloud deploy timeout, it seems to fail with the Controller PostPuppetRestartDeployment error08:49
ccamachochecking logs08:49
*** cllewellyn__ has joined #tripleo08:49
openstackgerritCarlos Camacho proposed openstack/puppet-tripleo: Composable roles within services - NTP  https://review.openstack.org/31072508:50
*** apetrich has quit IRC08:50
*** zoli|brb is now known as zoli08:50
*** cllewellyn__ has quit IRC08:51
shardydtantsur: sure, will do08:51
dtantsurthnx!08:51
*** mbound has joined #tripleo08:52
*** cllewellyn__ has joined #tripleo08:52
*** apetrich has joined #tripleo08:53
*** mcornea has joined #tripleo08:53
*** cllewellyn__ has quit IRC08:54
*** cllewellyn__ has joined #tripleo08:54
*** cllewellyn__ has quit IRC08:56
*** cmyster has joined #tripleo08:56
*** ramishra has quit IRC08:56
*** cllewellyn__ has joined #tripleo08:56
*** jaosorior has quit IRC08:57
*** cllewellyn__ has quit IRC08:58
*** cllewellyn__ has joined #tripleo08:59
*** electrofelix has joined #tripleo08:59
ccamachoI will deploy any job to get the error, from the CI not getting any useful.09:00
*** ramishra has joined #tripleo09:02
*** fzdarsky has quit IRC09:02
*** fzdarsky has joined #tripleo09:04
*** cllewellyn__ has quit IRC09:06
*** cllewellyn_ has quit IRC09:06
*** cllewellyn__ has joined #tripleo09:06
*** cllewellyn_ has joined #tripleo09:06
*** jtomasek_ has joined #tripleo09:07
*** cllewellyn__ has quit IRC09:08
*** cllewellyn__ has joined #tripleo09:08
*** cllewellyn_ has quit IRC09:08
*** cllewellyn_ has joined #tripleo09:08
*** cllewellyn_ has quit IRC09:09
*** cllewellyn_ has joined #tripleo09:09
*** mgould|afk is now known as mgould09:10
*** cllewellyn_ has quit IRC09:11
*** cllewellyn_ has joined #tripleo09:12
*** jtomasek_ has quit IRC09:13
*** sambetts|afk is now known as sambetts09:16
jistrheya folks, do we still manage endpoints via os-cloud-config or did the endpoint management via Puppet make it in?09:17
* jistr can't find it in puppet but keeps looking09:18
*** jaosorior has joined #tripleo09:21
chem``ccamacho: I think I figure out the problem09:21
ccamachochem``, with upgrades?09:22
chem``ccamacho: yeap09:22
ccamachotell me :)09:22
ccamachoim deploying all jobs to check them until now they are running..09:22
chem``ccamacho: looking at the log, it seems that openstack-nova-scheduler, openstack-cinder-volume, nova-api and nova-conductor fail to restart09:23
chem``ccamacho: after the upgrade script say that the cluster is instable for too long and abort09:24
ccamachowoow..09:24
chem``ccamacho: I think this is due to the removal of the openstack-core constraint on the conductor resource09:24
ccamacholet me see if I can reproduce it locally09:25
ccamachoTHe good thing is that we have some clues09:25
*** apetrich has quit IRC09:25
chem``ccamacho: you can see that hapening at the end of the 3.2M log/message file in the controler-0 logs09:26
*** mikelk has joined #tripleo09:26
chem``ccamacho: parsing 3.2M file in firefox is a joy :)  I need to upgrade to a 64GB laptop.09:26
*** akrivoka has joined #tripleo09:26
bandinimarios: can I tickle your brain for an upgrade issue I am seeing?09:26
chem``ccamacho: ... or download the file ...09:26
*** apetrich has joined #tripleo09:28
chem``ccamacho: this is the file http://logs.openstack.org/18/329718/1/check-tripleo/gate-tripleo-ci-centos-7-upgrades/cb4b9c1/logs/overcloud-controller-0/var/log/messages09:28
ccamachochem``: I usually do http://paste.openstack.org/show/516179/ as is really hard to see the logs in the browsre09:28
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo: Enable TLS in the internal network for keystone  https://review.openstack.org/32702909:29
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo: Enable TLS in the internal network for heat  https://review.openstack.org/32706909:29
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo: Enable TLS in the internal network for glance API and registry  https://review.openstack.org/32747309:29
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo: Enable TLS in the internal network for RabbitMQ  https://review.openstack.org/32748209:29
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo: Enable TLS in the internal network for cinder-api  https://review.openstack.org/32885909:29
ccamachohey bandini, marios morning!!!: chem`` is giving us some clues about the upgrades error09:29
matbuchem``: ccamacho you are talking about the compute upgrade ?09:32
ccamachomatbu, :) nope upgrade job in CIç09:33
matbuccamacho: CI job for minor upgrade?09:33
ccamachomarios, matbu, bandini, sorry guys mixing upgrades, is the minor upgrade job in CI09:34
*** apetrich has quit IRC09:34
matbuccamacho: I was talking about major liberty to mitaka upgrade, the compute failed because it failed with nova-scheduler09:34
matbuack :)09:34
*** ohamada_ has joined #tripleo09:34
*** ohamada has quit IRC09:34
bandinimatbu: so my status today is that aodh and keystone work correctly. I am having a very odd issue with the major-pacemaker-upgrade step. the first step completes but the second step is never started09:35
*** cllewellyn__ has quit IRC09:35
chem``ccamacho: thanks for the snippet by the way09:35
bandinimatbu: so I have not yet reached the compute issue ;)09:35
bandiniccamacho: in what situations is the minor upgrade ci job broken? it worked for me this morning09:36
*** cllewellyn__ has joined #tripleo09:36
ccamachochem`` np09:36
ccamachobandini, not sure, there are lot of jobs in CI getting errors in the upgrades job09:37
matbubandini: yep , the same for me09:37
matbubandini: the aodh and keystone works09:37
*** cllewellyn__ has quit IRC09:37
ccamachoso right now im locally deploying all jobs from master to check it locally09:37
matbubandini: but the step2 never start ? (hang forever)09:37
bandinimatbu: EXACTLY09:37
matbubandini: cool :) i'm not crazy09:38
*** apetrich has joined #tripleo09:38
bandinimatbu: it is as if step1 completes, cluster is down. but step2 never starts09:38
matbubandini: nothing happen,09:38
hewbroccabandini: the jury is still out on whether you're crazy09:38
bandinimatbu: right09:38
bandinihewbrocca: oh no, it has very well decided on that ;)09:38
matbubandini: yep, i checked eveyr thing, i don't see what heat is waiting for09:38
bandinimatbu: ok, shall we collect some infos/data on the etherpad ?09:39
*** cllewellyn__ has joined #tripleo09:39
matbubandini: and when i executed the step2 script, it works correctly09:39
bandiniah, good to know09:39
hewbroccaos-collect-config running on the nodes?09:39
*** cllewellyn__ has quit IRC09:39
matbubandini: yep if you want09:39
ccamachomatbu bandini, about that never end timeout, if you are testing locally can you connect to the vms using virt-manager to see the console? In liberty sometimes the deployment hangs and the reason is that the mvs are not booting up. just saying...09:39
*** cllewellyn__ has joined #tripleo09:39
matbuccamacho: nop it during an upgrade steps09:40
bandiniccamacho: the vms are up and running, it is just heat from the undercloud that seems to be stuck09:40
bandinihewbrocca: os-collect-config is running, it seems heat is not telling it to do the second step09:40
matbubandini: i was thinking of a network issue, some of a VIP that heat wants to reach, but which has been disable by the cluster down09:40
matbuhewbrocca: bandini yep if you trigger os-collect-config manually eveyr thing is fine, the previous step is ended correctly09:41
bandinimatbu: that might be a good lead actually09:42
jistrbandini: re AODH and keystone working correctly -- we don't have the AODH endpoints tough yet, right? Just sent a suggestion to pradk how to solve that a while ago.09:43
hewbroccaMan, we really, really need to replace this whole os-collect-config nonsense with a nice push/pull thing like Zaqar09:44
bandinijistr: that is correct. while not ideal, I feel it is a bit of a minor issue (i.e. well the aodh endpoints won't be around until convergence step runs). but yeah worth fixing09:44
bandinijistr: I have got bigger fish to fry at the moment :D09:44
jistrbandini: yea makes sense :D09:44
*** athomas has quit IRC09:45
matbubandini: jistr btw i wonder if those two additionnals steps (aodh / keystone) could be add  to the major upgrade controller step09:45
jistrbandini: though the endopints wouldn't be created on convergence either09:45
matbuto avoid a 5 steps upgrade overcloud09:45
jistrcurrently we don't use Puppet to create endpoints AFAIK, and os-cloud-config only runs on stack create09:45
bandinijistr: ah, I did not know that. That is definitely a bigger problem09:45
*** cllewellyn__ has quit IRC09:46
*** cllewellyn_ has quit IRC09:46
chem``ccamacho: so the final error is 'ERROR: cluster remained unstable for more than 1800 seconds, exiting.' from os-collect-config and this from the pacemaker engine http://paste.fedoraproject.org/379421/46598391/09:46
chem``ccamacho: I'm looking at other logs to see how they look09:46
*** cllewellyn__ has joined #tripleo09:46
*** cllewellyn_ has joined #tripleo09:46
hewbroccaArrgh I thought we had the puppet endpoint creation, at least on trunk09:47
jistrmatbu: yea it could reduce the number of steps, but on the other hand we wanted them separate on purpose i think, to have them separately testable too, and have a smaller failure domain (to avoid "i've attempted to execute this blob of 3 invasive operations on my cloud, and the blob failed, what state is my cloud in now?")09:47
* hewbrocca so tired of os-cloud-config09:47
*** cllewellyn__ has quit IRC09:47
*** cllewellyn__ has joined #tripleo09:47
*** florianf has quit IRC09:48
jistrit is proposed but not merged yet, probably needs an amendment wrt composability09:48
matbujistr: hm yep true09:48
shardyYeah, we should figure out how to land that though, it'll make the composable endpoints much easier I think09:48
chem``ccamacho: well ignore the paste, it's the same on a working one09:49
mariosbandini: hey man, sorry was getting some foods.. gimme couple mins and reading back09:49
*** trumpetnl has joined #tripleo09:49
*** athomas has joined #tripleo09:50
chem``ccamacho: oups sorry wrong file, the paste is still legite ...09:50
ccamachochem`` ack, for me both ha nonha ran without errors, now runing a minor upgrade09:50
bandinimarios: I am trying to describe the issue matbu and I are seeing here: https://etherpad.openstack.org/p/tripleo-liberty-mitaka-upgrades09:51
*** cinerama has quit IRC09:51
mariosbandini: ack thanks (there is a lot of text there)... are you sure there ar eno errors after controller_pacemaker_1.sh09:54
mariosbandini: things like the cluster timing out for example after things are stopped? or even not setting in time after being started?09:54
bandinimarios: I know sorry lots of text :) I believe there are no errors. the cluster is fully stopped and the crudini operations took place09:54
matbumarios: yep the step1 is really done09:55
bandiniand the packages are updated09:55
*** cinerama has joined #tripleo09:55
matbumarios: you can stop/start the cluster manually, it'sworks fine09:55
bandiniif you look at line 69-72 you see that Step2 is never triggered on the controller09:55
bandiniyet heat on the undercloud shows it as CREATE_IN_PROGRESS09:55
mariosmatbu: bandini so the cluster stays stopped?09:56
hewbroccastevebaker: ^^^ this sounds weirdly familiar to me09:56
bandinimarios: correct. cluster is down. step1 completed successfully. step2 is never started so we all hang there09:57
matbumarios: yes09:57
bandiniI have reproduced this on three different systems09:58
bandiniso it is not a race or something09:58
matbumarios: heat show step2 in progress, but the script _2.sh is never start09:58
bandiniexactly, script _2.sh does not even exist in /var/lib/heat-config/heat-config-scripts/09:58
bandiniit really looks like heat is in some la-la-la land here09:59
matbubandini: lol yep09:59
bandinimatbu: I started seeing this today but I was more focused on the aodh/keystone steps. Have you seen this behaviour from day 1?10:00
mariosbandini: is swift-* started on controllers? (cluster is stopped right)10:00
shardybandini: https://paste.fedoraproject.org/379427/98481714/10:00
shardytry that - it shows how to grab the server metadata10:00
shardythen you can grep that and check heat actually exposed the step2 config via the deployment10:00
matbubandini: yep, i have seen it for a long time10:00
chem``ccamacho: on your plateform is the openstack-nova-consoleauth service enabled ?10:01
shardybandini: that will bisect the problem to heat vs something in the node (or network)10:01
mariosbandini: matbu fwiw my additions at pacemaker_common_functions.sh adds a lot of debugging, wondering if it would help here.10:01
mariosbandini: still think there may be an error, timeout for something to stop possibly. but is strange if all the crudini are also set (so 1.sh really did complete)10:01
bandinishardy: thanks will try!10:02
matbumarios: yep, and i think the blockstorage is done also10:02
*** florianf has joined #tripleo10:02
matbushardy: will try to, /me deploying a new env10:02
mariosbandini: matbu (I mean at https://review.openstack.org/#/c/321027/13/extraconfig/tasks/pacemaker_common_functions.sh )10:02
bandinimarios: yes crudini did run, because I had to add another one crudini line due to a change in keystone paste.ini files10:03
ccamachochem`` not enabled by default10:03
chem``ccamacho: so it's not managed by systemd.. weird10:04
bandinimarios: swift is all down btw10:04
mariosbandini: ok thx was wondering if it was just missing the bootstrap for some reason @ https://github.com/openstack/tripleo-heat-templates/blob/bcd726f1242d78169e6a5687e998473c1043c622/extraconfig/tasks/major_upgrade_controller_pacemaker_2.sh#L9 and then just started swift10:04
bandinimarios: nope that is fine. step_2 script does not exist on the controllers yet so it cannot have run10:05
bandinimatbu: you said that running os-collect-config by hand triggers things and it all works, correct?10:05
matbubandini: nop, i try os-collect-config in debug mode, to see what goes wrong10:06
matbubandini: but every thing was fine10:06
bandinimatbu: so if you run it by hand does it run Step2 of the upgrade or not?10:06
matbubandini: but i execute the _2.sh script manually on the controller10:06
matbubandini: nop it didn't run the step210:07
bandinigot it10:07
matbuafair10:07
bandiniI can try, does it need any special parameters?10:07
matbubut i mean, it's not an issue with the step2 script itself, cause, you can run it manually and the cluster will start, the vip too and so on...10:08
bandinimatbu: fully agreed10:08
matbubandini: for running the script ? or os-collect-config ?10:08
bandiniwe need to understand why step2 is not triggered on the controllers10:08
bandinimatbu: yes if I were to rerun os-collect-config manually, how would I do that? just run the binary or are there special parameters10:09
matbujust do sudo service os-collect-config stop10:09
matbusudo os-collect-config --force --one-time --debug10:09
matbubandini: ^10:09
bandinimatbu: ack, trying now10:09
bandinithenI will follow shardy's tips10:09
matbuyep me too, upgrading UC atm10:10
matbubandini: but i happy you hit that too, cause i was wondering if it was only an issue with my env10:11
matbui'm*10:11
bandinimatbu: ack, I confirm that it talks to heat but no Step2 in sight10:12
bandinimatbu: indeed it's good to have common issues :)10:12
matbuhehe yep10:12
matbumarios: do you remember the review that Dan paste about mistral during the composable upgrade meeting ?10:14
bandinishardy: the upgrade timed out, so I guess that is why I get empty strings from your commands? https://paste.fedoraproject.org/379433/14659857/10:16
bandiniI assume I need to run those commands while heat is still trying10:16
mariosmatbu: which one, remote execution one?10:16
*** karthiks has quit IRC10:17
matbumarios: i don't remeber exactly, he pasted it in the bj chat as an example on how to use mistral10:17
yolandahi shardy , i'm having some issues with https://review.openstack.org/#/c/299643 change, the upload-puppet-modules script. I'm hitting that problem with slash removal10:17
mariosmatbu: bandini sure, sec10:18
marioserr sry bandini10:18
marioshttps://etherpad.openstack.org/p/tripleo-remote-execution matbu10:18
yolandaalso when i tried to upload just tripleo package, it may be some problem with my paths, becuse it failed when not having tripleo puppet module updated. I had to use the approach to move to /etc/puppet/modules, and upload the whole directory10:18
matbumarios: thx man10:18
mariosmatbu: maybe it was this one (https://review.openstack.org/#/c/313957/ ) but see the etherpad10:18
mariosack10:18
shardybandini: No, you need to use resource-metadata on the OS::Nova::Server resource, not OS::Heat::SoftwareDeployment10:19
shardyit should work even after a timeout10:19
* bandini whistles innocently10:19
*** apetrich has quit IRC10:19
shardyyolanda: Hi, perhaps we need some more fixes re the slash removal, but it works fine for me just specifying the local directory10:20
shardye.g upload-puppet-modules -d puppet_modules10:20
yolandamm, i was using absolute directory10:21
shardywhere ./puppet_modules exists and contains e.g a "tripleo" directory which is a copy of the puppet-tripleo module10:21
openstackgerritIan Wienand proposed openstack/diskimage-builder: Clear up "already provided" message  https://review.openstack.org/29096810:21
openstackgerritIan Wienand proposed openstack/diskimage-builder: Convert element_dependencies to logging  https://review.openstack.org/32807210:21
shardyyolanda: yeah, perhaps that is still broken, a relative path without any slashes should work10:21
*** apetrich has joined #tripleo10:22
shardyif you wanted to fix the script for absolute paths I'm pretty sure dprince would be fine with you pushing a fix to the patch10:22
*** karthiks has joined #tripleo10:22
yolandashardy i'll retry with relative path to confirm10:22
yolandaalso if you can add some clarification for the change? it has a -1 due to that issue10:23
bandinishardy: https://paste.fedoraproject.org/379441/98624814/10:25
bandinimarios, matbu: ^10:25
bandininot entirely sure how to interpret that yet10:25
yolandahi, when deploying tripleo composable roles, i got that error... http://paste.openstack.org/show/516195/10:26
yolandais that a known problem?10:26
shardyyolanda: https://paste.fedoraproject.org/379442/98634114/ shows it working fine with relative paths10:26
yolandashardy, thx. Knowing that i need to pass a relative path is enough to me. Going to do a try with that to confirm from my side10:28
yolandashardy, also, are you familiar with that error i pasted? that's only failure i see when testing composable roles10:28
matbubandini: marios is not that ControllerAllNodesValidationDeployment which trying to check the status of the ips10:29
matbubut the VIP are down10:29
shardyyolanda: the first thing to check is pull the latest puppet-ceilometer and add it to puppet_modules (named "ceilometer")10:29
mariosmatbu: the vip are brought down at https://github.com/openstack/tripleo-heat-templates/blob/bcd726f1242d78169e6a5687e998473c1043c622/extraconfig/tasks/major_upgrade_controller_pacemaker_1.sh#L2910:29
matbumarios: yep10:29
shardyI have an updated ceilometer module there, IIRC it may have been to fix that issue10:29
bandiniok but does ControllerAllNodesValidationDeployment check for VIPs? that would make little sense to me10:30
matbumarios: maybe the Allnodevalidation steps, is trying to check if all the ip is reachable10:30
openstackgerritDougal Matthews proposed openstack/python-tripleoclient: Add websocket utils module  https://review.openstack.org/32261110:32
openstackgerritDougal Matthews proposed openstack/python-tripleoclient: Use Mistral for baremetal registration  https://review.openstack.org/32261210:32
openstackgerritDougal Matthews proposed openstack/python-tripleoclient: WIP Use Mistral for baremetal introspection  https://review.openstack.org/32778010:32
yolandashardy, ok, going to try that, also using that relative path approach10:32
yolandai guess it should be the common workflow10:32
* matbu brb lunch time, it would be easier after lunch :D10:33
bandinimatbu: the way I read overcloud.yaml it checks only for the non VIPs ip addresses10:33
bandinimatbu: enjoy ;)10:33
*** jefrite has joined #tripleo10:34
dtantsurwow10:41
dtantsurI mean WOW!!10:41
dtantsurI got a successful pass of my ironic-in-overcloud patch \o/10:41
dtantsurifarkas, ^^^10:41
dtantsurlook, ironic running on the controller: http://logs.openstack.org/28/316128/21/check-tripleo/gate-tripleo-ci-centos-7-ha/941866f/logs/overcloud-controller-0/var/log/ironic/10:42
* dtantsur celebrates10:42
ifarkas\o/10:43
ifarkascongrats dtantsur!10:43
dtantsurfolks, please review https://review.openstack.org/#/c/319297/ it seems to be working10:44
openstackgerritDmitry Tantsur proposed openstack/tripleo-heat-templates: Basic support for deploying Ironic in overcloud  https://review.openstack.org/31612810:45
openstackgerritDmitry Tantsur proposed openstack/tripleo-heat-templates: DO NOT MERGE: testing ironic  https://review.openstack.org/32987210:45
dtantsurifarkas, cleaned up the patches ^^^ please take a look as well10:45
ifarkaswill do10:46
*** apetrich has quit IRC10:50
*** panda has quit IRC10:50
*** apetrich has joined #tripleo10:50
jaosoriordtantsur: got the commit for t-h-t?10:50
*** zoli is now known as zoli|lunch10:50
*** weshay has joined #tripleo10:51
jaosoriordtantsur: ironic-api doesn't need the database?10:51
*** olap has quit IRC10:51
dtantsurjaosorior, tht is https://review.openstack.org/316128 ironic-api accesses the database as of now10:51
*** olap has joined #tripleo10:52
* dtantsur checks10:53
jaosoriordtantsur: Commented here10:53
jaosoriorhttps://review.openstack.org/#/c/319297/510:53
dtantsurjaosorior, hmm, so maybe it makes sense to move the database creation to the base ironic.pp, right?10:54
*** panda has joined #tripleo10:54
jaosoriordtantsur: that may be the case. Would need to check the other services and see if they do something like that10:55
jaosoriorbut it does seem to me that it's wrong that there is no trace of database creation on the api profile10:56
jaosoriordtantsur: On the other hand, the mysql related values are on the ironic-base template in t-h-t. So I guess it does make sense to move that10:57
dtantsurjaosorior, will do. could you please review the remaining parts, so that I can update them at once?10:58
jaosoriordtantsur: I gave a read to the t-h-t and the puppet parts. It looks pretty good from my side.10:59
dtantsurjaosorior, environments/ironic-generic-config.yaml is for people to include to enable ironic (it's optional). will comment. thanks!10:59
jaosoriorcommented on both10:59
jaosorioris it optional?10:59
dtantsurjaosorior, yes11:00
jaosoriordtantsur: Should those resources be set as OS::Heat::None here then? https://review.openstack.org/#/c/316128/22/overcloud-resource-registry-puppet.yaml11:00
dtantsurjaosorior, dunno, maybe? I don't understand this bit, sorry :)11:01
*** pradk has quit IRC11:01
dtantsurjaosorior, isn't e.g. sahara optional as well?11:01
openstackgerritMarios Andreou proposed openstack/instack-undercloud: Overcloud is not able to deploy with the default 4GB of RAM using instack-undercloud  https://review.openstack.org/32987411:01
openstackgerritDmitry Tantsur proposed openstack/puppet-tripleo: Add base ironic profiles  https://review.openstack.org/31929711:03
jaosoriordtantsur: It should be. However, it needs to be added to the ControllerServices list parameter to be taken into use11:03
jaosoriorso not sure how that file that I commented on will actually be used11:04
ccamachochem`` reproduced locally http://paste.openstack.org/show/516225/   "ERROR: cluster remained unstable for more than 1800 seconds, exiting" the minor upgrades job is failing also locally11:04
dtantsurjaosorior, probably -e /path/to/environment?11:04
chem``ccamacho: great news!  So do you see why the nova-consoleauth service is not restarting ?11:04
dtantsurjaosorior, in the same fashion as network isolation11:05
openstackgerritDmitry Tantsur proposed openstack/tripleo-heat-templates: Basic support for deploying Ironic in overcloud  https://review.openstack.org/31612811:05
openstackgerritDmitry Tantsur proposed openstack/tripleo-heat-templates: DO NOT MERGE: testing ironic  https://review.openstack.org/32987211:05
dtantsurupdated ^^^11:06
*** cllewellyn_ has quit IRC11:06
*** cllewellyn__ has quit IRC11:06
jaosoriordtantsur: yeah, I understand how it can be added; but not the effect that it will actually have. For example -> OS::Tripleo::Services::IronicApi: is already being set as puppet/services/ironic-api.yaml in the base resource registry11:06
dtantsurjaosorior, I don't see a contradiction, sorry...11:07
jaosoriorso it seems to me taht doing -e environments/ironic-config.yaml is a no-op11:07
openstackgerritJohn Trowbridge proposed openstack-infra/tripleo-ci: [NO MERGY] Test a fake periodic job  https://review.openstack.org/22978911:08
dtantsurjaosorior, well, I'll check it again11:08
shardyjaosorior: are you saying the problem is there's no way to append to the ControllerServices parameter in an environment file?11:08
jaosoriordtantsur: So what I mean that the values that are being set here https://review.openstack.org/#/c/316128/23/environments/ironic-config.yaml is the value that this already has https://review.openstack.org/#/c/316128/23/overcloud-resource-registry-puppet.yaml11:08
jaosoriorunless I'm misunderstanding something11:09
shardyjaosorior: you're right, it won't do anything11:09
dtantsurjaosorior, yeah, I've checked the other files, I think the environment can be dropped.. I'm not sure how a user requests Ironic (Sahara etc) to be deployed though11:09
shardywhat is needed is a way to add OS::Tripleo::Services::IronicApi to ControllerServices, but for now we'll have to document copying the entire default list and adding it11:10
jaosoriordtantsur: well, I guess they manually set the value for the controller services list11:10
jaosoriorshardy: yeah, there is no trivial way of just adding services11:10
jaosoriorshardy: Do you know what the status of OS::Heat::value (or however it's called) is?11:10
dtantsurthat's fine with me, thanks :)11:11
shardyI've been looking into ways we could add a "merge" feature to heat environments so that you could e.g to -e ironic-config.yaml and have it add to ControllerServices just by defining ControllerServices with values to be appended11:11
jaosoriorthat or yaql could help maybe?11:11
openstackgerritDmitry Tantsur proposed openstack/tripleo-heat-templates: Basic support for deploying Ironic in overcloud  https://review.openstack.org/31612811:11
openstackgerritDmitry Tantsur proposed openstack/tripleo-heat-templates: DO NOT MERGE: testing ironic  https://review.openstack.org/32987211:11
dtantsurdropped the file for now ^^^11:11
shardyjaosorior: yaql could help if we wanted to have say an ControllerExtraServices param and join it (actually, list_join could do that..)11:11
ccamachochem``: do you know if there is a bug for this?11:12
shardybut then you still can't declare values for that more than once11:12
shardywe need to either support client-side appending of values in tripleo-common, or add a feature to heat11:12
chem``ccamacho: not that I'm aware of11:12
shardyI prefer the latter, going to post a spec11:12
shardyfor now we'll have to just document specifying the entire list11:12
shardywhich to be fair isn't that hard11:12
jaosorioryeah, seems like the only solution for now11:13
jaosoriorit isn't that hard. But not very user-friendly either11:13
chem``ccamacho: I can start one on launchpad, so that we can put our finding there11:13
ccamachochem`` neat! ill post comments11:13
shardyjaosorior: well, it'd be pretty trivial to have either the UI or a wizard in the CLI prompt and askk the user which services they want11:13
shardythen the interface to t-h-t remains clean, we just require the list output from those answers11:14
*** dprince has joined #tripleo11:14
shardyanyway, something we can think about for sure11:14
dtantsurthanks for clarification shardy. please see the updated patches11:14
shardydtantsur: np, will do11:15
jaosoriorshardy: true, well, the documentation for how to add services to the list could go on ccamacho's tutorial.11:15
jaosoriorccamacho: How's your tutorial patch going by the way?11:16
*** ccamacho is now known as ccamacho|lunch11:16
ccamacho|lunchjaosorior, I think is going well just add comments and Ill put more information there :)11:16
*** stendulker has quit IRC11:16
jaosoriorccamacho|lunch can you roll that link?11:16
ccamacho|lunchsure11:17
ccamacho|lunchhttps://review.openstack.org/#/c/311512/2411:17
dprincederekh, bnemec: so switch the IP to .224?11:17
derekhdbecker: yup11:17
dbeckerderekh, ack11:17
derekhdprince: yup, dbecker sorry wrong person11:18
dbeckerderekh, :-)11:18
*** thrash|g0ne is now known as thrash11:19
dprincederekh: the IP is updated. Will have to wait for the TTL to expire11:21
derekhdprince: cool beans, thanks11:23
*** hewbrocca is now known as hewbrocca-afk11:25
*** hewbrocca-afk is now known as hewbrocca11:25
chem``ccamacho|lunch: https://bugs.launchpad.net/tripleo/+bug/159277611:25
openstackLaunchpad bug 1592776 in tripleo "Ha upgrade jobs failing with "cluster remained unstable for more than 1800 seconds"" [Undecided,New]11:25
openstackgerritAthlan-Guyot sofer proposed openstack/puppet-tripleo: WIP: integration of the new puppet pacemaker.  https://review.openstack.org/30906911:27
*** adarazs is now known as adarazs_lunch11:33
*** pkovar has joined #tripleo11:33
*** cllewellyn__ has joined #tripleo11:35
*** cllewellyn_ has joined #tripleo11:36
*** cllewellyn_ has quit IRC11:37
*** bvandenh has quit IRC11:37
*** cllewellyn_ has joined #tripleo11:37
*** rasca has quit IRC11:38
*** bvandenh has joined #tripleo11:47
*** paramite has quit IRC11:50
*** rhallisey has joined #tripleo11:57
*** bfournie has quit IRC11:58
*** fzdarsky has quit IRC11:58
*** MaxPC has joined #tripleo11:58
*** jcoufal has joined #tripleo11:59
*** morazi has joined #tripleo12:00
*** jpena is now known as jpena|lunch12:01
*** jayg|g0n3 is now known as jayg12:02
*** rasca has joined #tripleo12:04
mgouldhi everyone12:08
mgouldcould someone please review https://review.openstack.org/#/c/321118/ ? Trivial patch, already has one +2, passing CI apart from one failure on a broken (and now disabled) gate12:08
openstackgerritBrad P. Crochet proposed openstack/tripleo-puppet-elements: Add mistral packages to controller image  https://review.openstack.org/32950412:09
*** ccamacho|lunch is now known as ccamacho12:10
bandinishardy: here is the output for controller-0 https://paste.fedoraproject.org/379478/65992672/, is there anything in particular that should catch attention?12:14
*** mbound has quit IRC12:15
*** mbound has joined #tripleo12:15
*** ramishra has quit IRC12:16
openstackgerritMerged openstack/instack-undercloud: Add option to enable introspection of UEFI nodes  https://review.openstack.org/32111812:16
*** adarazs_lunch is now known as adarazs12:17
*** paramite has joined #tripleo12:17
ccamachochem`` im back, yesterday landed https://review.openstack.org/#/c/326118/7/puppet/services/pacemaker/nova-consoleauth.yaml and by default is disabled12:18
*** ramishra has joined #tripleo12:18
EmilienMhello12:21
*** fultonj has quit IRC12:22
hewbroccaEmilienM: bonjour et bienvenue12:23
*** trown|outtypewww is now known as trown12:23
*** fultonj has joined #tripleo12:23
mgouldhi EmilienM12:23
trowndtantsur: looks like tripleo got a promote on master last night, I think you were waiting on that for updated IPA?12:25
*** Goneri has joined #tripleo12:26
dtantsurtrown, not IPA, but our ironic-on-overcloud work. it now passed, thanks12:26
jaosoriorEmilienM: Just so you know, TripleO upgrades gate is broken. So if you have a patch that fails that, no need for rechecks12:26
EmilienMI just figured12:26
EmilienMwhy is it failing?12:26
trownhmm... wonder if that is related to the promote12:27
jaosoriorbandini, ccamacho and chem`` are looking into it12:27
trownderekh: we promote solely based on ha+nonha ya?12:27
ccamachoHey EmilienM here is the error https://bugs.launchpad.net/tripleo/+bug/159277612:28
openstackLaunchpad bug 1592776 in tripleo "Ha upgrade jobs failing with "cluster remained unstable for more than 1800 seconds"" [Undecided,New]12:28
EmilienMshardy: so I half-figured why upgrade job is broken on liberty12:28
trownmgould: looking12:28
EmilienMccamacho: thx12:28
EmilienMshardy: i think stable/liberty is missing some ipv6 backports, because it started to fail when we enable ipv6 onupgrade job, on March 30th12:29
*** rlandy has joined #tripleo12:29
EmilienMshardy: I'm doing local testing today and maybe we can sort this out but it should be a big deal12:29
ccamachoEmilienM Im starting to crawl into controller logs.. But after having lunch is much harder, need more coffee..12:29
trownmgould: is there a follow-up backport of the undercloud.conf regeneration?12:30
mgouldtrown: yes, one moment12:30
* mgould thanks jaosorior for the review12:30
EmilienMccamacho, dprince, dprince, thrash, jaosorior: composable standup?12:30
mgouldtrown: https://review.openstack.org/#/c/324553/12:31
ccamachoyeahp12:31
mgouldalso passing all non-broken gates12:31
ccamachojoining12:31
mgouldthere are liberty backports too, but they're still failing CI12:31
*** bfournie has joined #tripleo12:33
*** zoli|lunch is now known as zoli12:37
*** zoli is now known as zoliXXL12:37
openstackgerritMerged openstack/instack-undercloud: Fix inspection_enable_uefi description  https://review.openstack.org/32455312:37
*** apetrich has quit IRC12:40
*** mbound has quit IRC12:40
shardyEmilienM: thanks for the update, good that we understand the root-cause now :)12:41
*** apetrich has joined #tripleo12:42
*** fzdarsky has joined #tripleo12:42
*** tbonds has quit IRC12:47
*** jprovazn has quit IRC12:48
*** itamarl has quit IRC12:58
*** fzdarsky has quit IRC12:58
*** tzumainn has joined #tripleo12:58
*** julim has joined #tripleo12:59
*** rbrady has joined #tripleo12:59
jaosoriorEmilienM: Hey dude, I'm looking into adding a custom fact to get different fqdn's depending on the network; Would this be appropriate for that? https://review.openstack.org/#/c/329299/13:00
*** dprince has quit IRC13:02
EmilienMjaosorior: looking dude13:02
openstackgerritMarios Andreou proposed openstack/tripleo-heat-templates: Enable Manila integration - as a composable controller service  https://review.openstack.org/18813713:02
*** ibravo has joined #tripleo13:02
EmilienMjaosorior: this is awesome!13:02
EmilienMjaosorior: I like the idea!13:02
jaosoriorEmilienM: thanks!13:04
*** jcoufal has quit IRC13:04
*** noslzzp has joined #tripleo13:05
openstackgerritHarry Rybacki proposed openstack/tripleo-quickstart: [WIP] Add scale to roles gate  https://review.openstack.org/32954213:05
*** hewbrocca is now known as hewbrocca-afk13:06
*** hewbrocca-afk is now known as hewbrocca13:06
openstackgerritMerged openstack/tripleo-heat-templates: Add redis constraint to aodh upgrade manifest  https://review.openstack.org/32965513:07
openstackgerritBob Callaway proposed openstack/tripleo-heat-templates: Enable Neutron LBaaS Integration  https://review.openstack.org/31393313:08
*** jpena|lunch is now known as jpena13:10
EmilienMbnemec: hey, my stuff in tripleo-ci for liberty does not seem to work http://logs.openstack.org/64/329664/4/check-tripleo/gate-tripleo-ci-centos-7-upgrades/67ee596/console.html#_2016-06-15_02_11_01_56113:12
EmilienMbnemec: can you look https://review.openstack.org/329663 again please?13:12
jaosoriormarios: Hey dude, regarding https://review.openstack.org/#/c/327029/10/manifests/profile/base/keystone.pp I have kept putting tls_cert_refresh_command's default as something else than undef, because I can't set the default explicitly in the parameter definition. It needs the "include ::apache::params" to come first. Else it won't find the service_name in the resource catalog13:13
mariosjaosorior: ack thanks for clarifications , i am in a call right now. will likely revisit later/tomorrow13:14
jaosoriorsure thing13:14
*** rcernin has quit IRC13:14
jaosoriormarios: Thanks for taking a look at it dude13:14
openstackgerritSagi Shnaidman proposed openstack-infra/tripleo-ci: DONT MERGE TESTING  https://review.openstack.org/31643613:14
openstackgerritBrad P. Crochet proposed openstack/puppet-tripleo: Add Mistral profiles  https://review.openstack.org/32343113:15
*** tbonds has joined #tripleo13:16
*** lblanchard has joined #tripleo13:17
EmilienMmarios: damn, stable/mitaka upgrade job also red?13:17
EmilienMmaster + stable/mitaka?13:17
jaosoriorEmilienM: What? It shouldn't13:18
EmilienMjaosorior, marios, shardy, slagle, trown, bnemec: please do not approve patches until we sort things out for upgrade job - I noticed some patches landed this morning without all green jobs13:19
EmilienMmarios: https://review.openstack.org/32987413:19
jaosoriorEmilienM: that is the random error where it times out in the "overcloud deploy"13:19
jaosoriornot sure if it's a nova issue or ironic... but it happens very randomly13:20
Ngccamacho: I never left here! :D13:20
mariosEmilienM: ack don't think i did /usually avoid that13:20
*** ramishra has quit IRC13:20
ccamachoNg hey! :)13:20
Ngccamacho: thanks for the offer, I'm sure I'll be taking you up on it13:21
EmilienMI'm looking when it broke exactly13:21
EmilienMit broke yesterday between 4pm and 11pm on my TZ13:22
EmilienMI'm comparing packaging now13:22
EmilienMthat's the list of package diff: https://www.diffchecker.com/bnubbtry (left job that passed upgrade and right job that failed, a few hours after)13:24
hewbroccaNg: welcome!13:24
EmilienMa lot of puppet modules updates13:24
*** bfournie has quit IRC13:25
*** jprovazn has joined #tripleo13:27
thrashEmilienM: that looks more like it's just a different order than necessarily updates13:27
*** rcernin has joined #tripleo13:28
thrashEmilienM: odd tho... they don't even seem to *be* on the left (passing)13:28
EmilienMthis diff is actually better: https://www.diffchecker.com/b59ji2sc13:28
*** akshai has joined #tripleo13:28
thrashthat seems much more sane. :)13:29
EmilienMI sorted packages13:29
EmilienMsorry yeah13:29
thrashEmilienM: order different? Passing on right now? or left?13:30
EmilienMthrash: left is packages from a job that passed upgrade13:30
EmilienMthrash: right is a failing job13:30
thrashbecause otherwise, openstack-puppet-modules took a huge rollback. :)13:30
thrashopenstack-puppet-modules-8.1.1-0.20160609150428.ab63b38.el7.centos.noarch -> openstack-puppet-modules-8.0.0-0.20160520142355.6a3e8bf.el7.centos.noarch13:31
thrashthat's left to right13:31
thrashline 42013:31
ccamacholot of packages...13:31
thrashAnd puppet modules now coming from packages.13:31
thrashthat's the two things I see.13:31
pandaLooking at overcloud I see that puppet modules are symlinked to /etc/puppet/modules. What part of tripleo is creating those symlinks ?13:32
EmilienMthrash: there is a problem13:32
EmilienMthrash: puppet openstack modules version is 8.013:32
EmilienMit should be 9.0 no?13:32
EmilienMjayg: ^13:32
EmilienMthrash: yeah this regression looks weird13:33
jayg8 is mitaka for opm, what is the question?13:33
EmilienMjayg: we're investigating why upgrade job is broken in tripleo13:33
EmilienMjayg: something between yesterday 4pm and 11pm (our TZ) broke us13:34
EmilienMjayg: https://www.diffchecker.com/b59ji2sc13:34
EmilienMjayg: on your left, packages of a job that passed upgrade and on your right, packages from a job that failed upgrade job13:34
jaygI didn't tag anything in rdo yesterday, only did downstream build13:34
* jayg looks13:34
EmilienMwhy do we have openstack-puppet-modules-8.0.0-0.20160520142355.6a3e8bf.el7.centos.noarch on recent jobs?13:35
thrashEmilienM: and why would all of the puppet modules start coming from packages instead of source?13:35
EmilienMI have no idea13:36
jaygyeah, that is weird that the newer side shows older opm13:36
EmilienMlet's confirm on other jobs13:37
*** dtrainor has quit IRC13:37
EmilienMyeah I confirm13:37
EmilienMon another (failing) very recent job: openstack-puppet-modules-8.0.0-0.20160520142355.6a3e8bf.el7.centos.noarch13:37
trownEmilienM: I wonder if it is promote of current-tripleo13:37
*** dtrainor has joined #tripleo13:37
trownEmilienM: it happened this morning13:37
EmilienMit's installing Mitaka13:37
trownderekh: promote does not check upgrades job?13:38
dtantsurEmilienM, hey, did you have a chance to see my response on  https://review.openstack.org/#/c/319297/ ? I've just checked and the connection is set correctly in the resulting ironic.conf13:38
EmilienMtrown: no, it started to fail between 4pm and 11 pm last night13:38
trownEmilienM: oh, promote happened this morning13:38
EmilienMtrown: but maybe it's related...13:38
openstackgerritwes hayutin proposed openstack/tripleo-quickstart: use the ansible-role-tripleo-inventory to override native inventory  https://review.openstack.org/32993813:38
openstackgerritMerged openstack/tripleo-ui: Register nodes new workflow  https://review.openstack.org/32366513:38
derekhtrown: correct, https://review.openstack.org/#/c/315075/2/scripts/mirror-server/mirror-server.pp13:38
EmilienMdtantsur: will look when our CI is back13:38
*** coolsvap has quit IRC13:39
jaosoriorEmilienM: the column of packages on the left, is it coming from a passing upgrades job?13:39
EmilienMderekh: any idea of what happens?13:39
*** jcoufal has joined #tripleo13:39
trownderekh: hmm that feels optimistic13:39
dtantsurEmilienM, sure, no hurry. just letting you know that it seems to work as expected13:39
ccamachojaosorior yeahp13:39
EmilienMjaosorior: like I said, left is green, right is red13:39
jaosoriordafuq13:39
derekhEmilienM: what happen on what? I havn't been following along, /me reads back13:39
trownI guess upgrades code path is not really dependent on anything external to tripleo that the other jobs are though13:39
EmilienMderekh: start at XX:33:4913:40
*** mbound has joined #tripleo13:41
*** jcoufal_ has joined #tripleo13:41
EmilienMtrown: can we compare packages before/after promotion?13:42
trownEmilienM: ya we can find the previous hash from https://trunk.rdoproject.org/centos7/promote-current-tripleo.log and compare the versions.csv in each13:44
*** jcoufal has quit IRC13:45
trownEmilienM: https://trunk.rdoproject.org/centos7/39/b4/39b44bf2ee28cc21ce92e5cd694cd82a4ad7ac8f_6bf0c01f/versions.csv vs https://trunk.rdoproject.org/centos7/db/aa/dbaa9e6db36181e1ec6d1c00b086fc6fb45e90e2_6686315c/versions.csv13:45
*** mbound has quit IRC13:46
EmilienMtrown: opm is on same hash13:47
trownEmilienM: but puppet modules are not http://chunk.io/f/e8484db8ae1e4fbd82f90f7000a42f1b13:47
*** rodrigods has quit IRC13:47
*** rodrigods has joined #tripleo13:48
trownthere are not many packages that DID NOT change13:48
derekhEmilienM: looking into it now, will shout if I find anything13:49
EmilienMthx13:49
EmilienMi'm quite sure this puppet regression makes the upgrade failing13:50
EmilienMwe can stop investigating pacemaker & things13:50
EmilienMchem``: ^13:50
*** pradk has joined #tripleo13:51
jaosoriorNow I'm seeing some errors related to the installation of tripleo-common13:51
jaosoriorError: Execution of '/bin/yum -d 0 -e 0 -y list tripleo-common' returned 1: Error: No matching Packages to list13:51
jaosoriorError: /Stage[main]/Main/Package[tripleo-common]/ensure: change from absent to present failed: Execution of '/bin/yum -d 0 -e 0 -y list tripleo-common' returned 1: Error: No matching Packages to list13:52
EmilienMtrown: thi sis the last patch that worked: https://review.openstack.org/#/c/312420/13:52
jaosoriorI've seen that in a couple of patches in the past hour13:52
EmilienMit was almost 4pm13:52
jaosoriorEmilienM: That's stable/mitaka; shouldn't we be looking for CRs for master?13:52
*** dtrainor has quit IRC13:54
*** rcernin has quit IRC13:54
EmilienMjaosorior: indeed13:54
*** dtrainor has joined #tripleo13:54
EmilienMjaosorior: the most recent patch I have is https://review.openstack.org/#/c/328361/13:55
EmilienMit was in the morning13:55
EmilienMdoing diff again13:55
*** links has quit IRC13:56
EmilienMjaosorior: https://www.diffchecker.com/f9hkwqmj13:59
*** dprince has joined #tripleo13:59
EmilienMthis is diff between https://review.openstack.org/#/c/324541/ and https://review.openstack.org/#/c/328361/14:00
jaosoriornow that looks like a more reasonable search14:00
EmilienMyeah14:01
jaosorioronly thing that merged in os-net-config was this https://review.openstack.org/#/c/291384/214:01
jaosoriorwhich is only adding debug statements14:01
*** ibravo2 has joined #tripleo14:01
*** egafford has joined #tripleo14:01
EmilienMyeah14:02
jaosoriorso it must be something from t-h-t14:02
*** cdearborn has joined #tripleo14:02
jaosoriorwe now have to fix the undercloud14:03
jaosoriorwhich is now broken it seems14:03
jaosoriorso now officially all the gate is red14:03
jaosoriorcrapo14:03
jaosoriorderekh, any idea what might be causing tripleo-common not to be found like I posted above? ^^14:04
EmilienMjaosorior: ask on #rdo14:04
EmilienMmaybe it's a repo thing14:04
*** ibravo has quit IRC14:05
ccamachoIm deploying from https://review.openstack.org/#/q/project:openstack/tripleo-heat-templates+status:merged the patches from the morning to see when fails.. time consuming task..14:05
derekhjaosorior: no idea off the top of my head, will take a look in a few minutes, tracking down the other error first14:06
*** tbonds has quit IRC14:06
jaosoriorEmilienM, derekh: they said that that package got renamed not too long ago14:07
jaosoriorto openstack-tripleo-common14:07
*** tbonds has joined #tripleo14:07
derekhjaosorior: that would do it14:08
openstackgerritSagi Shnaidman proposed openstack/tripleo-quickstart: Prepare tempest if running it  https://review.openstack.org/32908214:08
jaosoriorgonna change the name in instack-undercloud14:08
openstackgerritEmilien Macchi proposed openstack/instack-undercloud: Update tripleo-common package name  https://review.openstack.org/32996114:08
EmilienMjaosorior: ^14:08
*** olap has quit IRC14:08
*** rcernin has joined #tripleo14:09
openstackgerritJuan Antonio Osorio Robles proposed openstack/instack-undercloud: Use renamed tripleo-common package  https://review.openstack.org/32996214:09
jaosoriorEmilienM: ok, let me abandon my change14:09
EmilienMjaosorior: sorry man14:10
jaosoriorhaha no worries14:10
*** olap has joined #tripleo14:10
jaosoriorthe point is to get it fixed; not who fixes it14:10
jaosoriorEmilienM: +2ed your change14:10
EmilienMtrown: when was last promotion before the latest?14:10
EmilienMpackage was renamed 14 days ago14:10
EmilienMhow did we miss it?14:11
jaosoriorthat's an excellent question14:11
openstackgerritAthlan-Guyot sofer proposed openstack/puppet-tripleo: WIP: integration of the new puppet pacemaker.  https://review.openstack.org/30906914:12
EmilienMlike apevec said, there is a Provides, so I don't understand14:13
EmilienMah indeed puppet3 does not support well virtual packages14:13
EmilienMso we need this undercloud patch14:13
*** masco has quit IRC14:13
jaosorioralright, makes sense then14:14
EmilienMI'm not sure it's going to fix upgrade job though14:14
jaosoriorit won't14:15
jaosoriorthat seems to be another issue14:15
*** fzdarsky has joined #tripleo14:17
jaosoriorpradk: upgrades gate is broken. recheck won't help14:18
*** rcernin has quit IRC14:18
ccamachojaosorior this patch https://review.openstack.org/#/c/327307/14:18
*** trown is now known as trown|mtg14:18
ccamachommm im deploying the previous one14:18
pradkjaosorior, good to know, thx14:19
jaosoriorccamacho: it had passed at some point. I can try doing a revert for that though14:19
ccamachojaosorior, no wait until I have the prev one deployed..14:19
jaosoriorccamacho: ??14:20
jaosoriorah14:20
jaosorioryou're testing locally14:20
jaosorioralright14:20
ccamachoyeahp14:20
*** hjensas__ has quit IRC14:22
*** jrist has quit IRC14:23
*** zoliXXL is now known as zoli|mtg14:24
*** dprince has quit IRC14:25
shardyslagle: Hey, any thoughts on how we might decommission https://github.com/agroup/ ?14:26
shardyI encountered some folks referring to the old instack* stuff in there recently14:26
openstackgerritwes hayutin proposed openstack/tripleo-quickstart: use the ansible-role-tripleo-inventory to override native inventory  https://review.openstack.org/32993814:26
*** rcernin has joined #tripleo14:30
derekhshardy: slagle I see a "Delete this organization" button14:32
*** apetrich has quit IRC14:34
*** jefrite has quit IRC14:34
*** apetrich has joined #tripleo14:36
*** jrist has joined #tripleo14:36
*** jrist has joined #tripleo14:36
openstackgerritJohn Trowbridge proposed openstack/tripleo-quickstart: use environmental variables for ansible ssh configuration  https://review.openstack.org/32912414:37
*** bfournie has joined #tripleo14:43
*** trumpetnl has quit IRC14:44
openstackgerritBrad P. Crochet proposed openstack/python-tripleoclient: Add Mistral password to deployment  https://review.openstack.org/32998714:49
openstackgerritBrad P. Crochet proposed openstack/tripleo-heat-templates: Composable Mistral services  https://review.openstack.org/32343614:50
*** panda has quit IRC14:50
*** panda has joined #tripleo14:50
*** numans has quit IRC14:55
*** yamahata has quit IRC14:59
*** yamahata has joined #tripleo14:59
*** cllewellyn_ has quit IRC15:01
*** cllewellyn__ has quit IRC15:01
*** zoli|mtg is now known as zoli15:03
openstackgerritAdriano Petrich proposed openstack/tripleo-quickstart: inject debug options on the under/overcloud images  https://review.openstack.org/32999915:03
*** hewbrocca is now known as hewbrocca-afk15:05
EmilienMjaosorior: back15:05
EmilienMjaosorior: so it seems my patch helps, it's passing more than 50 min15:06
EmilienMso I guess it's deploying overcloud now15:06
*** cllewellyn_ has joined #tripleo15:06
*** cllewellyn__ has joined #tripleo15:06
*** hewbrocca-afk is now known as hewbrocca15:06
EmilienMccamacho, jaosorior, derekh: any update on upgrade job failure?15:06
ccamachoEmilienM im finishing to run the minor upgrade in https://review.openstack.org/#/c/328361/ @ Jun 14 7:17 PM as passed/merged the next one, failed/merged https://review.openstack.org/#/c/327318/ @ Jun 15 7:22 AM <-Networking related15:08
ccamachoif my test pass might be related to THT15:08
ccamachoif not the problem happened between Jun 14 7:17 PM and Jun 15 7:22 AM15:08
*** tobias_fiberdata has quit IRC15:09
*** numans has joined #tripleo15:09
ccamachoim basically based in this patches list https://review.openstack.org/#/q/project:openstack/tripleo-heat-templates+status:merged15:09
jaosoriorccamacho: You da man15:10
chem``EmilienM: ccamacho jaosorior derekh I've got an error I've never had before: http://logs.openstack.org/09/302409/43/check-tripleo/gate-tripleo-ci-centos-7-ha/8aca348/console.html#_2016-06-15_15_03_52_96715:10
jaosoriorEmilienM: Just waiting for your patch to pass CI so I can merge it15:10
chem``EmilienM: does it looks related to your patch or I just recheck ?15:11
jaosoriorchem`` that error is being fixed15:11
jaosoriorby EmilienM's patch15:11
chem``jaosorior: oki, so this is the tripleo package name stuff15:11
EmilienMchem``: http://logs.openstack.org/09/302409/43/check-tripleo/gate-tripleo-ci-centos-7-ha/8aca348/logs/undercloud/var/log/undercloud_install.txt.gz#_2016-06-15_14_51_58_00015:11
mariosjistr: sorry forgot to say, didn't get round to revisit the docs patches... bnemec thanks for comments there will do, i'll have another pass tomorrow15:12
EmilienMchem``: yes it is15:12
derekhEmilienM: I havn't find anything in the logs yet that seems relevant15:12
EmilienM:(15:12
jistrmarios: sure thing15:12
*** bfournie has quit IRC15:16
EmilienMthe good news is upgrade job still working on stable/mitaka15:20
EmilienMso it's really something in master15:20
*** ebarrera has quit IRC15:21
*** hjensas__ has joined #tripleo15:22
jistrbandini, marios: to have the thing complete and behaving nice, one probably needs all three of these patches https://github.com/openstack/tripleo-common/commits/master/undercloud_heat_plugins/server_update_allowed.py15:22
mariosjistr: ack nice thanks15:23
ccamachojaosorior, EmilienM, chem``, derekh got a different issue deploying the minor upgrade in https://review.openstack.org/#/c/328361/ (with the depends puppet-tripleo) , damn http://paste.openstack.org/show/516289/15:23
*** ifarkas has quit IRC15:23
EmilienMccamacho: logs of nova compute?15:24
*** aufi has quit IRC15:24
EmilienMwhy doesn't it start?15:24
*** leanderthal is now known as leanderthal|afk15:24
ccamachologging in there15:24
bandinijistr: + this that marios mentioned right? https://review.openstack.org/#/c/28383215:25
bandiniso 4 patches in total15:25
openstackgerritSanjay Upadhyay proposed openstack/tripleo-specs: new spec: tripleo-sriov  https://review.openstack.org/31387215:26
mariosbandini: no should be those three in total15:26
chem``ccamacho: failing restart is the problem we've seen this morning15:27
mariosbandini: you can get to the reviews like gerrit_url="http://review.openstack.org/#q,$1,n,z" where $1 is the change id from those commits15:27
bandinimarios: right, I am blind15:27
ccamachoyeahp15:27
ccamachoEmilienM, http://paste.openstack.org/show/516290/15:27
EmilienMmhh15:28
d0ugalMy undercloud install has stopped at: 2016-06-15 15:24:32 - Notice: /Stage[main]/Nova::Cert/Nova::Generic_service[cert]/Service[nova-cert]/ensure: ensure changed 'stopped' to 'running'15:28
EmilienMit sounds like rabbitmq is down or something?15:28
d0ugalany one else hitting this?15:28
EmilienMccamacho: it's only during upgrade?15:28
EmilienMccamacho: or also during deployment15:28
ccamachoyeahp15:28
ccamachothe deployment went fine15:28
ccamachoonly in the upgrade15:28
EmilienMok15:28
EmilienMthat"s super interesting15:28
EmilienMccamacho: rabbitmq status? up?15:29
EmilienMccamacho: can you also paste nova.conf please?15:29
ccamachoyeahp wait a min15:29
openstackgerritBrad P. Crochet proposed openstack/python-tripleoclient: Add Mistral password to deployment  https://review.openstack.org/32998715:29
openstackgerritBrad P. Crochet proposed openstack/puppet-tripleo: Add Mistral profiles  https://review.openstack.org/32343115:31
*** coolsvap has joined #tripleo15:31
EmilienMoh wait15:31
*** apetrich has quit IRC15:31
EmilienMI need to see your nova.conf15:32
EmilienMI can also look at jobs15:32
EmilienMmhh, no15:32
ccamachoEmilienM http://paste.openstack.org/show/516295/15:33
ccamachorabbit on the controller is running, let me check the config15:33
*** apetrich has joined #tripleo15:34
EmilienMlet me check something15:34
ccamachoThis is wrong.. auth_url=http://192.0.2.21:35357/v3 right ?15:34
*** bfournie has joined #tripleo15:34
*** shardy has quit IRC15:35
*** dsariel has quit IRC15:35
EmilienMon the compute, we have rabbit_userid=guest15:35
EmilienMnot on the controller15:35
hewbroccaO NOES not the damn rabbit password again15:36
*** saneax is now known as saneax_AFK15:37
*** olap has quit IRC15:37
EmilienMok I might have something15:37
*** mcornea has quit IRC15:40
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates: compute: align rabbitmq configuration with nova-base service  https://review.openstack.org/33002215:41
EmilienMccamacho: ^ I'm not sure it will fix the problem15:41
EmilienMbut it's at least a good cleanup15:41
EmilienMccamacho: were you able to start nova compute by hand?15:42
*** tobias_fiberdata has joined #tripleo15:44
*** mikelk has quit IRC15:45
jaosoriorEmilienM:  it makes sense as a cleanup15:45
EmilienMjaosorior: yeah but I figured that nova.conf rabbit things are diff on controler/compute15:47
EmilienMthat's not good ^15:47
ccamachoEmilienM dead.. http://paste.openstack.org/show/516306/15:47
*** zoli is now known as zoli|gone15:48
EmilienMccamacho: wait15:49
EmilienMdon't touch anything15:49
ccamachoEmilienM ack15:49
*** oshvartz has quit IRC15:50
EmilienMccamacho: can I ssh maybe15:50
EmilienM?15:50
ccamachosure15:50
*** pcaruana has quit IRC15:50
*** xinwu has joined #tripleo15:50
EmilienMccamacho: how is controller going? nova conductor for example, can you restart it?15:51
*** jrist has quit IRC15:52
EmilienMI want to see if rabbit is only unavailable for compute service or for everything15:52
jaosoriorEmilienM: Well, they should be different, I guess15:52
*** trown|mtg is now known as trown15:52
*** dmk0202 has quit IRC15:52
EmilienMjaosorior: what different?15:52
jaosoriornow that I think about it, this is gonna end up being good for security; they should have different credentials15:52
jaosoriorcontrollers and computes15:52
*** rcernin has quit IRC15:53
EmilienMjaosorior: we talked about it during summit with ayoung15:53
EmilienMlet me find this issue first15:53
jaosorioryep15:53
EmilienMthis is not related15:53
ayoungI'll; join the convo in a sec...in another right now15:54
jaosoriorayoung: No convo about that yet. First we gotta debug something wrong in the upgrades gate15:55
ayoungk15:55
EmilienMok same for nova conductor15:55
EmilienMso it's not only compute15:55
*** tesseract has quit IRC15:55
EmilienMsame for cinder schedule as an example15:58
EmilienMso something is wrong with credentials in general15:58
*** zoli|gone is now known as zoli_gone-proxy15:58
*** ohamada_ has quit IRC15:59
ccamachoEmilienM so the creds are messed up?15:59
EmilienMmaybe, let me find why15:59
*** noslzzp has quit IRC15:59
*** xinwu has quit IRC16:00
EmilienMccamacho: do you have a nova.conf pre upgrade by any chance?16:00
*** pkovar has quit IRC16:01
EmilienMI'm wondering if credentials were updated during update16:01
*** jcoufal has joined #tripleo16:01
EmilienMccamacho: we should try again by 1) deploying overcloud 2) backup nova.conf on controller/compute nodes 3) run update 4) compare config files16:02
EmilienMcan we try that?16:02
EmilienMI suspect a change during the update that breaks services16:03
*** jcoufal_ has quit IRC16:03
EmilienMwait, credentials are good16:03
EmilienMyou can see them in /etc/rabbitmq/rabbitmq.config16:04
ccamachoEmilienM nope :( but16:04
openstackgerritJohn Trowbridge proposed openstack/tripleo-quickstart: Use quickstart.sh to manage venv in all ci-scripts  https://review.openstack.org/33004016:04
ccamachojust installed the pacemaker env, and then minor upgrade16:04
ccamachothe only thing i did there16:05
EmilienMccamacho: trying to restart rabbit16:05
*** athomas has quit IRC16:05
*** numans has quit IRC16:07
*** ramishra has joined #tripleo16:07
*** ramishra has quit IRC16:07
EmilienMccamacho: it works16:07
EmilienMI did one thing: pcs resource restart rabbitmq16:08
EmilienMso I'm not sure why but rabbitmq needs to be restarted during the update16:08
ccamachowithout changing anything ?¿16:08
openstackgerritJakub Libosvar proposed openstack/tripleo-heat-templates: Rename Neutron database name  https://review.openstack.org/33004216:08
ccamachommm16:08
EmilienMnope16:08
EmilienMand I'm not sure it's related to our CI issue16:08
EmilienMccamacho: is it?16:08
*** krotscheck is now known as krotscheck_dcm16:09
*** tremble has quit IRC16:09
ccamachonot sure, it should worked.. I will re-deploy it.. but it should worked.. as It passed CI...16:10
*** [1]cdearborn has joined #tripleo16:10
dtrainorI have a failed deployment.  I show CREATE_FAILED for Compute and Controller with 'heat stack-list --show-nested -f stack_status=CREATE_FAILED', but no failures in 'heat resource-list foo'.  Looking at the resource details via resource-show doesn't give me any clues either.  Where else can I look for information?16:10
*** tobias_fiberdata has quit IRC16:11
EmilienMccamacho: so16:12
EmilienMccamacho: if we compare with our CI failures16:12
EmilienMin CI we also have rabbit issues, or? let me verify16:13
*** ramishra has joined #tripleo16:13
EmilienMhttp://logs.openstack.org/14/329714/1/check-tripleo/gate-tripleo-ci-centos-7-upgrades/dec4380/logs/overcloud-novacompute-0/var/log/nova/nova-compute.txt.gz#_2016-06-15_07_56_50_31216:14
ccamachoEmilienM, if you deploy master and execute the minor upgrade, you will reproduce the error as is in the bug ticket, I tried to deploy a prev commit which passed CI to see if the error is related to THT but the rabbit issue hit me in the face..16:14
*** tobias_fiberdata has joined #tripleo16:15
*** milan has quit IRC16:16
EmilienMit sounds failing during ControllerPostPuppetRestartDeployment16:16
hewbroccasilly wabbit16:16
ccamachoEmilien, in the env you logged I was launching https://review.openstack.org/#/c/328361/ which passed16:16
EmilienMI think it fails during extraconfig/tasks/pacemaker_resource_restart.sh16:17
EmilienMwhen it restart rabbit16:17
EmilienMeverything in logs point to rabbit16:17
EmilienMjistr: you still around?16:18
hewbroccaEmilienM: He just left :(16:18
derekhslagle: bnemec been waiting for DNS to update so I can rerecord the rh2 deployment, will be doing it later tonight16:18
derekhslagle: bnemec gonna try and condense down to a 15 minute video to post somewhere16:19
EmilienMlaunchpad 156738516:19
openstackLaunchpad bug 1567385 in tripleo "Minor update always triggered on first stack-deploy after major upgrade" [High,Fix released] https://launchpad.net/bugs/1567385 - Assigned to Jiří Stránský (jistr)16:19
*** pkovar has joined #tripleo16:19
hewbroccaOh no, is it that one?16:19
EmilienMno16:20
EmilienMlaunchpad 156738416:20
openstackLaunchpad bug 1567384 in tripleo "Services not restarted on stack-update - config changes can go unapplied" [High,Fix released] https://launchpad.net/bugs/1567384 - Assigned to Jiří Stránský (jistr)16:20
ccamachoderekh let me know when published to link it to the tripleo channel I have created (https://www.youtube.com/channel/UCNGDxZGwUELpgaBoLvABsTA)16:20
EmilienMok that might be that one16:20
*** jaosorior has quit IRC16:20
derekhccamacho: will do16:20
EmilienMccamacho: ok I know where is the issue but can't find why, I'm going to add debug in script and kick off CI jobs16:21
openstackgerritDan Radez proposed openstack/tripleo-heat-templates: Adding Congress Support  https://review.openstack.org/33005016:21
derekhEmilienM: gotta run, sorry wasn't any help16:21
ccamachoEmilienM nice! What's the problem then?16:22
EmilienMwe got it covered16:22
EmilienMjust please don't merge anything until we fix this16:22
derekhack16:22
*** derekh has quit IRC16:22
EmilienMccamacho: the pcs resource restart rabbit16:22
EmilienMccamacho: maybe it fails16:22
ccamachoack16:22
EmilienMit causes to cloud to go down16:22
EmilienMccamacho: let me some time, I continue to read logs16:24
ccamachotoo late :(16:24
EmilienMccamacho: not on your setup16:25
EmilienMccamacho: you can break your setup16:25
*** cdearborn has quit IRC16:25
ccamachoEmilienM :)  sure then :)16:25
ccamachoAnyway your keys will remain in the undercloud without problems..16:26
*** pkovar has quit IRC16:28
*** numans has joined #tripleo16:30
*** dprince has joined #tripleo16:31
*** tobias_fiberdata has quit IRC16:34
*** jpena is now known as jpena|off16:35
*** cwolferh has joined #tripleo16:36
*** trown is now known as trown|lunch16:39
*** mgould is now known as mgould|afk16:40
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates: DO NOT MERFGE - debug why upgrade job fails  https://review.openstack.org/33006916:42
EmilienMccamacho: ^16:42
*** pkovar has joined #tripleo16:42
EmilienMlet's see how it goes16:42
ccamachoEmilienM ack16:42
*** dmacpher is now known as dmacpher-afk16:43
*** mbound has joined #tripleo16:43
EmilienMany core around please look https://review.openstack.org/#/c/329961/ and see if we can land it16:43
EmilienMI think yes16:43
EmilienMnow I see some HA jobs failing too16:44
EmilienMhttp://logs.openstack.org/61/329961/1/check-tripleo/gate-tripleo-ci-centos-7-ha/20f4a7a/logs/postci.txt.gz#_2016-06-15_15_42_27_00016:44
*** cdearborn has joined #tripleo16:44
* EmilienM brb lunch16:46
*** yolanda has quit IRC16:46
*** xinwu has joined #tripleo16:49
*** [2]cdearborn has joined #tripleo16:50
*** cllewellyn_ has quit IRC16:53
*** cllewellyn__ has quit IRC16:53
*** apetrich has quit IRC16:53
*** oshvartz has joined #tripleo16:56
openstackgerritMike Burns proposed openstack/tripleo-common: update removed undercloud-package-install  https://review.openstack.org/33008416:56
*** milan has joined #tripleo16:56
*** yamahata has quit IRC16:56
*** apetrich has joined #tripleo16:58
*** [1]cdearborn has quit IRC16:58
EmilienMtrown|lunch: I know it's bad but I think we can land https://review.openstack.org/#/c/329961/ as it fix undercloud16:58
EmilienMand ha/upgade failures are not related I think16:59
EmilienMbut yeah it's bad16:59
openstackgerritLars Kellogg-Stedman proposed openstack/tripleo-quickstart: make --requirements cumulative  https://review.openstack.org/33008616:59
* EmilienM afk lunch16:59
*** NobodyCam has quit IRC16:59
*** igorbelikov has quit IRC16:59
*** dtantsur is now known as dtantsur|afk17:00
openstackgerritDan Prince proposed openstack/tripleo-common: Add RegisterNodesAction action  https://review.openstack.org/31958717:00
*** NobodyCam has joined #tripleo17:00
*** igorbelikov has joined #tripleo17:01
openstackgerritLars Kellogg-Stedman proposed openstack/tripleo-quickstart: Use quickstart.sh to manage venv in all ci-scripts  https://review.openstack.org/33004017:01
openstackgerritDan Prince proposed openstack/tripleo-common: Add baremetal workflows  https://review.openstack.org/30020017:05
*** cdearborn has quit IRC17:06
*** cdearborn has joined #tripleo17:07
openstackgerritwes hayutin proposed openstack/tripleo-quickstart: move release var from positional to an argument  https://review.openstack.org/33009117:09
openstackgerritBrad P. Crochet proposed openstack/puppet-tripleo: Add Mistral profiles  https://review.openstack.org/32343117:09
*** pkovar has quit IRC17:10
*** coolsvap has quit IRC17:11
*** numans has quit IRC17:13
*** bswartz has quit IRC17:13
openstackgerritBrad P. Crochet proposed openstack/tripleo-heat-templates: Composable Mistral services  https://review.openstack.org/32343617:14
*** fzdarsky is now known as fzdarsky|afk17:14
*** yamahata has joined #tripleo17:15
openstackgerritPradeep Kilambi proposed openstack/python-tripleoclient: Fix keystone init  https://review.openstack.org/33009617:19
*** [2]cdearborn has quit IRC17:21
*** pcaruana has joined #tripleo17:21
*** ramishra has quit IRC17:25
*** noslzzp has joined #tripleo17:29
*** fragatina has quit IRC17:29
openstackgerritJiri Tomasek proposed openstack/tripleo-ui: Nodes Introspection new workflow  https://review.openstack.org/33011517:34
*** electrofelix has quit IRC17:34
openstackgerritJiri Tomasek proposed openstack/tripleo-ui: Nodes Introspection new workflow  https://review.openstack.org/33011517:34
openstackgerritMerged openstack/tripleo-common: Add --json-output option to tripleo-build-images  https://review.openstack.org/32783017:40
openstackgerritBrad P. Crochet proposed openstack/puppet-tripleo: Add Mistral profiles  https://review.openstack.org/32343117:44
openstackgerritMerged openstack/instack-undercloud: Update tripleo-common package name  https://review.openstack.org/32996117:45
*** akshai has quit IRC17:47
*** akshai has joined #tripleo17:47
ccamachoEmilienM jaosorior, just to update you, I have redeployed a passing submission not affected by the timeout issue (The one with the rabbit issue i just re-deployed it again https://review.openstack.org/#/c/328361/) and had failed with the same error (1800 secs timeout when minor upgrade), so I dont think is related to THT or puppet-tripleo. It might be a package breaking the deployment?17:48
ccamachoEmilien I will leave the the deployment in that state just in case you want to log in and see the environment17:49
EmilienMback from lunch17:49
EmilienMccamacho: mhh ok17:49
ccamachoin this case rabbit restarted without issues but then the timeout issue, this is the current state of the overcloud deployment http://paste.openstack.org/show/516324/17:51
*** trown|lunch is now known as trown17:51
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates: DO NOT MERFGE - debug why upgrade job fails  https://review.openstack.org/33006917:58
EmilienMccamacho: really I have no idea what's going on17:59
openstackgerritPradeep Kilambi proposed openstack/python-tripleoclient: Run post deploy config on force  https://review.openstack.org/33009617:59
*** bswartz has joined #tripleo18:04
openstackgerritPradeep Kilambi proposed openstack/python-tripleoclient: Run post deploy config on force  https://review.openstack.org/33009618:04
trownEmilienM: I am pretty confused how we got to this mess with tripleo-common vs openstack-tripleo-common18:05
trownEmilienM: as you said that rename happened two weeks ago18:05
*** mbound has quit IRC18:06
EmilienMtrown: me too18:08
EmilienMso many CI issues this week18:08
openstackgerritHarry Rybacki proposed openstack/tripleo-quickstart: [WIP] Add scale to roles gate  https://review.openstack.org/32954218:08
*** ccamacho is now known as ccamacho|out18:08
*** fragatina has joined #tripleo18:08
*** akshai_ has joined #tripleo18:10
*** akshai has quit IRC18:13
openstackgerritJohn Trowbridge proposed openstack/tripleo-quickstart: Use quickstart.sh to manage venv in all ci-scripts  https://review.openstack.org/33004018:14
*** cwolferh has quit IRC18:15
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates: controller/cinder: set auth_uri with version-less endpoint  https://review.openstack.org/33012918:15
EmilienMbnemec: thx18:17
openstackgerritHarry Rybacki proposed openstack/tripleo-quickstart: [WIP] Add scale to roles gate  https://review.openstack.org/32954218:17
*** akshai_ has quit IRC18:17
*** akshai has joined #tripleo18:17
dprincetrown: how long as it been since we promoted?18:19
dprinceEmilienM: what did you want help w/?18:20
trowndprince: there was a promote this morning18:20
dprincetrown: so is that why we are broken perhaps?18:21
dprincetrown: perhaps the timing of that along with some other packaging change got us broken?18:21
EmilienMI think it broke before18:21
EmilienMit seems like yesterday18:21
trowndprince: ya, that was my first thought, but 1) tripleo-common is in our includepkgs so it should not get affected by promote and 2) tripleo-ci is doing the promote via periodic job, so how did periodic job pass18:22
EmilienMit does not seem related to promotion18:23
EmilienMtrown: wait, does promotion run upgrade job right?18:23
*** chem``` has joined #tripleo18:23
dprinceEmilienM: I think it runs all 3 (upgrade job included)18:24
EmilienMhttp://logs.openstack.org/periodic/periodic-tripleo-ci-centos-7-upgrades/3040af8/console.html18:24
EmilienMfailure18:24
trownEmilienM: dprince, but upgrade job does not vote18:24
trownalso, I think https://github.com/openstack-infra/tripleo-ci/blob/master/scripts/tripleo.sh#L233 may have got us18:24
EmilienMwere are the promotion logs ?18:24
trowntripleo-common in there and not openstack-tripleo-common18:24
dprincetrown: so this could be an issue18:24
EmilienMwhere*18:25
EmilienMI want to check if upgrade job passed the promotion18:25
*** chem`` has quit IRC18:25
trownya only ha and nonha are checked for promote https://github.com/openstack-infra/tripleo-ci/blob/master/scripts/mirror-server/mirror-server.pp#L5118:25
trownnot sure why we did that18:25
EmilienMdamn18:26
openstackgerritLars Kellogg-Stedman proposed openstack/tripleo-quickstart: make --requirements cumulative  https://review.openstack.org/33008618:26
EmilienMwe need to fix that18:26
*** cwolferh has joined #tripleo18:26
trownI dont know how to find logs from the "real" periodic job, but I have not seen upgrades passing on the fake one18:26
EmilienMtrown: the promotion upgrade job failed for the same reason18:27
EmilienMhttp://logs.openstack.org/periodic/periodic-tripleo-ci-centos-7-upgrades/3040af8/console.html#_2016-06-15_11_33_36_89618:27
EmilienMControllerPostPuppetRestartDeployment error18:27
EmilienMhopefully my patch https://review.openstack.org/#/c/330069/ can help to debug18:27
openstackgerritEmilien Macchi proposed openstack-infra/tripleo-ci: promote: add upgrade job part of voting  https://review.openstack.org/33013918:29
EmilienMtrown: is it good? ^18:29
trownEmilienM: I think we need to understand why it was left out originally... I am doubtful it was simply an oversight18:30
EmilienMwell, what I understand now is that we allowed a promotion that was not passing our CI jobs18:30
trownEmilienM: dprince, wdyt of merging https://review.openstack.org/#/c/329961/ without upgrades job, since upgrades job is known broken, and that at least fixes the other two18:30
EmilienMtrown: we landed it18:30
trownoh. great :)18:31
EmilienMyeah18:31
EmilienMthis one was safe to land18:31
*** jpich has quit IRC18:32
*** sambetts is now known as sambetts|afk18:32
*** hjensas__ has quit IRC18:33
openstackgerritHarry Rybacki proposed openstack/tripleo-quickstart: [WIP] Add full-deploy-with-scale script  https://review.openstack.org/33014618:34
*** egafford has quit IRC18:35
openstackgerritJohn Trowbridge proposed openstack-infra/tripleo-ci: Update current repo setup includepkgs  https://review.openstack.org/33014818:37
trownEmilienM: I think ^ fixes the bit that broke the undercloud on promote18:38
*** akshai has quit IRC18:39
EmilienMtrown: nice18:39
EmilienMtrown: +218:39
*** akshai has joined #tripleo18:39
EmilienMtrown: maybe related to upgrade job failure? (not sure)18:40
openstackgerritHarry Rybacki proposed openstack/tripleo-quickstart: [WIP] Add full-deploy-with-scale script  https://review.openstack.org/33014618:41
trownEmilienM: not sure, probably not18:41
*** jaosorior has joined #tripleo18:44
jaosoriorEmilienM: Still around?18:47
EmilienMyes18:47
jaosoriorhow's the upgrades job debugging going?18:48
jaosoriorany news about that?18:48
EmilienMjaosorior: not much18:48
EmilienMjaosorior: we figured that promotion job didn't run upgrade (it will in future)18:48
EmilienMwe also merged the tripleo-common package thing18:48
openstackgerritHarry Rybacki proposed openstack/tripleo-quickstart: [WIP] Add full-deploy-with-scale script  https://review.openstack.org/33014618:48
jaosoriorEmilienM: yeah, +2ed that18:48
EmilienMbut really nothing else18:48
jaosoriorthe promotion18:48
EmilienMjaosorior: waiting on https://review.openstack.org/#/c/330069/18:48
jaosoriorgot a patch with logs of the upgrades failure?18:48
EmilienMso we can have more debug on where it fails18:49
jaosoriorok18:49
EmilienMjaosorior: it's a ControllerPostPuppetRestartDeployment error18:49
EmilienMthe bash script that restart resurces fail18:49
jaosorioryeah, that's up to where I figured out18:49
dprincetrown: do we think the non-ha and ha jobs will pass with just 330148?18:49
dprincetrown: just wondering if if we should consider going ahead and sending it?18:50
*** apetrich has quit IRC18:50
*** panda has quit IRC18:50
*** panda has joined #tripleo18:50
jaosoriorEmilienM: got some logs from a run that had failed with it18:50
jaosoriorI'm not sure if it's a red herring18:51
trowndprince: kind of depends if something we landed in tripleo-common in the last 3 days is broken18:51
jaosoriorbut there's a bunch of "client unexpectedly closed TCP connection" in the end of the puppet logs18:51
trowndprince: I dont think that patch should be a requirement for ha and non-ha to pass though18:51
trowndprince: and I do not have much hope that patch will fix the upgrades job18:52
*** apetrich has joined #tripleo18:52
jaosoriorother than that I haven't noticed much :/18:53
dprincetrown: the recent ha and non-ha jobs I'm looking at all fail with Execution of '/bin/yum -d 0 -e 0 -y list tripleo-common' returned 1: Error: No matching Packages to list18:58
EmilienMdprince: yeah we fixed it18:58
trowndprince: that should be fixed by instack-undercloud patch18:58
jaosoriordprince: that was fixed already with a commit from EmilienM18:58
EmilienMhttps://review.openstack.org/32996118:59
*** akrivoka has quit IRC18:59
dprinceI saw that, just got confused.18:59
dprinceokay, so ha and non-ha should be fine then...18:59
EmilienMyes only upgrade is failing19:00
jaosorioryeah19:00
jaosoriorEmilienM: Seen this? http://paste.openstack.org/show/516336/19:04
EmilienMyes19:04
jaosorioraw19:04
jaosoriordamn19:04
jaosorioralright19:04
EmilienMbut thanks19:05
EmilienMit's really the problem19:05
EmilienMcluster dies during upgrade19:05
*** mbound has joined #tripleo19:06
dprincejaosorior, EmilienM that error message comes from our pacemaker_common_functions.sh19:06
openstackgerritSagi Shnaidman proposed openstack/tripleo-quickstart: Create playbook for running ansible tempest role  https://review.openstack.org/33016419:06
dprinceso it could just be we are hitting that timeout now19:06
EmilienMdprince: see https://review.openstack.org/#/c/330069/19:07
EmilienMI'm trying to debug it19:07
EmilienMwe thought it was rabbitmq19:07
jaosoriordprince: EmilienM is waaay ahead of us O_O19:07
EmilienMwhy would we get a timeout?19:07
EmilienMjaosorior: I spent my day on it19:07
jaosoriorI still thought it was rabbitmq19:08
dprinceEmilienM: perhaps because something (anything) is taking longer...19:08
jaosoriorit's very weirdly closing connections (from the logs)19:08
EmilienMwe hit this timeout 100% of time19:08
openstackgerritOpenStack Proposal Bot proposed openstack/tripleo-common: Updated from global requirements  https://review.openstack.org/32309019:09
*** egafford has joined #tripleo19:09
dprinceEmilienM: the resource is openstack-core19:10
dprinceso that could be a lot of things right?19:10
*** mbound has quit IRC19:11
EmilienMdprince: yeah I checked in logs, afict it was only rabbitmq that crashed19:12
openstackgerritLars Kellogg-Stedman proposed openstack/tripleo-quickstart: return global control of force_cached_image  https://review.openstack.org/33016619:19
*** dprince has quit IRC19:21
*** apetrich has quit IRC19:22
*** karthiks has quit IRC19:24
*** skramaja has quit IRC19:24
*** apetrich has joined #tripleo19:24
trownlarsks: do you have strong feelings about bumping the default stopping point of quickstart to post undercloud install? ie just after running `openstack undercloud install`19:30
larskstrown: that seems reasonable to me.19:32
openstackgerritgreghaynes proposed openstack/diskimage-builder: Move hook generation in to python  https://review.openstack.org/27113919:32
larskstrown: but not the post-install?19:32
bandinimarios, jistr: I think I know why heat decides to ignore Step2 in the major upgrade. The yum upgrade -y -q to mitaka at the end of Step 1, breaks the process somehow. Not sure why yet, but if I comment the yum update, Step2 takes place19:32
*** openstackgerrit has quit IRC19:33
trownlarsks: k, it increases the time of our "quick" gates a bit, but I think the user experience is a bit better19:33
*** openstackgerrit has joined #tripleo19:33
larskstrown: right, but i was asking, should we also run the post-install (e.g., stop after the tripleo/undercloud role is complete)?19:33
*** akshai_ has joined #tripleo19:33
larsksIn particular, that makes sure your network is set up correctly.19:34
trownlarsks: ya, there are quite a few things people can do after `openstack undercloud install` but before running deploy... though I guess that is what skip tags are for19:35
*** akshai has quit IRC19:35
trownfwiw, shardy would like to stop just before someone would run `openstack overcloud deploy` https://bugs.launchpad.net/tripleo-quickstart/+bug/156947719:35
openstackLaunchpad bug 1569477 in tripleo-quickstart 0.1 "Undercloud install should be automated by default" [High,Confirmed]19:35
*** bfournie1 has joined #tripleo19:35
trownI guess if we are changing it we could go for the full change...19:36
*** karthiks has joined #tripleo19:36
*** bfournie has quit IRC19:36
*** skramaja has joined #tripleo19:36
*** egafford1 has joined #tripleo19:39
openstackgerritBrad P. Crochet proposed openstack/tripleo-heat-templates: Composable Mistral services  https://review.openstack.org/32343619:40
openstackgerritBrad P. Crochet proposed openstack/puppet-tripleo: Add Mistral profiles  https://review.openstack.org/32343119:40
trownlarsks: now that we have the ability to run arbitrary playbooks with quickstart.sh, I am less convinced that tags are even worth the effort19:41
trowncould just have different playbooks for different flows19:41
*** egafford has quit IRC19:41
larskstrown: it may be worthwhile to maintain some sort of big switches (e.g., "do not install undercloud", "do not deploy overcloud", "do not validate") maybe.19:42
trownthe '*-scripts' tags are still nice19:42
larsksOr at least some way to control that via the quickstart.sh script.  Maybe we just include multiple playbooks or something...19:43
*** dsariel has joined #tripleo19:43
*** egafford1 is now known as egafford19:44
*** cllewellyn_ has joined #tripleo19:54
*** cllewellyn__ has joined #tripleo19:54
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates: Enable libvirt as a micro-service  https://review.openstack.org/32971819:55
*** jprovazn has quit IRC19:56
*** jaosorior has quit IRC19:57
openstackgerritJohn Trowbridge proposed openstack/tripleo-quickstart: Move default stopping point to just before overcloud deploy  https://review.openstack.org/33017619:57
trownpanda: ^ we should rebase your stuff on that I think19:57
trownpanda: specifically the ironic config for qemu://session19:58
*** jcoufal_ has joined #tripleo19:59
pandatrown: before or after it's merged ?19:59
*** krotscheck_dcm is now known as krotscheck20:00
trownpanda: suppose it doesn't matter20:00
openstackgerritHarry Rybacki proposed openstack/tripleo-quickstart: [WIP] Add scale to roles gate  https://review.openstack.org/32954220:00
*** jcoufal has quit IRC20:02
EmilienMtrown, bnemec: you want me to update commit message? or can we land it like it?20:02
bnemecEmilienM: If it passed CI, just fix the commit message and then land it.20:03
trownEmilienM: I think we should just update commit just before merging20:03
bnemecNo need to wait for another CI run on a commit message change.20:03
EmilienMk20:03
trownyep20:03
EmilienMI don't think CI test this code (or does it?)20:04
*** cllewellyn_ has quit IRC20:04
*** cllewellyn__ has quit IRC20:04
bnemecProbably not.20:04
bnemecIn fact, you might want to check with derek that once it's merged it gets applied to the actual CI env.20:04
EmilienMyep20:04
trownya, I think there is no automation to do the puppet apply20:05
openstackgerritEmilien Macchi proposed openstack-infra/tripleo-ci: promote: add upgrade job part of voting  https://review.openstack.org/33013920:05
EmilienMkilling useless job20:05
EmilienMsending an email to derek20:05
EmilienMfeel free to +2 again20:05
EmilienMI'll let him +A20:05
openstackgerritMerged openstack-infra/tripleo-ci: promote: add upgrade job part of voting  https://review.openstack.org/33013920:05
trownoh whoops20:05
trownmerged :)20:06
*** toure has joined #tripleo20:06
EmilienMlol20:06
EmilienMthanks trown !20:06
EmilienMso fast20:06
*** fzdarsky|afk has quit IRC20:07
*** dprince has joined #tripleo20:07
*** karts has joined #tripleo20:07
*** krsacme has joined #tripleo20:07
EmilienMtrown: no worries, I emailed him and he'll figure20:07
trowncool20:08
*** karthiks has quit IRC20:11
*** skramaja has quit IRC20:11
*** toure is now known as toure|biab20:19
*** MaxPC has quit IRC20:27
EmilienMtrown: ok my patch to debug finished CI20:31
EmilienMI'm currently digging into http://logs.openstack.org/69/330069/2/check-tripleo/gate-tripleo-ci-centos-7-upgrades/aecb52e/logs/overcloud-controller-0/var/log/messages20:31
EmilienMit failed before trying to stop rabbit20:31
EmilienMsee Jun 15 19:35:35 localhost systemd: Unit openstack-ceilometer-collector.service entered failed state.20:31
EmilienMdprince: ^20:32
EmilienMit failed earlier than you said in the review20:32
EmilienMI don't see the "pacemaker is about to restart rabbit"20:32
*** egafford has quit IRC20:32
EmilienMI see nothing special in http://logs.openstack.org/69/330069/2/check-tripleo/gate-tripleo-ci-centos-7-upgrades/aecb52e/logs/overcloud-controller-0/var/log/ceilometer/collector.txt.gz20:33
EmilienMJun 15 19:52:56 localhost pengine[11455]: warning: Processing failed op start for ip-fd00.fd00.fd00.3000..18 on overcloud-controller-0: unknown error (1)20:35
EmilienMwe didn't have it on previous jobs ^20:36
EmilienMwe really need a pacemaker guru20:38
EmilienMlet's file a bug20:39
EmilienMtrown: do we have a bug alraedy for it ^20:39
*** julim has quit IRC20:39
trownEmilienM: not that I am aware of20:39
EmilienMkk20:39
EmilienMtrown: https://bugs.launchpad.net/tripleo/+bug/159277620:40
EmilienMit's not only HA job20:40
openstackLaunchpad bug 1592776 in tripleo "Ha upgrade jobs failing with "cluster remained unstable for more than 1800 seconds"" [Undecided,New]20:40
openstackgerritGabriele Cerami proposed openstack/tripleo-quickstart: Update downloaded images to latest delorean repos  https://review.openstack.org/32789820:41
openstackgerritGabriele Cerami proposed openstack/tripleo-quickstart: Move ironic config to post install  https://review.openstack.org/32830020:41
*** noslzzp has quit IRC20:50
dprinceEmilienM: ack, I was looking at a different patch I think21:03
*** bfournie1 has quit IRC21:04
*** ooolpbot has joined #tripleo21:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION21:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/159277621:10
*** ooolpbot has quit IRC21:10
openstackLaunchpad bug 1592776 in tripleo "upgrade jobs failing with "cluster remained unstable for more than 1800 seconds"" [Critical,Confirmed]21:10
*** trozet has quit IRC21:13
*** dprince has quit IRC21:13
*** trozet has joined #tripleo21:14
*** trown is now known as trown|outtypewww21:15
*** cmyster has quit IRC21:15
*** cmyster has joined #tripleo21:15
ayoungEmilienM, jmiu and Ozz helped me  figure out the problem from yesterday.  I was updating the HA Controller template, but deploying non HA.21:15
ayoungGot it working now21:16
EmilienMcool21:16
ayoungEmilienM, I'm even more dangerous than I was before21:16
ayounghttp://adam.younglogic.com/2016/06/custom-overcloud-deploys/21:16
ayoungEmilienM, I need to go play Dad for a while, but tomorrow, lets confer about V3 Keystone everywhere...21:17
*** rhallisey has quit IRC21:17
EmilienMayoung: enjoy :)21:18
*** jayg is now known as jayg|g0n321:24
*** lblanchard has quit IRC21:28
*** ccamacho|out has quit IRC21:33
*** myoung is now known as myoung|afk21:35
*** cdearborn has quit IRC21:47
*** weshay has quit IRC22:00
*** openstackgerrit has quit IRC22:02
*** yamahata has quit IRC22:04
*** openstackgerrit has joined #tripleo22:05
*** ibravo2 has quit IRC22:08
*** paramite has quit IRC22:08
*** jcoufal_ has quit IRC22:09
*** ooolpbot has joined #tripleo22:10
ooolpbotURGENT TRIPLEO TASKS NEED ATTENTION22:10
ooolpbothttps://bugs.launchpad.net/tripleo/+bug/159277622:10
*** ooolpbot has quit IRC22:10
openstackLaunchpad bug 1592776 in tripleo "upgrade jobs failing with "cluster remained unstable for more than 1800 seconds"" [Critical,Confirmed]22:10
*** mbound has joined #tripleo22:10
*** egafford has joined #tripleo22:20
*** rlandy has quit IRC22:24
*** abehl has quit IRC22:27
*** yamahata has joined #tripleo22:27
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates: Enable firewall by default on the overcloud  https://review.openstack.org/32183322:27
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates: Allow pcsd port in firewall  https://review.openstack.org/33024922:27
*** myoung|afk has quit IRC22:29
*** jcoufal has joined #tripleo22:55

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!