Monday, 2016-03-07

*** tiswanso has joined #tripleo00:52
*** Marga_ has quit IRC00:53
*** shivrao has joined #tripleo01:02
*** tiswanso has quit IRC01:03
*** shivrao has quit IRC01:13
*** shivrao has joined #tripleo01:44
*** shivrao has joined #tripleo01:45
*** shivrao has quit IRC01:50
*** tiswanso has joined #tripleo02:00
openstackgerritMerged openstack/tripleo-common: Expose TENANT_STACK_DEPLOY_ARGS  https://review.openstack.org/28816102:13
*** tiswanso has quit IRC02:15
*** dmacpher has joined #tripleo02:38
*** chlong has joined #tripleo02:45
*** tiswanso has joined #tripleo03:11
*** yuanying has quit IRC03:22
*** tiswanso has quit IRC03:24
*** yuanying has joined #tripleo03:40
*** yuanying has quit IRC03:44
*** shivrao has joined #tripleo03:48
*** shivrao has quit IRC03:53
*** ayoung has quit IRC03:54
*** yuanying has joined #tripleo04:08
*** links has joined #tripleo04:15
*** Marga_ has joined #tripleo04:17
*** tiswanso has joined #tripleo04:20
*** cwolferh has quit IRC04:20
*** tiswanso has quit IRC04:27
*** masco has joined #tripleo04:39
openstackgerritIan Wienand proposed openstack/diskimage-builder: Fix spurious = in dib-python readme  https://review.openstack.org/28699104:47
openstackgerritMerged openstack/diskimage-builder: Fix cloud-init-disable-resizefs README title  https://review.openstack.org/28699904:49
openstackgerritMerged openstack/diskimage-builder: Fix spurious = in dib-python readme  https://review.openstack.org/28699104:59
*** chlong has quit IRC05:12
*** oshvartz has quit IRC05:23
*** tiswanso has joined #tripleo05:24
*** saneax_AFK is now known as saneax05:25
*** chlong has joined #tripleo05:29
*** tiswanso has quit IRC05:29
*** trozet has quit IRC05:31
openstackgerritJuan Antonio Osorio Robles proposed openstack/puppet-tripleo: Make OpenStack service ports configurable in HAProxy  https://review.openstack.org/28719905:40
*** jaosorior has joined #tripleo05:43
*** rcernin has joined #tripleo05:47
*** rcernin has quit IRC05:57
*** cmyster has joined #tripleo06:03
*** cmyster has quit IRC06:03
*** cmyster has joined #tripleo06:03
*** tiswanso has joined #tripleo06:25
*** tiswanso has quit IRC06:32
*** olap has joined #tripleo06:48
*** akuznetsov has joined #tripleo06:52
*** aufi has joined #tripleo06:56
*** oshvartz has joined #tripleo07:07
*** aufi has quit IRC07:07
*** aufi has joined #tripleo07:09
*** Marga_ has quit IRC07:17
marios:( gerrit...07:20
jaosoriormarios: What's up?07:21
*** liverpooler has joined #tripleo07:21
*** dmacpher has quit IRC07:21
mariosjaosorior: slow/getting 503 proxy errors07:23
*** ccamacho has joined #tripleo07:23
-openstackstatus- NOTICE: gerrit is going to be restarted due to bad performance07:25
*** ChanServ changes topic to "gerrit is going to be restarted due to bad performance"07:25
*** ChanServ changes topic to "TripleO | CI status: http://tripleo.org/cistatus.html | Docs: http://tripleo.org/"07:28
*** tiswanso has joined #tripleo07:29
*** chlong has quit IRC07:30
openstackgerritMarios Andreou proposed openstack/tripleo-heat-templates: Fixup swift device string to delimit the ipv6 address with []  https://review.openstack.org/26752307:32
*** cmyster has quit IRC07:33
*** tiswanso has quit IRC07:41
openstackgerritMarios Andreou proposed openstack/tripleo-heat-templates: Fixup swift device string to delimit the ipv6 address with []  https://review.openstack.org/26752307:42
*** leanderthal|afk is now known as leanderthal07:43
*** akuznetsov has quit IRC07:43
openstackgerritMarios Andreou proposed openstack/tripleo-heat-templates: Fixup the memcached servers string in nova.conf for v6  https://review.openstack.org/27011007:46
*** olap has quit IRC07:46
openstackgerritMerged openstack/tripleo-heat-templates: Introduce a UpgradeScriptDeliveryWorfklow as part of tripleo upgrades  https://review.openstack.org/28731807:55
*** jprovazn has joined #tripleo07:57
*** ohamada has joined #tripleo07:58
*** akuznetsov has joined #tripleo08:00
*** ohamada has quit IRC08:01
*** xinwu has quit IRC08:01
*** akuznetsov has quit IRC08:01
*** akuznetsov has joined #tripleo08:01
*** akuznetsov has quit IRC08:02
*** ohamada has joined #tripleo08:02
*** xinwu has joined #tripleo08:02
*** fgimenez has joined #tripleo08:03
openstackgerritMarios Andreou proposed openstack/tripleo-heat-templates: Introduce a UpgradeScriptDeliveryWorfklow as part of tripleo upgrades  https://review.openstack.org/28921208:04
*** dshulyak has joined #tripleo08:05
*** rdopiera has joined #tripleo08:07
*** mikelk has joined #tripleo08:07
*** xinwu has quit IRC08:07
*** ccamacho has quit IRC08:09
*** athomas has joined #tripleo08:16
*** ishant has joined #tripleo08:23
*** pcaruana has joined #tripleo08:25
*** hjensas has joined #tripleo08:27
openstackgerritIshant Tyagi proposed openstack/os-collect-config: Add insecure option to the cfn collector  https://review.openstack.org/28472508:29
*** ccamacho has joined #tripleo08:33
*** tiswanso has joined #tripleo08:37
*** ccamacho has quit IRC08:37
openstackgerritJuan Antonio Osorio Robles proposed openstack/instack-undercloud: Fix default IP addressed violating rfc5737  https://review.openstack.org/28922108:42
*** ccamacho has joined #tripleo08:42
*** ccamacho_ has joined #tripleo08:44
*** ccamacho has quit IRC08:47
*** ccamacho_ is now known as ccamacho08:47
*** bandini has quit IRC08:47
*** bandini has joined #tripleo08:48
*** tiswanso has quit IRC08:48
*** olap has joined #tripleo08:53
*** akrivoka has joined #tripleo08:56
*** jaosorior has quit IRC09:00
*** jaosorior has joined #tripleo09:00
*** gfidente has joined #tripleo09:01
*** paramite has joined #tripleo09:01
*** mbound has joined #tripleo09:05
*** akuznetsov has joined #tripleo09:06
*** akuznetsov has quit IRC09:06
gfidenteanyone wants to merge netiso in ci? https://review.openstack.org/#/c/288163/09:10
*** ccamacho has quit IRC09:10
*** ccamacho has joined #tripleo09:11
*** shardy has joined #tripleo09:11
*** ifarkas has joined #tripleo09:12
*** palexster has joined #tripleo09:15
jprovazngfidente: good morning09:16
jprovazngfidente: do you have a sec/minute/hour for a couple of questions about IPv6?09:17
jaosoriorgfidente: Was there a reason why the 192.0.2.0/24 network was used for the defaults of network isolation?09:20
openstackgerritMerged openstack/tripleo-heat-templates: Fixup swift device string to delimit the ipv6 address with []  https://review.openstack.org/26752309:20
gfidentejprovazn, sure09:21
gfidentejaosorior, not sure I got your question right, which network you mean?09:21
shardyMorning all09:22
shardyHey, I have a question re the ipv6 patches - we're just landing them to master right, not backporting to liberty?09:22
jprovazngfidente: nova db sync is failing with http://paste.openstack.org/show/489508/ - shadower hit the same too, have you already seen this?09:22
gfidenteshardy, we will backport to liberty09:23
jaosoriorwell, addresses such as the ControlPlaneDefaultRoute, EC2MetadataIp which are in the netiso templates use 192.0.2.X addresses. And I was just wondering if there's a reason for that09:23
jaosoriorgfidente ^^09:23
gfidentejprovazn, yep there is two more patches I added to the list in trello09:23
gfidentejprovazn, this is for the db_sync issues https://review.openstack.org/#/c/288813/09:23
gfidentejprovazn, this you will also hit https://review.openstack.org/#/c/288826/09:23
jprovazngfidente: thanks, I will try with these09:24
gfidentejaosorior, oh the ctlplane is the network where the undercloud does baremetal provisioning09:24
shardygfidente: That seems pretty risky from a regressions perspective - we'll have to ensure trown|outtypewww is aware09:24
jprovaznshadower: ^09:24
jaosoriorgfidente: ah, thanks, the thing is, I'm changings the network the undercloud does the provisioning on, cause of this bug https://bugs.launchpad.net/tripleo/+bug/1553222 . So yeah, seems I'll have to change that too. Thanks09:25
openstackLaunchpad bug 1553222 in tripleo "Default undercloud control plane network violates rfc5737" [Undecided,In progress] - Assigned to Juan Antonio Osorio Robles (juan-osorio-robles)09:25
jprovazngfidente: and second one - on Fri I was hitting keepalived segfault issue because I didn't use pacemaker env file. Is keepalived expected to work, do we support non-pacemaker deployments?09:25
gfidentejprovazn, I'd say not initially no09:26
gfidentejaosorior, I see, unfortunate decision that was :(09:26
gfidentejaosorior, we probably have references to that subnet in the docs around too :()09:26
jprovazngfidente: ack, should we update https://etherpad.openstack.org/p/tripleo-ipv6-support instructions - I don't see pacemaker env file included there09:27
jprovazn?09:27
gfidentejprovazn, yes!09:27
gfidentethanks!09:27
jprovaznnp09:27
gfidenteshardy, regressions you mean breaking ipv4 deployments?09:28
openstackgerritMoshe Levi proposed openstack/diskimage-builder: Add lshw package to ironic-agent  https://review.openstack.org/28923309:32
*** jistr has joined #tripleo09:33
*** mkovacik has joined #tripleo09:33
shadowerjprovazn, gfidente: awesome, thanks!09:34
openstackgerritJuan Antonio Osorio Robles proposed openstack/tripleo-heat-templates: Make defautl addresses comply with rfc5737  https://review.openstack.org/28923409:35
*** panda has quit IRC09:40
*** olap has quit IRC09:40
*** panda has joined #tripleo09:40
*** olap has joined #tripleo09:42
*** mgould has joined #tripleo09:42
*** electrofelix has joined #tripleo09:44
*** derekh has joined #tripleo09:44
*** tiswanso has joined #tripleo09:45
*** dtantsur|afk is now known as dtantsur09:50
mariosgfidente: is there stable/liberty for the main v6 review? /me looks09:53
gfidentemarios, no not yes09:53
gfidente*yet09:53
gfidenteyou can cherry-pick that if it's clean!09:53
mariosgfidente: ah k... i want to make the stable/liberty for that swift-device one09:53
gfidenteyeah I think we need to figure what stable/liberty misses to make the port of the main ipv6 patch clean09:54
mariosgfidente: kk thanks man i'll revisit later today see if is done/if i have time to try the cherrypick myself09:54
*** tiswanso has quit IRC09:57
dtantsurMorning TripleOwl, happy monday: https://dl.dropboxusercontent.com/u/1730743/owls/owl-d16023ad6f67d3ca66722c195d18878e.jpg09:59
*** jtomasek_ has joined #tripleo10:02
*** xinwu has joined #tripleo10:03
openstackgerritMerged openstack/tripleo-common: Adds override for the overcloud node user in upgrade-non-controller  https://review.openstack.org/28795310:05
*** xinwu has quit IRC10:09
shadowermy Controller & Compute resource groups are stuck in DELETE_IN_PROGRESS. Any ideas on how to get them unstuck?10:12
shadowere.g. could restarting heat engine help?10:12
dtantsurshadower, sometimes repeating stack-delete several times helps :)10:12
shadowerdtantsur: I've tried a few times, but I'll try some more :-)10:13
dtantsurlol10:13
dtantsureventual consistency, y'know..10:13
openstackgerritMerged openstack/tripleo-heat-templates: Remove unsafe "unset" defaults  https://review.openstack.org/28805710:14
shadowershardy: any suggestions for a stuck stack-delete command? I'm not too comfortable with the "edit the database" hammer10:17
shardyshadower: Any clues as to why it's stuck?10:18
dtantsurshadower, btw, do you have problems deleting nova instances or what?10:18
* dtantsur is not familiar with heat concepts10:19
*** ccamacho has quit IRC10:19
shadowershardy, dtantsur: yeah it's the nova instances. I deleted them manually after that kept failing, but Heat's still stuck on deleting them (even though they're gone now)10:19
shardydtantsur: that's a good question actually - if nova can't delete the instances, doing the delete via heat won't help10:19
shadowershardy: nova deleted them fine when I asked it to directly10:20
shardyshadower: heat should ignore any resources which 404 on delete10:20
shardyso I'd check the heat-engine log for other errors10:20
shadowernothing suspicious, it just keeps running the destroy step10:23
shardyhttps://github.com/openstack/heat/blob/master/heat/engine/resources/openstack/nova/server.py#L142510:24
shardyyou should just hit that ignore_not_found path and return10:24
shadoweryeah10:24
shadowerI'll just edit the db10:25
*** ccamacho has joined #tripleo10:27
*** tzumainn has joined #tripleo10:28
*** olap has quit IRC10:29
*** olap has joined #tripleo10:30
shardyshadower: OK - it's be good to actually understand the bug tho10:31
shadowershardy: yeah I know, but I'm feeling the pressure to test & review the ipv6 stuff, too :-(10:32
shadowerif it happens again, I'll debug it properly10:32
shardykk, fair enough :)10:32
openstackgerritMerged openstack/tripleo-heat-templates: Support network isolation without external nets  https://review.openstack.org/28759910:32
openstackgerritJuan Antonio Osorio Robles proposed openstack-infra/tripleo-ci: Use IP addresses compliant with rfc5737  https://review.openstack.org/28925010:34
openstackgerritJuan Antonio Osorio Robles proposed openstack/instack-undercloud: Fix default IP addressed violating rfc5737  https://review.openstack.org/28922110:34
*** tzumainn has quit IRC10:34
*** jtomasek_ has quit IRC10:36
jaosoriorshardy, derekh: the above commits depends on each other, and I set the Depends-On flag for both... do you guys know if that will work for the CI to test those commits together?10:39
dtantsurgfidente, hi! could you please take a look at the liberty backport https://review.openstack.org/#/c/287713/ ? 1x +2, gate is fine10:39
*** ishant has quit IRC10:40
shadowershardy: okay so it may have been a neutron deletion issue instead: Heat tries to delete subnets that have ports with IP addresses in them10:41
derekhjaosorior: https://review.openstack.org/#/c/289221/ seems weird, when I click on the depends on it brings me back to itself10:43
derekhjaosorior: you mean (a needs b) and also (b needs a) ?10:43
jaosoriorvittu10:44
shardyshadower: that's odd - it should delete all the ports first10:44
jaosoriorcopy paste problem10:44
jaosorioryeah, that should be the case10:44
jaosoriorderekh: I missed up the SHAs10:45
jaosoriorlet me fix that10:45
jaosoriorbut yeah, they both depend on each other10:45
shadowershardy: I've seen that happen a few times throughout the years but it's quite inconsistent10:45
derekhjaosorior: never tried it that way, somebody mentioned one time using merge-with10:45
derekhjaosorior: or something like that instead of depends on10:45
*** chlong has joined #tripleo10:45
shardyshadower: you can look at the events output from tripleoclient and figure out if it's trying to do things in the wrong order10:46
derekhjaosorior: but  haven't tried that either, worth looking for to see if it works, but I'm not sure exacactly of the wording10:46
shardyshadower: FWIW I did add overcloud deletes to our CI, but had to remove it again because it took too much time :(10:46
*** athomas has quit IRC10:46
shadowershardy: what's the command for that?10:46
shadowerI know heat event-list, but dunno abotu tripleoclient10:47
shardyshadower: it outputs all the events by default to stdout10:47
shardyit's roughly the same as heat event-list -n5 overcloud10:47
shadowershadower: ah, I've been doing  "heat stack-delete overcloud"10:47
shadowerlol10:47
shadowershardy: ^10:47
shardyshadower: No, that's fine10:47
shardyduh10:48
shardyI was thinking about the deployment, where we output of the events from tripleoclient10:48
shardy-enocoffee ;)10:48
shadowerright10:48
shadoweralthough I've deleted the ports manually and the stack is gone now. I'll look at the events next time this happens10:48
shadowerbut yeah, the error Heat reported was identical to me deleting the subnet manually -- before removing the ports10:49
openstackgerritJuan Antonio Osorio Robles proposed openstack/instack-undercloud: Fix default IP addressed violating rfc5737  https://review.openstack.org/28922110:49
*** tosky has joined #tripleo10:50
*** ccamacho has quit IRC10:51
*** ccamacho has joined #tripleo10:52
*** tiswanso has joined #tripleo10:53
*** dsneddon has joined #tripleo10:57
openstackgerritMerged openstack/tripleo-heat-templates: updating enable_ceph conditions for controller  https://review.openstack.org/28806010:59
openstackgerritMerged openstack/instack-undercloud: Enable extra hardware data collection and processing for ironic-inspector  https://review.openstack.org/28771311:00
openstackgerritJuan Antonio Osorio Robles proposed openstack-infra/tripleo-ci: Add IP addresses compliant with rfc5737  https://review.openstack.org/28925011:00
*** tiswanso has quit IRC11:00
jaosoriorderekh: Well, at least in the documentation they highly discourage the usage of circular depends-on... And I didn't find that merge-with flag (or similar). If you find it or remember it at some point let me know. I have the feeling I will need it in subsequent changes.11:01
*** ccamacho has quit IRC11:01
jaosoriorstill looking for it though11:01
openstackgerritMerged openstack/tripleo-heat-templates: Fix password issue with mysql address for ceilometer  https://review.openstack.org/28797011:02
openstackgerritMerged openstack/tripleo-heat-templates: Add a sample network-environment.yaml file to environments  https://review.openstack.org/28614411:08
jaosoriormarios, jistr, gfidente: If there is a value in the puppet manifests being set. And that same value is also being set in the hieradata, what gets prioritized?11:11
gfidentejaosorior, it depends on which hiera11:12
jaosoriorgfidente: puppet/hieradata/compute.yaml11:12
gfidentejaosorior, the static controller.yaml stuff is *after* tht11:12
gfidenteoh sorry you're asking manifests and hiera or templates and hiera?11:12
jaosoriorin a commit, there is a value in puppet/manifests/overcloud_compute.pp that is being set, and is also being set in puppet/hieradata/compute.yaml11:13
gfidentethe one in pp preveals11:13
jaosoriorI see11:13
jaosorioralright, thanks dude11:14
gfidenteI was confused by tht vs hiera because we had similar conversation with jistr the other day11:14
gfidentethen I figured we were talking about manifest vs hiera , not tht11:14
jaosoriorgfidente: Ah, now I see you co-authored this https://review.openstack.org/#/c/270831/8 I +1ed it since it won't be a problem. But yeah, there is that same value being set in the hierdata11:15
gfidenteoh man no problem, -1 that11:15
gfidentethanks for pointing it out11:15
*** jcoufal has joined #tripleo11:15
jaosoriordtanstur, I've actually tried it11:16
jaosorioryour comment on this CR https://review.openstack.org/#/c/289221/11:16
jaosoriorand it gets correctly updated by the puppet manifests with the br-ctlplane updated11:17
jaosorior* dtantsur ^^11:17
openstackgerritMerged openstack-infra/tripleo-ci: IPv4 network isolation testing for HA and Ceph  https://review.openstack.org/28816311:19
gfidentederekh, that was bold :)11:19
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: puppet: allow config of ad-hoc Neutron settings  https://review.openstack.org/28927011:21
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: puppet: allow config of ad-hoc Cinder settings  https://review.openstack.org/28927111:21
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: puppet: allow config of ad-hoc Heat settings  https://review.openstack.org/28927211:21
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: puppet: allow config of ad-hoc Glance settings  https://review.openstack.org/28927311:21
dtantsurjaosorior, ok good, but please mention it in the commit message11:21
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: Set swift replicas = min(device_count, replicas)  https://review.openstack.org/28927411:22
jaosoriordtantsur: I usually don't add test results in the commit message.. Do you mind if I just mention it in a comment in that CR?11:23
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: Update VNI and TunnelID ranges.  https://review.openstack.org/28927611:23
dtantsurjaosorior, in case of upgrade implications, the note should be in the commit message IMO11:23
openstackgerritDan Sneddon proposed openstack/tripleo-heat-templates: [WIP] Enable IPv4/IPv6 dual-stack Public API endpoints  https://review.openstack.org/28927911:26
openstackgerritDan Sneddon proposed openstack/tripleo-heat-templates: [WIP] Enable IPv4/IPv6 dual-stack Public API endpoints  https://review.openstack.org/28927911:27
*** chlong has quit IRC11:29
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: DO NOT MERGE: Add network templates for multiple NIC configuration  https://review.openstack.org/28928611:38
*** ccamacho has joined #tripleo11:41
*** chlong has joined #tripleo11:42
openstackgerritRyan Hallisey proposed openstack/tripleo-heat-templates: Allow the containerized compute node to spawn larger VMs  https://review.openstack.org/28882211:46
openstackgerritRyan Hallisey proposed openstack/tripleo-heat-templates: Remove unused Neutron Agents container  https://review.openstack.org/28791811:46
*** masco has quit IRC11:47
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: Change default bond-mode  https://review.openstack.org/28760311:48
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: Adding ManagementIpSubnet to linux bridge net conf  https://review.openstack.org/28760211:48
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: Add Management Network For System Administration.  https://review.openstack.org/26496311:48
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: Add all isolated networks to all nodes.  https://review.openstack.org/26883311:48
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: Add network templates for multiple NIC configuration  https://review.openstack.org/28760011:48
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: Allow for usage of pre-allocated IPs for the management network  https://review.openstack.org/28928911:48
*** tiswanso has joined #tripleo11:56
*** jtomasek_ has joined #tripleo11:57
*** saneax is now known as saneax_AFK11:58
gfidenteguys what is going to happen with aodh in liberty?11:58
gfidenteshould we merge all tht/puppet changes for that or rather not?11:59
gfidenteshardy, ^^11:59
gfidente(backport)11:59
*** lucasagomes is now known as lucas-hungry12:03
jprovazngfidente: it seems my compute has unset rabbit_hosts in /etc/nova/nova.conf which causes that nova-compute fails to start, I wonder if you hit this?12:05
*** xinwu has joined #tripleo12:05
*** tiswanso has quit IRC12:06
jprovaznshadower: ^ how is your rabbit_hosts doing on compute node?12:07
gfidentejprovazn, I got a working guest with the patches12:08
gfidente(in the overcloud)12:08
* jprovazn wonders what's different :(12:08
gfidenteso you have it set on the controllers but not on the computes?12:09
openstackgerritJames Slagle proposed openstack-infra/tripleo-ci: Use swap-partition.yaml environment  https://review.openstack.org/28908512:09
*** xinwu has quit IRC12:10
jprovazngfidente: good point. controller is unset too12:10
gfidenteshit ?12:10
jprovaznI checked compute first because heat stack-create is stuck on it ATM12:10
openstackgerritDmitry Tantsur proposed openstack/instack-undercloud: Enable IPA debug logging during introspection when undercloud_debug is True  https://review.openstack.org/28929712:11
jprovazngfidente: just to confirm, I used the exacty same network environment yaml file and OC deploy command as is here: https://etherpad.openstack.org/p/tripleo-ipv6-support12:11
*** trown|outtypewww is now known as trown12:12
gfidentejprovazn, nah, let me update the pad12:13
shardygfidente: I'd chat with EmilienM - my assumption was that we wouldn't backport all those patches to liberty though12:13
jprovaznshadower: ^ you will hit this too I guess12:14
gfidenteshardy, yeah I'd be inclined to trop aodh in liberty too12:15
gfidente*drop12:15
*** mburned_out is now known as mburned12:15
*** jtomasek_ has quit IRC12:16
gfidentejprovazn, though I am not sure if/how that would change the rabbit_hosts thing12:16
jprovazngfidente: what did yo uadd? -e $THT/environments/net-single-nic-with-vlans-v6.yaml?12:17
gfidentejprovazn, yep12:17
jprovazngfidente: this is 14 patches I have applied on Friday afternoon, since then at least the biggest one was merged. There wasn't any fix/change in code itself though, was it? should I re-apply?12:18
gfidentejprovazn, nothing, it doesn't change anything12:18
jprovaznhttp://paste.openstack.org/show/489527/12:18
openstackgerrityolanda.robla proposed openstack/diskimage-builder: Generate fedora-atomic images using dib  https://review.openstack.org/28716712:19
*** cmyster has joined #tripleo12:21
shadowerjprovazn, gfidente: my overcloud actually deployed, but the post-deployment steps couldn't reach keystone12:31
jprovaznshadower: nice12:31
shadowerthat's with gfidente's sdn and rabbit patches on top12:31
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: Update typos  https://review.openstack.org/28930512:32
gfidenteshadower, ok that's because the undercloud can't reach the keystone public address12:32
shadowerah ok12:32
gfidenteshadower, let me paste something12:32
gfidenteshadower, try this on the undercloud https://gist.github.com/gfidente/729fa398c368640c739a12:32
gfidenteyou should be able to ping the ping the public_virtual_ip from the undercloud then12:33
shadowergfidente: I can!12:33
jaosoriorHey guys, do you know if there are already any defined talks for tripleo on the openstack summit?12:34
gfidentejaosorior, I think dprince and shardy have one?12:34
gfidenteshadower, so in theory if you re-deploy now ...12:35
gfidente:)12:35
shadowerI'll try :-)12:35
shadoweralso time to show some love to the patches :-)12:36
gfidenteshadower, try using the overcloud, things might just fail on stuff which puppet doesn't figure itself during configuration12:38
shadoweryeah12:39
shadowergfidente: what's the difference (in puppet) between "include" and "class"?12:39
shadowerI've seen you replace one with the another in the rabbitmq patch12:39
gfidenteinclude sets the class param values to its default or hiera bindings12:41
gfidenteclass allows you to override some of those within the manifest12:41
*** dprince has joined #tripleo12:42
*** weshay has joined #tripleo12:43
gfidentemarios, that looks like communication across the pacemaker nodes is unstable? this just changed in ci yes, it's using multiple nics on each vm to test netiso12:44
openstackgerritJuan Antonio Osorio Robles proposed openstack/tripleo-heat-templates: Make injected CA file readable by others  https://review.openstack.org/28930712:48
jprovazngfidente: shadower: it's failing like a boss today :/ - http://paste.openstack.org/show/489529/12:48
jprovaznI haven't hit this one yet though12:48
*** rhallisey has joined #tripleo12:51
shadower:-(12:53
shadowerjprovazn: I'm having issues with Heat deleting stuff today12:54
gfidentejprovazn, shadower I remember something with the selinux context in which httpd was running?12:54
*** ccamacho has quit IRC12:55
shadowergfidente: yea me too but that was over a year ago...12:56
*** jayg|g0n3 is now known as jayg12:56
openstackgerritMerged openstack/tripleo-heat-templates: Add IPv6 versions of the Controller NIC configs  https://review.openstack.org/26987212:56
shadowerthough it's possible there's more than on instances of that12:56
gfidenteI think jistr might help us here12:57
* jistr scans the conversation12:57
gfidentejistr, do you have any pointer to the systemd/selinux issue which was causing httpd start to fail?12:57
*** cmyster has quit IRC12:59
*** colonwq has joined #tripleo12:59
*** cmyster has joined #tripleo13:01
*** cmyster has quit IRC13:01
*** cmyster has joined #tripleo13:01
jistrthat was because something in horizon/httpd wanted to use syslog, but systemd-journald was down due to some selinux-mislabeled file, so thing went bad. The paste doesn't look too familiar to me though. jprovazn is systemd-journald running on that system?13:01
*** cmyster has quit IRC13:02
*** cmyster has joined #tripleo13:03
*** tiswanso has joined #tripleo13:03
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates: Remove Ceilometer Alarm from the overcloud  https://review.openstack.org/28812013:04
jprovaznjistr: I deleted the stack meantime :(, I'll tell when I hit it again13:05
*** nico_auv has joined #tripleo13:09
*** lucas-hungry is now known as lucasagomes13:10
*** rbrady has joined #tripleo13:14
*** cmyster has quit IRC13:16
*** cmyster has joined #tripleo13:16
*** cmyster has quit IRC13:16
*** cmyster has joined #tripleo13:16
*** tiswanso has quit IRC13:17
openstackgerritEmilien Macchi proposed openstack/puppet-tripleo: loadbalancer: add Aodh API support  https://review.openstack.org/28812213:18
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: Deploy Aodh services, replacing Ceilometer Alarm  https://review.openstack.org/28813013:21
shadowershardy: I hit the stack-delete issue again: http://paste.fedoraproject.org/335112/45735700/raw/13:24
gfidentejistr, is the keystone/httpd thing going to liberty?13:24
shadowershardy: I misunderstood it before -- looks like it tries to delete the nested stacks under Compute & Controller RGs but then doesn't actually try deleting the resources under them13:25
shardyshadower: Are you running an older heat?13:25
shadowershardy: could be -- it's on a centos-based instack13:26
shardyshadower: Ah, if it's upstream instack based that should be fairly recent13:26
shadowershardy: $ heat-engine --version13:26
shadower6.0.013:26
shardyshadower: there was a bug fixed a month or so ago where hooks weren't cleared properly and operations got stuck as a result13:27
* shardy finds the patch13:27
*** links has quit IRC13:28
shardyhttps://review.openstack.org/#/q/Ie894f416a898edeca3b6a123392853213893ff74,n,z13:28
jprovaznjistr: gfidente: after re-deployment I hit httpd issue again, to answer your prev question - yes systemd-journald is running13:28
jistrgfidente: no it's not AFAIK. It would need upgrades support. And i have the same concern about Ceilometer / AODH.13:28
gfidentejistr, ah good point about aodh13:28
gfidenteEmilienM, ^^13:29
jistrjprovazn, gfidente: hmm so it's probably something different than the selinux issue we had before13:29
EmilienMjistr: what concerns?13:29
EmilienMjistr: have you ping pradk about that?13:29
shardyshadower: it might be worth double checking there's no hooks around - I guess you could query the resource data in the DB just to be sure13:29
*** ccamacho has joined #tripleo13:30
*** ohamada has quit IRC13:30
openstackgerritMerged openstack/tripleo-heat-templates: Add meta notify=true to rabbitmq resource  https://review.openstack.org/28519813:30
*** ohamada has joined #tripleo13:30
jistrEmilienM: missing migration logic for upgrades. Do we absolutely need Ceilo/AODH change in Liberty? At this point we need to cut off even the things we need, only the things that we *absolutely* need can stay :D13:30
shadowershardy: thanks, I do seem to have that in though (checked the source)13:30
jistrEmilienM: didn't ping him. Doesn't seem like he's around atm.13:31
gfidentejistr just set another channel topic to me13:32
jistrgfidente: :))13:32
jistrgfidente: btw if we land this, we'll have a place where we can start putting the upgrade migration logic (removing 'delay' resource, eventlet->wsgi migration, ceilometer/aodh migration) https://review.openstack.org/#/c/285416/13:33
openstackgerritJuan Antonio Osorio Robles proposed openstack/tripleo-common: Change IPs used for the pingtest to comply with rfc5737  https://review.openstack.org/28933113:33
jistrgfidente: thx!13:34
*** dmacpher has joined #tripleo13:35
*** pradk has joined #tripleo13:35
openstackgerritMerged openstack/tripleo-heat-templates: Function library for major upgrades  https://review.openstack.org/28541613:38
*** xinwu has joined #tripleo13:39
*** panda has quit IRC13:39
jistrpradk: hi, i wanted to verify with you, is it absolutely necessary to have the AODH change backported to Liberty? At this point we're trying to keep potentially invasive changes away from Liberty. If we don't have the AODH patch, will something break?13:40
*** panda has joined #tripleo13:40
*** ishant has joined #tripleo13:41
pradkjistr, Hi, So Aodh replaces ceilometer alarm in liberty which is deprecated. So if aodh doesnt make it into liberty we'll be shipping pretty much dead code and will incur a large backport cost in liberty.13:41
shadowershardy: good idea, thanks13:41
pradkjistr, so imho it is necessary13:42
* shadower has been reading the scheduler code (where it seems stuck) in teh meantime but nothing useful yet13:42
jprovazngfidente: jistr: httpd is failing to start because haproxy is sitting on the port already - http://paste.openstack.org/show/489540/13:42
jprovaznI would expect then that httpd has to listen on localhost port only13:43
jprovaznbut *:8042 is set:13:43
jprovazn[root@overcloud-controller-0 systemd]# grep -r 8042 /etc/httpd/13:43
jprovazn/etc/httpd/conf/ports.conf:Listen 804213:43
jprovazn/etc/httpd/conf.d/10-aodh_wsgi.conf:<VirtualHost *:8042>13:43
shadowershardy: do you happen to remember which tables we store hooks in?13:43
shadoweroh you actually said resource data, sorry13:44
jistrpradk: ok. Just to give more context, the main concern here is about migrating existing deployments to the new model. E.g. despite a change that moves Keystone from eventlet to WSGI is done in Mitaka, we're not backporting it to Liberty because we miss the upgrade migration logic for this change.13:44
*** jdob has joined #tripleo13:44
*** xinwu has quit IRC13:44
jistrjprovazn: hmm ok that looks related to the AODH change we're just talking about  /cc pradk13:45
gfidentejprovazn, so I think haproxy is meant to bind there, httpd to use a local ip13:45
gfidenteand yes that's from the aodh changes we're trying to figure if should be ported to liberty13:45
shadowershardy: so, all I see in resource_data are credentials and keys, no hooks13:46
pradkjistr, i see, so we cannot start aodh with wsgi in liberty13:46
pradk?13:46
gfidentejprovazn, it's interesting it didn't happen to me though or shadower ?13:46
jprovazngfidente: yes, weird13:46
jprovaznmight be reace issue maybe13:46
jprovazngfidente: can you please try "grep -r 8042 /etc/httpd/" on your OC?13:47
jprovaznfor shadower it pased because he doesn't have any aodh config under httpd at all13:47
jprovaznI wonder if I should pass an additional param when deploying stack?13:47
gfidentejprovazn, I was doing just that and I don't have it indeed13:48
*** ishant has quit IRC13:48
gfidentejprovazn, we're also using the same overcloud image right?13:49
jprovazngfidente: only difference I know about is that shadower uses hiw own OC images, I use yours copied on Friday13:49
openstackgerritAthlan-Guyot sofer proposed openstack/puppet-pacemaker: Basic beaker one node test.  https://review.openstack.org/28137613:49
*** tiswanso has joined #tripleo13:50
jistrpradk: we could use wsgi, but for every change we do, we need to think about how to get an existing deployment into the new state. E.g. here we remove the pacemaker resources for ceilometer alarm from the manifest, which means they won't be created in a new deployment, but it doesn't mean that they will be removed on an already existing deployment.13:52
jistrpradk: https://review.openstack.org/#/c/288120/313:52
openstackgerritAthlan-Guyot sofer proposed openstack/puppet-pacemaker: Add a service provider.  https://review.openstack.org/28612413:55
*** morazi has joined #tripleo13:55
*** tiswanso has quit IRC13:55
pradkjistr, i understand now, thanks .. i can try to help with upgrade logic for ceilo to aodh if thats something we need for this to get into liberty13:55
pradkjistr, can you tell me what and where it needs to be done?13:56
jistrpradk: we just merged this change https://review.openstack.org/285416 where the functions for upgrade logic can go, and they'd then be called from13:58
jistrhttps://github.com/openstack/tripleo-heat-templates/blob/master/extraconfig/tasks/major_upgrade_controller_pacemaker_1.sh13:58
jistror https://github.com/openstack/tripleo-heat-templates/blob/master/extraconfig/tasks/major_upgrade_controller_pacemaker_2.sh13:58
jistras applicable13:58
*** rlandy has joined #tripleo13:59
*** dustins has joined #tripleo14:00
jprovaznEmilienM: it might be https://review.openstack.org/#/c/241408/ introduces a bug with httpd failing to start: http://paste.openstack.org/show/489540/14:01
gfidentejistr, pradk, EmilienM I think the issue jprovazn just found out tells we should even revert it in master14:01
jprovaznthanks gfidente for pointing this out14:01
thrashtrown: you ever see this or have an idea on how to fix? /tmp/in_target.d/install.d/60-ironic-agent-install: line 13: virtualenv: command not found14:02
gfidentejprovazn, you did all the things :)14:02
jprovazngfidente: hah, sure - I made it fail :)14:02
trownthrash: you are hitting that building the IPA ramdisk?14:02
thrashtrown: yes14:03
gfidentejprovazn, exactly it's always been like that14:03
trownthrash: seems like python-virtualenv is not installed14:03
gfidenteI remember it14:03
jprovaznhehe14:03
trownthrash: but... why is IPA element needing a virtualenv?14:03
thrashtrown: not really sure.14:04
pino|work(dib world, never known what nonsense you might find)14:04
trownlol pino|work :)14:04
pino|work(and yet the biggest nonsense is that people keep using it... "but that's none of my business")14:05
pradkgfidente, i think there is already a revert in progress in master, so apache is not binding to that port is the issue?14:05
pradkgfidente, interesting i dont see that locally14:06
gfidentepradk, apache is binding on the port but it's in use by haproxy14:06
gfidentepradk, I think we need to pass via hieraconfig the virtualhost ip14:06
gfidentepradk, like we do already for horizon14:07
pradkgfidente, thought we already did that.. lemme check14:07
*** cmyster has quit IRC14:07
gfidentepradk, but even then it's not working as expected because jprovazn gets *:804214:07
thrashtrown: I looks like it's doing the source install of ironic agent and not package.14:07
trownthrash: ya that would not be what we want :)14:07
thrashtrown: let me figure out why that is happening.14:08
*** cmyster has joined #tripleo14:09
*** cmyster has quit IRC14:09
*** cmyster has joined #tripleo14:09
*** lblanchard has joined #tripleo14:09
pradkgfidente, the *:8042 is what puppet configures the apache vhost config to be14:10
thrashtrown: maybe this? Missing package name for distro/element: centos7/ironic-agent14:10
pradkgfidente, so if we want that to be localhost bind only, then we'll have to update puppet-aodh to use the right template14:11
openstackgerritRyan Hallisey proposed openstack/tripleo-heat-templates: Remove unused Neutron Agents container  https://review.openstack.org/28791814:11
openstackgerritRyan Hallisey proposed openstack-infra/tripleo-ci: Allow the continer job to run again  https://review.openstack.org/28891514:13
trownthrash: ya, I guess it is falling back to source because that package is missing? All of this could be avoided by starting with base images that have repos already setup14:14
openstackgerritRyan Hallisey proposed openstack-infra/tripleo-ci: Allow the continer job to run again  https://review.openstack.org/28891514:14
*** tzumainn has joined #tripleo14:16
*** Marga_ has joined #tripleo14:17
gfidentepradk, not localhost only, we need to be able to configure the binding address ... it depends on a few factors when doing network isolation14:19
*** absubram has joined #tripleo14:21
*** absubram_ has joined #tripleo14:23
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: Add IPv6 versions of the Controller NIC configs  https://review.openstack.org/26988314:24
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: Add IPv6 Support to Isolated Networks  https://review.openstack.org/28935514:24
*** jpeeler has joined #tripleo14:27
*** absubram has quit IRC14:27
*** absubram_ is now known as absubram14:27
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: Ensure access to Redis is password protected  https://review.openstack.org/21040514:27
*** apetrich has quit IRC14:28
*** tiswanso has joined #tripleo14:29
openstackgerritMerged openstack/tripleo-heat-templates: Revert "Deploy Aodh services, replacing Ceilometer Alarm"  https://review.openstack.org/28871414:31
*** absubram has quit IRC14:31
*** jaosorior is now known as jaosorior_climbi14:32
gfidentepradk, ^^ so it was reverted, can you pick it up again so we can test it more?14:38
openstackgerritNisha Agarwal proposed openstack/diskimage-builder: Add psmisc to the packages for ironic-agent  https://review.openstack.org/28936414:40
*** apetrich has joined #tripleo14:41
*** tiswanso has quit IRC14:43
jprovazngfidente: so revert of the patch solved httpd issue, unfortunately I'm hitting again the issue from this morning - rabbit_hosts is unset both on controller and compute nodes14:44
gfidentejprovazn, which sounds like glance-api thing?14:45
jprovazngfidente: I'm hitting it with nova, but I also have the 2 patches you linked today, applied14:46
gfidenteno I mean sounds like could be due to that patch14:46
openstackgerritMerged openstack/tripleo-heat-templates: Make the Neutron subnet ipv6_{ra,address}_mode configurable  https://review.openstack.org/27120814:52
openstackgerritRyan Hallisey proposed openstack/tripleo-heat-templates: Allow the containerized compute node to spawn larger VMs  https://review.openstack.org/28882214:54
openstackgerritRyan Hallisey proposed openstack/tripleo-heat-templates: Remove unused Neutron Agents container  https://review.openstack.org/28791814:54
*** trozet has joined #tripleo14:55
*** pradk_ has joined #tripleo14:55
EmilienMhello, would it be possible to have a review on puppet-tripleo for Ipv6 dual stack ? https://review.openstack.org/#/c/286344/14:55
*** rpothier has joined #tripleo14:55
*** pradk has quit IRC14:56
*** ohamada has quit IRC14:56
*** pradk_ is now known as pradk14:56
openstackgerritEmilien Macchi proposed openstack-infra/tripleo-ci: Test Puppet Parser Future - Do not merge  https://review.openstack.org/28673214:56
*** dustins_ has joined #tripleo14:59
*** dustins_ is now known as dustins|away14:59
*** jprovazn has quit IRC15:00
EmilienMslagle or dprince : do we have a matrix / document that explain what our CI tests? AFIK we did lot of changes lately15:00
*** dustins has quit IRC15:01
slaglewe don't have a document15:02
dprinceEmilienM: I'm not sure we maintain an official matrix document. We could add one into tripleo-ci if it helps15:02
EmilienMno problem, I was just asking15:03
EmilienMdprince: so now we have netiso for ceph and ha jobs: what plans we have for ipv6?15:03
*** ayoung has joined #tripleo15:03
EmilienMI like slagle's proposal on the ML about reducing the jobs in check queue15:04
dprinceEmilienM: I would like to work on that w/ gfidente15:04
dprinceEmilienM: I replied to your thread. I'm not a fan of dropping nonha at this time15:04
EmilienMdprince: sure, it was just an idea but sounds like a bad one15:04
dprinceEmilienM: IMO there are good reasons to have it. And also it has proven to be more stable than the HA job15:04
dprinceEmilienM: IPV6 will come shortly. We've got the architecture sorted out for that so that once IPv6 works adding it into CI should be easy15:05
EmilienMdprince: slagle also kind of agreed with me on non-ha job15:05
dprinceEmilienM: The #1 way to help at this point with CI is getting CI on upgrades15:06
dprinceEmilienM: that will unblock many features that will allow us to improve other various areas in CI15:07
EmilienMI'm afraid upgrade testing will consume lot of resources15:07
EmilienMbut I agree it's very important15:07
dprinceEmilienM: which is why the periodic image build caching work derekh is pursuing is of interest15:08
EmilienMdprince: do we have plans to increase overcloud RAM? 4GB sounds like causing problems15:09
trowndprince: maybe it would change if we did not have a job for it, but I have never seen the nonHA job fail legitimately when the HA job passed15:10
EmilienMshardy: if I plan to patch split stack, should I use split_stack2 Gerrit topic?15:10
dprinceEmilienM: we can look into it. We would need to reallocate things for sure to accodidate it15:10
shardyEmilienM: sure, that was the series which I was planning to do the t-h-t refactoring on15:11
dprincetrown: I'd still want a separate HA job to ensure single node HA works anways15:12
*** cmyster has quit IRC15:14
trowndprince: right, all I am pointing out is that if the failure rate module HA passing is near zero, we could do that in a periodic job15:14
trowndprince: I think that is what slagle was getting at15:14
dprinceEmilienM, trown, slagle: I get the interest in consolidating our CI. But I gotta say... as one who helps deploy our baremetal resources I'm not in favor of complicating the TripleO CI rack w/ pacemaker15:14
trowns/module/modulo/15:14
*** cmyster has joined #tripleo15:14
*** cmyster has quit IRC15:14
*** cmyster has joined #tripleo15:14
trowndprince: isnt the TripleO CI rack using Icehouse?15:15
trownor maybe Juno?15:15
slagledprince: i didnt know the tripleoci rack was using updated templates to be honest15:15
shardydprince: Hey, when you get a sec I wanted to clarify your plan ref https://review.openstack.org/#/c/288626/1/scripts/tripleo.sh15:15
slagleif it is, that's certainly a good reason to not drop CI coverage of single node15:16
dprincetrown, slagle: it is old. I'm talking about when we rebuild it15:16
slaglestill, we'd have a periodic job15:16
trownya, I think periodic jobs can tell us if we need check jobs15:16
dprincetrown: I'd still like more than one job running that accurately represents the developer case. Most developers can't use 3 controller nodes all the time.15:17
trownie, single HA check job, if periodic jobs start failing based on bad merges that passed HA check, we can revisit15:17
dprincetrown: single node HA (a separate job for that) would be a required case too I think regardless15:17
dprinceI don't think just periodic is enough for that15:17
shardydprince: I was initially looking at changing the puppet-modules element, so it'd support setting a global reporef, e.g to stable/liberty or whatever15:17
shardyhttps://github.com/openstack/tripleo-puppet-elements/blob/master/elements/puppet-modules/environment.d/01-puppet-modules-install-types.sh15:17
shardybut that doesn't work because we mix up puppetlabs and openestack modules there15:18
shardyare you proposing I break that element into two?15:18
shardyOr can we just tag another element on which overrides the branches pulled by the main element?15:19
dprinceshardy: we could do that if it makes sense. I'm really open to whatever here and I'm not actually meaning to block your tripleo-ci change. Just asking the question: what if we did this via elements instead?15:19
dprinceshardy: I suppose I was thinking of adding another element which overrides the specific values15:20
dprinceshardy: that was my initial thought anyways (your last response)15:20
openstackgerritRyan Hallisey proposed openstack/tripleo-docs: Document using node capabilities to control placement  https://review.openstack.org/27421715:20
shardydprince: Ok, I can definitely give that a try15:20
rhalliseyshardy, ^ updated your doc and added to it15:20
shardydprince: for CI, all we actually need is to force building from source, because all the repos are there and reset to stable/liberty15:20
shardyso this is only for developer convenience really15:21
*** olap has quit IRC15:21
shardyrhallisey: thanks!15:21
dprinceshardy: cool. For developers I think using this element would work well15:22
dprinceshardy: also, for developers we might go a totally different approach and have a Puppetfile (or shell script, etc.) to help stage all the liberty modules. And then use the artifact deployment to dynamically deploy them into the overcloud15:24
dprinceshardy: now that we have puppet modules via swift, I'd like to consider the option to move away from using DIB as a means to deploy the puppet modules15:24
shardydprince: Yup, we could definitely look at that too, this was really just a first step to move us away from opm for liberty - I'm trying the element approach now, thanks!15:30
trownshardy: dprince, doesnt the whole path of puppet modules from source go away once we have a package per puppet module?15:34
trowndprince: shardy at that point arent the puppet packages just another package?15:35
dprincetrown: no, I don't think it does. Turns out packages aren't how most developers (and some opperators) like to consume puppet15:36
dprincetrown: we can support both I think15:36
openstackgerritJavier Peña proposed openstack/tripleo-heat-templates: Fix vncproxy_host for IPv6  https://review.openstack.org/28706815:37
trowndprince: if the delorean puppet packages are just packaged source with no patches... does that change?15:37
trowndprince: I get not wanting to consume a monolithic package of puppet modules with no clear origin story for the individual modules, but it seems like we want to get rid of the special case of puppet modules15:38
dprincetrown: not for me :)15:38
trowndprince: using them from source leaves it as a special case15:39
shardytrown: that will certainly help, but I think ref the deploy artefacts thing, some folks would still prefer a local tree vs packages15:39
*** dustins|away is now known as dustins15:39
dprincetrown: forcing packages for all puppet users in TripleO IMO is a step sideways. Not everyone wants it15:39
dprincetrown: we support both. Lets just go with that15:40
shardydprince: not everyone wants to deploy from source either tho, e.g in the case where folks consume tripleo and the puppet modules from a distributor15:40
shardy+1 :)15:40
trowndprince: ok, I am thinking alot about unifying the CI between RDO and TripleO, and that is where I am coming from with this15:40
shardyFWIW I think anything which reduces our reliance on elements is a good thing15:41
trownshardy: +115:41
dprinceshardy: right, for CI I'd rather see us use swift deployment to do that15:41
derekhdprince: wouldn't it be nice though it we could be more consistent with what we are deploying with, i.e. have the puppetmodule and core package come from the same pinned repository?15:41
trownelements are very opaque15:41
openstackgerritBen Nemec proposed openstack-infra/tripleo-ci: Test overcloud SSL  https://review.openstack.org/28198815:41
dprincederekh: yes, we can do that with swift modules deployment though right?15:42
shardydprince: if everything comes from a yum repo, why do we need the swift transport, vs building the packages into the image, or updating them via yum pointing at the repo?15:42
shardyI get it for the from-source case, but I'm not clear on the advantage in the packaged case15:43
openstackgerritMiles Gould proposed openstack/python-tripleoclient: Use Ironic API v1.11 to support ENROLL state  https://review.openstack.org/27220615:43
dprinceshardy, derekh: okay, sure. I'm fine with that angle too15:43
dprinceshardy, derekh: I don't think that is very helpful for people who will actually develop t-h-t -> puppet15:43
dprinceit is a lot of work to build packages for each little change15:43
dprinceagain, I would like CI to represent both cases I think15:44
derekhdprince: yes, we could I suppose (havn't tried it), what I mean is i'd like to see us approach the situation where where everything is being deployed from a trunk repository and the only think we use from git is the projects we are actually testing15:44
shardydprince: sure, for developers the swift from-source workflow makes a lot of sense15:44
shardyalthough running a local delorean repo with source isn't that hard, it is added complexity I guess15:44
trowndprince: the nice thing with that approach, is that the 'current-tripleo' or hopefully just the 'current-passed-ci' repo would be totally fixed. currently puppet changes could break that repo since they are external15:44
dprincederekh: I had envisioned installing puppet modules (from packages) on the undercloud. And then deploying those via t-h-t via Swift to the overcloud.15:44
shardydprince: for non-developer usage, that has the disadvantage that you lose versioning of the deployed modules on the overcloud nodes15:45
shardywhich is probably difficult from a support standpoint when something breaks ;)15:45
dprinceshardy: we can test both I think. Again, I think most puppet users would use their own Puppetfile (or equivalent) rather than packages.15:46
shardyyup, supporting both sounds like a good plan15:46
derekhdprince: I'd be fine with that, I just want us to stop looking at git repositories for our default deployment, so long as they originate from a yum repository that would be fine15:46
trownI think we could create tooling (in tripleo-quickstart or elsewhere) that makes delorean usage really simple15:46
dprinceshardy: that is my take anyways. I think the distro angle is fine, I just think it is perhaps too much of a RedHat'ism to state it is the most common/useful case. There are advanced operators who won't like puppet being packaged at all15:47
trowntripleo.sh already does IMHO15:47
*** dprince has quit IRC15:49
*** jprovazn has joined #tripleo15:49
openstackgerritMerged openstack/tripleo-heat-templates: Use MysqlVirtualIPUri for nova_api and sahara database  https://review.openstack.org/28881315:51
*** ohamada has joined #tripleo15:54
openstackgerritJavier Peña proposed openstack/tripleo-heat-templates: Fix vncproxy_host for IPv6  https://review.openstack.org/28706815:54
openstackgerritMerged openstack-infra/tripleo-ci: Collect status of all nested stacks in resource-list  https://review.openstack.org/28606215:58
*** oshvartz has quit IRC16:00
openstackgerritAthlan-Guyot sofer proposed openstack/puppet-pacemaker: Basic beaker one node test.  https://review.openstack.org/28137616:00
*** lazy_prince has joined #tripleo16:01
openstackgerritAthlan-Guyot sofer proposed openstack/puppet-pacemaker: Add a service provider.  https://review.openstack.org/28612416:01
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: Make the Neutron subnet ipv6_{ra,address}_mode configurable  https://review.openstack.org/28941716:01
openstackgerritMerged openstack/tripleo-heat-templates: Allow to enable IPv6 on Corosync  https://review.openstack.org/26707316:02
*** liverpooler has quit IRC16:03
*** Goneri has quit IRC16:06
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: Allow to enable IPv6 on Corosync  https://review.openstack.org/28942216:08
jprovazngfidente, what is your heat version?16:11
dtantsurlucasagomes, could you please take one more look at https://review.openstack.org/288417 ?16:12
lucasagomesdtantsur, hi there, sure16:12
* lucasagomes looks16:12
jprovaznshardy, do you know if openstack-heat-engine-5.0.1-3.el7ost.noarch is sufficient version for running current master tripleo-heat-templates?16:14
shardyjprovazn: not sure tbh, but you will require this fix:16:15
shardyhttps://review.openstack.org/#/q/Ib934f443a8b8e4f75335a9d8b992e7f86791aa45,n,z16:15
jprovaznshardy, I have that one applied, I wonder how it could be that rabbit_hosts is empty both on my compute and ctontroller nodes16:17
jprovaznand I noticed a new syntax in tht - nova::rabbit_hosts: *rabbit_nodes_array16:17
jprovaznat least I was not faimilar with the asterisk reference16:17
shardyjprovazn: that's a yaml alias, not something specific to heat16:18
jprovaznah16:18
openstackgerritJohn Trowbridge proposed openstack/instack-undercloud: Deploy Monitoring on the undercloud with Puppet  https://review.openstack.org/28942716:18
shadowershardy: So during stack-delete a controller has a call to {get_attr: [TenantPort, ip_subnet]} which ends up going through:  https://github.com/openstack/tripleo-heat-templates/blob/master/network/ports/tenant.yaml#L56 and that results to a string split on a None which in turn results in this exception: http://paste.openstack.org/show/489561/16:18
*** Goneri has joined #tripleo16:19
shadowershardy: regarding my overcloud delete voes16:19
shardyshadower: Hmm, why is it splitting on None at delete time?16:20
openstackgerritJohn Trowbridge proposed openstack/instack-undercloud: Deploy Monitoring on the undercloud with Puppet  https://review.openstack.org/27612716:20
shadowershardy: yeah, that I don't know16:20
shadowercould it be it's running some sort of validation?16:20
shardyshadower: it may still be a heat bug tho, if we created it, we should delete it16:21
shadoweryeah16:21
jprovazngfidente, ha, so the reason is that rabbit_node_ips parameter passed to the heat nested stack puppet/all-nodes-config.yaml is already empty16:21
shardyshadower: can you raise a heat bug please - I'm not sure why we're doing th enforce_stack thing on delete16:22
*** dprince has joined #tripleo16:22
shadowershardy: will do16:22
shardyneed to dig deeper, but it seems like we shouldn't be doing that preview_resources on delete16:22
shadowershardy: yeah I was surprised to see it there but it's been a while I hacked on heat16:23
lucasagomesdtantsur, left a comment re overwrite16:23
lucasagomeslemme know what you think16:23
openstackgerritMerged openstack/tripleo-heat-templates: Fix rabbit_hosts list for glance-api for IPv6  https://review.openstack.org/28882616:24
dtantsurlucasagomes, actually I was planning this to be a normal situation (see the comment)16:26
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: Fix rabbit_hosts list for glance-api for IPv6  https://review.openstack.org/28943216:26
openstackgerritJohn Trowbridge proposed openstack/instack-undercloud: Deploy Logging on the undercloud with Puppet  https://review.openstack.org/25865316:29
lucasagomesdtantsur, will take a look16:29
*** jaosorior_climbi is now known as jaosorior16:29
openstackgerritPradeep Kilambi proposed openstack/tripleo-heat-templates: Deploy Aodh services, replacing Ceilometer Alarm  https://review.openstack.org/28943516:30
openstackgerritMerged openstack/instack-undercloud: Enable notifications on undercloud  https://review.openstack.org/24134116:31
jaosoriorbnemec: well, this did work with the upgrade https://review.openstack.org/#/c/289221/ only thing is that when upgrading, this needs to be taken into consideration to set some parameters for the overcloud (if needed)16:31
gfidentebnemec, ack on https://review.openstack.org/21040516:32
jaosoriorbnemec: Or at least it did work when going from a deployment with the old range, to do an $ openstack undercloud isntall     that has the new range16:32
gfidentemarios, jistr ^^ do you guys have any way to check https://review.openstack.org/210405 ?16:32
*** absubram has joined #tripleo16:33
gfidenteI think now that we do pcmk restart it should be okay but ...16:33
*** aufi has quit IRC16:33
lucasagomesdtantsur, re-commented... I was just thinking about someone reconfiguring the node16:34
bnemecjaosorior: It can't work with upgrades when you have a deployed overcloud.  When you re-ran the undercloud install it would try to re-create the subnet and fail because ports are still allocated.  I think anyway.16:34
lucasagomesdtantsur, in case a new disk is added so he wants to refresh the root device hints or situations like that16:34
*** mkovacik has quit IRC16:35
bnemecgfidente: That's fine, I just want someone to test that it doesn't break upgrades before we merge it.16:35
dtantsurlucasagomes, the last thing we want to do is to silently change the root device if a new disk is added, to be honest16:35
gfidentebnemec, agreed16:35
jaosoriorbnemec: well, crap... isn't there a way to get the undercloud to re-allocate those addresses?16:35
mariosgfidente: i just tried to apply it to my 7.1 env but has merge conflict and really can't handle vimdiff right now16:36
bnemecjaosorior: Not that I know of.16:36
lucasagomesdtantsur, it will only change if it's requested right?16:36
lucasagomesif one do not pass the --root-device-hints we won't touch it16:36
jaosoriorbnemec: Do you have any other ideas? The choosing of those addresses was quite unfortunate :/16:37
lucasagomesor will we?16:37
gfidentemarios, to 7.1?16:37
gfidentemarios, shouldn't it go into 8 only?16:37
mariosgfidente: in which case i can't help you my envs are all 7.x16:38
bnemecjaosorior: I don't disagree.  I also hate that the provisioning network is called ctlplane, but it's not trivial to change that either. :-/16:38
jaosoriorbummer :/16:38
dtantsurlucasagomes, we won't. changing the root device on rebuild without cleaning is a receipt for problems16:38
jaosoriorwell, lets talk about the ip addresses tomorrow in the meeting as you suggested16:38
mariosgfidente: (well 7.3 which is being upgraded to 8 but can't use that /is in progress)16:38
*** xinwu has joined #tripleo16:39
bnemecjaosorior: Yeah, somebody else may know something we don't. :-)16:39
*** yamahata has quit IRC16:39
gfidentemarios, so I think the problem is understanding if landing that change in liberty makes an update from kilo to succeed16:40
thrashbnemec: I want to say that was fixed...16:40
lucasagomesdtantsur, right, maybe I'm out of context where that function will be called16:41
lucasagomescause I thought it was the operator manually calling it to update the nodes16:41
dtantsurlucasagomes, unfortunately I can't show you the bug, it's super-private.. well, yes. but still it should not overwrite anything without a warning16:42
dtantsurotherwise we won't be able to fine-tune it AND it will change the root device in cases like adding a new disk16:42
bnemecthrash: It's fixed in the case where the cidrs match.  If we change the default they wouldn't.  You can't change your provisioning IP range with an overcloud deployed (AFAIK).16:42
dtantsur(which IMO we should not do)16:42
lucasagomesdtantsur, yeah, exactly. I'm not saying we should overwrite it, quite the oposite we should fail saying "the root device hints is already set, use --overwrite if you mean to overwrite it"16:43
dtantsurlucasagomes, if we fail, how do we fine-tune it?16:43
dtantsuri.e. I want another root device only for one nodes16:43
lucasagomesbecause if the user request B but A is set and we return success16:43
dtantsur* node16:43
lucasagomesthat's the bad form16:43
dtantsurno. that's how it should work16:43
lucasagomeswhy?16:44
dtantsurlucasagomes, otherwise how do you fine-tune it? or how do you rerun this command btw?16:44
lucasagomesI'm requesting something, this return sucess saying it was applied but it wasn't16:44
dtantsurlucasagomes, well, we document it does not override the existing things..16:44
dtantsurwhich is the same e.g. for "overcloud image upload"16:44
*** leanderthal is now known as leanderthal|afk16:45
thrashbnemec: i haven't looked at it in a couple of weeks, but I think you're correct now that you put it like that. :)16:45
*** david_lyle has quit IRC16:46
dtantsurlucasagomes, in other words: we're changing *the default* behavior. it can still be overwritten16:46
lucasagomesdtantsur, sounds a bit odd to me... I'm not familiar with "overcloud image upload" but if I built a new image and call this command again opinting to it I would expect it to upload the new image for me16:46
dtantsurlucasagomes, it doesn't :)16:46
lucasagomesnot just say "success" and the old image is uploaded16:46
lucasagomesso that sucks16:46
lucasagomesit's very misleading16:46
dtantsurbut ok, nevermind about image upload16:46
*** david_lyle has joined #tripleo16:46
dtantsurlucasagomes, another example: IPA does not fail when root device hints are already set, right? :) it just uses them. the same here: we define the default, but users can explicitly override them by setting root device hints16:47
lucasagomesdtantsur, right... maybe I don't have the full picture here16:47
lucasagomesdtantsur, I get it, but disks fails16:47
lucasagomessay the disk you were using died... now you replaced it16:48
lucasagomesso you run the command again and it says "success", but it still pointing to the old disk what happens?16:48
lucasagomesI think it should fail at least saying "look device hints is already set, do you mean to overwrite it? if so, use --overwrite"16:49
openstackgerritBrad P. Crochet proposed openstack/tripleo-common: Build image files from definitions in yaml  https://review.openstack.org/23556916:49
dtantsurlucasagomes, then how do we use it with existing root device hints?16:49
*** adarazs has quit IRC16:50
lucasagomesdtantsur, maybe I'm missing how this funciton will be called. Do we expect it to be called over and over without failing ?16:51
*** mikelk has quit IRC16:51
*** dmacpher is now known as dmacpher-afk16:51
dtantsurlucasagomes, well, yes. I think so. we may issue a warning on existing hints, this is a sane idea. but it should be callable. e.g. if you add 2 nodes, you should be able to call it.16:52
*** Marga_ has quit IRC16:53
lucasagomesdtantsur, so it's not a per node thing? when you call it it goes and fill out all nodes?16:53
openstackgerritAttila Darazs proposed openstack-infra/tripleo-ci: WIP: add IPv6 gate job  https://review.openstack.org/28944516:53
dtantsurlucasagomes, yep (like all commands in tripleo) (and it's not something I really like)16:54
lucasagomesdtantsur, gotcha16:55
lucasagomesx.x16:55
lucasagomesdtantsur, ok... well yeah warning then to keep it consistent16:55
dtantsuryeah, not so obvious :(16:55
dtantsuryep. so that people clearly know when nodes are NOT updated16:55
lucasagomesbut urgh, it's very ugly16:55
lucasagomesthese "bulk" things are not ideal16:55
lucasagomesIMHO16:55
dtantsurlucasagomes, ideally we should allow changing the default logic in IPA.. nice feature to consider for newton16:56
lucasagomesdtantsur, yeah, I which we had operators for root device hints too (lesser than and so on)16:56
dtantsurlucasagomes, or extend our root device hints to support things like "largest"... actually we already have "name", but it's mitaka only and does not allow several options16:56
lucasagomesfor size that would great16:56
dtantsuryep16:56
lucasagomesdtantsur, yeah indeed16:57
dtantsurso yeah "in" operator would be of great value16:57
lucasagomesI will try to add a quick RFE for it16:57
lucasagomessee if we can do w/o the need of a spec16:57
lucasagomesshouldn't be hard16:57
lucasagomesdtantsur, yeah16:57
dtantsurlike "root_device": {"name": ["sda", "hda"]}16:57
lucasagomesyeah, it should be easy done since it's all python now16:59
* lucasagomes start a small RFE about it16:59
*** mgrohar has joined #tripleo16:59
dtantsurlucasagomes, and then "root_device": {"strategy": "largest"}16:59
dtantsurthat would be much better16:59
*** dcain has joined #tripleo16:59
dtantsurbut for liberty + mitaka we have to have something......16:59
lucasagomes++17:00
*** jaosorior has quit IRC17:00
Erming__ xxxx has no server available!17:00
dtantsurlucasagomes, so yeah, lets try to land it at least for newton17:00
*** jaosorior has joined #tripleo17:01
lucasagomesdtantsur, yeah, I will try to come up with something about it after the meeting (or tomorrow morning)17:01
dtantsurawesome, thnx17:02
shadowershardy: https://bugs.launchpad.net/heat/+bug/155412417:03
openstackLaunchpad bug 1554124 in heat "Deleting a stack fails during preview_resources" [Undecided,New]17:03
*** ifarkas has quit IRC17:03
shadowerit's one of the worse bug reports I've filed, but I didn't get to dig deep enough yet :-(17:03
*** mkovacik has joined #tripleo17:04
Erming__trown: After manually started mysqld, I tried to restart pacemaker to restart all the services, however, it reports:  proxy xxxx has no server available.17:04
Erming__trown: for example:  haproxy[1896]: proxy keystone_admin has no server available!17:04
*** yamahata has joined #tripleo17:05
Erming__trown: same to all services except swift ones. what should I do to start them normally.17:05
*** trown is now known as trown|lunch17:05
*** absubram has quit IRC17:06
*** mbound has quit IRC17:09
*** cwolferh has joined #tripleo17:10
bnemecshadower: I believe that's already fixed on Heat master.  It's just a problem on the pinned tripleo repo.17:13
bnemecAt least moving to master Heat got me past the problem.17:13
*** dustins has quit IRC17:14
derekhbnemec: shadower FYI if it helps, we've moved the pin on saturday17:14
*** lazy_prince has quit IRC17:15
shadowerbnemec, derekh: ah, okay. My undercloud comes from Friday17:16
openstackgerritEmilien Macchi proposed openstack/puppet-tripleo: Add glance api & registry classes  https://review.openstack.org/28945917:16
bnemecYeah, it was definitely still broken Friday.17:16
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: Set /64 cidr_netmask for pcmk VIPs when IPv6  https://review.openstack.org/28946117:17
openstackgerritMerged openstack/tripleo-heat-templates: Set /64 cidr_netmask for pcmk VIPs when IPv6  https://review.openstack.org/26764717:19
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates: deploy glance using split roles  https://review.openstack.org/28946617:22
EmilienMdprince, shardy: this is a first attemp to split Glance stack https://review.openstack.org/#/q/topic:glance_splitstack17:23
*** Marga_ has joined #tripleo17:24
openstackgerritEmilien Macchi proposed openstack/puppet-tripleo: Add glance api & registry classes  https://review.openstack.org/28945917:24
*** dustins has joined #tripleo17:25
*** absubram has joined #tripleo17:29
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates: deploy glance using split roles  https://review.openstack.org/28946617:30
*** ohamada has quit IRC17:32
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates: deploy glance using split roles  https://review.openstack.org/28946617:32
*** trown|lunch is now known as trown17:33
openstackgerritDmitry Tantsur proposed openstack/instack-undercloud: Add an option to enable cleaning  https://review.openstack.org/28947317:34
*** Marga_ has quit IRC17:35
*** chem has quit IRC17:36
*** mgrohar has quit IRC17:39
*** panda has quit IRC17:40
*** panda has joined #tripleo17:40
*** jaosorior has quit IRC17:41
*** jaosorior has joined #tripleo17:42
*** cmyster has quit IRC17:45
*** jaosorior has quit IRC17:46
*** fgimenez has quit IRC17:48
*** Marga_ has joined #tripleo17:48
openstackgerritBrad P. Crochet proposed openstack/tripleo-common: Build image files from definitions in yaml  https://review.openstack.org/23556917:50
*** Marga_ has quit IRC17:51
openstackgerritBrad P. Crochet proposed openstack/tripleo-common: Build image files from definitions in yaml  https://review.openstack.org/23556917:51
*** Marga_ has joined #tripleo17:51
openstackgerritPradeep Kilambi proposed openstack/tripleo-heat-templates: Deploy Aodh services, replacing Ceilometer Alarm  https://review.openstack.org/28943517:54
*** lucasagomes is now known as lucas-dinner17:56
*** adarazs has joined #tripleo18:01
slagledprince: fyi, this https://review.openstack.org/#/c/289085/ and the 2 depends-on should fix the ValidationDeployment problems18:03
openstackgerritGiulio Fidente proposed openstack/tripleo-common: Change the private subnet of the overcloud tenant network  https://review.openstack.org/28948918:04
gfidenteslagle, ^^ I think I nailed the subnet collision18:04
slaglegfidente: ok. any idea how it's been passing on master though, and not stable/liberty?18:05
*** ayoung has quit IRC18:06
gfidenteslagle, exactly what I was thinking and no, I don't18:06
slaglemaybe neutron in master allows subnet overlaps18:06
openstackgerritBen Nemec proposed openstack-infra/tripleo-ci: Test overcloud SSL  https://review.openstack.org/28198818:08
*** tremble has quit IRC18:09
*** dtantsur is now known as dtantsur|afk18:11
openstackgerritGiulio Fidente proposed openstack-infra/tripleo-ci: Don't override the private_net settings for the tenant  https://review.openstack.org/28949518:11
*** yamahata has quit IRC18:11
gfidenteslagle, though ^^ was supposed to be customizing that already!18:11
openstackgerritDmitry Tantsur proposed openstack/python-tripleoclient: Allow 'openstack baremetal configure boot' to guess the root device  https://review.openstack.org/28841718:12
openstackgerritBrad P. Crochet proposed openstack/python-tripleoclient: Add 'undercloud upgrade' command  https://review.openstack.org/28949818:12
gfidenteslagle, oh tripleo-common has stable/liberty ...18:15
gfidenteslagle, so given in stable/liberty we were creating 10.0.0.0/8 I think the heat client in liberty just wasn't doing well with -P parameters18:18
gfidenteslagle, they weren't being effectively overridden causing the overlap to happen only in liberty18:18
gfidenteslagle, makes sense?18:18
gfidenteslagle, in that case it's https://review.openstack.org/#/c/289495/ which together with the depends-on should fix liberty18:19
*** dustins has quit IRC18:19
openstackgerritAthlan-Guyot sofer proposed openstack/puppet-pacemaker: Basic beaker one node test.  https://review.openstack.org/28137618:19
openstackgerritGiulio Fidente proposed openstack-infra/tripleo-ci: Don't override the private_net settings for the tenant  https://review.openstack.org/28949518:20
openstackgerritAthlan-Guyot sofer proposed openstack/puppet-pacemaker: Add a service provider.  https://review.openstack.org/28612418:22
*** dustins has joined #tripleo18:22
*** mgould has quit IRC18:22
*** Marga_ has quit IRC18:23
*** xinwu has quit IRC18:24
slaglegfidente: k, makes sense18:24
*** shivrao has joined #tripleo18:27
*** akrivoka has quit IRC18:31
*** slagle changes topic to "TripleO | stable/liberty CI failing: https://bugs.launchpad.net/tripleo/+bug/1554169 | CI status: http://tripleo.org/cistatus.html | Docs: http://tripleo.org/"18:33
Erming__larsks: I noticed that you article on deploying ha openstack, and you also use the undercloud.qcow2. Is the the same as Trown's image? Assuming you know Trown already. :-)18:37
trownErming__: ya larsks has been making a ton of contributions to tripleo-quickstart18:38
larskstrown: beat me to the reply... :)18:38
Erming__trown: thanks18:38
larsksI was just using tripleo-quickstart, which pulls images from centosci infra.18:38
Erming__larsks: what's the difference between your two solutions?18:39
trownErming__: just got to your question above... no clue, probably need someone with a better clue about pacemaker to answer18:39
larsksErming__: there is no difference.18:39
Erming__larsks: thanks18:39
larsksErming__: I was using trown's solution...18:39
Erming__larsks: I got proxy xxxx has no server available! issue18:39
trownErming__: ya the blog details different (better) usage instructions for tripleo-quickstart, but it is the same ansible bits being used in both cases18:40
larsksThat generally means that one of the services fronted by haproxy is not running.18:40
larsksFigure out which one, check the logs...18:40
larsks...lather, rinse, repeat.18:40
Erming__larsks: after deployed overcloud by trown's way. All services like this. so I guess it's haproxy issue?18:40
larsksProbably *not* an haproxy issue, actually.18:41
Erming__larsks: haproxy[1896]: proxy keystone_admin has no server available!18:41
*** jistr has quit IRC18:41
*** dustins has quit IRC18:41
larsksErming__: Okay.  So you need to (a) verify that keystone is not, in fact, running, and (b) look at the keystone server logs to see if there is a clue there as to the problem.18:42
Erming__Yes. I did. I found this: Lost connection to MySQL server during query'18:42
trownErming__: are you using the recent (from Friday) mitaka undercloud.qcow2?18:42
Erming__and then I restarted mysqld, it 's running now18:42
trownErming__: ah... if mysqld was stopped, you probably have OOM issues18:43
Erming__trown: no I deployed a few weeks ago18:43
Erming__trown: really? good point.18:43
Erming__trown: but i have 128GB on the host18:43
Erming__memory18:43
trownErming__: oh, as in this environment has been running for 2 weeks then stopped working?18:43
Erming__trown: yes18:44
trownErming__: cool, that is the longest running tripleo-quickstart environment I have encountered :)18:44
*** rdopiera has quit IRC18:44
trownErming__: I would check /var/log/messages on overcloud nodes for OOM kill messages18:45
Erming__trown: sorry. a bit confusion. I meant I deployed a few weeks ago, but didn't get time to check it carefuly. later found mysqld and almost all cloud services not running18:45
Erming__on the host?18:45
Erming__sorry18:46
Erming__overcloud18:46
trownErming__: ya18:46
trownErming__: how much RAM did you give the overcloud nodes? by default it is pretty low, but on a 128GB virthost you could give them alot18:47
Erming__should I check instackenv.json?18:47
trownthat would work18:47
Erming__4096, I keep your instackenv.json no change18:47
trownok that is the problem then18:48
Erming__why?18:48
trownthe defaults are really meant for CI/dev env use case18:48
trownwhich is 32G hosts mostly18:48
Erming__32G?18:48
trownand the envs dont need to stay up for 2 weeks :)18:48
trownin your case, you would want to give the overcloud nodes more RAM18:49
trown32G RAM18:49
Erming__how much for each ?18:49
Erming__32GB for each controller?18:49
trowndo you use that host for anything else?18:49
Erming__basically no18:49
*** Marga_ has joined #tripleo18:50
trownI would do 16GB undercloud, 8GB controllers18:50
*** jprovazn is now known as jprovazn_bbl18:50
trownthat leaves plenty for compute or ceph nodes18:51
openstackgerrityolanda.robla proposed openstack/diskimage-builder: Create new partitioning-sfdisk element.  https://review.openstack.org/25988118:51
Erming__do I have a place to set RAM for undercloud?18:52
Erming__undercloud.conf?18:52
trownErming__: it would be on deploy of tripleo-quickstart... you could manually change up libvirt, but I would recommend redeploy18:52
Erming__trown: in redeploy, how can I set that in your script18:53
trownI plan to make more "scenario" example settings files, but https://github.com/redhat-openstack/tripleo-quickstart/blob/master/playbooks/centosci/centosci_minimal_nodes.yml shows everything you would want to change18:53
trownErming__: as far as how to use it, I would follow larsks blog post to use ansible-playbook directly instead of quickstart.sh18:54
*** dustins has joined #tripleo18:54
trownErming__: quickstart.sh is really a quick try it out script, I always just use ansible-playbook directly18:55
Erming__trown: I will try to find the place to change that. thanks. what else you think I need change to your instackenv file?18:55
trownErming__: you would not need to change the instackenv file if you redeploy, it will get created from the settings18:55
Erming__trown: makes sense.18:55
Erming__trown: for baremetal, I don't see a place in lars's instruction, how to set up mac address for IPMI.18:56
*** tosky has quit IRC18:56
Erming__I just took a glance18:56
trownErming__: you have baremetal too?18:56
openstackgerritBen Nemec proposed openstack/instack-undercloud: Enable notifications on undercloud  https://review.openstack.org/28951818:56
Erming__trown: I am going to have.18:57
Erming__trown: that's the target18:57
trownErming__: ah ok, that is bleeding edge for tripleo-quickstart... as in I have not even personally tried it :)18:57
trownyou will want to manually create the instackenv.json for that18:57
Erming__trown: why? in the rdo-manager, baremetal is the other option18:57
trownErming__: well, I dont see why it wouldnt work, but I have not pursued that use case18:59
trownmostly due to lack of hardware18:59
Erming__trown: I will create my own instackenv.json for sure. So all in all, you think the virtual machine case is basically OOM issue? In default it should work as I don't have anything customized.19:00
Erming__trown: I have hardware for pilot deployment, maybe we could work together if you want :-)19:01
trownErming__: ya, 4GB overcloud nodes is barely enough to pass CI, I am sure they would have memory problems if left running for 2 weeks19:01
trownErming__: I am up for collaborating on that19:01
*** pradk_ has joined #tripleo19:01
Erming__trown: I can create account(s) for you to check with me. I have 3 head nodes and a bunch of compute nodes.19:01
Erming__each head node has 128GB mem. Just a bit old. (IBM x3650)19:02
trowncool19:03
*** pradk has quit IRC19:03
Erming__trown: Awesome. I will show you more details about accessing the nodes later afternoon.  (going to play badminton now in lunch time) Thanks!!!!19:03
trownnice, have fun!19:04
*** tzumainn has quit IRC19:07
*** pradk has joined #tripleo19:10
*** pradk_ has quit IRC19:11
*** cwolferh has quit IRC19:16
*** cwolferh has joined #tripleo19:17
*** shardy_ has joined #tripleo19:18
*** yamahata has joined #tripleo19:18
*** shardy has quit IRC19:18
*** tzumainn has joined #tripleo19:19
dprinceslagle: thanks for these validation deployment fixes19:19
*** jprovazn_bbl has quit IRC19:24
openstackgerritMerged openstack/tripleo-heat-templates: Make AllNodesExtraConfig depend on the validation deployments  https://review.openstack.org/28874719:27
*** gfidente has quit IRC19:29
openstackgerritJames Slagle proposed openstack/tripleo-heat-templates: Set host in nova.conf for compute nodes  https://review.openstack.org/28886619:32
*** paramite has quit IRC19:44
*** jtomasek_ has joined #tripleo19:51
*** dcain has quit IRC20:01
*** dcain has joined #tripleo20:03
*** Marga_ has quit IRC20:04
*** akuznetsov has joined #tripleo20:08
*** david-lyle_ has joined #tripleo20:12
*** david_lyle has quit IRC20:13
*** Marga_ has joined #tripleo20:15
*** akuznetsov has quit IRC20:20
*** akuznetsov has joined #tripleo20:25
*** david-lyle_ is now known as david-lyle20:26
openstackgerritgreghaynes proposed openstack/diskimage-builder: Add element to force loading of the 3ware module  https://review.openstack.org/28955220:28
openstackgerritgreghaynes proposed openstack/diskimage-builder: Add element to force loading of the 3ware module  https://review.openstack.org/28955220:29
*** Goneri has quit IRC20:30
*** Goneri has joined #tripleo20:30
*** admin0 has joined #tripleo20:36
*** zeroshft has joined #tripleo20:38
*** nico_auv has quit IRC20:39
EmilienMdprince: when you have some time, I have a patch in puppet-tripleo that I would like some feedback so I can continue with other services: https://review.openstack.org/#/c/28634420:42
EmilienMdsneddon: do you have a PoC handy in THT?20:42
EmilienMdsneddon: about dual stack20:42
*** jtomasek_ has quit IRC20:43
*** admin0 has quit IRC20:44
*** xinwu has joined #tripleo20:44
openstackgerritBen Nemec proposed openstack-infra/tripleo-ci: Only unset proxy for deploy command  https://review.openstack.org/28955620:50
bnemecdprince: derekh: ^ We're spending a bunch of time and bandwidth downloading the Fedora image every run now.20:51
*** yamahata has quit IRC20:55
*** yamahata has joined #tripleo20:55
openstackgerritRyan Hallisey proposed openstack/tripleo-common: Use Fedora 23 atomic in container gate  https://review.openstack.org/28956321:04
dprincebnemec: ack. looking21:05
openstackgerritRyan Hallisey proposed openstack/tripleo-common: Use Fedora 23 atomic in container gate  https://review.openstack.org/28956521:06
dprincebnemec: when it passes CI feel free to +A21:07
openstackgerritRyan Hallisey proposed openstack-infra/tripleo-ci: Allow the continer job to run again  https://review.openstack.org/28891521:07
bnemecdprince: Thanks, will do.21:07
dprincebnemec: I would really like CI metrics so we notice stuff like this front and central21:11
*** akuznetsov has quit IRC21:11
bnemecdprince: Yeah, we should see if we could do something like http://status.openstack.org/openstack-health/#/ for toci.21:12
bnemecAlthough I thought that had test time metrics too, which I'm not seeing now.21:14
*** dcain has quit IRC21:15
dprincebnemec: yeah. THat is a nice report but I want metrics on each part (fine grained) of our CI setup21:15
bnemecdprince: I thought it had something kind of like that, but now I don't see it so I must have been thinking of something else.21:16
openstackgerritJames Slagle proposed openstack/tripleo-heat-templates: Make AllNodesExtraConfig depend on the validation deployments  https://review.openstack.org/28956821:16
dprincebnemec: I used to have custom metrics in SmokeStack. I might could leverage some of my old work and we could raise it from there21:17
EmilienMbnemec: I think you need tempest runs for openstack-health21:17
*** jcoufal has quit IRC21:17
dprincebnemec: I want DIB build times, image download times, RPM build times, etc.21:17
EmilienMin puppet CI, we use dstat a lot to grab metrics and store them in /var/log/dstat.log21:17
bnemecdprince: Yeah, that would definitely be good.21:17
EmilienMit helped us to find out timeouts21:17
bnemecEmilienM: We actually have dstat now too.  I haven't looked at the output yet though.21:18
EmilienMcool!21:18
dprinceEmilienM: dstat would be cool too21:19
dprinceEmilienM: I really just think having wall time for all of our tasks is what I'm after21:19
EmilienMyeah, I can submit a patch for that21:20
*** ayoung has joined #tripleo21:20
EmilienMI was looking at code to see how it works21:20
*** dcain has joined #tripleo21:20
dprinceEmilienM: as a start could you just link me how puppet CI does this?21:20
derekhWe got dstat on the jenkins node and undercloud, iirc not the overcloud  nodes21:21
*** jayg is now known as jayg|g0n321:22
dprincederekh: I'm interested in undercloud and jenkins for starters anyways21:22
EmilienMdprince: about dstat? sure. Code is here: https://github.com/openstack/puppet-openstack-integration/blob/master/run_tests.sh#L81 and example of file: http://logs.openstack.org/46/289446/1/check/gate-puppet-openstack-integration-scenario002-tempest-dsvm-centos7/35e11b8/logs/dstat.txt.gz21:22
derekhwhat I'd like to see is metrics for timings between specific checkpoints that we can litter around the ci test21:22
dprincederekh: yes, exactly. I was thinking just a shell script function. (this is what I had in smokestack FWIW)21:23
openstackgerritPradeep Kilambi proposed openstack/tripleo-heat-templates: Deploy Gnocchi as a Ceilometer metrics storage backend  https://review.openstack.org/25203221:23
EmilienMdprince, derekh: I like this idea21:23
dprincederekh: See my start_metric/stop_metric functions here https://github.com/dprince/smokestack/blob/master/app/templates/puppet_runner.sh.erb#L7821:24
dprinceEmilienM: this is how I kept my CI runtime < 25 minutes21:24
EmilienMdprince: multi node?21:25
dprinceEmilienM: yes21:25
EmilienMo_O21:25
EmilienMthat's what we have, on a single node (in puppet CI)21:25
dprinceEmilienM: before there was a devstack man21:25
dprinceEmilienM: there didn't used to be so many services I think21:26
dprinceEmilienM: things have exploaded a bit21:26
EmilienM"a bit" :)21:26
*** shardy_ has quit IRC21:27
derekhdprince: where does start_metric set the data to?21:27
dprinceEmilienM: anyways, a combination of dstat and some custom shell fuctions we can put in our scripts would help21:27
dprincederekh: let me check and see21:27
*** weshay has quit IRC21:28
derekhdprince: doesn't matter too much, was just curious, what I had in my imaginary world was some kind of graphite server21:28
dprincederekh: https://github.com/dprince/smokestack/blob/master/app/templates/common.sh.erb#L30421:28
dprincederekh: THis was built to log to a file, and then I had a POST script that either created HTML, or could offload it to a production service somewhere (graphite, etc.)21:29
derekhdprince: cool21:29
*** admin0 has joined #tripleo21:29
dprincederekh: I think perhaps having both is useful. The central service really makes changes pop out. But having each run self-contain something that is readable can be useful too21:30
derekhdprince: ack, makes sense21:30
dprincederekh: whatever we use. I want the report on tripleo.org. Along side the owl :)21:31
derekh;-)21:31
dprincejrist: Hey, BTW how is the new owl logo21:31
dprincejrist: want to mention your prototype in the tripleo IRC meeting tomorrow?21:32
jristsure, was going to ask around today about writing a spec - a jtomasek and a few others mentioned a 'style guide' for usage21:33
dprincejrist: if people like it the code it plugs into lives here for now: https://github.com/dprince/tripleosphinx21:33
jristI was going to throw it into the header and login screen21:33
jristdprince: ok21:33
jristyeah lets add it to the agenda if you don't mind21:33
jristI wanted to get it going today but I'm staring blankly at keystone at the moment21:33
dprincejrist: fine, you mentioned it last week when I was really busy w/ some sysadmin tasks, it looked coolish to me21:34
jristdprince: no worries21:34
dprincejrist: only the feet... owl's gotta have some talons I think21:35
jristhaha21:35
jristshould we hold a vote about that?21:35
jristperhaps he's looking up at you and his feet are hidden below his body21:35
dprincejrist: meetings have been running long, but perhaps on the list we can21:36
*** derekh has quit IRC21:36
jrist10-421:37
*** admin0 has quit IRC21:46
*** r-mibu has quit IRC21:47
*** r-mibu has joined #tripleo21:47
*** dshulyak has quit IRC21:47
*** ccamacho has quit IRC21:55
*** rpothier has quit IRC21:55
*** trown is now known as trown|outtypewww21:57
*** david-lyle has quit IRC22:02
*** david-lyle has joined #tripleo22:02
*** dshulyak has joined #tripleo22:03
*** rhallisey has quit IRC22:06
*** lblanchard has quit IRC22:06
*** yamahata has quit IRC22:07
*** dshulyak has quit IRC22:08
*** dprince has quit IRC22:09
*** admin0 has joined #tripleo22:19
*** psanchez has quit IRC22:23
*** shadower has quit IRC22:23
*** slagle has quit IRC22:24
*** adarazs has quit IRC22:25
*** stevebaker has quit IRC22:25
*** adarazs has joined #tripleo22:27
*** jtomasek_ has joined #tripleo22:33
*** dcain has quit IRC22:36
*** dustins has quit IRC22:41
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates: Add an environment to use a swap partition  https://review.openstack.org/28961022:59
*** jtomasek_ has quit IRC23:01
*** jtomasek has quit IRC23:02
*** absubram has quit IRC23:06
*** admin0 has quit IRC23:18
*** palexster has quit IRC23:19
*** zeroshft has quit IRC23:25
*** palexster has joined #tripleo23:32
*** saneax_AFK is now known as saneax23:35
*** palexster has quit IRC23:37
*** palexster has joined #tripleo23:50

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!