Thursday, 2016-03-10

*** openstack has joined #tripleo00:09
*** openstackgerrit has quit IRC00:17
*** openstackgerrit has joined #tripleo00:18
*** xinwu has quit IRC00:19
*** morazi has quit IRC00:29
*** yamahata has joined #tripleo00:29
*** ccamacho has quit IRC00:34
openstackgerritMerged openstack/instack-undercloud: Store events in Undercloud Ceilometer  https://review.openstack.org/28978900:37
*** thrash is now known as thrash|g0ne00:37
*** xinwu has joined #tripleo00:42
openstackgerritMerged openstack/tripleo-heat-templates: Store events in Ceilometer  https://review.openstack.org/29015300:43
*** Erming__ has joined #tripleo00:44
*** michchap has joined #tripleo00:45
*** eggmaste` has joined #tripleo00:45
*** ryansb_ has joined #tripleo00:45
*** ryansb_ has quit IRC00:45
*** ryansb_ has joined #tripleo00:45
*** isq_ has joined #tripleo00:46
*** pino|work_ has joined #tripleo00:47
*** saneax is now known as saneax_AFK00:47
*** prometheanfire has quit IRC00:47
*** Nakato_ has joined #tripleo00:48
*** Erming_ has quit IRC00:49
*** pino|work has quit IRC00:49
*** michchap_ has quit IRC00:49
*** eggmaster has quit IRC00:49
*** isq has quit IRC00:49
*** ryansb has quit IRC00:49
*** Nakato has quit IRC00:49
*** ryansb_ is now known as ryansb00:49
*** prometheanfire has joined #tripleo00:50
*** lblanchard has joined #tripleo00:57
*** rhallisey has quit IRC01:00
openstackgerritMerged openstack/tripleo-heat-templates: Add missing createUser line to /etc/snmp/snmpd.conf  https://review.openstack.org/29031701:11
openstackgerritMerged openstack/tripleo-common: Add capabilities filter for Nova  https://review.openstack.org/28808701:12
*** dmacpher-afk has quit IRC01:14
*** xinwu has quit IRC01:22
*** panda has quit IRC01:40
*** panda has joined #tripleo01:40
*** xinwu has joined #tripleo01:56
openstackgerritJames Slagle proposed openstack/tripleo-common: Add capabilities filter for Nova  https://review.openstack.org/29094201:56
openstackgerritSam Yaple proposed openstack/diskimage-builder: Use fstrim to prep the block device  https://review.openstack.org/29094402:02
*** lblanchard has quit IRC02:26
*** rbrady has quit IRC02:31
*** dmacpher has joined #tripleo02:49
*** Marga_ has quit IRC02:54
*** Marga_ has joined #tripleo02:56
openstackgerritIan Wienand proposed openstack/diskimage-builder: centos-minimal does not provide base  https://review.openstack.org/29095603:00
*** Marga_ has quit IRC03:01
*** xinwu has quit IRC03:30
*** xinwu has joined #tripleo03:34
*** shivrao has quit IRC03:44
*** rlandy has quit IRC03:45
*** yamahata has quit IRC03:49
*** Marga_ has joined #tripleo03:50
*** links has joined #tripleo03:50
*** Marga_ has quit IRC03:51
*** Marga_ has joined #tripleo03:51
*** akuznetsov has joined #tripleo03:55
openstackgerritIan Wienand proposed openstack/diskimage-builder: Clear up "already provided" message  https://review.openstack.org/29096803:59
*** jaosorior has quit IRC04:02
*** jaosorior has joined #tripleo04:03
*** panda has quit IRC04:22
*** panda has joined #tripleo04:22
*** saneax_AFK is now known as saneax04:25
*** xinwu has quit IRC04:27
*** dmacpher has quit IRC04:30
*** dmacpher has joined #tripleo04:31
*** akuznetsov has quit IRC04:35
*** saneax is now known as saneax_AFK04:54
*** masco has joined #tripleo05:15
*** dmacpher_ has joined #tripleo05:23
*** dmacpher has quit IRC05:26
*** cmyster has quit IRC05:35
*** panda has quit IRC05:40
*** jaosorior has quit IRC05:40
*** Erming__ has quit IRC05:40
*** panda has joined #tripleo05:40
*** Erming__ has joined #tripleo05:46
*** jaosorior has joined #tripleo05:46
*** jaosorior has quit IRC05:46
*** jtomasek has joined #tripleo05:53
*** ayoung has quit IRC05:53
*** Erming__ has quit IRC06:02
*** ayoung has joined #tripleo06:04
*** Erming__ has joined #tripleo06:08
*** veteran has joined #tripleo06:21
*** veteran has quit IRC06:22
*** jprovazn has joined #tripleo06:23
*** dmacpher has joined #tripleo06:27
*** dmacpher_ has quit IRC06:28
*** cmyster has joined #tripleo06:29
*** cmyster has quit IRC06:29
*** cmyster has joined #tripleo06:29
openstackgerritPurandhar Sairam Mannidi proposed openstack/diskimage-builder: Add support for building images capable of UEFI  https://review.openstack.org/28778406:39
*** jtomasek has quit IRC06:53
openstackgerritPurandhar Sairam Mannidi proposed openstack/diskimage-builder: Add support for building images capable of UEFI  https://review.openstack.org/28778406:54
*** jprovazn has quit IRC06:55
*** jprovazn has joined #tripleo06:55
*** saneax_AFK is now known as saneax06:58
*** xinwu has joined #tripleo07:04
openstackgerritMerged openstack/tripleo-heat-templates: Introduce a UpgradeScriptDeliveryWorfklow as part of tripleo upgrades  https://review.openstack.org/28921207:04
*** yamahata has joined #tripleo07:07
*** rwsu has quit IRC07:07
*** ohamada has joined #tripleo07:13
bandinimornin'07:14
openstackgerrityolanda.robla proposed openstack/diskimage-builder: Set default locale to image in ubuntu-minimal  https://review.openstack.org/29078907:15
*** olap has quit IRC07:22
*** ohamada has quit IRC07:23
*** liverpooler has quit IRC07:23
*** trozet has quit IRC07:27
*** trozet has joined #tripleo07:28
*** ccamacho has joined #tripleo07:28
openstackgerritMerged openstack/tripleo-heat-templates: stable/liberty: set default upgrade level to kilo  https://review.openstack.org/29058407:29
*** dshulyak has joined #tripleo07:32
*** rcernin has joined #tripleo07:33
*** pino|work_ is now known as pino|work07:44
*** rdopiera has joined #tripleo07:48
*** olap has joined #tripleo07:50
*** shivrao has joined #tripleo07:52
*** shivrao_ has joined #tripleo07:54
*** fgimenez has joined #tripleo07:56
*** shivrao has quit IRC07:57
*** shivrao_ is now known as shivrao07:57
*** rwsu has joined #tripleo08:01
*** ifarkas has joined #tripleo08:02
*** xinwu has quit IRC08:03
*** rain has joined #tripleo08:08
*** rain is now known as Guest7863108:09
*** Guest78631 is now known as leanderthal08:09
*** aufi has joined #tripleo08:09
*** xinwu has joined #tripleo08:10
*** paramite has joined #tripleo08:14
*** xinwu has quit IRC08:16
*** stendulker has joined #tripleo08:28
*** mbound has joined #tripleo08:31
*** mikelk has joined #tripleo08:32
*** pcaruana has joined #tripleo08:34
*** liverpooler has joined #tripleo08:34
*** liverpooler has quit IRC08:35
*** liverpooler has joined #tripleo08:35
*** rhefner has quit IRC08:36
*** Ng has quit IRC08:36
*** igorbelikov has quit IRC08:36
*** shivrao has quit IRC08:38
*** rhefner has joined #tripleo08:40
*** igorbelikov has joined #tripleo08:42
*** xinwu has joined #tripleo08:43
*** Ng has joined #tripleo08:44
*** ChanServ sets mode: +v Ng08:44
*** chem has joined #tripleo08:46
*** jaosorior has joined #tripleo08:52
*** ohamada has joined #tripleo08:55
openstackgerritIshant Tyagi proposed openstack/os-collect-config: Add insecure option to the cfn collector  https://review.openstack.org/28472508:58
*** ishant has joined #tripleo08:58
*** dmacpher has quit IRC08:59
*** shardy has joined #tripleo09:05
*** jaosorior has quit IRC09:09
*** jaosorior has joined #tripleo09:09
*** jistr has joined #tripleo09:09
*** dtantsur|afk is now known as dtantsur09:11
*** jcoufal has joined #tripleo09:12
*** lucas-dinner is now known as lucasagomes09:14
*** xinwu has quit IRC09:15
openstackgerritMerged openstack/tripleo-heat-templates: Moves the swift start/stop into the common_functions.sh file  https://review.openstack.org/28796009:23
*** openstackgerrit has quit IRC09:30
*** openstackgerrit_ has joined #tripleo09:30
*** openstackgerrit_ is now known as openstackgerrit09:31
*** openstackgerrit has quit IRC09:31
*** openstackgerrit_ has joined #tripleo09:31
*** pblaho has joined #tripleo09:31
*** openstackgerrit_ is now known as openstackgerrit09:32
*** openstackgerrit has quit IRC09:32
*** openstackgerrit_ has joined #tripleo09:32
*** openstackgerrit_ is now known as openstackgerrit09:33
*** openstackgerrit has quit IRC09:33
*** openstackgerrit_ has joined #tripleo09:33
*** openstackgerrit_ is now known as openstackgerrit09:34
*** panda has quit IRC09:40
*** panda has joined #tripleo09:41
*** electrofelix has joined #tripleo09:41
*** rasca has quit IRC09:49
*** mgould has joined #tripleo09:53
*** rasca has joined #tripleo09:54
*** akrivoka has joined #tripleo09:57
openstackgerritImre Farkas proposed openstack/tripleo-docs: Fix url for current-passed-ci  https://review.openstack.org/29107809:57
*** tosky has joined #tripleo09:58
openstackgerritMerged openstack/tripleo-heat-templates: Fixup swift device string to delimit the ipv6 address with []  https://review.openstack.org/28975709:58
*** dmacpher has joined #tripleo10:02
*** Marga_ has quit IRC10:03
*** liverpooler has quit IRC10:05
*** liverpooler has joined #tripleo10:10
*** derekh has joined #tripleo10:13
*** nico_auv has joined #tripleo10:16
*** jtomasek has joined #tripleo10:16
*** liverpooler has quit IRC10:17
openstackgerritMerged openstack/tripleo-docs: Extending the image build information  https://review.openstack.org/27029010:22
openstackgerritMerged openstack/tripleo-heat-templates: Fixup systemctl_swift stop/start  during the controller upgrade  https://review.openstack.org/29050110:22
openstackgerritMarios Andreou proposed openstack/tripleo-heat-templates: Fixup systemctl_swift stop/start  during the controller upgrade  https://review.openstack.org/29108810:25
*** paramite is now known as paramite|afk10:26
openstackgerritPurandhar Sairam Mannidi proposed openstack/diskimage-builder: Add support for building images capable of UEFI  https://review.openstack.org/28778410:26
openstackgerritMerged openstack/tripleo-docs: Update python-rdomanager-oscplugin to python-tripleoclient  https://review.openstack.org/29055310:27
*** paramite|afk is now known as paramite10:28
*** liverpooler has joined #tripleo10:29
shardySimple docs patch needs a second +2/A please - https://review.openstack.org/#/c/283582/10:32
openstackgerritMerged openstack/tripleo-docs: Document deploying the overcloud with ssl  https://review.openstack.org/26500610:33
*** athomas has joined #tripleo10:34
mariosshardy: done10:35
shardythanks!10:35
mariosshardy: can you +2 the cherrypick for that systemctl fix you just +A (thanks for that) https://review.openstack.org/#/c/291088/110:37
openstackgerritMerged openstack/tripleo-docs: Removed reference to SpinalStack to prevent confusion  https://review.openstack.org/28358210:37
shardymarios: done!10:38
mariosshardy: tyvm10:39
*** stendulker has quit IRC10:45
*** Marga_ has joined #tripleo10:47
*** paramite is now known as paramite|afk10:50
jistrceph upgrades ready to land, just need a +2 https://review.openstack.org/#/c/289896/10:57
jistrsame for upgrade init command for repo switching https://review.openstack.org/#/c/290465/10:57
shardyjistr: So, what's the rationale behind not actually running the script?10:59
shardyObviously we're going for the manual approach for computes due to the need for migrating workloads, but could this safely be automated?11:00
openstackgerritAttila Darazs proposed openstack-infra/tripleo-ci: Use IPv6 on the ceph gate job  https://review.openstack.org/28944511:01
jistrshardy: it would run on all nodes at the same time, i'm not sure if that's safe if we want to keep ceph data availability. I'd guess it isn't.11:02
shardyjistr: Ok, so we need a way to do a rolling apply of the script11:02
shardymakes sense, thanks11:02
shardyjistr: I wonder if we should modify SoftwareDeploymentGroup, so it has an option to serialize the deployments11:04
shardyand/or do them in batches11:04
shardywe already have that support in ResourceGroup (which SoftwareDeploymentGroup is based on), so would potentially be quite easy11:05
* shardy adds that to the list of things to look into11:05
openstackgerritMarios Andreou proposed openstack/tripleo-common: Install the upgrade-non-controller.sh script with tripleo-common  https://review.openstack.org/29110111:06
mariosjistr: not sure if that is right yet... testing ^^^11:06
jistrshardy: yeah that would be quite useful i think. It could avoid a CDN hit in other situations, for example. And we might be able to do a minor update without having to control everything synchronously from tripleoclient.11:06
shardyramishra: ^^ Hey maybe this might be something you'd be interested in looking at?11:10
shardyramishra: we'd like SoftwareDeploymentGroup to expose the new rolling update features of ResourceGroup11:10
*** pblaho has quit IRC11:11
*** aufi has quit IRC11:11
ramishrashardy: hey, surely I'll add to my newton todo:)11:13
*** paramite|afk is now known as paramite11:16
openstackgerritMerged openstack/tripleo-heat-templates: Updated the heat_template_version  https://review.openstack.org/28811611:19
shardyramishra: thanks! :)11:23
*** ishant has quit IRC11:25
*** akrivoka has quit IRC11:26
*** trown|outtypewww is now known as trown11:27
*** paramite is now known as paramite|afk11:31
*** oshvartz has joined #tripleo11:33
openstackgerritMerged openstack/tripleo-heat-templates: Increase default netdev_max_backlog to 10x  https://review.openstack.org/28990711:39
*** akrivoka has joined #tripleo11:40
openstackgerritMerged openstack/puppet-tripleo: Make OpenStack service ports configurable in HAProxy  https://review.openstack.org/28796111:42
*** mbound has quit IRC11:47
openstackgerritSteven Hardy proposed openstack/tripleo-heat-templates: Updated the heat_template_version  https://review.openstack.org/28813411:48
*** paramite|afk is now known as paramite11:51
*** trown is now known as trown|outtypewww11:56
slaglelook at all that green12:02
*** jaosorior has quit IRC12:02
*** jaosorior has joined #tripleo12:03
shardyI think we do have an issue with the lint check on stable tho:12:04
shardyhttps://review.openstack.org/#/c/288867/12:04
shardyit's failing on the verify gate on lines unrelated to the patch I think12:04
* shardy looks for patch which fixes it12:05
openstackgerritJuan Antonio Osorio Robles proposed openstack/tripleo-heat-templates: Make service certificate come from explicit input  https://review.openstack.org/29113212:06
jaosoriorshardy: That's the quick patch for the SoftwareConfig stuff we talked about in the morning ^^12:07
openstackgerritDmitry Tantsur proposed openstack/python-tripleoclient: Remove hardcoded delay between introspections  https://review.openstack.org/29113512:09
shardyHrm, we have the same long line on master12:09
jaosoriorEnabling TLS for the CI is all green :D https://review.openstack.org/#/c/281988/ if someone has time to check that out12:09
slagleshardy: yea, these liberty patches passed the lint job in the check, but then failed it in the gate12:10
shardyhttps://github.com/rodjek/puppet-lint/commit/2b48ab36bb5334a41f98f8bd75867cd69eb6f85912:11
shardyNeeds to be under 140 chars12:11
* shardy fixes12:11
shardyHmm, that should just be a warning tho12:12
openstackgerritSteven Hardy proposed openstack/instack-undercloud: Fix long line in puppet-stack-config.pp  https://review.openstack.org/29113612:16
openstackgerritSteven Hardy proposed openstack/instack-undercloud: Fix long line in puppet-stack-config.pp  https://review.openstack.org/29113712:17
openstackgerritJuan Antonio Osorio Robles proposed openstack/python-tripleoclient: Remove hardcoded delay between introspections  https://review.openstack.org/29113512:23
openstackgerritMike Burns proposed openstack/tripleo-heat-templates: Use service tenant for ceilometer  https://review.openstack.org/29114012:25
openstackgerritMike Burns proposed openstack/instack-undercloud: Use service tenant for ceilometer  https://review.openstack.org/29114212:25
openstackgerritMike Burns proposed openstack/tripleo-heat-templates: controller/ceilometer: use internalURL for os endpoint type  https://review.openstack.org/29114512:31
*** weshay has joined #tripleo12:31
slagleEmilienM: hi, getting a CI failure on one of the ipv6 patches, https://review.openstack.org/#/c/272089/12:32
openstackgerritSteven Hardy proposed openstack/python-tripleoclient: Allow node import via yaml not only csv/json  https://review.openstack.org/25522812:32
slagleEmilienM: "Error: Cannot reassign variable nova_ipv6"12:32
slagleEmilienM: i don't see where we are reassigning it12:32
*** lucasagomes is now known as lucas-hungry12:34
openstackgerritSteven Hardy proposed openstack/tripleo-docs: Update baremetal import to not use --json option  https://review.openstack.org/29114712:35
openstackgerritSteven Hardy proposed openstack/instack-undercloud: Fix long line in puppet-stack-config.pp  https://review.openstack.org/29113612:37
EmilienMhello12:38
openstackgerritSteven Hardy proposed openstack/instack-undercloud: Fix long line in puppet-stack-config.pp  https://review.openstack.org/29113712:39
EmilienMslagle: will look asap12:39
*** paramite is now known as paramite|afk12:39
slagleEmilienM: ok, thanks. i'm stumped on it. b/c I only see nova_ipv6 assigned to one time12:40
slagleunless the variable name is used in another module?12:40
*** paramite|afk is now known as mmagr12:42
*** mmagr is now known as paramite12:42
EmilienMslagle: https://review.openstack.org/#/c/272089/11/puppet/extraconfig/ceph/ceph-external-config.yaml12:45
*** pcaruana has quit IRC12:45
shardyOk https://review.openstack.org/#/q/I7b3e6177d160f6f0cb775636f25baed2164d2002,n,z does appear to fix the lint failures for instack-undercloud12:46
shardyI'm not sure why that wasn't failing before tho tbh12:46
*** Goneri has quit IRC12:46
adarazsfolks, SOS, I'm still seeing trouble with IPv6 and rabbit, I have this in the gate job (on the new IPv6 gate):12:48
adarazsError: curl -k --noproxy localhost --retry 30 --retry-delay 6 -f -L -o /var/lib/rabbitmq/rabbitmqadmin http://guest:guest@fd00:fd00:fd00:2000::12:15672/cli/rabbitmqadmin returned 7 instead of one of [0]12:48
adarazshttp://logs.openstack.org/45/289445/4/check-tripleo/gate-tripleo-ci-f22-ceph/b6997a4/console.html12:48
adarazsthat is supposed to be bracketed. and the rabbitmq IPv6 change was merged, so it's supposed to work.12:49
jistrmarios: o/12:49
jistrmarios: could you please review the cinder upgrade when you have a minute, it finally passed CI :)) https://review.openstack.org/#/c/287929/12:50
mariosjistr: sure12:50
adarazsI don't have enough tripleo-fu to figure out where that command comes from.12:50
slagleshardy: cool12:51
*** trown|outtypewww is now known as trown12:52
slagleEmilienM: you are likely right in that review, but that environment file doesnt get used in CI, so i dont think that would cause the nova_ipv6 issue12:53
EmilienMslagle: let me look again, I'm still reading logs12:53
*** pblaho has joined #tripleo12:55
jistrmarios: thanks!12:55
mariosjistr: after I +A I thought of the -q... do you want it there?12:55
*** aufi has joined #tripleo12:56
EmilienMslagle: we might have merged something already that would cause that12:56
mariosjistr: commented there fwiw12:56
EmilienMslagle: I'm looking at it, it's in HA jobs only12:57
*** rhallisey has joined #tripleo12:57
openstackgerritMerged openstack/tripleo-heat-templates: Upgrade of Cinder block storage nodes  https://review.openstack.org/28792912:57
rhalliseyderekh, morning.  Can you see what the journal log returned for the container job?12:57
*** oshvartz has quit IRC12:58
jistrmarios: good catch, thanks!12:59
mariosjistr: ... bit late now :/ sry12:59
*** yamahata has quit IRC13:00
openstackgerritMartin Mágr proposed openstack/tripleo-heat-templates: Keystone domain for Heat  https://review.openstack.org/18056613:00
EmilienMslagle: I think I found it13:00
jistrmarios: no it's fine, i'm not sure if that *always* has to cause problems, probably not, so good that we have it merged so that we can progress with backporting the most important stuff13:00
*** pcaruana has joined #tripleo13:00
jistrhrmm we don't have gfidente13:01
*** dprince has joined #tripleo13:01
EmilienMslagle: would it be possible that HA jobs fail since bb05fa304a2eed2caa4840e8039832d369a357f7 ?13:01
EmilienMslagle: other HA jobs fail too?13:03
slagleEmilienM: other ha jobs are passing13:03
EmilienMlooking at http://tripleo.org/cistatus.html, it seems ok13:03
slagleEmilienM: they also passed on that patch that added nova_ipv6, https://review.openstack.org/#/c/270110/13:03
EmilienMslagle: ok I had a patch but in fact i think it's useless13:04
EmilienMslagle: can I push over the patch to address my comment?13:04
EmilienMslagle: I think gfidente is not here today13:05
slaglesure13:05
EmilienMok13:05
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates: Support the deployment of Ceph over IPv6  https://review.openstack.org/27208913:06
EmilienMlet's try again, see if the errors happens again13:07
EmilienMslagle: is it possible that a bug in heat would dupplicate puppet manifests?13:07
adarazsderekh: ^ can you help me to track this down?13:07
slagleEmilienM: i've not seen it before. but anything is possible13:07
derekhadarazs: which error do you want help tracking down?13:10
*** thrash|g0ne is now known as thrash13:10
*** akrivoka has quit IRC13:11
*** jayg|g0n3 is now known as jayg13:13
*** pradk has joined #tripleo13:13
derekhAll, we now have 51 testenvs and (currently) 59 jenkins slaves trying to use them, of the jenkins slaves are waiting too long for testenvs when they eventually do get one it will be too late to run a full test and ZUUL will time them out, so they spend 1.5 hours using up a testenv only to fail13:14
derekhThen the other jobs behind them will fail because they in turn were waiting even more on testenvs,13:15
derekhAnd the whole thing will become a big sea of red timeouts13:15
derekhWe need to kill jobs that have been waiting for a testenv for more then X minutes to avoid this13:15
*** morazi has joined #tripleo13:15
openstackgerritJiri Stransky proposed openstack/tripleo-heat-templates: Upgrade of Cinder block storage nodes  https://review.openstack.org/29116713:16
derekhhttps://review.openstack.org/#/c/290731/13:16
slagleshould we just merge that?13:16
derekhThat should do it ^^13:16
*** lucas-hungry is now known as lucasagomes13:17
derekhslagle: I thinks so, PS1 proved that it can kill the tests, PS2 has passed at least on of the tests13:17
trown+1 to just merging13:17
slagleyea, it passed nonha13:17
*** masco has quit IRC13:18
openstackgerritMerged openstack-infra/tripleo-ci: Kill CI job if it doesn't get a testenv quickly  https://review.openstack.org/29073113:18
slaglederekh: overall, i am seeing a lot more green so far today. i think the redeploy helped13:19
derekhslagle: shardy trown thanks13:19
derekhslagle: Yup, I'm hoping now that we're fully loaded I'm hoping it stays that way over the next hour or 213:20
*** paramite is now known as paramite|afk13:21
trownderekh: with that job killer patch, are we going to get more single job of the three fails requiring full recheck?13:21
trownit would be nice if a job killed because of the 20min timeout was autorqueued and voted with the other jobs based on the requeued job13:22
derekhtrown: Yup, quite probably, the real solution it to reduce the number of jenkins slaves13:22
trownright, that is simpler solution :)13:22
derekhtrown: that patch needs to go into infra/project-config , I'm gonna line that up now, but sometimes it takes a while to get things in13:22
pradkcan i request some reviews on https://review.openstack.org/#/c/289435/ please13:22
trownderekh: yep, makes sense13:23
openstackgerritJiri Stransky proposed openstack/tripleo-heat-templates: Upgrades: quiet yum upgrade on cinder nodes  https://review.openstack.org/29117313:24
derekhrhallisey: that last run of the containers jobs, failed to deploy the compute node, I got not logs there to look at, (its the compute node logs you wanted isn't it?)13:25
derekh2016-03-10 04:26:22.953 | | e9db9ca2-5fc7-4c14-ac30-19c4fc54e8eb | overcloud-novacompute-0 | ERROR  | -          | NOSTATE     |                    |13:25
rhalliseyreally I thought it passed earlier..13:25
rhalliseyrather spwaned the node..13:25
rhalliseylet me look again13:25
derekhrhallisey: ok, if it has, and you have a computenode tarball from the ci run, you should now have the jourlan log entries in that tarball13:26
rhalliseywhere would the tarball be though13:26
*** paramite|afk is now known as paramite13:31
*** akrivoka has joined #tripleo13:37
openstackgerritMerged openstack/tripleo-heat-templates: Enable glance-api show_image_direct_url for COW  https://review.openstack.org/29035813:37
openstackgerritMerged openstack/tripleo-heat-templates: Set notification driver for nova to send  https://review.openstack.org/28849713:38
*** tremble has joined #tripleo13:38
openstackgerritMerged openstack/tripleo-heat-templates: Upgrades: install zaqarclient  https://review.openstack.org/28770813:41
jistradarazs: looking further, i'm thinking this might actually need a fix in puppetlabs-rabbitmq. I'm a bit puzzled how this could have worked for anyone before.13:42
openstackgerritMerged openstack/tripleo-heat-templates: Add support for DeployArtifactURLs  https://review.openstack.org/28909413:42
jistradarazs: we're passing unbracketed IP into puppetlabs-rabbitmq, and it seems like it's using it for both cases where unbracketed IP would go, and where a bracketed IP would go13:44
jistradarazs: i'll try submitting a patch to t-h-t to pass a bracketed IP, which should fix the problem you're having, but it might break rabbitmq's config file for a change13:45
*** saneax is now known as saneax_AFK13:45
jistradarazs: there might be something we're missing though. Do you know who from the network team had rabbitmq working on IPv6?13:46
*** links has quit IRC13:48
*** jdob has joined #tripleo13:50
*** akuznetsov has joined #tripleo13:51
openstackgerritJiri Stransky proposed openstack/tripleo-heat-templates: IPv6: pass bracketed IP to rabbitmq puppet module  https://review.openstack.org/29118613:54
*** mkovacik has quit IRC13:54
*** mkovacik has joined #tripleo13:54
*** akuznetsov has quit IRC13:55
jistradarazs: ^^ that's the patch, but as i said, it might fix the curl but break something else. So i gave it WIP status. Maybe a fix in puppetlabs-rabbitmq would be better.13:57
*** Goneri has joined #tripleo13:57
* jistr back to upgrades13:57
*** snecklifter has joined #tripleo13:59
*** jaosorior has quit IRC13:59
*** mbound has joined #tripleo14:00
snecklifterHello, I've been debugging OSP-d installation on Lenovo hardware14:00
adarazsjistr: do you have a patch for the rabbit uri thing? (sorry to pester you about it, just trying to make the ipv6 gate asap)14:00
snecklifterIt looks like the latter exposes a cdc_ether device which potentially tripleo is seeing as an active nic14:01
jistradarazs: :D yes14:01
jistradarazs: i wrote you ~5 or so messages about it, see above14:01
*** rlandy has joined #tripleo14:01
*** akuznetsov has joined #tripleo14:02
snecklifterDoes this sound plausible? I'm wondering what the logic is for determining if a nic is active14:02
snecklifter2: enp0s29u1u1u5: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UNKNOWN qlen 100014:02
snecklifterip reports as unknown14:03
snecklifterbut ethtool reports link is up14:03
*** openstackgerrit has quit IRC14:03
*** openstackgerrit_ has joined #tripleo14:03
snecklifterLink detected: yes14:03
shardysnecklifter: https://github.com/openstack/os-net-config/blob/master/os_net_config/utils.py#L5314:04
shardythat's the logic14:04
*** trozet has quit IRC14:04
*** ifarkas has quit IRC14:04
sneckliftershardy, thanks14:04
*** openstackgerrit_ is now known as openstackgerrit14:04
*** openstackgerrit has quit IRC14:04
*** openstackgerrit_ has joined #tripleo14:05
*** openstackgerrit_ is now known as openstackgerrit14:05
*** openstackgerrit has quit IRC14:05
*** openstackgerrit_ has joined #tripleo14:06
sneckliftershardy, what tool is it using to determine presence of carrier and address - sorry, not clear to me14:06
*** openstackgerrit_ is now known as openstackgerrit14:06
*** openstackgerrit has quit IRC14:07
*** lblanchard has joined #tripleo14:07
*** openstackgerrit_ has joined #tripleo14:07
*** openstackgerrit_ is now known as openstackgerrit14:08
*** openstackgerrit has quit IRC14:08
*** openstackgerrit_ has joined #tripleo14:08
shardysnecklifter: it's looking in /sys/class/net/<device>/carrier14:08
*** openstackgerrit_ is now known as openstackgerrit14:09
*** openstackgerrit has quit IRC14:09
*** rodrigods has joined #tripleo14:09
snecklifterah, simple as that, great, will poke further14:09
*** openstackgerrit_ has joined #tripleo14:09
*** openstackgerrit_ is now known as openstackgerrit14:10
*** Guest41345 has joined #tripleo14:10
shadowerthe CI is no longer busted, is it?14:10
shadower(not seeing any problems, just making sure that's the case)14:11
shardyshadower: it's working apart from the lint job on instack-undercloud14:11
shadowershardy: thanks!14:13
sneckliftershardy, could we add /sys/class/net/<device>/operstate == up14:16
openstackgerritAttila Darazs proposed openstack-infra/tripleo-ci: Use IPv6 on the ceph gate job  https://review.openstack.org/28944514:18
derekhdprince, hey if you got a spare minute or two can you add your setup to here as its different to everybody elses, https://etherpad.openstack.org/p/tripleo-dev-env-census14:18
shardysnecklifter: yup probably, looks like that could possibly replace the current carrier test?14:18
shardyhttps://www.kernel.org/doc/Documentation/networking/operstates.txt14:19
*** rbrady has joined #tripleo14:20
sneckliftershardy, sounds like a better test14:20
snecklifterlocal interface also reports UNKNOWN rather than UP but no harm in leaving that check14:21
dprincederekh: yeah, I can14:21
snecklifterI will prep a patch14:21
derekhdprince: thanks14:26
*** rwsu has quit IRC14:33
dtantsurcan someone confirm my understanding of heat that if I need to change something in a template, I have to stack-delete and rebuild?14:35
dtantsuri.e. it's not like puppet which modified the existing thing to the declared state?14:36
dtantsurshardy, shadower ^^?14:36
* dtantsur tries to figure out if it's a bug or expected behavior14:37
shadowerdtantsur: you should be able to do a "heat stack-update" with the updated templates/parameters14:38
dtantsurshadower, so, I'm testing $stuff on the rdo day, and people told me I should add https://github.com/redhat-openstack/tripleo-quickstart/blob/master/playbooks/roles/tripleo/overcloud/templates/overcloud-deploy.sh.j2#L18-L4014:38
dtantsurshadower, I tried adding -e /path/to/such/file.yaml to the deploy command and start it again14:38
dtantsurand it resulting in something like "error 4" (the next attempt -9), and I dunno if it's worth investigating or just give up and stack-delete first14:39
shadowerdtantsur: so what you linked is not a heat template14:41
*** pcaruana has quit IRC14:41
* dtantsur is clueless :)14:41
dtantsurshadower, that's fine with me :) was it expected to work at all?14:41
shadowerdtantsur: I'm not sure. Not familiar with tripleo-quickstart at all yet :-(14:42
dtantsurshadower, I'm rather asking if adding this file with -e flag on stack update is expected to work (at least potentially)14:42
dtantsurignore the context for a while :)14:42
shadowerah, well yeah adding an environment file on update should work imho14:42
dtantsurit didn't :)14:43
shadoweryeah so assuming the yaml file has the right contents, that sounds like a bug14:44
dtantsurdo you think I should report it against tripleo? or upstream heat? or tripleo-heat-templates? :)14:44
toskyall of them!14:45
tosky(sorry)14:45
dtantsureasily :D14:45
*** rwsu has joined #tripleo14:50
shardydtantsur: I'd start with a bug against tripleo, then we can re-route it if needed14:50
shardydtantsur: in answer to your original question, in nearly all cases it should be possible to update a stack, even from a failed state14:51
shardydeleting it and starting again is valid, but shouldn't be mandatory unless you want a clean start14:52
dtantsurshardy, ok, do you have some quick checklist which things I should collect for a report in addition to heat resource-list/show?14:52
shardydtantsur: The exact steps to reproduce, any error output, and any associated error in /var/log/heat/engine.log on the undercloud14:53
*** pcaruana has joined #tripleo14:53
dtantsurshardy, ok14:53
shardydtantsur: for resource-list, do heat resource-list -n5 overcloud | grep FAILED14:54
shardythen you'll grab all the nested resources too14:54
*** pradk_ has joined #tripleo14:54
*** jaosorior has joined #tripleo14:56
jaosoriorHas anybody seen the following error while deploying the overcloud in HA? Error: Must pass auth_password to Class[Aodh::Auth] at /var/lib/heat-config/heat-config-puppet/fa59863d-38c0-463f-8754-dbb8b43d4156.pp:1048 on node overcloud-controller-1.localdomain14:58
openstackgerritTomas Sedovic proposed openstack/tripleo-heat-templates: Allow the vnc server to bind on IPv6 address on computes  https://review.openstack.org/27083114:58
openstackgerritTomas Sedovic proposed openstack/tripleo-heat-templates: Surround MongoDB IPs with braces in the connection string if IPv6  https://review.openstack.org/27015414:58
*** akuznetsov has quit IRC14:59
openstackgerritGonéri Le Bouder proposed openstack/instack-undercloud: add INTERFACE_MTU parameter  https://review.openstack.org/28804114:59
dtantsurshardy, https://bugs.launchpad.net/tripleo/+bug/1555676 and trying to get more information now15:00
openstackLaunchpad bug 1555676 in tripleo "Failed to add a simple environment file when updating the stack" [Undecided,New]15:00
shardydtantsur: can you add the output of heat deployment-show b1dfc129-91f4-4bce-86d8-fe79aa1c08a4 please?15:02
shardythat should give us the stderr of the failed puppet run15:03
shardyActualluy sorry that's 3db70738-20c4-47b1-9c96-cad55212c05515:03
*** trozet has joined #tripleo15:03
shardyyou need the ID of the OS::Heat::StructuredDeployment resource that's FAILED15:04
openstackgerritBen Nemec proposed openstack/instack-undercloud: Secure haproxy stats endpoint  https://review.openstack.org/29091215:04
*** devvesa has joined #tripleo15:05
dtantsurshardy, mmm, that's long, lemme fetch it as a file15:05
*** thrash has quit IRC15:05
*** rdopiera has quit IRC15:06
slagledprince: derekh : i think i might have found an issue with the mulitple nics in ci15:08
*** thrash has joined #tripleo15:08
*** thrash has joined #tripleo15:08
derekhslagle: ya?15:08
slaglei was looking at one of the jobs that was about to time out15:09
slaglethe compute node had deployed fine, but when it rebooted, the ctlplane ip became unreachable15:09
slaglecouldn't ssh or ping15:09
slagleturns out there is another job in a different testenv that is also using that same ip for one of it's nodes15:09
slaglei think this is causing an issue15:10
dprinceslagle: hmmm. They should be on different bridges though right?15:10
slaglei suppose so, yes15:10
dprinceslagle: like each testenv' should have it's own bridge's now, for each network15:10
derekhslagle: ya, the seperate bridges should keep them isolated, it it isn't we got a problem15:10
dprinceslagle: perhaps some even ARP flux is going on or something15:10
dtantsurshardy, updated15:11
slagledprince: yea, arp could be it15:11
slaglethese jobs will probably tiem out soon, but it's testenv32-testenv1-sbiwjg32inl615:12
shardyCannot allocate memory - fork(2)15:12
shardydtantsur: You need more memory or some swap on the overcloud nodes15:12
dtantsurshardy, something is terribly wrong with out installer if 4 GiB is not enough even for launching a simple instance...15:13
shardydtantsur: I agree, which is why we need composable services, so you can turn off stuff you don't want15:13
shardyas it is, people keep adding stuff and we have no way to turn it off15:13
dtantsurshardy, now I understand it's not a question for you, but I have no clues how I (the developer) is supposed to test anything15:14
shardydtantsur: we had to increase the memory on CI nodes from 4G recently for this reason15:14
shardydtantsur: how much ram does your test box have?15:14
dtantsurshardy, so, what's the minimum with which I would be able to pass the pingtest on HA?15:14
dtantsurshardy, I have a dell box with 32 GiB15:14
dtantsurso I can probably bump memory to 6 (with risking of swapping, but still)15:15
trowndtantsur: I think the pingtest would have passed with that single worker heat environment... it was just updating the stack without that to include it that bombed out15:15
slagledprince: all the seeds are bridged into the same br-ctlpane though?15:16
dtantsursigh...15:16
trownya15:16
dtantsurok, I'll try stack-delete and rebuild. otherwise I won't be able to test scaling up..15:16
dprinceslagle: seriously? did I miss this!?15:16
slaglei dunno :) i'm grasping at straws here15:17
derekhslagle: yes, they always have been, one nic on br-ctlpane for external access and one bridge on brbmX for internal15:17
openstackgerritAthlan-Guyot sofer proposed openstack/puppet-pacemaker: Basic beaker one node test.  https://review.openstack.org/28137615:17
shardydtantsur: I run my undercloud and overcloud nodes with 8G (5 nodes total) on a 32G ram box, but with KSM enabled (default) you can just about deploy a 4 node overcloud15:18
openstackgerritAthlan-Guyot sofer proposed openstack/puppet-pacemaker: Add a service provider.  https://review.openstack.org/28612415:18
shardydtantsur: that said, I mostly stick to 2-3 node deployments (nonha) personally15:18
slaglederekh: right, ok. i'm confused how this works i guess15:18
trownshardy: KSM enabled is default?15:19
slaglethere's some networking related issue going on15:19
slaglethe node is definitely up with the right ip and networking applied (i added a console and watched the cloud-init output)15:19
shardytrown: /sys/kernel/mm/ksm is present on my centos7 host15:19
slaglebut the ip is not reachable externally15:19
derekhslagle: dprince could the route on the undercloud be sending traffic out the wrong nic ?15:20
dprinceslagle: br-ctlplane isn't used for the undercloud ctlplane I think15:21
dprinceslagle: I think that only gets used for the seed -> jenkins communication.15:21
derekhYa br-ctlplane on the TE host is used for trafic between jenkins slaves and the undercloud15:22
dprincederekh: which route?15:22
trownshardy: does /sys/kernel/mm/ksm/pages_shared actually show some pages shared though?15:22
dprinceslagle: agree the naming of br-ctlplane is confusing though.15:22
trownshardy: I have a deployment on my centos host, and that shows 015:22
slagledprince: when the node first comes up, it has a default route of 192.0.2.1 though15:22
shardytrown: Hmm, I'm just deploying some VMs to find out :)15:22
slaglei saw that in the cloud-init output15:22
openstackgerritDan Radez proposed openstack/os-cloud-config: Adding support for pxe_amt and amt_agent  https://review.openstack.org/29123215:23
derekhdprince: dunno, I took some straws out of the packet slagle was clutching15:23
*** jaosorior has quit IRC15:23
slagledprince: and if there are multiple 192.0.2.1's on that bridge...15:23
dprinceslagle: what if once the undercloud is installed we deleted the route?15:23
slaglefrom the neutron subnet?15:24
dprinceslagle: the seed vm only needs external connectivity while it is installing instack-undercloud right15:24
slagleuntil it does the pingtest15:24
openstackgerritRyan Hallisey proposed openstack/tripleo-heat-templates: Allow the containerized compute node to spawn larger VMs  https://review.openstack.org/29123515:24
openstackgerritRyan Hallisey proposed openstack/tripleo-heat-templates: Remove unused Neutron Agents container  https://review.openstack.org/29123615:24
openstackgerritRyan Hallisey proposed openstack/tripleo-heat-templates: Parameterize the heat-docker-agents image  https://review.openstack.org/29123715:24
slagleit will need it again to download the image15:24
openstackgerritMerged openstack/tripleo-heat-templates: Fixup systemctl_swift stop/start  during the controller upgrade  https://review.openstack.org/29108815:24
derekhslagle: dprince how about we take a test env host out of rotation, and bring up 3 jenkins slaves using the same TE host for envs and manually run jobs we can poke at15:25
openstackgerritAttila Darazs proposed openstack-infra/tripleo-ci: Use IPv6 on the ceph gate job  https://review.openstack.org/28944515:25
dprincederekh: yep, lets do it15:25
openstackgerritMerged openstack/tripleo-heat-templates: Upgrades: object storage node upgrade fix  https://review.openstack.org/28982615:25
shardytrown: weird, ksm and ksmtuned services are running, the kernel stuff is loaded, and overcommitting does seem to work, but it's not sharing pages AFAICS15:25
*** fgimenez has quit IRC15:25
derekhdprince: ok, this will take a little time to setup, I'll be back with login details in a bit15:26
trownshardy: hmm, that would be huge win if that worked... there has to be alot that could be shared15:27
trownlarsks: do you know anything about ksm ^15:27
larskstrown: not really, other than that it exists :)15:27
trownlarsks: k, that is the extent of my knowledge as well15:28
openstackgerritDan Radez proposed openstack/os-cloud-config: Adding support for pxe_amt  https://review.openstack.org/28207715:28
derekhafazekas: your not using that instance on the ci cloud are you? wanna zap it if I can15:28
openstackgerritJaume Devesa proposed openstack/tripleo-docs: Add MidoNet documentation in advanced deployment  https://review.openstack.org/27032015:28
openstackgerritAthlan-Guyot sofer proposed openstack/puppet-pacemaker: Basic beaker one node test.  https://review.openstack.org/28137615:30
openstackgerritChristopher Brown proposed openstack/os-net-config: Fixes lp bug 1555669  https://review.openstack.org/29124315:30
openstackLaunchpad bug 1555669 in os-net-config "better link state detection" [Undecided,New] https://launchpad.net/bugs/1555669 - Assigned to Christopher Brown (snecklifter)15:30
sneckliftershardy, ^^^15:31
* bnemec glares at puppet-lint15:31
larskstrown: shardy: ...but on my system, ksmtuned is running and looking at the values in /sys/kernel/mm/ksm it seems as if there is page sharing going on.15:31
bnemecI turned on KSM for my single-node OpenStack box.  It crashed within 24 hours.15:32
openstackgerritAthlan-Guyot sofer proposed openstack/puppet-pacemaker: Add a service provider.  https://review.openstack.org/28612415:32
bnemec /data point15:32
larsksbnemec: I ate lunch yesteday and my phone crashed.  I'm not sure that's a data point, unless there is some corraborating evidence :)15:33
bnemeclarsks: Well, in this case the kernel oops included a trace that went through ksm. :-)15:33
larsksIt looks like ksm is enabled by default on centos7 (and presumably RHEL).15:33
slaglederekh: what is the tedev port on br-ctlplane?15:33
larsks(That is, I didn't explicitly enable it on my system...)15:34
openstackgerritMerged openstack/instack-undercloud: Fix long line in puppet-stack-config.pp  https://review.openstack.org/29113615:34
openstackgerrityolanda.robla proposed openstack/diskimage-builder: Generate fedora-atomic images using dib  https://review.openstack.org/28716715:34
*** mbound has quit IRC15:34
openstackgerritMerged openstack/instack-undercloud: Fix long line in puppet-stack-config.pp  https://review.openstack.org/29113715:34
derekhslagle: thats was put there to give the Host an IP on the 192.168.1.0/24 network15:34
slagleok15:35
trownlarsks: shardy, going to try a drastically overcommited setup to test ksm page sharing... it does appear to be running by default on CentOS, but doesn't try to share pages unless they would otherwise be swapped out15:37
*** Goneri has quit IRC15:40
*** Goneri has joined #tripleo15:40
*** trozet has quit IRC15:40
*** adarazs has quit IRC15:41
*** adarazs has joined #tripleo15:42
dtantsurfolks, could you please take a look at https://review.openstack.org/#/c/288417/ ?15:43
*** paramite is now known as paramite|afk15:43
dtantsurwithout this thing, people are complaining that IPA has a different root device selection logic15:43
dtantsurand we have no way to override it15:43
dtantsurmeaning that without root device hints, the root device will change for many people on the next rebuild :(15:44
dtantsur(I don't really like this patch, but I dunno what we could do)15:44
dtantsurlucasagomes, ^^15:44
* lucasagomes looks15:44
lucasagomeslook*15:44
openstackgerritAthlan-Guyot sofer proposed openstack/puppet-pacemaker: Basic beaker one node test.  https://review.openstack.org/28137615:45
openstackgerritPradeep Kilambi proposed openstack/os-cloud-config: add aodh and gnocchi to keystone service list  https://review.openstack.org/27211015:45
*** paramite|afk is now known as paramite15:46
adarazswhen I get "Merge Failed." from Gerrit, how can I figure out what depends-on patch is actually failing to merge?15:46
openstackgerritAthlan-Guyot sofer proposed openstack/puppet-pacemaker: Add a service provider.  https://review.openstack.org/28612415:46
adarazsor what method does Jenkins use to merge them? I tried to cherry pick them all in the specified order and it worked.15:46
openstackgerritPradeep Kilambi proposed openstack/tripleo-heat-templates: Deploy Gnocchi as a Ceilometer metrics storage backend  https://review.openstack.org/25203215:47
dtantsuradarazs, it's overly paranoid sometimes15:47
openstackgerritMerged openstack/instack-undercloud: Remove trailing / on keystone admin endpoint  https://review.openstack.org/29072415:47
openstackgerritJiri Stransky proposed openstack/tripleo-heat-templates: Upgrades: object storage node upgrade fix  https://review.openstack.org/29125415:47
openstackgerritMerged openstack/instack-undercloud: Enable heat-manage purge_deleted cron job  https://review.openstack.org/28289915:47
lucasagomesdtantsur, +1 and left a nit inline15:49
*** bvandenh has joined #tripleo15:49
lucasagomesdtantsur, not sure if it's possible tho, I'm not very familiar with the project and it may not be multi-threaded as I think15:49
lucasagomesbut added the nit to document it anyway15:49
dtantsurlucasagomes, it's not multithreaded right now15:50
dtantsuryeah, thanks15:50
shardyadarazs: sometimes it just means the verify gate jobs failed, not necessarily that there's a merge conflict15:50
adarazsshardy: okay, so what can I do? play with the "Depends-On" order of the patches until it passes?15:51
shardyadarazs: what patch is it?15:51
shardyadarazs: you can do reverify if it's a transient error15:52
*** yamahata has joined #tripleo15:52
adarazsshardy: https://review.openstack.org/289445 -- adding IPv6 to the gate. it needs a bunch of THT patches for having a chance to pass.15:52
adarazsit worked until recently until I added a hotfix from jistr. but I doubt that the problem is Jiri's patch.15:54
shardyadarazs: Hmm, yeah that's not very clear - I thought you meant a merge failure after approval15:57
openstackgerritBen Nemec proposed openstack/instack-undercloud: Enable notifications on undercloud  https://review.openstack.org/28951815:57
adarazsshardy: nope. I don't even know what method does it use to merge all these depends-on stuff if they are in the same repo.15:57
shardyadarazs: ah, that may be the problem15:59
shardyyou may have a merge conflict between two different Depends-On changes in the same repo, e.g t-h-t15:59
*** xinwu has joined #tripleo15:59
adarazsyes. all of them are tht changes.15:59
shardyideally you only want one Depends-On per repo, pointing to the head of any series there15:59
*** jistr has quit IRC15:59
shardyadarazs: You may need to rebase the t-h-t patches into a series, then just depend on the top of the branch16:00
adarazsshardy: hm, okay. I will try that.16:00
openstackgerritBen Nemec proposed openstack/instack-undercloud: Remove trailing / on keystone admin endpoint  https://review.openstack.org/29126616:00
*** paramite has quit IRC16:00
*** jtomasek has quit IRC16:00
openstackgerritMerged openstack/instack-undercloud: Use pymysql database driver for OpenStack DBs  https://review.openstack.org/28495516:01
*** absubram has joined #tripleo16:02
*** absubram_ has joined #tripleo16:04
*** mbound has joined #tripleo16:05
*** pradk has quit IRC16:06
*** absubram has quit IRC16:06
*** pradk_ is now known as pradk16:06
*** absubram_ is now known as absubram16:06
*** eggmaste` is now known as eggmaster16:07
dtantsurtrown, do I need more memory on computes or only controllers? or only computes?16:08
*** bvandenh has quit IRC16:08
trowndtantsur: controllers are where the pressure is16:08
trownshardy: larsks: [root@desk-trown ~]# cat /sys/kernel/mm/ksm/pages_sharing16:09
trown154751116:09
trownso it does "just work" on centos16:09
larskstrown: yeah, that's pretty much what I was seeing...16:09
trownyou just have to be overcommitted for it to kick on16:09
dtantsureven more for me :)16:09
shardycool, that explains why I've been able to overcommit then - I guess I didn't launch enough VMs to see it this time16:09
trowndtantsur: ya, I am currently running an HA deploy with 12G undercloud and 4 8GB overcloud nodes on a 32G host... so far so good16:10
dtantsurawesome16:11
d0ugalslagle: See my comment here: https://review.openstack.org/#/c/288869/ - does it make sense for the parameter to be different between the two files?16:13
openstackgerritJames Slagle proposed openstack/instack-undercloud: Revert "run keystone in a wsgi process"  https://review.openstack.org/29127816:19
EmilienMayoung: ^16:22
ayoungEmilienM, what is his nick?16:23
ayoungcan someone please -2 that16:23
slagled0ugal: it's ok as it is i guess, the other parameters are like that16:23
ayoungslagle, kill that please16:23
openstackgerritAttila Darazs proposed openstack/tripleo-heat-templates: IPv6: pass bracketed IP to rabbitmq puppet module  https://review.openstack.org/29118616:23
openstackgerritAttila Darazs proposed openstack/tripleo-heat-templates: Allow the vnc server to bind on IPv6 address on computes  https://review.openstack.org/27083116:23
openstackgerritAttila Darazs proposed openstack/tripleo-heat-templates: Surround MongoDB IPs with braces in the connection string if IPv6  https://review.openstack.org/27015416:23
openstackgerritAttila Darazs proposed openstack/tripleo-heat-templates: Fix vncproxy_host for IPv6  https://review.openstack.org/28706816:23
openstackgerritAttila Darazs proposed openstack/tripleo-heat-templates: Support the deployment of Ceph over IPv6  https://review.openstack.org/27208916:23
slagled0ugal: but yea, you'd have to specify both parameters16:24
*** pblaho has quit IRC16:24
*** leanderthal has quit IRC16:24
ayoungslagle, you are going to be inflicting countless Keystomne errors on the rest of the world16:24
ayoungdo16:24
ayoungnot16:24
slagled0ugal: which is odd16:24
ayoungrevert16:24
d0ugalslagle: Right, that confused me - I got it working by passing both16:24
adarazschooo-chooo here goes the patch train.16:24
ayoungbnemec, please remove +2 on https://review.openstack.org/#/c/291278/116:24
slagleayoung: it doesnt work on upgrades16:25
ayoungslagle, then lets fix that16:25
EmilienMdo we have upgrade jobs?16:25
slagleayoung: go for it :)16:25
slagleEmilienM: we don't. it still has to work16:25
ayoungslagle, abandon your patch please16:25
EmilienMlet's figure what is wrong16:25
openstackgerritAttila Darazs proposed openstack-infra/tripleo-ci: Use IPv6 on the ceph gate job  https://review.openstack.org/28944516:25
bnemecIt's.  Broken.16:25
slagleEmilienM: please16:25
ayoungbnemec, Eventlet is broken16:26
slagleEmilienM: the error is in there16:26
*** fgimenez has joined #tripleo16:26
ayoungbnemec, Please remove your +2 and put a workflow on it16:26
bnemecNo16:26
bnemecIt's fine to revert something if a breakage is found after it merges.16:27
bnemecThat's what is happening here.16:27
ayoungbnemec no16:27
ayoungbnemec, lets fix the upgrade16:27
bnemecIf/when we come up with a fix then it can go back in.16:27
slagleayoung: please do16:27
ayoungbnemec, Eventlet is broken16:27
slagleor, we can keep discussing16:27
ayoungyou are going to be putting errors into the installed service16:27
EmilienMslagle: do you have httpd logs?16:28
ayoungand we don;'t catch them in Keystone anymore...its HTTPD only16:28
shardyslagle: would it help to raise a bug with more details of the issues found - it's not clear from the revert other then apparently it's broken?16:28
EmilienMpuppet fails to start apache but don't show why16:28
ayoungso lets fix this, but if you revert you are going to cause a major load of pain16:28
slaglemarios: do you have the httpd logs?16:28
ayoungshardy, can you please put a stop on the revert.16:28
EmilienMmarios: or journalctl16:28
EmilienMwe have missed something I guess, the problem is just apache does not work16:29
EmilienMI'm sure eventlet was started before16:29
EmilienMand binding can't happen.16:29
shardyayoung: lets stop arguing over the revert - quick reverts are OK, and can easily be un-reverted when the issues are resolved16:29
ayoungshardy, not this one16:29
shardylet's hear what the actual issue is, then make a call if it can be fixed quickly16:29
EmilienMhttps://github.com/openstack/puppet-keystone/blob/stable/liberty/manifests/init.pp#L92416:30
EmilienMit should stop eventlet before running apache16:30
slagleapetrich: were you getting the keystone error on upgrade too?16:30
slaglethrash: ^?16:30
ayoungshardy, fine, but please workflow -1 the revert until we know16:30
EmilienMbut if eventlet was already started maybe apache started before16:30
apetrichslagle, aye16:30
ayoungEmilienM, ok...so I suspect Systemd16:30
apetrichslagle, cheers!16:30
EmilienMwe can suspect anything I want to see apache logs16:31
EmilienMdo not revert this patch before at least providing useful logs16:31
mariosslagle: EmilienM gimme few will attach to https://bugzilla.redhat.com/show_bug.cgi?id=131658816:31
openstackbugzilla.redhat.com bug 1316588 in instack-undercloud "Upgrade undercloud fails on keystone error" [High,New] - Assigned to brad16:31
EmilienMmarios: let me 2 min16:31
ayoungEmilienM, what we need to do is remove the whole openstack-keystone systemd config, as that is what is kicking off the start of eventlet16:31
ayoungat a minimum, it should be explicitly disabled and the service not run16:32
EmilienMthe problems looks like in HAproxy16:32
EmilienMhaproxy[9505]: proxy keystone_admin has no server available!16:32
EmilienMhaproxy[9505]: proxy keystone_public has no server available!16:32
EmilienMno actually eventlet is stopped16:33
EmilienMso haproxy is not happy16:33
*** aufi has quit IRC16:33
EmilienMbut apache does not sart16:33
EmilienMI'm waiting for httpd logs16:33
slagledprince: got some new info16:33
*** trozet has joined #tripleo16:33
bkerothat means that haproxy can't reach the servers associated with keystone_admin and keystone_public16:34
ayoungEmilienM, that sounds like HAProxy depends on Keystone being up before staring httpd16:34
*** rwsu has quit IRC16:34
slagledprince: managed to get the os-collect-config logs from teh failed node, and it looks like the fallback mode of os-net-config took down networking16:34
ayoungbkero, of course it can't16:34
EmilienMthat's not that, I'm pretty sure the answer is in httpd logs16:34
EmilienMHAproxy is a warning16:34
EmilienMwe don't care about that ^16:34
ayoungEmilienM, you think HTTPD failed to start?16:34
derekhslagle: dprince I've hijacked testenv32-testenv116:35
derekhAnd I've kicked off 3 ha jobs using the following fake jenkins slaves16:35
EmilienMayoung: yes.16:35
derekhslagle: dprince 1. 66.187.229.58 2. 66.187.229.82 3. 66.187.229.12416:35
EmilienMhaproxy is crying because keystone is done, during upgrade, because we stop eventlet and try to start httpd16:35
slagledprince: http://paste.openstack.org/show/490030/16:35
slaglederekh: ok, i've got a new straw ^^16:36
derekhits only started once the underclouds comes up we can take a look16:36
EmilienMayoung: let's wait for marios's logs16:36
slaglederekh: something went wrong in os-net-config, and it looks like the fallback mode took down all of networking, or at least the default route to 192.0.2.116:36
*** bvandenh has joined #tripleo16:36
slaglederekh: that paste is from the failed node...i had to save the disk off, mount it, and get the log out16:37
apetrichEmilienM, posted systemctl status and error_logs16:37
* EmilienM looking16:37
*** mikelk has quit IRC16:37
EmilienMbingo16:37
derekhslagle: nice, we're getting places16:37
EmilienMMar 10 09:18:19 instack.localdomain httpd[10499]: (98)Address already in use: AH00072: make_sock: could not bind to address 0.0.0.0:3535716:37
EmilienMso either keystone eventlet is still started at this time, or HAproxy is stealing the binding but I don't think it does, since it's binded on VIPs16:38
EmilienMit's a problem in vhost config I think, let me check code16:38
apetrichEmilienM, I assumed that because of Process: 31065 ExecStop=/bin/kill -WINCH ${MAINPID} (code=exited, status=1/FAILURE) it probably could not kill it first16:38
ayoungEmilienM, this is all from logs, right?  If keystone eventlet started up, it should show in the keystone log16:38
EmilienMyeah it's a problem in code16:39
EmilienMwe let default apache binding in puppet, which is 0.0.0.016:39
EmilienMand I think haproxy is already using it16:39
EmilienMwe need to patch https://github.com/openstack/instack-undercloud/blob/stable/liberty/elements/puppet-stack-config/puppet-stack-config.pp#L11216:39
EmilienMto add bind_host options16:40
EmilienMI'm doing a patch right now16:40
EmilienMthe question is, how it works when not testing upgrade16:40
bnemecEmilienM: I thought you had already fixed that.  Maybe it just needs a backport?16:41
EmilienMyeah16:41
EmilienMlet me check that16:41
EmilienM5d020c717ccc9c4b8758de105816ab6a108dd14616:41
openstackgerritEmilien Macchi proposed openstack/instack-undercloud: keystone/wsgi: bind on local IP  https://review.openstack.org/29129216:41
EmilienMok it should fix upgrade ^16:42
*** ohamada has quit IRC16:42
slagleEmilienM: cool, can you test it please16:42
*** ohamada has joined #tripleo16:42
EmilienMno16:42
EmilienMmy env is f* up16:42
ayoungHA16:42
ayoungAnd there is the rub16:42
slagleayoung: do you want to test it?16:42
ayoungslagle,16:42
EmilienMI'm sure this is the fix16:42
EmilienMbut yeah we need to test it16:43
ayoungslagle, I can honestly say I do not know how to test it.  I have a machine running tripleo16:43
ayoungbut it has the patch already16:43
ayoungslagle, how was the error originally discovered16:43
apetrichEmilienM, wait a moment I can test that now16:44
slagleayoung: osp testing16:44
EmilienMapetrich++16:44
slagleapetrich: thanks16:44
ayoungapetrich, TYVM16:44
*** pcaruana has quit IRC16:47
pradkEmilienM, do we need/have a similar fix on overcloud? iirc there was a similar issue jprovazn ran into where  wsgi was conflicting with ha proxy bind16:47
EmilienMthat's a good question16:47
EmilienMlet me check16:47
EmilienMyes we have16:48
*** xinwu has quit IRC16:48
EmilienMpuppet/controller.yaml has the right options so we are good16:48
EmilienMpuppet/controller.yaml:                keystone::wsgi::apache::bind_host: {get_input: keystone_public_api_network}16:48
EmilienMpuppet/controller.yaml:                keystone::wsgi::apache::admin_bind_host: {get_input: keystone_admin_api_network}16:48
thrashslagle: yes16:48
pradkcool16:48
*** ohamada_ has joined #tripleo16:52
*** MaxPC has joined #tripleo16:52
*** ohamada has quit IRC16:52
thrashEmilienM: slagle ayoung I'll manually test that backport.16:52
*** ohamada_ has quit IRC16:52
*** ohamada_ has joined #tripleo16:53
ayoungthrash, cool thanks16:53
ayoungslagle, in the interest of keeping me from blowing a gasket incase the revert gets accidentally approved, and as a courtesy to me, who has to fix all the nastiness in Keystone Eventlet bufgs when they are reports, could you please -1 Workflow the revert for now?16:54
slaglei liked it better when you were making demands16:55
ayoungslagle, 4 years I've battled this beast16:55
ayounghttp://adam.younglogic.com/2012/03/keystone-should-move-to-apache-httpd/16:55
ayoungslagle, and, in that time, we built all the federation infrastructure, which is dependent on the Apache modules for Crypto.  So, without this, I don;t even have a prayer of setting it up, even as a one off or proof of concept16:56
*** devvesa has quit IRC17:00
apetrichEmilienM, slagle, ayoung good stuff17:00
*** tosky has quit IRC17:00
apetrichEmilienM, it's still deploying but passed that step17:00
EmilienMcool17:01
ayoungapetrich, is that a request? That I buy some good stuff for the next face to face?  You all going to Austin?17:01
apetrichayoung, not this time unfortunately17:02
apetrichUndercloud install complete.17:02
EmilienMok, we can merge my backport then17:03
EmilienMapetrich: thx a lot for the testing17:03
ayoungEmilienM, so that was not an upgrade test, just a smoke test?17:03
apetrichEmilienM, No worries I'm stuck on that one17:03
ayoungapetrich, ^^?17:03
*** lucasagomes is now known as lucas-afk17:04
apetrichayoung, no this was an upgrade test. I had an env that failed on the upgrade. now it passed all the way17:04
apetrichno, *17:04
*** derekh has quit IRC17:05
apetrichayoung, so this was an actual CI upgrade test that passed the undercloud upgrade17:05
thrashayoung: I'll be there. :D17:05
ayoungthrash, excellent.17:05
ayoungapetrich, nice17:05
trownI think we should revert the wsgi patch just for giggles anyways17:06
ayoungSo happy that this is going ahead.17:06
ayoungtrown, have you ever met me in person?17:06
trown:)17:06
thrashtrown: you are evil. I like that.17:06
trownayoung: ya we met in Tokyo17:07
* EmilienM coughs17:07
thrashayoung: my test appears to be running fine as well.17:07
ayoungwhew17:08
thrashayoung: I put a W-1 on slagle's patch for the sake of your blood pressure.17:08
MaxPCis this the right time to mention ayoung thought this was for OSP 9 and not 8 ?17:08
thrashhaha17:09
ayoungOh, MaxPC17:09
MaxPC:p17:09
*** bvandenh has quit IRC17:09
slagleMaxPC: i dunno, i was going to point out to him that keystone isnt wsgi in the overcloud17:09
MaxPClooool17:09
slaglewhich is where i think he'd care about it the most17:10
slagle:)17:10
ayoungslagle, that is true17:10
thrashfinally... a clean CI run on https://review.openstack.org/#/c/288568/17:11
ayoungslagle, its like a cancer.  You want to kill it where ever you see it, and any remission is deathly17:11
MaxPCI do agree with the sentiment thought let's not give up on it at the first gray cloud17:13
MaxPCthere might be fait weather on the other side17:13
slagleMaxPC: no one did that.17:13
MaxPCI know :-)17:13
slaglebroken is broken, reverts are an option in that case17:13
slaglein this case, we got a quick fix, that is great17:14
MaxPCI just got a lot of noise around this very quickly :-) it's all good, no blame here17:14
*** fgimenez has quit IRC17:14
MaxPConly love17:15
MaxPCwe all want the best product possible17:15
trownin my experience posting a revert is the quickest way to get a quick fix if one is possible :)17:16
bnemectrown: +1 :-)17:16
EmilienMslagle: I should have backported that patch - mea culpa...17:16
bnemec I missed it too.  It would have broken SSL undercloud on Liberty, so we needed it anyway.17:17
openstackgerritMerged openstack/tripleo-common: Add capabilities filter for Nova  https://review.openstack.org/29094217:17
*** sthillma has joined #tripleo17:18
*** ccamacho has quit IRC17:18
openstackgerritAttila Darazs proposed openstack/tripleo-heat-templates: Allow the vnc server to bind on IPv6 address on computes  https://review.openstack.org/27083117:19
openstackgerritAttila Darazs proposed openstack/tripleo-heat-templates: Fix vncproxy_host for IPv6  https://review.openstack.org/28706817:21
bnemecSpeaking of not breaking SSL, https://review.openstack.org/#/c/281988 has passed CI and the dependency is in.17:21
openstackgerritAttila Darazs proposed openstack/tripleo-heat-templates: Support the deployment of Ceph over IPv6  https://review.openstack.org/27208917:21
*** sthillma_ has joined #tripleo17:22
bnemecOh, and I should note that it works on stable too: https://review.openstack.org/#/c/287425/17:22
openstackgerritAttila Darazs proposed openstack/tripleo-heat-templates: IPv6: pass bracketed IP to rabbitmq puppet module  https://review.openstack.org/29118617:23
*** sthillma has quit IRC17:23
*** sthillma_ is now known as sthillma17:23
*** tosky has joined #tripleo17:23
*** ccamacho has joined #tripleo17:27
dprinceslagle: is this related to http://git.openstack.org/cgit/openstack/os-net-config/commit/?id=c545e46f8fe2362df81e86c187aa6e50be185ad617:28
dprinceslagle: sorry, I was away for a bit. Is os-net-config still what you think the root of our problem is now?17:28
slagledprince: could be, did you see the paste?17:28
dprinceslagle: that would have landed last week around the time things went south I think17:28
slaglei guess nic5 didnt get mapped to anything, and then the fall back took down all of networking on the node17:29
dprinceslagle: yes, that is why I linked this patch17:29
slagledprince: the tb is from earlier up, in interface_name17:30
slaglenot sure it's related to that patch17:30
dprinceslagle: tb? sorry?17:30
slagletraceback :)17:30
bnemecHmm, trunk.rdoproject.org seems to be down.17:32
slaglenic5 should have gotten mapped to eth417:32
dprinceslagle: yeah17:32
slagleand then eth4 passed into utils.interface_name17:32
slagleerr, utils.interface_mac17:32
*** thrash is now known as thrash|biab17:32
trownbnemec: ruh roh, asking in #rdo17:33
dprinceslagle: this is from the compute node you say? probably doesn't matter which node it is17:35
slagledprince: yes it was a compute node17:36
*** olap has quit IRC17:37
dprinceslagle: is os-net-config perhaps running too early on that first pass? So it isn't settled enough to get the correct nic mapping?17:37
*** liverpooler has quit IRC17:37
*** ccamacho has quit IRC17:38
*** shivrao has joined #tripleo17:38
slagledprince: i don't think so. mainly b/c there was a 5 minute delay on boot trying to start the networking service17:39
slagledprince: for some reason, the network systemd service tried dhcp on eth1, which took 5 minutes to time out.17:39
*** panda has quit IRC17:39
dprinceslagle: okay, is perhaps the MAC address assigned to that nic just plain bad then?17:40
*** panda has joined #tripleo17:40
jristugh17:40
dprinceslagle: I can't imagine libvirt would allow that17:40
slagledprince: i don't see any dupes for this mac, i had already checked17:40
slagledprince: i have the image saved and mounted17:41
slagleif you want to poke at it17:41
dprinceslagle: yeah, I don't think there would be dups, I'm just wondering if something in the image thinks it is a bad MAC for some reason17:41
dprinceslagle: but it worked the second pass, so couldn't be17:41
dprincedsneddon: are you following this, we have a paste file showing an odd os-net-config failure http://paste.openstack.org/show/490030/17:42
dsneddondprince, I am following17:43
dprincedsneddon: thanks17:43
*** athomas has quit IRC17:45
slagledprince: are we sure that the nic mapping is fully populated before add_bridge would be called?17:45
dprinceslagle: I'd like to try reverting the is_active_nic change17:45
dprinceslagle: it is the only thing that merged last week and it isn't critical I think, or at least we could re-add it easily later17:46
dprinceslagle: it could potentially be effecting the mappings because _is_active_nic is called from ordered_active_nics17:47
slagledprince: sure, worth a try17:48
dprinceslagle: is there a bug to reference for this?17:48
EmilienMfyi trunk.rdoproject.org is down17:48
dprinceEmilienM: yeah, bummer17:48
*** bnemec changes topic to "TripleO | trunk.rdoproject.org is down. CI will fail until it's back up. | CI status: http://tripleo.org/cistatus.html | Docs: http://tripleo.org/"17:48
dprinceEmilienM: that will give us time to talk perhaps :)17:49
slagledprince: let me file one17:50
openstackgerritDan Prince proposed openstack/os-net-config: Revert "launchpad bug 1537330, fix _is_active_nic"  https://review.openstack.org/29132217:50
openstackLaunchpad bug 1537330 in os-net-config "os_net_config.utils._is_active_nic gives wrong result for linux bond" [Undecided,New] https://launchpad.net/bugs/153733017:50
dprinceslagle: ^^^17:50
slagleok17:51
*** jtomasek has joined #tripleo17:51
EmilienMdamn we don't need CI outage *now*17:51
dprinceslagle: oh, I din't file an actual bug17:51
slaglei'll file a new one :)17:51
dprinceslagle: thanks, you've done the best detective work here so far17:51
dprinceEmilienM: we need our own mirrors man!17:51
bnemecThings never go down when it's convenient.17:52
EmilienMdprince: we need to mirror Internet17:52
EmilienMdo we have enough space?17:52
bnemecDisk is cheap. ;-)17:52
dprinceEmilienM: I would actually suggest we run the mirror outside of our cloud I think17:56
dprinceEmilienM: like on RAX or something17:56
EmilienMI would love seeing packaging mirror hosted by OpenStack Infr17:56
EmilienMInfra*17:56
dprinceEmilienM: hey, I wanted to organize the puppet-tripleo stuff a bit better17:56
dprinceEmilienM: I really like your initial patches here. https://review.openstack.org/#/c/289459/217:57
EmilienMdprince: my stuff on glance?17:57
dprinceEmilienM: one comment I had was to eliminate the "defined" pacemaker bits. I don't like that17:57
dprinceEmilienM: I'd rather just see us control it directly via Heat17:57
EmilienMdprince: and I did not know how to do that17:57
EmilienMwe need a Hiera level for Pacemaker17:57
EmilienMthat is loaded when running HA17:57
dprinceEmilienM: I've done this before17:58
EmilienMdprince: can you show me an example?17:58
EmilienMso I can pick it and do the same for my work17:58
dprinceEmilienM: yes, probably easier if I just update your patches17:58
EmilienMdprince: go ahead man17:58
dprinceEmilienM: okay, lets work this out17:58
dprinceEmilienM: so hey, I noticed micheal chapin filed an LP blueprint for a similar thing too17:58
dprinceEmilienM: https://blueprints.launchpad.net/tripleo/+spec/refactor-puppet-manifests17:59
dprinceEmilienM: basically similar to what we talked about in Tokyo, what you are doing now, etc.17:59
EmilienMdprince: michchap nice17:59
dprinceEmilienM: should we organize all this under a spec and his blueprint?17:59
dprinceEmilienM: and then we make composable services depend on it?17:59
EmilienMdprince: that would be an approach, yes18:00
EmilienMdo we really need a spec?18:00
dprinceEmilienM: perhaps a slightly slower path but it would make the composable services patches slightly smaller in some places18:00
EmilienMAFIK it's just moving code18:00
dprinceI'm asking that same question, would it make sense to organize it? Or just do it18:01
slagledprince: https://bugs.launchpad.net/tripleo/+bug/155574918:01
openstackLaunchpad bug 1555749 in tripleo "CI: compute node networking unresponsive after os-net-config run" [Undecided,New]18:01
shardyI was wondering the same thing - seems like a candidate for a specless blueprint or a spec-lite bug to me18:01
shardyIOW just do it under the existing BP you just mentioned ;)18:01
dprinceEmilienM: lets just reference Michael's LP blueprint for all this code18:01
dprinceshardy: ack, I agree18:01
dprinceI'm gonna say this is officially approved then on LP. Any objectsion to approving https://blueprints.launchpad.net/tripleo/+spec/refactor-puppet-manifests now?18:02
openstackgerritDan Prince proposed openstack/os-net-config: Revert "launchpad bug 1537330, fix _is_active_nic"  https://review.openstack.org/29132218:03
openstackLaunchpad bug 1537330 in os-net-config "os_net_config.utils._is_active_nic gives wrong result for linux bond" [Undecided,New] https://launchpad.net/bugs/153733018:03
bnemecdprince: Just noticed this: https://review.openstack.org/#/c/29124318:07
bnemecWonder if it could be related.18:07
slagledprince: ah, i see that is_active_nic directly influences what gets mapped, so yea, that could explain it. it must not have seen nic5/eth4 as active18:08
dprincebnemec: yep, it could. This same function was changed last week (March 2nd) and is similar to what I'm suggesting reverting18:08
bnemecYeah, that's how I found it.  It's listed in the conflicts for the revert.18:09
dprinceslagle/bnemec: a blind revert (no CI passes) of the os-net-config change from last week should be safe and get us results faster18:09
shardysnecklifter: ^^ FYI18:10
bnemecdprince: Agreed.  I'm not suggesting we don't do the revert, just pointing out a possible fix.18:10
sneckliftershardy, thanks18:10
*** Marga_ has quit IRC18:11
snecklifterexcept I'm having this issue on OSP-d so ^^^ not affecting it18:11
*** Marga_ has joined #tripleo18:12
openstackgerritDan Prince proposed openstack/os-net-config: Revert "launchpad bug 1537330, fix _is_active_nic"  https://review.openstack.org/29132218:12
openstackLaunchpad bug 1537330 in os-net-config "os_net_config.utils._is_active_nic gives wrong result for linux bond" [Undecided,New] https://launchpad.net/bugs/153733018:12
dprincebnemec: think/fixed ^18:12
snecklifterbnemec, I'm not using bonded in this env, I think its down to the way linux kernel handles ethernet over usb18:13
*** trown is now known as trown|lunch18:14
snecklifteror the device reports to kernel or whatever18:14
bnemecdprince: Thanks18:14
openstackgerritSam Yaple proposed openstack/diskimage-builder: Revert "Zerofree the image if possible"  https://review.openstack.org/29135018:15
openstackgerritSam Yaple proposed openstack/diskimage-builder: Use fstrim to prep the block device  https://review.openstack.org/29135118:15
*** thrash|biab is now known as thrash18:16
*** electrofelix has quit IRC18:17
slaglewow, 3 ha jobs on the same testenv hosts really brigns it to a crawl18:17
slaglewe need ha ci job testenv anti-affinity18:17
dprinceslagle: I think controllers are by definition IO intensive18:17
dprinceslagle: and to even think that some would even suggest we only run HA jobs. Imagine what would happend then :)18:18
*** mgould has quit IRC18:18
slagleindeed :)18:18
EmilienMrdo server is back18:19
bnemec\o/18:19
EmilienMwell, need to be tested18:19
EmilienMbecause some other stuffs are still down18:19
openstackgerritGonéri Le Bouder proposed openstack/instack-undercloud: add INTERFACE_MTU parameter  https://review.openstack.org/28804118:20
openstackgerritRyan Hallisey proposed openstack/tripleo-docs: Docs for containerized compute node  https://review.openstack.org/25474318:21
*** ohamada_ has quit IRC18:21
*** xinwu has joined #tripleo18:22
*** sthillma has quit IRC18:28
*** jaosorior has joined #tripleo18:28
*** openstackgerrit_ has joined #tripleo18:30
openstackgerritMerged openstack/tripleo-heat-templates: Add a ceph-storage node upgrade script for the upgrade workflow  https://review.openstack.org/28989618:32
jaosoriorbnemec: Noticed that the overcloud ssl patch is all green? :D https://review.openstack.org/#/c/281988/18:33
openstackgerritMerged openstack/tripleo-heat-templates: Upgrades: initialization command/snippet  https://review.openstack.org/29046518:34
bnemecjaosorior: I did.  I was begging for reviews earlier. :-)18:34
bnemecLotta stuff going on right now though.18:34
jaosorioryou got my +1... but I guess that doesn't really do much :/18:34
jaosoriorbnemec: Yeah, noticed also the change you did for the keystone endpoint in the overcloud18:34
jaosoriorI somehow thought I was using an old undercloud or something18:34
bnemecjaosorior: The admin endpoint?18:35
jaosorioryeah18:35
jaosorioraah, now I noticed it got merged already18:35
jaosoriorthat was fast18:35
bnemecjaosorior: Is it wrong in the overcloud too?18:35
bnemecThings can merge fast when CI is running reasonably well.18:36
jaosoriorbnemec: It isn't (that I know of)18:36
jaosoriorbnemec: Have you seen this error, by the way:18:36
jaosoriorError: Must pass auth_password to Class[Aodh::Auth] at /var/lib/heat-config/heat-config-puppet/fa59863d-38c0-463f-8754-dbb8b43d4156.pp:1048 on node overcloud-controller-1.localdomain18:36
bnemecjaosorior: You may have images that were built when Aodh was merged, but that was since reverted.18:37
*** xinwu has quit IRC18:37
bnemecSo the templates will no longer pass the Aodh configuration to the puppet modules on the image.18:37
jaosoriorcrap18:37
bnemecAlthough that's in a heat-config manifest.18:37
jaosorioralright, will have to re-build the images then18:37
bnemecI would have thought that's coming from t-h-t anyway. :-/18:37
jaosoriorgonna give it another try18:38
bnemecjaosorior: That's the first thing I would try.18:38
*** sthillma has joined #tripleo18:40
jaosoriorbnemec: I don't really understand this change https://review.openstack.org/#/c/290570/18:41
jaosoriorwill this no longer be worked on then? https://review.openstack.org/#/c/244162/18:42
bnemecjaosorior: Not at all, that's why there's a revert also pushed for the deprecation message change.18:42
bnemecI just don't see https://review.openstack.org/#/c/244162/ merging before we branch Mitaka at this point, so the deprecation message is just wrong.18:43
jaosorioroh, I see18:43
bnemecThe deprecation shouldn't have merged before the functional patch in the first place.18:43
jaosorioryeah... that change doesn't look like it's gonna be merged any time soon :/18:43
bnemecRight now we're telling people that a thing is deprecated, without the non-deprecated replacement having merged.18:44
openstackgerritMerged openstack/instack-undercloud: Use service tenant for ceilometer  https://review.openstack.org/29114218:44
jaosoriorbnemec: Now I see18:44
openstackgerritSam Yaple proposed openstack/diskimage-builder: Use fstrim to prep the block device  https://review.openstack.org/29094418:46
openstackgerritMerged openstack/tripleo-heat-templates: Upgrade of Cinder block storage nodes  https://review.openstack.org/29116718:47
EmilienMcould we have a review on https://review.openstack.org/#/c/274492/ please ?18:53
*** rhallisey has quit IRC18:58
openstackgerritBen Nemec proposed openstack/instack-undercloud: Switch to package-installs  https://review.openstack.org/29136719:00
jaosoriorbnemec: Where can I get info about that package-installs?19:02
jaosoriordocumentation and such19:02
bnemecjaosorior: It's a dib element: https://github.com/openstack/diskimage-builder/tree/master/elements/package-installs19:03
bnemecSpeaking of which, I need to add element-deps to those elements now too.19:03
*** rohitpagedar__ has joined #tripleo19:04
jaosoriorbnemec: I see19:05
jaosoriorthanks19:05
openstackgerritBen Nemec proposed openstack/instack-undercloud: Switch to package-installs  https://review.openstack.org/29136719:05
*** tosky has quit IRC19:06
*** sthillma has quit IRC19:09
*** trown|lunch is now known as trown19:12
*** jistr has joined #tripleo19:15
*** dmsimard has quit IRC19:17
*** akrivoka has quit IRC19:19
*** absubram has quit IRC19:19
*** dmsimard has joined #tripleo19:22
openstackgerritOpenStack Proposal Bot proposed openstack/os-cloud-config: Updated from global requirements  https://review.openstack.org/28504919:28
*** jcoufal has quit IRC19:29
*** rhallisey has joined #tripleo19:30
*** sthillma has joined #tripleo19:31
*** jaosorior has quit IRC19:31
openstackgerritJames Slagle proposed openstack/os-net-config: Add some debugging output to ordered_active_nics  https://review.openstack.org/29138419:35
openstackgerritBen Nemec proposed openstack/tripleo-image-elements: Remove mysql-dev dependency from os-svc-install  https://review.openstack.org/29138519:36
*** dmsimard has quit IRC19:36
bnemec^Removes a gross legacy hack that is pulling in unnecessary packages on our images.19:36
*** dmsimard has joined #tripleo19:37
openstackgerritSam Yaple proposed openstack/diskimage-builder: Use fstrim to prep the block device  https://review.openstack.org/29094419:39
*** rcernin has quit IRC19:40
*** ayoung has quit IRC19:52
*** nico_auv has quit IRC19:54
openstackgerritDan Prince proposed openstack-infra/tripleo-ci: Add common bash functions to help track metrics.  https://review.openstack.org/29139219:55
openstackgerritDan Prince proposed openstack-infra/tripleo-ci: Metrics tracking for TripleO deployment tasks  https://review.openstack.org/29139319:55
*** trown has quit IRC19:56
*** sshnaidm_ has joined #tripleo19:58
*** sshnaidm has quit IRC19:59
*** trown has joined #tripleo19:59
dprinceEmilienM: metrics https://review.openstack.org/#/c/291393/20:00
*** rhallisey has quit IRC20:01
*** rhallisey has joined #tripleo20:02
pradkquick question, whats the right place to set the user/tenant roles for a new service? is os-cloud-config the right place?20:04
openstackgerritJiri Stransky proposed openstack/tripleo-heat-templates: Upgrades: initialization command/snippet  https://review.openstack.org/29140020:06
dprincepradk: we use puppet for that now20:07
dprincepradk: so have a look at the puppet module for the respective service20:07
pradkdprince, oh we do? i updated this https://review.openstack.org/#/c/272110/7/os_cloud_config/keystone.py20:08
dprincepradk: is this what you are asking about: http://git.openstack.org/cgit/openstack/puppet-heat/tree/manifests/keystone/auth.pp20:08
pradkdprince, EmilienM mentioned to me that his puppet keystone manage patch was reverted20:08
EmilienMdprince: w00t20:08
pradkdprince, so basically i need to define the gnocchi user/tenant and set it ResellerAdmin role20:08
dprincepradk: correct, we still use os-cloud-config, but we need to get the keystone patch back in20:08
dprincepradk: perhaps just do it in both places for now20:09
dprincepradk: not a great solution but for now (today) it is the lay of the land20:09
dprincepradk: I'm optimistic we'll get the keystone patch into our overcloud heat templates soon again20:09
bnemecslagle: https://review.openstack.org/#/c/291322/ passed CI20:09
pradkdprince, hmm so if i updated os-cloud-config that should have worked? or i strill need keystone patch for it to work20:10
pradkdprince, the above patch i mentioned doesnt seem to set the role for me.. i picked the latest pkg build and updated overcloud image with virt-customize20:10
EmilienMbnemec: same for https://review.openstack.org/#/c/290568/, except ceph job... not sure it's supposed to pass20:12
dprincepradk: this needs to be deployed in your undercloud20:12
dprincepradk: did you (perhaps manually) deploy the latest os-cloud-config in your patch to your undercloud?20:13
pradkdprince, yea i updated my undercloud os-cloud-config as well20:13
dprincepradk: when using os-cloud-config... the configuration occurs externally from the undercloud node20:13
pradkos-cloud-config-999.9.9-99999.noarch is what i ahve from jenkins rpm-build20:14
openstackgerritJiri Stransky proposed openstack/tripleo-heat-templates: Add a ceph-storage node upgrade script for the upgrade workflow  https://review.openstack.org/29140820:17
openstackgerritMerged openstack/tripleo-heat-templates: Add Rabbit IPv6 only support  https://review.openstack.org/29056820:22
*** thrash is now known as thrash|bbl20:28
*** sthillma has quit IRC20:28
*** bnemec changes topic to "TripleO | stable/liberty blocked by https://bugs.launchpad.net/tripleo/+bug/1555803 | CI status: http://tripleo.org/cistatus.html | Docs: http://tripleo.org/"20:29
dprinceslagle: did you want to try this? https://review.openstack.org/#/c/291322/20:29
*** mbound has quit IRC20:30
pradkdprince, once i update the os-cloud-config on undercloud, it should set up the roles when overcloud install runs automatically? or do i need to update something in between for it to kick in?20:34
openstackgerritBen Nemec proposed openstack/tripleo-heat-templates: Use puppet parameter for Heat notifications  https://review.openstack.org/29141620:34
bnemecNeed ^ before any stable jobs will pass.20:35
dprincepradk: I think it'll just go20:35
openstackgerritBen Nemec proposed openstack/instack-undercloud: Enable notifications on undercloud  https://review.openstack.org/28951820:37
openstackgerritSam Yaple proposed openstack/diskimage-builder: Revert "Zerofree the image if possible"  https://review.openstack.org/29135020:38
pradkdprince, any way to confirm that happened? since the overcloud install failed there is no rc file to poke keystone on overcloud20:38
bnemecpradk: It won't re-run keystone init on an already deployed overcloud.20:39
slagledprince: it's hard to reproduce...so i dont need to explicitly try it20:39
bnemecYou have to either redeploy or hack the client to force it to re-run keystone init.20:39
slagledprince: e.g., i cant confirm it fixes it20:39
dprinceslagle: no, neither can I. But a revert is safe and we can re-add this later I think20:40
pradkbnemec, ah ok.. would it log the keystone init somewhere i can check what services it did for?20:40
openstackgerritDan Sneddon proposed openstack/os-net-config: Fix order-of-operations bug in os-net-config restart_interfaces  https://review.openstack.org/29142020:42
*** absubram has joined #tripleo20:44
bnemecpradk: I'm not sure if that's logged anywhere.20:45
*** MaxPC has quit IRC20:50
openstackgerritMerged openstack/tripleo-common: Install the upgrade-non-controller.sh script with tripleo-common  https://review.openstack.org/29110120:50
openstackgerritDan Sneddon proposed openstack/os-net-config: Fix order-of-operations bug in os-net-config restart_interfaces  https://review.openstack.org/29142020:51
*** yamahata has quit IRC20:52
slaglebnemec: should we just merge the liberty fix?20:53
slagleit's about 20th in the queue20:53
slagleno point in waiting everything to fail before it20:54
bnemecslagle: Might as well.  It can't break things worse than they are.20:56
openstackgerritDan Sneddon proposed openstack/os-net-config: Fix order-of-operations bug in os-net-config restart_interfaces  https://review.openstack.org/29142020:56
*** weshay has quit IRC21:01
openstackgerritMerged openstack/tripleo-heat-templates: Use puppet parameter for Heat notifications  https://review.openstack.org/29141621:04
openstackgerritDan Sneddon proposed openstack/os-net-config: Fix order-of-operations bug in os-net-config restart_interfaces  https://review.openstack.org/29142021:06
*** bnemec changes topic to "TripleO | stable/liberty fix for https://bugs.launchpad.net/tripleo/+bug/1555803 merged | CI status: http://tripleo.org/cistatus.html | Docs: http://tripleo.org/"21:09
pradkso looking at tripleclient, the keystone-init runs after the stack is created, but i need the user while the deploy is in progress so gnocchi can auth with swift21:09
openstackgerritMerged openstack/tripleo-heat-templates: Surround MongoDB IPs with braces in the connection string if IPv6  https://review.openstack.org/27015421:09
pradkso where does the user/role get created in tripleo ?21:10
pradkis os-cloud-config run as a pre or post stack creation step during overcloud deploy?21:13
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates: Surround MongoDB IPs with braces in the connection string if IPv6  https://review.openstack.org/29142921:14
openstackgerritMerged openstack/tripleo-heat-templates: Allow the vnc server to bind on IPv6 address on computes  https://review.openstack.org/27083121:15
openstackgerritMerged openstack/tripleo-heat-templates: Fix vncproxy_host for IPv6  https://review.openstack.org/28706821:15
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates: Allow the vnc server to bind on IPv6 address on computes  https://review.openstack.org/29143521:18
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates: Fix vncproxy_host for IPv6  https://review.openstack.org/29143921:18
*** Marga_ has quit IRC21:20
*** weshay has joined #tripleo21:20
*** penick has joined #tripleo21:21
pradkdprince, ^^ could you clarify that for me please?21:22
slaglebnemec: can you review https://review.openstack.org/#/c/272089/21:24
slaglebnemec: it passed ceph earlier on PS 1221:25
slagleand it was cruising for a pass before timed out21:25
slaglethe images took 90mins to build for whatever reason21:25
slaglebnemec: i think we could merge it is what i'm trying to say21:25
bnemecWhy aren't any services smart enough to handle ipv6 automatically?21:25
slagleHah21:26
bnemecIt seems like everything has a magic "use ipv6" bit that needs to be flipped.21:26
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: Change default host reserved memory to 2048MB from 512MB  https://review.openstack.org/29144621:29
*** mbound has joined #tripleo21:30
*** liverpooler has joined #tripleo21:32
bnemecslagle: Done21:32
* bnemec crosses his fingers that he didn't just break the ceph job21:32
*** dprince has quit IRC21:32
*** mbound has quit IRC21:32
*** mbound has joined #tripleo21:33
openstackgerritMerged openstack/tripleo-heat-templates: Support the deployment of Ceph over IPv6  https://review.openstack.org/27208921:35
*** panda has quit IRC21:40
*** panda has joined #tripleo21:40
*** jistr has quit IRC21:40
*** lblanchard has quit IRC21:40
*** openstackstatus has quit IRC21:42
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates: compute: include VIR_MIGRATE_TUNNELLED when doing VM shared storage  https://review.openstack.org/28658421:43
*** dshulyak has quit IRC21:43
*** snecklifter has left #tripleo21:43
*** snecklifter has joined #tripleo21:43
*** openstackstatus has joined #tripleo21:45
*** ChanServ sets mode: +v openstackstatus21:45
jdobpradk: so yeah, I remember this coming up and I think it was in relation to manila21:46
jdobbut the workaround there was to remove the need for the user that early on21:46
jdobtrying to remember who i was working with on that21:47
*** r-mibu has quit IRC21:47
pradkjdob, oh interesting, having uses while services come up is prtty standard case i thought.. so i cant get a gnocchi user until the stack is created?21:47
*** r-mibu has joined #tripleo21:47
jdobit was a guy named ryan, but he's not online right now (not even sure if he's still working on the project)21:48
jdobok, so, this is me dredging my memories from about 6 months ago21:48
jdobbut IIRC, we want to move the keystone init out of os-cloud-config21:48
jdobwhich would help alleviate this21:48
jdobbut from what I remember, that's not the case and so far it hasn't been a blocker21:49
pradkyea this is not good :(21:49
pradkEmilienM, ^^21:49
pradkjdob, wait how does glance do it?21:50
jdobmagic?21:50
pradkjdob, glance uses swift as default backend too21:50
pradki guess our only work around is perhaps to default to file driver instead21:50
jdobholy shit: https://review.openstack.org/#/c/209594/21:51
jdobthats the patch I was thinking of21:51
jdobcannot believe I found that21:51
openstackgerritMerged openstack/tripleo-heat-templates: Enable predictable IPs on non-controllers  https://review.openstack.org/29068721:51
pradklooking21:51
jdobthis might be apples and oranges now that I look at it21:51
openstackgerritGiulio Fidente proposed openstack/tripleo-heat-templates: Support the deployment of Ceph over IPv6  https://review.openstack.org/29145521:51
pradki guess this is the endpoint itself being accessible21:52
jdobya, you're right21:52
jdobi was a bit off, i just remembered it as a keystone related timing issue21:53
jdobas for glance, I don't know off the top of my head, someone else in here should21:53
pradkreading the tripleoclient code, keystone-init is called once the stack is created.. in our case the stack fails due to the user not available21:54
*** Marga_ has joined #tripleo21:54
pradkso are we in a chicken-egg problem then?21:54
pradkglance uses swift, but it could be glance db sync doesnt require swift to be up?21:55
jdobya :(21:55
jdobwithout knowing much about it, is there a reason a db sync would need the service running?21:56
jdobon the surface it seems like it'd be better to execute prior to starting the service21:56
jdobthough i think i'm reading your comment wrong; your issue is that gnocchi uses keystone for auth during db sync, right?21:56
*** jayg is now known as jayg|g0n321:57
pradkyea so gnocchi has two backends, one is for metadata and other is for metrics .. metadata uses sqlalchemy, and metrics use carbonara which has swift/ceph/file backends21:57
pradkjdob, so db sync upgrades both the indexer and storage .. hence swift should be accessible if swift is the default backend22:00
jdobok, so swift is running before gnocchi, so that's fine, but keystone doesn't know about it yet, so gnocchi can't get to it22:00
pradkyep thats exactly the issue22:01
jdobok, so it's not a surprise this os-cloud-config patch didn't work, since that's just dorking with users22:01
jdobwell, shit.22:01
jdobthere are hooks in the THT templates that can be used for post deployment configuration22:02
pradkyea i asumed that would run while keystone is coming up, guess not22:02
jdobi can see why you'd think that22:02
pradkshouldnt init-keystone run as part of keystone setup :)22:02
jdobits an artifact of the past22:02
pradkah ok22:02
jdobi wonder if we can put off the db sync until the post config22:03
jdobthough I suppose the service won't start without db sync running, huh.22:03
pradkyea22:03
pradkdbsync is part of the puppet manifest22:03
jdobah, shit, so it's not easy to move22:04
jdobcould probably pass a flag in to disable it, but this is ultimately a not-good line of thought to follow in the first palce22:04
pradkif we use file driver we wont have this issue but swift /cepg are recommended for large deployments22:05
pradkjdob, and document swift use case perhaps22:06
pradkas by then the user exists22:06
jdobin the interim, that's not a bad solution22:06
pradki hope we can find a solution by default but worst case we could do this22:06
jdobit comes down to timeframe, we can push to address keystone init in newton, but that won't be available until osp 1022:07
jdob(talking inside baseball in an upstream channel, but whatever)22:07
*** shardy has quit IRC22:07
*** dshulyak has joined #tripleo22:09
pradkjdob, yea, i was banging my head figure out why it wasnt picking up when all the config is in place22:10
jdobits tricky to wrap your head around a few thousand lines of THT + python + puppet :)22:11
*** rcernin has joined #tripleo22:11
pradki'll run this by ceilo team to see if we can get an agreement on using file driver as an interim solution for near term22:11
pradkjdob, hehe no kidding22:11
jdobok cool. tomorrow i'll start looking at the aodh patches (mostly just don't want to screw up this gnocchi environment right now)22:12
*** sthillma has joined #tripleo22:13
pradkjdob, sounds good, thx for your reviews and testing .. aodh patch is in good shape and passing ci22:13
jdoboh good, that should go smoother then22:13
*** jtomasek has quit IRC22:18
openstackgerritBen Nemec proposed openstack/tripleo-puppet-elements: Use package-installs for puppet installation  https://review.openstack.org/29146522:19
*** trown is now known as trown|outtypewww22:21
*** rcernin has quit IRC22:22
*** Goneri has quit IRC22:25
*** dshulyak has quit IRC22:26
*** jprovazn has quit IRC22:38
*** chlong has quit IRC22:39
openstackgerritJeff Peeler proposed openstack/tripleo-docs: Docs for containerized compute node  https://review.openstack.org/25474322:40
*** trozet has quit IRC22:42
openstackgerritEmilien Macchi proposed openstack/tripleo-heat-templates: compute: include VIR_MIGRATE_TUNNELLED when doing VM shared storage  https://review.openstack.org/28658422:42
*** trozet has joined #tripleo22:49
*** trozet_ has joined #tripleo22:50
openstackgerritGiulio Fidente proposed openstack/tripleo-common: Use m1.tiny instead of m1.demo for the pingtest VM  https://review.openstack.org/28984522:51
*** morazi has quit IRC22:53
*** trozet has quit IRC22:54
*** trown|outtypewww has quit IRC22:55
*** trown has joined #tripleo22:58
*** derekh has joined #tripleo23:09
*** dmsimard is now known as dmsimard|pto23:10
derekhslagle: Are you still using them hosts? gonna gick them off again, this time without destrying the Test envs at the end23:13
derekhbnemec: you either ? ^23:14
*** nico_auv has joined #tripleo23:14
bnemecderekh: I am not23:15
derekhbnemec: ack , I doubt slagle is either cause their not running, ok gonna kick them off so the'll be there in the morning when I get here23:17
bnemecderekh: Yeah, it's getting late his time, so hopefully he's logged off by now. :-)23:17
*** dmsimard|pto has quit IRC23:26
*** absubram has quit IRC23:31
*** xinwu has joined #tripleo23:34
*** Goneri has joined #tripleo23:36
*** derekh has quit IRC23:39
*** dmsimard|pto has joined #tripleo23:42
*** yamahata has joined #tripleo23:46
*** mkovacik has quit IRC23:47
*** penick has quit IRC23:51
*** nico_auv has quit IRC23:52
*** penick has joined #tripleo23:54
*** penick has quit IRC23:57

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!