Thursday, 2016-09-22

*** akuznetsov has joined #tripleo00:10
*** akuznetsov has quit IRC00:16
*** rlandy is now known as rlandy|bbl00:25
*** lblanchard has joined #tripleo00:45
*** limao has joined #tripleo00:52
*** dhill_ has quit IRC00:54
*** bana_k has quit IRC01:06
*** akuznetsov has joined #tripleo01:11
*** akuznetsov has quit IRC01:15
*** yamahata has quit IRC01:15
*** bkopilov has quit IRC01:26
*** cmyster has quit IRC01:26
*** saneax is now known as saneax-_-|AFK01:28
*** cmyster has joined #tripleo01:29
openstackgerritzhangyanxian proposed openstack/tripleo-image-elements: Fix typos in rootwrap.conf  https://review.openstack.org/37392201:32
*** bkopilov has joined #tripleo01:32
*** dmacpher-afk has quit IRC01:44
*** alop has joined #tripleo02:00
*** mburned is now known as mburned_out02:08
openstackgerritMerged openstack-infra/tripleo-ci: Re-enable temprevert/cherry-pick/pin functionality  https://review.openstack.org/37096102:08
*** akuznetsov has joined #tripleo02:12
*** akuznetsov has quit IRC02:16
*** rlandy|bbl is now known as rlandy02:38
*** alop has quit IRC02:58
*** david-lyle has quit IRC03:03
*** rlandy has quit IRC03:07
*** dmacpher has joined #tripleo03:11
*** akuznetsov has joined #tripleo03:13
*** akuznetsov has quit IRC03:17
*** ebalduf has joined #tripleo03:25
*** akshai has joined #tripleo03:33
*** coolsvap has joined #tripleo03:36
*** akshai has quit IRC03:37
*** cmyster has quit IRC03:40
openstackgerritRedHat RDO CI proposed openstack/tripleo-heat-templates: GATE TEST, please ignore  https://review.openstack.org/36544903:40
*** rwsu has joined #tripleo03:43
*** cmyster has joined #tripleo03:48
*** akuznetsov has joined #tripleo04:13
*** kberger has quit IRC04:15
*** kberger has joined #tripleo04:16
*** akuznetsov has quit IRC04:18
*** michchap has quit IRC04:20
*** yolanda has quit IRC04:20
*** yamahata has joined #tripleo04:37
*** kberger has quit IRC04:45
openstackgerritMerged openstack/python-tripleoclient: Remove excessive output when configuring nodes  https://review.openstack.org/37247704:46
*** kberger has joined #tripleo04:46
*** bana_k has joined #tripleo04:52
openstackgerritMerged openstack/python-tripleoclient: Remove openstackclient imports in the new parameters command  https://review.openstack.org/37262104:55
*** sshnaidm|afk is now known as sshnaidm05:03
*** saneax-_-|AFK is now known as saneax05:03
sshnaidmmorning05:03
*** jaosorior has joined #tripleo05:07
*** akuznetsov has joined #tripleo05:14
*** akuznetsov has quit IRC05:18
*** ianw has quit IRC05:23
*** ianw has joined #tripleo05:27
*** mcornea has joined #tripleo05:30
*** tzumainn has quit IRC05:42
*** rajinir has quit IRC05:45
*** bana_k has quit IRC05:46
*** florianf has joined #tripleo05:51
*** florianf has quit IRC05:51
*** limao has quit IRC05:52
*** florianf has joined #tripleo05:54
cmystermorning05:56
*** apetrich has quit IRC06:01
*** apetrich has joined #tripleo06:02
*** rcernin has joined #tripleo06:07
*** ccamacho has joined #tripleo06:09
*** pgadiya has joined #tripleo06:09
*** akuznetsov has joined #tripleo06:15
*** rasca has joined #tripleo06:15
*** akuznetsov has quit IRC06:20
*** limao has joined #tripleo06:20
*** shardy has joined #tripleo06:23
*** oshvartz has joined #tripleo06:23
*** nyechiel has joined #tripleo06:23
*** bana_k has joined #tripleo06:24
*** jprovazn has joined #tripleo06:25
*** jbadiapa has joined #tripleo06:34
bandinimorning06:34
bandinimatbu: ever seen this one? http://paste.openstack.org/show/582501/06:34
bandiniProperty KeystoneCredential0 not assigned when doing upgrades06:35
*** aufi has joined #tripleo06:35
openstackgerritMerged openstack/python-tripleoclient: Add missing unit tests for the 'configure' workflows  https://review.openstack.org/36933206:35
jaosoriorbandini: that property was introduced very recently to puppet-keystone06:37
*** aufi has quit IRC06:38
*** aufi has joined #tripleo06:38
bandinijaosorior: thanks. This might be a problem, because we this is during upgrade where we have new tht but the older puppet modules on the overcloud (it is the init step of the upgrade)06:39
* bandini looks06:39
matbubandini: /me looks06:39
ccamachomorning guys06:39
matbubandini: nop not yet :)06:40
bandinimatbu: be quick in hitting it, otherwise I feel lonely :D06:40
jaosoriorshardy: hey dude are you around yet>06:40
socialmoin06:40
jaosoriorshardy: I'm adding a script to an openstack service. But since the script has not been packages it fails when I do --delorean-build. How do I address that?06:41
matbubandini: lol yes06:43
*** apetrich has quit IRC06:44
*** apetrich has joined #tripleo06:46
openstackgerritDougal Matthews proposed openstack/python-tripleoclient: Use the passed in workflow when creating or updating a plan  https://review.openstack.org/37420106:50
openstackgerritSaravanan KR proposed openstack/os-net-config: WIP: Handle deployment update for DPDK nic changes  https://review.openstack.org/37368006:50
*** bana_k has quit IRC06:52
openstackgerritSaravanan KR proposed openstack/os-net-config: WIP: Handle deployment update for DPDK nic changes  https://review.openstack.org/37368006:52
openstackgerritMerged openstack/tripleo-ui: Retrieve zaqar websocket url from keystone  https://review.openstack.org/37249906:55
*** limao has quit IRC06:58
shardyjaosorior: Hi - you need to submit a patch to get the script added to RDO packaging AFAIK07:03
shardyI think the repo where the specs are maintained moved a while back, so I'd ask the latest process in #rdo07:03
*** limao has joined #tripleo07:04
matbubandini: is there a fix for "Could not fetch contents for file:///home/stack/tripleo-heat-templates/puppet/post.yaml" ?07:05
jaosoriorshardy: but I'm still developing that script. I'm trying to test the deployment and the script at the same time :/07:05
bandinimatbu: there is a workaround: pushd <tht-directory>; swift download overcloud; popd07:05
bandiniit got me past the issue07:05
shardymatbu: see https://bugs.launchpad.net/tripleo/+bug/162472707:05
openstackLaunchpad bug 1624727 in tripleo "Could not fetch contents for file:///home/stack/tripleo-heat-templates/puppet/post.yaml" [Critical,Triaged]07:05
matbushardy: bandini thx07:06
shardymatbu: can you please add details of how you're reproducing to the bug07:06
shardyI still cannot reproduce the issue locally, but am aiming to figure out a fix when I can07:06
* shardy will probably have to rebuild his undercloud07:06
matbushardy: i'm wondering, i used to copy the tht dir in $HOME07:07
shardyjaosorior: you can test locally with a locally built package or DeployArtifacts, but in the gate, we test with packages07:07
shardyso I'm not sure what to tell you07:07
shardyadd a placeholder script, add it to the package, then iterate on it?07:07
shardyor prove it locally then add it to the package07:07
matbushardy: but it's producible in the upgrade M to N workflow07:08
shardymatbu: are you sure you have the latest tripleo-common?07:08
jaosoriorshardy: yeah, I'm talking about local testing07:08
shardythere was a bug specific to updating the plan which d0ugal fixed recently07:08
shardymatbu: but thanks, I'll try again w/upgrade/update and see if I can reproduce07:08
* d0ugal hopes he fixed them all07:08
jaosoriorshardy: not sure where the spec is for the package in the tripleo.sh --delorean-build workflow07:08
matbushardy: i deploy a "current-passed-ci" , so something probably not really up to date07:09
shardymatbu: ack, sounds like you may not have the fixes for update yet then07:09
shardymatbu: can you confirm the tripleo-common version?07:09
shardy(and tripleoclient)07:09
shardyd0ugal can then help figure out if you have the needed fixes (sounds like probably not)07:10
matbushardy: python-tripleoclient-5.1.1-0.20160920215837.fdbb7be.el7.centos.noarch openstack-tripleo-common-5.1.1-0.20160920134327.2d87e96.el7.centos.noarch07:10
*** chem has joined #tripleo07:11
*** zoli_gone-proxy is now known as zoliXXL07:11
openstackgerritDougal Matthews proposed openstack/tripleo-common: Fix the default plan creation  https://review.openstack.org/37134707:12
*** zoliXXL is now known as zoli|wfh07:12
openstackgerritDougal Matthews proposed openstack/instack-undercloud: Ensure that the default plan was created successfully  https://review.openstack.org/37344607:15
shardyjaosorior: you can build the package directly via dlrn in dev mode, then iterate on the spec file locally I think07:16
*** akuznetsov has joined #tripleo07:16
shardy. ~/tripleo/delorean/venv/activate && dlrn --config-file projects.ini --dev --package-name openstack-keystone07:16
shardyfor example07:16
shardyyou can see where it's pulling down the packaging stuff from07:16
b00tcatHi, can someone review this? https://review.openstack.org/#/c/373352/ :-) it's a biggie though07:17
shardyb00tcat: will do, thanks for the update :)07:18
openstackgerritDougal Matthews proposed openstack/tripleo-common: Remove the old, deprecated Mistral action names  https://review.openstack.org/36652907:20
*** akuznetsov has quit IRC07:20
*** ebarrera has joined #tripleo07:23
*** jlinkes has joined #tripleo07:23
*** zoli|wfh is now known as zoli_gone-proxy07:24
*** zoli_gone-proxy is now known as zoliXXL07:30
openstackgerritDougal Matthews proposed openstack/tripleo-common: Remove the unused service_host arg from node registration  https://review.openstack.org/32603607:32
*** jpena|off is now known as jpena07:34
*** chem has quit IRC07:35
*** chem has joined #tripleo07:35
*** jpich has joined #tripleo07:39
*** abehl has joined #tripleo07:41
*** akuznetsov has joined #tripleo07:50
*** apetrich has quit IRC07:50
*** apetrich has joined #tripleo07:52
*** akuznetsov has quit IRC07:55
*** ohamada has joined #tripleo07:55
*** akuznetsov has joined #tripleo07:55
*** panda|zZ is now known as panda07:56
*** dsariel has joined #tripleo07:59
*** oshvartz has quit IRC08:02
*** athomas has joined #tripleo08:03
openstackgerritmathieu bultel proposed openstack/python-tripleoclient: Upgrade needs to create the keystone credential  https://review.openstack.org/37460008:04
openstackgerritmathieu bultel proposed openstack/python-tripleoclient: Upgrade needs to create the keystone credential  https://review.openstack.org/37460008:04
*** rasca has quit IRC08:07
*** rasca has joined #tripleo08:08
dsarielhi, is there a way to install controller, compute and ceph on the same node (like packstack all-in-one)?08:08
shardydsariel: Yes, you can add the compute & ceph OSD services to the ControllerServices list08:10
shardydsariel: e.g see:08:10
shardyhttps://github.com/openstack-infra/tripleo-ci/blob/master/test-environments/multinode.yaml#L608:10
shardyhttp://hardysteven.blogspot.co.uk/2016/08/tripleo-composable-services-101.html08:11
dsarielshardy, awesome! thanks a lot :-)08:12
shardydsariel: np08:12
shardydsariel: we're testing the controller+compute single node setup in CI, but not with the ceph OSD co-located08:13
shardyin theory it should work tho, let us know how you get on :)08:13
sshnaidmslagle, hi08:18
openstackgerritDougal Matthews proposed openstack/tripleo-common: Return the result of create_plan in create_deployment_plan workflow  https://review.openstack.org/37134808:18
d0ugalsshnaidm: I'd guess slagle wont be awake for a few hours yet - unless he is in a different timezone than normal.08:20
sshnaidmd0ugal, ok :)08:20
*** dmacpher has quit IRC08:22
*** dsneddon has quit IRC08:24
*** tremble has joined #tripleo08:37
*** dsneddon has joined #tripleo08:38
*** derekh has joined #tripleo08:41
*** zoliXXL is now known as zoli|wfh08:41
jaosorioranybody with a running deployment that can answer something quick for me?08:44
*** pkovar has joined #tripleo08:46
*** pkovar has quit IRC08:47
*** pkovar has joined #tripleo08:48
jistrjaosorior: undercloud or overcloud? i have only undercloud atm08:48
*** akuznetsov has quit IRC08:48
*** akuznetsov has joined #tripleo08:48
*** akuznetsov has quit IRC08:49
jaosoriorjistr: undercloud is fine08:50
jaosoriorjistr: do you have glance-api-paste.ini in /etc/glance ?08:51
jistrjaosorior: hmm no it's not there08:51
jaosoriorjistr: where is glance getting that config from?08:52
jistrhmm i think there are defaults somewhere08:52
jaosoriorJokke_: are you around?08:52
*** hewbrocca-afk is now known as hewbrocca08:52
*** akuznetsov has joined #tripleo08:53
mcorneajistr: jaosorior I think this is the default: /usr/share/glance/glance-api-dist-paste.ini08:55
jistrmcornea: ah yea, thanks!08:55
*** akuznetsov has quit IRC08:57
jaosoriormcornea: thanks08:58
jaosoriorjistr, mcornea: Do you have any idea how we point to that in puppet? can't seem to find it08:59
jistri don't think we point to that in puppet, those are not meant for editing afaik... i'm not sure how are they applied in glance though, e.g. if they are merged with the content in /etc, or if they are only used if there's not a matching "overriding file" in /etc09:00
*** rasca has quit IRC09:10
*** dsneddon has quit IRC09:10
*** milan has joined #tripleo09:10
*** dsneddon has joined #tripleo09:11
*** rasca has joined #tripleo09:14
*** yamahata has quit IRC09:17
*** akuznetsov has joined #tripleo09:19
*** akuznetsov has quit IRC09:23
*** stendulker has joined #tripleo09:27
openstackgerritSagi Shnaidman proposed openstack-infra/tripleo-ci: POC: WIP: oooq undercloud install  https://review.openstack.org/35891909:28
*** milan is now known as milan|f00d09:28
*** oshvartz has joined #tripleo09:29
*** chem has quit IRC09:37
*** chem has joined #tripleo09:38
*** chem has quit IRC09:39
*** chem has joined #tripleo09:39
*** andrey-mp has joined #tripleo09:39
tbarronmarios: i updated https://review.openstack.org/#/c/358525 with results of my cephfs/manila overcloud deploy attempt09:39
andrey-mpHi! anybody knows where to get list of all nodes in post hook on cotroller?09:40
tbarronmarios: /etc/manila/manila.conf is getting edited as we expect and systemd controlled services (api, scheduler) are started but not pcs controlled share service, perhaps b/c the deploy itself is failing earlier, on the db sync: error running 'manila manage-db sync'.09:41
tbarronmarios: that db sync code looks to be in manila-puppet ?  merged a long time ago.09:42
shardyandrey-mp: do you need the ips/names running a specific service, or really all nodes?09:42
andrey-mpshardy: names of nodes09:42
shardyandrey-mp: the hosts file contains all the nodes, and we write hiera with $service_node_names for each service09:43
andrey-mpshardy: then i can parse name and do specific steps for each09:43
tbarronmarios: one thing I notice is that cinder runs it (on the lead controller) when api service is started, but manila is waiting to run the db sync when scheduler service is started09:43
mariostbarron: great thanks for update ... I don't think I'll have time to look at that ftr... I believe Jokke_ is driving that09:43
tbarronmarios: the cinder way makes sense intuitively since api service starts first and it interacts with DB09:43
panda2016-09-22 09:03:25Z [overcloud-Controller-cuwh4w2oe7qy-0-6ohgoxw2xx56]: CREATE_FAILED  Engine went down during stack CREATE09:43
pandawut ?09:43
tbarronmarios: sure, i will ping Jokke_09:44
shardypanda: did you run out of RAM?09:44
tbarronJokke_: updates on cephfs backend testing for manila in backlog above ^^^^^^09:44
shardythe OOM killer often decides heat-engine is a good candidate for killing when that happens09:44
pandashardy: yep, dmesg is full of OOM ... :(09:44
mariostbarron: thanks - but getting the info on the review would be useful regardless, and then you can just point at it, instead of repeating yourself :)09:44
pandashardy: I thought 32G were enough for ha09:44
*** florianf has quit IRC09:45
shardypanda: how much memory does your undercloud have?09:45
tbarronmarios: yeah, i've got the info in the review09:45
mariostbarron: right thanks just looked, i will try and cycle back to the review later and see if i can spot something09:45
shardyyou could add some swap if it's nearly enough09:45
pandashardy: 8G09:45
tbarronmarios: on the review that's outstanding even if the issue is elsewhere09:45
mariostbarron: so you only get this trying manila-cephfs backend?09:45
shardypanda: I run a 32G dev box, and IMO it's not enough to do a 3 controller HA deployment with compute09:46
shardyyou might be better doing a single controller with pacemaker enabled09:46
tbarronmarios: no, i see it with netapp too, just had been hitting blockeers that kept the deploy from getting this far before09:46
tbarronmarios: i'll put a note there09:46
andrey-mpshardy: thanks. hosts file realy contains all nodes. but service_node_names I can't find in my installation (I installed Mitaka version...)09:46
mariostbarron: i see so then i will try harder to have a look today :)09:46
pandashardy: that's not what I need to reproduce ... :(09:47
tbarronmarios: indicating that the issue is not cephfs specific09:47
shardyandrey-mp: ah, yeah the node_names thing is new for Newton09:47
pandashardy: but thanks for the advices09:47
andrey-mpshrady: ok, thank you09:47
shardypanda: you might want to consider disabling some services in the overcloud deployment then09:47
shardyso you can do a minimal 3 node HA deployment09:47
pandashardy: how ?09:48
*** florianf has joined #tripleo09:49
shardyhttp://hardysteven.blogspot.co.uk/2016/08/tripleo-composable-services-101.html09:49
tbarronmarios: i may play around with manila-puppet a bit to make the manila stuff look more like cinder unless you think that's a dead end approach09:49
pandashardy: wonderful, thanks!09:49
mariostbarron: i think it would be easier to fix the problem in the tht/puppet-tripleo (s/easier/faster) than trying to fix something in puppet-manila... unless it really is broken. so the answer may be both, tht/tripleo in the shorter term (like rc2 fix )09:50
openstackgerritJiri Stransky proposed openstack-infra/tripleo-ci: Sync worker-config.yaml with low-memory-usage.yaml  https://review.openstack.org/37466009:50
tbarronmarios: k, good advice!09:50
openstackgerritMerged openstack/tripleo-heat-templates: Make sure major upgrade script fails.  https://review.openstack.org/36662309:52
tbarronmarios: the only reason i was looking at puppet-manila was that i only saw the DB sync code there, not in THT, but I'll look again09:52
*** tosky has joined #tripleo09:57
pandaI'm also seeing these in today's ha periodic job Error: /Stage[main]/Swift::Storage::Account/Swift::Storage::Generic[account]/Swift::Service[swift-account-replicator]/Service[swift-account-replicator]: Cannot allocate memory - fork(2)m10:00
pandamemory requirements are generally increasing ?10:01
*** andrey-mp has quit IRC10:01
shardypanda: folks keep integrating new services, and quite a few ended up enabled by default10:03
shardyso, yes, memory requirements have crept up over the last couple of cycles10:04
shardyhappily we now have a way for folks to easily disable stuff they don't need10:04
shardyif it's in a CI job we may need to investigate further tho, as we shouldn't have suddenly needed more memory very recently10:05
pandashardy: is there something we can disable in the ha periodic job ? There coverage should be considered too.10:06
shardypanda: hard question to answer, I would assume with the new HA lite architecture it's less critical we have full coverage of those services not managed by pacemaker10:08
shardyand/or we can have a mixture of scenarios which achieve full coverage10:08
shardythat said, if the job was working, and now it's not, there may be a regression somewhere10:08
*** andrey-mp has joined #tripleo10:08
shardyI'd suggest doing a local HA deployment and looking at where the memory is going before stuff gets killed10:09
openstackgerritJulie Pichon proposed openstack/python-tripleoclient: Display error message when socket is closed  https://review.openstack.org/37466910:13
openstackgerritJulie Pichon proposed openstack/python-tripleoclient: Provide more information when 'node provide' fails  https://review.openstack.org/37467010:13
*** jprovazn has quit IRC10:15
pandashardy: that is launch a deploy and look at ps continuously until we get OOM or the fork error, or configure dstat to give per-process statistics (don't know if it's possible)10:17
shardypanda: your choice, we run dstat in CI, but I tend to use top or htop locally10:19
shardyspeaking of which gnocchi-statsd is hogging all my CPU on an otherwise idle overcloud :(10:19
jaosoriorayoung: let me know when you're online. I need some httpd help10:21
*** limao has quit IRC10:21
*** andrey-mp has quit IRC10:21
shardyhttps://bugs.launchpad.net/tripleo/+bug/162647310:22
openstackLaunchpad bug 1626473 in tripleo "gnocchi is eating all my CPU :(" [High,Triaged]10:22
*** akuznetsov has joined #tripleo10:22
shardybe interested to see if anyone else can reproduce that10:22
shardypradk: ^^ FYI10:22
*** dtantsur|afk is now known as dtantsur10:26
*** akuznetsov has quit IRC10:27
*** fzdarsky has joined #tripleo10:32
openstackgerritSteven Hardy proposed openstack/tripleo-heat-templates: Configure heat engine to not use convergence  https://review.openstack.org/33389010:34
openstackgerritAttila Darazs proposed openstack/tripleo-quickstart: Stop using deprecated network range  https://review.openstack.org/34344310:34
openstackgerritJiri Tomasek proposed openstack/tripleo-ui: Stacks and Resources data storing in app state  https://review.openstack.org/37422710:37
openstackgerritMerged openstack/tripleo-quickstart: Add settings to general config  https://review.openstack.org/37114410:38
*** mburned_out is now known as mburned10:41
bandinimarios: might want to keep an eye on this one https://bugs.launchpad.net/tripleo/+bug/162645210:42
openstackLaunchpad bug 1626452 in tripleo "M/N upgrades - Error: Could not find class ::tripleo::trusted_cas" [Critical,New]10:42
*** dprince has joined #tripleo10:42
mariosbandini: thanks10:43
mariosbandini:ouch that one sounds messy.10:43
bandinimarios: yeah I am a little concerned tbh. will keep trying until I get at least the init step working10:44
mariosbandini: so we may need to OS::Heat::None all the services...10:44
bandiniam afraid so, unless we can come up with a better plan10:44
mariosbandini: 'new' i mean... yeah... ok, well thanks for the heads up for now10:45
bandininp ;)10:45
mariosbandini: about that other one... i am guessing obviously yes we want the fsid to stay the same @ https://review.openstack.org/#/c/374600/210:46
mariosbandini: so it may/should be its own review, esp if it becomes more involved to make it so10:47
*** hjensas has joined #tripleo10:47
*** kberger has quit IRC10:47
bandinimarios: let's see. if it is simple enough we might want to keep it in a single review otherwise we can split, sure10:48
bandinifor now I am doing just workaround until I at least get the init step working10:48
*** kberger has joined #tripleo10:48
bandinithen I will start working on proper patches for all the issues10:48
openstackgerritOpenStack Proposal Bot proposed openstack/tripleo-common: Updated from global requirements  https://review.openstack.org/37372210:48
mariosbandini: thanks... seems your about a day ahead of the osp10 packages atm :) we started hitting the things you complained about yesterday morning10:48
mariosbandini: so far nothing _too_ nasty... even that OS::Heat::None one in the worst case is just  a list in the env file10:49
bandinimarios: yeah let's cross fingers. I will join the call today (sorry about yesterday was a bit poorly)10:50
mariosbandini: scrum at ... right was going to say, welcome to join, i will harrass you later too10:50
bandiniaye ;)10:51
openstackgerritMerged openstack/python-tripleoclient: Use the passed in workflow when creating or updating a plan  https://review.openstack.org/37420110:53
*** ohamada has quit IRC10:55
*** ohamada has joined #tripleo10:55
pandawhen is newton release deadline ?10:55
*** hjensas has quit IRC10:57
*** hjensas has joined #tripleo10:59
*** hjensas has quit IRC10:59
*** hjensas has joined #tripleo10:59
pandamaybe we can just bump undercloud memory to 12G temporarily until release ..11:00
*** pkovar has quit IRC11:01
pandasshnaidm: ^11:01
pandaand then take countermeasures11:02
*** thrash|g0ne is now known as thrash11:02
*** fzdarsky_ has joined #tripleo11:06
*** pkovar has joined #tripleo11:06
*** hjensas has quit IRC11:07
sshnaidmpanda, I don't know, to focus on memory leaking in heat is also the option11:08
pandasshnaidm: I don't know how much time do we have to do this ... for example I cant get result on the ipv6 ha job right now, and it should be tested before relase11:08
*** dprince has quit IRC11:10
openstackgerritCarlos Camacho proposed openstack/tripleo-heat-templates: Add metricd workers support in gnocchi  https://review.openstack.org/37470411:11
openstackgerritCarlos Camacho proposed openstack-infra/tripleo-ci: Setting to 1 GnocchiMetricdWorkers  https://review.openstack.org/37470911:14
openstackgerritSagi Shnaidman proposed openstack-infra/tripleo-ci: TEST: DONT RECHECK: periodic jobs  https://review.openstack.org/35921511:15
*** jprovazn has joined #tripleo11:15
*** fzdarsky_ has quit IRC11:17
*** fzdarsky has quit IRC11:17
*** fzdarsky has joined #tripleo11:17
*** lucas-afk is now known as lucasagomes11:18
sshnaidmpanda, I think it's better to discuss it here11:19
openstackgerritCarlos Camacho proposed openstack/tripleo-heat-templates: Add metricd workers support in gnocchi  https://review.openstack.org/37470411:19
*** jistr is now known as jistr|mtg11:20
openstackgerritCarlos Camacho proposed openstack-infra/tripleo-ci: Setting to 1 GnocchiMetricdWorkers  https://review.openstack.org/37470911:21
openstackgerritGabriele Cerami proposed openstack-infra/tripleo-ci: Add IPv6 network configuration for ipv6 job types  https://review.openstack.org/36367411:21
*** akuznetsov has joined #tripleo11:23
*** lblanchard has quit IRC11:23
*** stendulker has quit IRC11:24
shardypanda: https://releases.openstack.org/newton/schedule.html11:25
shardypanda: we're aiming to cut RC2 (and branch stable/newton) next week, the the final release will hopefully be aligned with the main newton release during w/c 3rd October11:27
shardywe are following the cycle-trailing model, so it's permitted to declare our final release later for some repos, but it'd be best if we can release very close to the time of the rest of OpenStack11:27
*** akuznetsov has quit IRC11:28
pandasshnaidm: I don't think we can solve the memory crysis in one week efficiently for everyone ... let's hope it has been a one time problem, if not it's probably best to increase memory for this last week11:35
pandashardy: thanks.11:35
openstackgerritMerged openstack/tripleo-common: Return the result of create_plan in create_deployment_plan workflow  https://review.openstack.org/37134811:38
*** bfournie has quit IRC11:42
*** pkovar has quit IRC11:47
*** ccamacho is now known as ccamacho|lunch11:47
*** coolsvap has quit IRC11:52
*** jpena is now known as jpena|lunch11:53
*** zigo has quit IRC11:54
*** zigo has joined #tripleo11:58
*** zigo is now known as Guest8360111:59
*** trown|outtypewww is now known as trown12:03
*** Guest83601 has quit IRC12:03
*** pkovar has joined #tripleo12:05
*** jayg|g0n3 is now known as jayg12:09
*** mbozhenko has joined #tripleo12:10
openstackgerritmathieu bultel proposed openstack/python-tripleoclient: Upgrade needs to create the keystone credential  https://review.openstack.org/37460012:11
*** cdearborn has joined #tripleo12:11
*** zigo_ has joined #tripleo12:12
mbozhenkoHello all. I want to start to work with TripleO and contribute to it. Can anyone kick me into some sort of quick start doc for setting up dev env?12:12
honzajtomasek: do you know if there is a way to get all of the resources for a stack in one API call including all the attributes?  i don't think there is and it's making my code ugly :(12:15
openstackgerritJiri Tomasek proposed openstack/tripleo-ui: Stacks and Resources data storing in app state  https://review.openstack.org/37422712:16
*** zigo_ has quit IRC12:17
openstackgerritJohn Trowbridge proposed openstack/tripleo-quickstart: Switch default image location back to CentOS CDN  https://review.openstack.org/37475312:17
*** zigo_ has joined #tripleo12:18
*** rhallisey has joined #tripleo12:19
*** milan|f00d is now known as milan12:20
*** rodrigods has quit IRC12:24
*** rodrigods has joined #tripleo12:24
*** akuznetsov has joined #tripleo12:25
bandinimarios: do you have any tips as to how I can find all the services affected by https://bugs.launchpad.net/tripleo/+bug/1626452, maybe I should just noop them all in the init step?12:27
openstackLaunchpad bug 1626452 in tripleo "M/N upgrades - Error: Could not find class ::tripleo::trusted_cas" [Critical,New]12:27
shardymbozhenko: Hi, welcome!12:27
shardymbozhenko: We have some docs here http://docs.openstack.org/developer/tripleo-docs/12:27
bandinimarios: so far I got three (snmp, trusted_cas and libvirt)12:27
shardymbozhenko: if you want a quick summary of how I set up my environment (uses a script from our CI to automate a few steps from the docs), see here:12:27
shardyhttp://paste.fedoraproject.org/432545/47454728/12:28
shardymbozhenko: ideally you need a machine with >= 32G ram to test the virt setup I describe there12:28
*** florianf has quit IRC12:29
openstackgerritDougal Matthews proposed openstack/tripleo-common: Remove the old, deprecated Mistral action names  https://review.openstack.org/36652912:29
shardymbozhenko: also there is the tripleo-quickstart tool, which is an ansible approach to bootstrapping a tripleo environment12:29
shardyhttps://github.com/openstack/tripleo-quickstart/12:29
*** akuznetsov has quit IRC12:30
*** bfournie has joined #tripleo12:35
jtomasekhonza: there is not, how many of the resources do you actually need?12:35
jtomasekhonza: I thought, you need just a specific one12:35
*** pkovar has quit IRC12:36
honzajtomasek: just one, code is not that bad --- there just needs to be some logic in the reducer to prevent overwriting12:36
openstackgerritChris Jones proposed openstack/tripleo-quickstart: Add ssh option IdentitiesOnly.  https://review.openstack.org/37476912:36
*** jistr|mtg is now known as jistr12:36
*** gfidente has joined #tripleo12:37
jtomasekhonza: you dispatch the action for that single resource and in reducer you can update that specific resource with additional attributes12:37
jtomasekhonza: what do you mean by overwriting?12:37
honzajtomasek: yep, but that API call usually comes back before the fetchResources one so fetchResources overwrites your data12:38
honzajtomasek: no big deal12:38
honzarace conditions ftw12:38
*** jeckersb_gone is now known as jeckersb12:38
jtomasekhonza: I see12:39
mariosbandini: so noop all in the init should be ok because we don't run the postconfig ... *i think* ... not sure how we'd come up with a list of new services... thinking12:43
mariosbandini: i mean i know we don't run the postconfig cos we noop it, but i mean i think it should be ok12:43
mariosbandini: i.e wont delete all the things/services12:44
mariosbandini: we could compare /var/lib/tripleo/installed-packages/overcloud_controller_pacemaker1 2 3 etc on the nodes for list of thigns already there12:45
*** pkovar has joined #tripleo12:45
mariosbandini: but noop all sounds better if it works for now12:45
bandinimarios: yeah I just did a noop for all of them and I have the init step terminating successfully12:45
mariosbandini: cool. do you still possess an overcloud? with things running on it?12:45
bandinimarios: looking right now ;)12:46
bandinimarios: yep still functional12:46
mariosbandini: thanks bandini even more beer for you12:46
*** andrey-mp has joined #tripleo12:47
mbozhenkoshardy: thank you!12:48
*** zigo_ has quit IRC12:48
mariosbandini: but for converge we would need a list of things not there already12:49
mariosbandini: so we will still have to solve that12:49
mbozhenkoshardy: I will try and let you know on the outcomes12:49
bandinimarios: but during converge the puppet-tripleo rpms are the newton ones so it should not matter, right?12:50
*** rlandy has joined #tripleo12:50
openstackgerritMichele Baldessari proposed openstack/tripleo-heat-templates: Noop all the TripleO::Services during the major-upgrade-pacemaker-init step  https://review.openstack.org/37478812:50
mariosbandini: reading bug again, i thought it was a case of new services getting started ... ah no it is missing puppet-tripleo being pulled in by the new service templates12:51
*** zigo_ has joined #tripleo12:51
mariosbandini: and they aren't nooped./.. yeah fine so when we have the puppet-tripleo should be good12:51
shardybandini: Question re ^^12:51
shardybandini: why noop the services, vs the deployment applying them?12:51
shardythe actual services don't do anything except contain data12:51
mariosshardy: this is just init step that fails and it fails because the puppet-tripleo isn't there yet, in this step of the upgrade12:52
shardyprobably you want to noop the deployment steps in puppet/post.yaml (or rather that entire stack)12:52
mariosshardy: so the tht is referencing classses in puppet-tripleo that aren't there12:52
shardymarios: but those resources don't apply puppet12:52
shardythey just contain some text12:52
b00tcatIs there any way to re-run *only* the puppet code after having provisioned a heat stack with tripleo? provisioning the overcloud takes a huge amount of time, and if I have -let's say- a syntax error it'd be a pita ^^"12:52
shardyhttps://github.com/openstack/tripleo-heat-templates/blob/master/overcloud.j2.yaml#L45512:53
mariosshardy: bandini so am not clear why are we getting the error for the missing class then still12:53
shardymarios: Can we instead Noop OS::TripleO::PostDeploySteps?12:53
mariosshardy: we already do12:53
shardyhmm12:53
*** Goneri has joined #tripleo12:53
bandinishardy: so we already noop some stuff https://github.com/openstack/tripleo-heat-templates/blob/master/environments/major-upgrade-pacemaker-init.yaml12:54
mariosshardy: i mean we oh i mean we noop the ControllerPostDeployment... all nodes12:54
shardymarios: that may be the issue, we've reworked how *PostDeployment works12:54
bandinishardy: so you think just nooping OS::TripleO::PostDeploySteps will do?12:54
marioshttps://github.com/openstack/tripleo-heat-templates/blob/master/environments/major-upgrade-pacemaker-init.yaml like this shardy12:54
shardyso you may need to noop the PostDeploySteps instead12:54
mariosshardy: right sounds like it may be it :)12:54
shardymarios: sec, let me show you the patch12:54
jistrhehe i was just looking into it :)12:55
jistrindeed12:55
mariosbandini: so it sounds like it was actually trying to deploy the services12:55
jistryea12:55
mariosbandini: cos our 'don't run the post deploy config  puppet stuff' isn't working now12:55
jistri have a patch actually :)12:55
shardyhttps://review.openstack.org/#/c/365763/14/overcloud.yaml12:55
mariosjistr: cool link12:55
jistrmarios: lemme write a commit message :)12:56
shardymarios: that's probably your issue, nooping PostDeployment stuff won't do anything anymore12:56
mariosjistr: haha :D12:56
bandinijistr: feel free to slap it on top of https://review.openstack.org/#/c/374788/12:56
mariosshardy: fantastic thanks shardy bandini jistr12:56
bandinithanks shardy jistr12:56
shardynp! :)12:56
gfidenteccamacho|lunch, thanks for the ext4/liberty comments!12:57
gfidentethanks a lot12:57
*** david-lyle has joined #tripleo12:58
openstackgerritJiri Stransky proposed openstack/tripleo-heat-templates: No-op Puppet for upgrades/migrations according to composable roles  https://review.openstack.org/37479112:58
jistrmarios bandini ^^12:58
gfidenteI am still unsure at this point why what seems a grub issue should not be seen with newer versions of openstack12:58
*** andrey-mp has left #tripleo12:58
jistrmeh copy pasta12:58
gfidentebut I think we spent enough time and resources on it, given it's going EOL12:58
EmilienMpanda: something must goes wrong in ipv6 experimental job, it's timeouting all the time :(12:59
EmilienMbnemec: were you lucky in your tests?12:59
openstackgerritJiri Stransky proposed openstack/tripleo-heat-templates: No-op Puppet for upgrades/migrations according to composable roles  https://review.openstack.org/37479112:59
mariosjistr: thanks12:59
matbubandini: cool for https://review.openstack.org/#/c/374788 did you test it ?12:59
jistrmarios, bandini ^^ more like that12:59
*** ccamacho|lunch is now known as ccamacho12:59
jaosoriorEmilienM: have you ever manipulated HTTP headers with httpd?12:59
matbubandini: and does the # close bug works better ? ;)12:59
*** cylopez has joined #tripleo12:59
openstackgerritEmilien Macchi proposed openstack-infra/tripleo-ci: WIP - Deploy TripleO with Puppet 4  https://review.openstack.org/37120912:59
mariosjistr: heh, i went looking for puppet/post.yaml i thought it was a special noop template :)13:00
bandinimatbu: I tested it but we're going for jistr review here https://review.openstack.org/37479113:00
EmilienMjaosorior: with apache you mean?13:00
jistrmarios: yea just my copypasta :))13:00
ccamachogfidente np man, just bad I dont have something to fix it :(13:00
jaosorioryeah13:00
*** jpena|lunch is now known as jpena13:00
EmilienMjaosorior: you need a l7 proxy13:01
EmilienMmod_headers does it in apache13:01
matbubandini: jistr ha cool :)13:01
EmilienMit's well documented here: http://httpd.apache.org/docs/2.0/mod/mod_headers.html13:01
jaosoriorEmilienM: yeah, I've been checking that out13:02
EmilienMjaosorior: I used it in the past yes13:02
EmilienMjaosorior: let me know if you need more help13:02
pandaEmilienM: this morning even periodic jobs had memory problems during tests. I tried to reproduce the build locally, but apparently 8G for the undercloud are not enough anymore, and I had OOM killer shut down heat-engine13:02
jaosoriorEmilienM: but haven't figured out how to get apache to accept underscores in HTTP headers in an elegant way13:02
jaosoriorthe only way I've found out is to do a bunch of setenvif with each header, and then add it with mod_headers13:02
jistrbandini, marios, matbu: i'm trying to think about some possible drawbacks, hopefully there wouldn't be any, as the puppet only gets data generated from the service chains etc., but in itself it doesn't produce any data to be consumed elsewhere, so i hope it would work the same way as we had for K/L and L/M, simply not running the puppet, but still producing the Hiera etc. Hopefully that's ok even now that we13:03
jistruse composability.13:03
jaosoriorEmilienM: basically I'm trying to get Glance to work with httpd... but glance uses a bunch of headers that contain underscores13:03
jaosoriorEmilienM: so apache filters those13:03
jaosoriorEmilienM: the only documented way to fix that is http://httpd.apache.org/docs/trunk/env.html#fixheader which I would have to do for each header...and I was looking for a more compact solution13:03
EmilienMpanda: ok13:03
mariosjistr: it looks much simpler... if the end result is same (none of the post-config puppet run) it should be fine. So on converge we will end up with any new services that weren't default/existing in 9 as well13:03
bandinijistr: it is certainly fine for the init step, I will give feedback as soon as I manage to get past init ;)13:04
mariosjistr: (as side note)13:04
EmilienMjaosorior: the modea headers can help you to do that13:04
matbubandini: jistr marios yep me too, testing the tripleoclient fix right now13:04
jaosoriorEmilienM: yeah... I've been trying for 3 hours now. Haven't gotten mod_headers to do what I need13:05
mariosjistr: yeah this is what shardy was saying earlier... the tht will contain/define the new services, all of them, including ones we didn't have before. but it is just the config so as long as that isn't passed to puppet to run it should be fine13:05
pandaEmilienM: we were  discussing about bumpping temporarily the undercloud memory for tripleo-ci to 12G until release, and then change the jobs configuration, but we'lre trying to understand if it's only a peek of memory or it's a permanent requirement13:05
jistrmarios: yea hopefully :) Btw any idea what services are the ones that are different between M and N? i saw a BZ email recently where tosky brought up that Sahara actually was default before but isn't anymore, and that it will probably cause some issues...13:06
toskyjistr: iirc Sahara is the only one which was enabled by default and not core, but maybe there was nother13:07
mariosjistr: no this is what bandini and i were discussing earlier, when we thought we should noop individual 'new' services. we could compare the service chains/enabled_services with the lists from  /var/lib/tripleo/installed-packages/overcloud_controller_pacemakerX13:07
jistrhmm right13:08
*** jcoufal has joined #tripleo13:08
*** coolsvap has joined #tripleo13:08
*** zoli|wfh is now known as zoli|lunch13:09
mariosjistr: but perhaps we don't need to do that. i mean we let the config deploy 'stock' newton new services and all. or if we need to we can noop things i guess13:09
mariosjistr: i mean on converge13:09
jistrmarios: yea deploying new could hopefully be ok, it's maybe potential removing that we might need to pay more attention to13:10
jistrit's probably a bit complicated due to the fact that some people might want to remove Sahara while others would like to keep it...13:10
toskymarios: also, you may want to apply the new configuration before restarting the services13:11
mariosjistr: keeping it *should* be ok, assuming we can just make it one of the enabled services13:12
mariostosky: do you mean, with sahara from M, if you want to upgrade to N with sahara, yu also need som more config? like a migration?13:12
*** jaosorior has quit IRC13:12
mariostosky: yes i see the note in the issue13:13
*** jaosorior has joined #tripleo13:13
toskymarios: if you deployed M, you have a certain set of plugins in sahara.conf (explicitely written down by TripleO); if you upgrade, even if you include the environment file, it seems that the new configuration is applied after the restart, not before13:13
mariosjistr: tosky seems we are already tracking this on the lifecycle readme issues13:13
toskyand the restart fails because the set of available plugins in N is different (one is lacking)13:13
toskymarios: partially, it does not cover the case "we want to remove it" and apparently even the "yes, please keep it" case is buggy13:14
mariostosky: right this is what omri was hitting earlier i think (the plugin issue https://bugs.launchpad.net/tripleo/+bug/1615056 )13:15
openstackLaunchpad bug 1615056 in tripleo "M/N upgrade sahara-api fails to start." [Undecided,Fix released] - Assigned to Emilien Macchi (emilienm)13:15
*** akuznetsov has joined #tripleo13:15
toskymarios: ah, a manual fix13:15
toskya bit... hackish13:15
toskybut I see13:15
openstackgerritMerged openstack/tripleo-quickstart: Switch default image location back to CentOS CDN  https://review.openstack.org/37475313:15
mariostosky: no no we won't go with a manual fix,13:16
mariostosky: i am just saying we are tracking this and we still need to work it out13:16
toskymarios: sorry, bad working on my side; the call to crudini is "manual" compared to the proper "converge the configuration to this point" provided by puppet13:16
EmilienMmatbu: https://review.openstack.org/#/c/374600/13:17
mariostosky: oh to remove the plugin you mean13:17
EmilienMwhy do you patch stable/newton?13:17
openstackgerritOpenStack Proposal Bot proposed openstack/tripleo-common: Updated from global requirements  https://review.openstack.org/37372213:17
toskymarios: yes13:17
openstackgerritJohn Trowbridge proposed openstack/tripleo-quickstart: Add swap to the undercloud when using an overcloud image  https://review.openstack.org/37480913:18
mariostosky: well it has to happen during the controller upgrade step otherwise we can't bring up sahara-api13:19
*** akuznetsov has quit IRC13:19
*** kjw3 has joined #tripleo13:19
matbuEmilienM: well, cause i hit the issue, but yes i should patch master13:19
mariostosky: converge won't happen until after controllers, then computes and ceph nodes updated so sahara-api would be down that whole time13:19
toskymarios: right; but then why isn't the new configuration applied directly before restartin the services? I mean, the configuration coming from hiera?13:20
mariostosky: if mean if we just let it happen on converge as defined by the newton templates13:20
mariostosky: we don't want to apply the config at that point we need that to happen everywhere at once, on converge13:20
*** pgadiya has quit IRC13:21
toskymarios: and then you restart everything again?13:21
mariostosky: e.g. what if your config changes passwords? if we ran config on controller upgrade then we'd get the new passwords only on controllers13:21
toskyuhm, I see (I think)13:21
toskyso with that crudini line the case "please keep Sahara" should be fixed13:22
mariostosky: yeah there is also a controlled service restart after converge13:22
mariostosky: so it should solve the case of yeah keep sahara13:22
toskystill the case where Sahara needs to be removed should be handled somehow13:22
mariostosky: yes, we are tracking it so it will get done at some point. I mean, if it is OS::Heat::None now it should mean that nothing will be done to it so we may need to manually remove during one of the upgrade steps, probably the controller upgrade13:23
toskymarios: or at least disable it13:24
mariostosky: i mean nothing will be done to the existing installation13:24
rhalliseyd0ugal, hey Dougal, is there a mistral workflow in place that will trigger updates?13:24
rhalliseyor is the workflow to deploy again13:25
shardyrhallisey: the workflow is to run the deploy workflow again I think, with some tweaks to how we prepare/update the plan in tripleoclient13:26
shardywe update the stack instead of creating it13:26
*** fultonj has joined #tripleo13:26
rhalliseyshardy, should we have an update workflow instead?13:27
rhalliseyI do think it's similar to deploy, but the process itself is a series of steps13:28
rhalliseythere is some variation13:28
shardyrhallisey: Yeah, there was some discussion about this yesterday13:28
shardye.g how do we maintain the openstack overcloud update command13:28
shardythat probably will need a different workflow, as you say due to the breakpoints etc13:28
shardyI'm not sure on the status of that, but I have a feeling we have a gap there which needs to be fixed asap13:29
shardyrhallisey: so, we have updates (just update configuration or scale out) == deploy workflow13:29
shardythe update (package update for applying errata) == update workflow13:29
rhalliseyfrom what I can tell, the current update is a redeloy13:29
shardyrhallisey: there's a bunch of stuff in there related to breakpoints which doesn't exist in the overcloud deploy path13:30
shardyand it changes UpdateIdentifier which triggers yum update13:30
*** ayoung has quit IRC13:30
shardybut the actual interaction with heat to start the update is the same as a deploy I guess13:30
rhalliseyyea I agree, I'm just saying the 'old' update path isn't hooked into the new mistra; + swift workflow13:30
socialshardy: slagle: how are composable roles going to update? what'll replace overcloud-without-mergepy.yaml ?13:30
socialah13:30
* social just reads backlog13:31
shardyrhallisey: Yeah, we either need to live with tripleoclient's existing functionality there13:31
shardy(which means updates won't work via the UI)13:31
shardyor wire it in via mistral13:31
shardyyou're right13:31
shardysocial: there's an overcloud.j2.yaml, which is rendered in a mistral action during plan creation13:31
rhalliseyok let's go with mistral then13:31
shardysocial: you can see the rendered version via swift download overcloud overcloud.yaml13:32
EmilienMplease add your rc2 patches on this gerrit topic: https://review.openstack.org/#/q/topic:tripleo/rc213:32
shardyovercloud-without-mergepy.yaml no longer exists, and it's been deprecated for several years13:32
shardywe've got a spurious error coming from tripleoclient tho which needs to be fixed13:32
rhalliseyovercloud-without-mergepy is still a constant O.o13:33
shardyrhallisey: yeah, we need to remove it13:33
rhalliseyshardy, ok will do.13:33
shardyd0ugal: was going to do it I think13:33
rhalliseygotcha13:33
*** ramishra has quit IRC13:34
*** akuznetsov has joined #tripleo13:35
socialshardy: and it's not only a constat but it's used during stack update13:35
rhalliseyI think upgrade references too13:35
*** ramishra has joined #tripleo13:35
shardyYeah, those are bugs in the client13:35
shardybecause we don't test those paths in CI13:35
shardyhttps://review.openstack.org/#/c/365735/13:36
shardyI fixed it there for deploy, but missed that we use it for update13:36
shardyso we'll need to do a similar fix13:36
shardyto read the file from the plan instead of the local disk13:36
*** myoung|gone is now known as myoung13:37
*** pkovar has quit IRC13:37
honzajtomasek: looks like we'll also need rbrady's password patch before we can wrap this up13:37
rbradyhonza, jtomasek: I'm currently trying to figure out why it doesn't pass CI.  will keep you updated13:38
honzarbrady: thanks!13:38
rhalliseyshardy, cool thanks!13:38
openstackgerritMerged openstack/tripleo-quickstart: Teardown libvirt pool: fix pool file removal  https://review.openstack.org/37440813:38
jtomasekhonza: you can set the passwords manually through GUI, it is slightly hard to find them though13:38
jtomasekhonza: most of them are in controller services13:38
socialshardy: how this will work if I have newton undercloud and want to update mitaka?13:39
rhalliseysocial, good question13:39
honzajtomasek: my original patch made a call to /stack/<name>/environment to get the password but it looks like it'll be available in environmentConfiguration13:40
bandinigfidente: is it correct to assume that CephClusterFSID must change during an upgrade?13:41
jtomasekhonza: I think it is safer to get it from Heat, rather than from plan configuration13:42
shardysocial: if you're updating to a new release, the first step will be to upgrade the undercloud, so you'll have the new templates and mistral pieces to render the new overcloud.yaml13:42
honzajtomasek: ok13:42
shardythe rendered file will just get passed into heat as if it was on the local disk13:42
shardybut we're still working through a few client issues around this atm13:42
d0ugalrhallisey: Yeah, so this was on my TODO list but I've not started it yet.13:42
socialshardy: I think the requirement is to update old overcloud13:42
socialeg scale/downscale and system updates13:42
rhalliseyd0ugal, gotcha.  No worries13:42
gfidentebandini, no it does not have to13:42
shardysocial: Sure, then you will point at an old version of tripleo-heat-templates13:43
shardywhich will have the old overcloud.yaml in it13:43
bandinigfidente: ack thanks13:43
shardysocial: in that case, the j2 rendering in mistral does nothing13:43
shardybut the deployment flow works the same13:43
d0ugalrhallisey: two related bugs: https://bugs.launchpad.net/tripleo/+bug/1614928 and https://bugs.launchpad.net/tripleo/+bug/162612813:43
openstackLaunchpad bug 1614928 in tripleo "openstack overcloud update stack should be powered by a mistral workflow" [High,Triaged]13:43
openstackLaunchpad bug 1626128 in tripleo "openstack overcloud update stack is broken" [Critical,Triaged]13:43
rhalliseyd0ugal, ya I also reported one too: https://bugs.launchpad.net/tripleo/+bug/162397813:44
openstackLaunchpad bug 1623978 in tripleo "overcloud update fails because of missing template in swift" [Undecided,New]13:44
d0ugalrhallisey: hah :)13:44
rhalliseyd0ugal, :)13:45
ccamachoguys is there any doc re how to use/test TripleO UI ? just curious as never used it before and there are some bugs related13:45
shardyjtomasek: ^^13:46
*** tzumainn has joined #tripleo13:46
*** dsneddon has quit IRC13:47
jtomasekccamacho: I've been using this tool up to now https://github.com/flofuchs/o3-virt-setup13:49
jtomasekccamacho: although it should be much easier to install GUI as it is going to get installed as part of undercloud. mandre has last puppet patch pending to achieve that afaik13:50
*** ramishra has quit IRC13:50
*** pkovar has joined #tripleo13:51
pandasshnaidm: still happening on the gates http://logs.openstack.org/92/363592/2/check-tripleo/gate-tripleo-ci-centos-7-ovb-ha-newton/adba039/logs/postci.txt.gz13:52
mandrejtomasek: that last patch was merged yesterday13:52
jpichjtomasek, ccamacho: That patch merged earlier today :) https://review.openstack.org/#/c/363167/13:52
jpichyestoday13:52
pandasshnaidm: master worked, newton failed.13:52
jtomasekmandre, jpich: thanks, great!13:52
pandasshnaidm: out of memory13:52
socialrhallisey: for now I'm testing update with manualy provided overcloud.yaml from swift13:53
socialrhallisey: I kinda expect it to break more :)13:53
jtomasekccamacho: in any case, installing GUI manually is also possible by following https://github.com/openstack/tripleo-ui/blob/master/README.md13:54
openstackgerritDougal Matthews proposed openstack/tripleo-common: Separate Template Processing From Create/Update Plan  https://review.openstack.org/37086813:54
jtomasekccamacho: it is slightly out of date - no validations api and tripleo-api is needed any more, so GUI setup is basically just: clone GUI repo, set cors for services, do endpoints tunnels, and run GUI using 'npm install and npm start'13:55
*** tzumainn has quit IRC13:55
openstackgerritBrent Eagles proposed openstack/tripleo-heat-templates: Deprecate the NeutronL3HA parameter  https://review.openstack.org/37483513:55
*** tzumainn has joined #tripleo13:55
sshnaidmpanda, yeah, I see13:56
sshnaidmpanda, more and more such errors in last days13:56
pandasshnaidm: we're doomed.13:56
sshnaidmpanda, can you please start a ML thread? flavor update is 5 min change, but I think we need a consensus about it13:59
toskypanda: always13:59
pandasshnaidm: a bug is not enough ?14:00
ccamachojtomasek awesome, just taking notes to trying to test it.14:01
sshnaidmpanda, no, nobody looks at them14:01
pandasshnaidm: lol, ok.14:01
rhalliseysocial, ya wfm14:01
sshnaidmpanda, sad-but-true14:01
pandasshnaidm: ok, on openstack-dev with tags TripleO and CI14:02
sshnaidmpanda, yep, thanks, let's start the flame14:03
shardyperhaps try to avoid using "we're doomed" as the subject line ;)14:03
pandatosky: Snape style answer14:03
pandashardy: we're f$^$#d ?14:03
openstackgerritJulie Pichon proposed openstack/python-tripleoclient: Stop plan creation when container exists  https://review.openstack.org/36962314:04
*** paramite has joined #tripleo14:04
jpichd0ugal: It felt too weird also updating someone else's name in a TODO() in https://review.openstack.org/#/c/369623/2 so I didn't do it, sorry!!14:06
d0ugaljpich: haha, I didn't expect you to. It was just an observation of failure14:06
*** ramishra has joined #tripleo14:06
jpichd0ugal: You'll just have to fix the actual TODO in order to make it disappear... ;)14:07
d0ugaljpich: that TODO isn't for me! It's for a dmatthews, no idea who that is.14:08
jpichd0ugal: lol14:08
*** akuznetsov has quit IRC14:09
*** florianf has joined #tripleo14:12
EmilienMpanda: still timeouting :(14:13
EmilienMinteresting regular HA job doesn't timeout much comparing to ipv614:13
*** mbozhenko has quit IRC14:16
*** rajinir has joined #tripleo14:17
pandaEmilienM: but they fail for memory error ... I don't know if it's related. I will launch again a local test, I added some swap to undercloud so maybe I'll be able to complete deploy and see what happens14:18
pandaEmilienM: where do you see the time out ?14:19
d0ugalEmilienM: Do we use https://github.com/openstack/puppet-mistral?14:19
*** saneax is now known as saneax-_-|AFK14:20
*** snecklifter has joined #tripleo14:21
snecklifterhi, is there a reason why os-net-config is determined to dhcp an interface despite me using use_dhcp: false14:21
snecklifterdriving me nuts14:21
socialrhallisey: is it doing anything? I don't think it's doing anything14:22
snecklifterthis is on mitaka14:22
weshaysshnaidm, can you please add the latest swift error from the ha periodic job to the etherpad.. http://logs.openstack.org/periodic/periodic-tripleo-ci-centos-7-ovb-ha/9d0c76f/logs/overcloud-controller-0/var/log/messages14:23
shardyd0ugal: yes we use it to configure mistral on the undercloud and overcloud (when the composable mistral patch lands)14:24
*** athomas has quit IRC14:24
sshnaidmweshay, I reran it and it passed http://logs.openstack.org/15/359215/12/check-tripleo/gate-tripleo-ci-centos-7-ovb-ha/35feaca/14:25
d0ugalshardy: I am trying to understand the log files - I don't think we log everything from Mistral14:26
sshnaidmweshay, I'm not sure it's swift issue, seems like it's memory issue again14:26
d0ugalshardy: I think we should have a log file for each of these: https://github.com/openstack/puppet-mistral/tree/master/templates14:27
pandaEmilienM: now I see it ...14:29
EmilienMd0ugal: yes sir14:29
*** jaosorior has quit IRC14:29
d0ugalEmilienM: Do you know where the mistral log file is configured? I'm trying to track that down.14:32
EmilienMd0ugal: https://github.com/openstack/puppet-mistral/blob/master/manifests/init.pp#L23414:34
EmilienMcheck https://github.com/openstack/puppet-mistral/blob/master/manifests/init.pp#L23614:35
EmilienMis debug setup on the undercloud manifest?14:35
EmilienMhttps://github.com/openstack/instack-undercloud/blob/master/elements/puppet-stack-config/puppet-stack-config.yaml.template#L48714:35
EmilienMI don't see mistral::debug::true14:35
EmilienMso no logs14:35
honzajtomasek: looks like i messed up your patch :(14:36
jpichd0ugal: May I pick your brain about deprecation warnings for the new node management commands? ( https://bugs.launchpad.net/tripleo/+bug/1595205 )14:36
openstackLaunchpad bug 1595205 in tripleo "[tripleoclient] Additional "overcloud node" commands" [High,Fix released] - Assigned to Julie Pichon (jpichon)14:36
d0ugalEmilienM: hrm, there is one log file /var/log/mistral/mistral-server.log14:36
EmilienMd0ugal: good so debug is true by default14:37
*** athomas has joined #tripleo14:37
d0ugalEmilienM: but I would like to make sure we are logging everything and have three logs - api, engine and executor.14:37
EmilienMindeed, it's weird we don't have 3 logs14:37
honzajtomasek: i failed to rebase properly; this is the most frustrating part of gerrit :(14:37
EmilienMit looks like a packaging thing14:37
d0ugalEmilienM: and no, we don't have debug logging - it is set to INFO14:37
d0ugaljpich: Sure!14:37
EmilienMd0ugal: sounds like it's managed by packaging script14:38
jpichd0ugal: In the end only introspect and provide got a deprecation warning. At this point I feel like it'd be better to remove the warnings and add them back during the next release, rather than deprecate the other commands because the docs, CI, etc are still using/referencing them... I know you thought it'd be better to do it anyway to encourage people to actually use them and update the docs, etc though :/14:39
EmilienMhttps://github.com/rdo-packages/mistral-distgit/blob/rpm-master/openstack-mistral-api.service14:39
EmilienMso it takes default in mistral.conf14:39
jpichd0ugal: There's also kind of a similar discussion going on for https://review.openstack.org/#/c/337676/14:39
d0ugalEmilienM: aha, thanks!14:40
EmilienMd0ugal: wait, I didn't help yet14:40
EmilienMd0ugal: something is weird here14:41
d0ugalEmilienM: That much is expected :)14:41
d0ugaljpich: Yeah, I don't have a strong view.14:41
d0ugaljpich: I just know the sooner we mark them as deprecated the sooner we can actually delete them14:42
EmilienMd0ugal: to me, it sounds like we run mistral-server process and by default mistral create mistral-server.log file14:42
d0ugaljpich: but the actual code overhead is fairly small14:42
EmilienMd0ugal: do you see engine/api/... in the mistral-server.log?14:42
EmilienMif yes, it makes all sense14:42
socialrhallisey: you are testing on RDO newton/master?14:42
rhalliseysocial, yes14:43
d0ugalEmilienM: Yeah, it seems they all log to the same file14:43
rhalliseynewton14:43
EmilienMd0ugal: it's a bug in msitral I think14:43
socialrhallisey: do you have issues with nova-compute during deploy?14:43
EmilienMhttps://github.com/rdo-packages/mistral-distgit/blob/rpm-master/openstack-mistral-executor.service#L814:43
d0ugalEmilienM: (which then means I don't know what happened to the logging I added on my local checkout of mistral :/)14:43
EmilienMmistral should create a logfile per "server" option given14:43
jpichd0ugal: Right. So I'm gonna remove the deprecation warnings for now, and I'll propose a patch to add them back to the relevant commands early in Ocata, together with some docs updates14:43
d0ugalEmilienM: Right, so mistral is ignoring that. I'll take a look.14:43
socialrhallisey: I have issue in oslo.messaging that I'll probably have to track down14:43
jpichd0ugal: And take it from there. Thanks!14:44
*** chem` has joined #tripleo14:44
d0ugaljpich: Sounds good!14:44
EmilienMd0ugal: that's the root cause14:44
openstackgerritHonza Pokorny proposed openstack/tripleo-ui: Stacks and Resources data storing in app state  https://review.openstack.org/37422714:44
*** chem has quit IRC14:45
rhalliseysocial, no, I've been able to deploy ok14:45
rhalliseylast time I did it was yerterday afternoon14:45
socialrhallisey: yes the issue is actually race, if compute deploys later than controllers it should work fine14:46
rhalliseyinteresting14:46
socialI'll give it another run today to debug/fix it14:46
rhalliseysocial, what error do you see?  Is it a timeout?14:47
rhalliseymessage timeout?14:47
socialrhallisey: 90% it's this https://bugs.launchpad.net/oslo.messaging/+bug/158114814:47
openstackLaunchpad bug 1581148 in oslo.messaging "Constant exceptions "NotFound: Basic.consume: (404) NOT_FOUND - no queue abc in vhost '/'" in log" [Undecided,Fix released] - Assigned to Kirill Bespalov (k-besplv)14:47
rhalliseysocial, a few times I've had the heatclient return14:48
rhallisey: message timeout <hash>14:48
rhalliseythat could be it14:49
*** ayoung has joined #tripleo14:49
socialI have reproducer now so I only need to apply patch update images and rerund deploy14:49
*** zoli|lunch is now known as zoli|wfh14:50
d0ugalEmilienM: so, it looks like we can set the log file by passing --log-file14:51
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-heat-templates: Update gnocchi database during M/N upgrade.  https://review.openstack.org/37488414:51
openstackgerritSteven Hardy proposed openstack/tripleo-heat-templates: Move keystone::auth into service_config_settings  https://review.openstack.org/37057314:51
openstackgerritSteven Hardy proposed openstack/tripleo-heat-templates: Tolerate missing keys from role_data in service templates  https://review.openstack.org/37423714:51
EmilienMd0ugal: patch the distgit :)14:51
EmilienMhttps://github.com/rdo-packages/mistral-distgit14:51
EmilienMand problem solved!14:51
d0ugalEmilienM: I can't find where the default comes from, maybe oslo.config uses the process name?14:51
d0ugalEmilienM: will do :)14:51
openstackgerritAthlan-Guyot sofer proposed openstack/tripleo-heat-templates: Update gnocchi database during M/N upgrade.  https://review.openstack.org/37488414:54
openstackgerritmathieu bultel proposed openstack/python-tripleoclient: Keystone credentials needs to be set with the overcloud password  https://review.openstack.org/37489214:57
openstackgerritSteven Hardy proposed openstack/tripleo-heat-templates: Move keystone::auth into service_config_settings  https://review.openstack.org/37057314:58
*** akuznetsov has joined #tripleo14:59
*** rcernin has quit IRC15:07
EmilienMd0ugal: yes oslo config I think15:08
b00tcatwhen deploying an overcloud using `openstack overcloud deploy --templates`, what's the default value for TEMPLATES?15:08
b00tcaton `/usr/share/tripleo/templates/` I can only find 2 XMLs15:08
b00tcat:/15:08
*** pkovar has quit IRC15:09
shardyb00tcat: /usr/share/openstack-tripleo-heat-templates/15:10
*** ebarrera has quit IRC15:11
shardywe should output that in the help text15:11
shardy(but we don't)15:11
*** yamahata has joined #tripleo15:11
*** zigo_ is now known as zigo15:14
b00tcatshardy: thanks shardy15:15
b00tcatoops, too much shardy15:15
jtomasekhonza: checking15:15
*** dhill_ has joined #tripleo15:17
jtomasekhonza: it is no big deal, just a few bits, I am going to send a new update15:17
openstackgerritJiri Tomasek proposed openstack/tripleo-ui: Stacks and Resources data storing in app state  https://review.openstack.org/37422715:20
*** yolanda has joined #tripleo15:20
openstackgerritBrent Eagles proposed openstack/tripleo-heat-templates: Neutron metadata agent worker count fix  https://review.openstack.org/37491515:21
*** tremble has quit IRC15:21
*** jcoufal has quit IRC15:21
hewbroccashardy: apropos of nothing at all, are there unit tests for the composable services stuff?15:22
shardyhewbrocca: No, we need to add CI scenarios which cover both composable services and custom roles15:23
hewbroccathanks15:23
shardyhewbrocca: we're somewhat using composable services for the multinode job tho15:23
shardybecause we use it to do an all-in-one deployment15:23
hewbroccasure15:23
openstackgerritEmilien Macchi proposed openstack/puppet-tripleo: Swift puppet-tripleo to use puppet-openstack_spec_helper  https://review.openstack.org/37491615:23
openstackgerritEmilien Macchi proposed openstack/puppet-tripleo: Switch puppet-tripleo to use puppet-openstack_spec_helper  https://review.openstack.org/37491615:24
EmilienMhewbrocca: i'm working o nit15:24
EmilienMshardy: ^15:24
openstackgerritMerged openstack/tripleo-quickstart: Use the proper private keys for ssh config file  https://review.openstack.org/37091915:24
hewbroccaEmilienM: excellent15:24
EmilienMpuppet unit tests?15:24
shardyEmilienM: Ah, I was assuming CI coverage vs unit tests for puppet-tripleo15:25
EmilienMI'm working on both scenarios and unit tests actually, and mwhahaha also started to add unit tests15:25
shardyI know we're improving things which is great15:25
openstackgerritJiri Tomasek proposed openstack/tripleo-ui: When deploy finishes, show overcloud info  https://review.openstack.org/37076515:25
EmilienMshardy, hewbrocca: it's documented here: https://github.com/openstack-infra/tripleo-ci#service-testing-matrix15:25
EmilienMand we're working on adding more services but we wait for the release15:25
*** dsneddon has joined #tripleo15:26
jtomasekhonza: I've just updated both patches15:26
jtomasekhonza: here is the diff of the changes I did to your patch: https://review.openstack.org/#/c/370765/6..715:26
*** jcoufal has joined #tripleo15:27
jtomasekhonza: I am going to review it tomorrow. In addition, I am working on DeploymentDetail component now, which is going to render DeploymentConfirmation, DeploymentProgress, DeploymentSuccess, DeploymentFailure components depending what state the deployment is in15:27
*** dmacpher has joined #tripleo15:29
*** lucasagomes is now known as lucas-hungry15:30
*** jcoufal_ has joined #tripleo15:31
*** panda is now known as panda|break15:31
*** pkovar has joined #tripleo15:32
openstackgerritBen Nemec proposed openstack-infra/tripleo-ci: Always configure ipv6 address with net-iso  https://review.openstack.org/37492215:32
*** jcoufal has quit IRC15:33
*** aufi has quit IRC15:35
openstackgerritEmilien Macchi proposed openstack/puppet-tripleo: Switch puppet-tripleo to use puppet-openstack_spec_helper  https://review.openstack.org/37491615:35
openstackgerritmathieu bultel proposed openstack/python-tripleoclient: Keystone credentials needs to be set with the overcloud password  https://review.openstack.org/37489215:35
honzajtomasek: if anyone could explain to me how gerrit works, that would be great :)15:37
honzajtomasek: thanks15:37
*** ccamacho is now known as ccamacho|afk15:39
ccamacho|afkhonza http://docs.openstack.org/infra/manual/developers.html15:40
*** akuznetsov has quit IRC15:40
openstackgerritBen Nemec proposed openstack-infra/tripleo-ci: Use low-memory-usage.yaml in ci  https://review.openstack.org/37493115:40
openstackgerritDougal Matthews proposed openstack/python-tripleoclient: Remove the get_hiera_key function  https://review.openstack.org/36736715:40
honzaccamacho|afk: yes, yes, yes, but it never does what i want, it has a mind of its own :)15:41
* hewbrocca thinks gerrit is a regression15:41
gfidenteping tbarron15:42
*** dsariel has quit IRC15:45
*** ebarrera has joined #tripleo15:45
tbarrongfidente: pong15:46
gfidentetbarron, wonder if you still have the environment where you tested https://review.openstack.org/#/c/358525 ?15:46
gfidenteI would like to look into what is failing, seems like we could see what failed from os-collect-config logs15:47
openstackgerritEmilien Macchi proposed openstack/puppet-tripleo: Switch puppet-tripleo to use puppet-openstack_spec_helper  https://review.openstack.org/37491615:48
EmilienMbnemec, panda|break: I think swift/ipv6 is fixed15:48
EmilienMmemcache_servers = [fd00:fd00:fd00:2000::18]:11211,[fd00:fd00:fd00:2000::14]:11211,[fd00:fd00:fd00:2000::1c]:1121115:48
EmilienMI saw it in CI15:48
openstackgerritBen Nemec proposed openstack-infra/tripleo-ci: Use low-memory-usage.yaml in ci  https://review.openstack.org/37493115:48
tbarrongfidente: yeah, I'll pm you access if you'd like15:49
*** abehl has quit IRC15:50
gfidentetbarron, ah nice, yes we could do tmux15:50
tbarrongfidente: i left it in that state15:50
bnemecEmilienM: Okay, I had no luck yesterday.15:50
*** Ryjedo has quit IRC15:50
gfidenteJokke_, ^^15:50
EmilienMbnemec: http://logs.openstack.org/74/363674/25/experimental-tripleo/gate-tripleo-ci-centos-7-ovb-ha-ipv6/92d07bf/logs/15:50
EmilienMlogs are here, if you look into swift config, we're good now15:50
tbarrongfidente: it's a beaker box, i asked for extended reservation and so far no one has taken it back :)15:50
EmilienMbut now it timeouts on something I haven't figured yet15:50
gfidentetbarron++15:51
*** kjw3 has quit IRC15:51
bnemecEmilienM: Yeah, that does look right.15:51
openstackgerritJulie Pichon proposed openstack/python-tripleoclient: Remove deprecation warning for bulk introspection  https://review.openstack.org/37493515:51
d0ugalWhen are we releasing RC2 again?15:51
EmilienMd0ugal: next week15:51
d0ugaldamn :)15:52
EmilienMd0ugal: maximum Thursday I think15:52
EmilienMif shardy is ok15:52
EmilienMand we're branching puppet modules next week15:52
bandinimarios: https://bugs.launchpad.net/tripleo/+bug/162662815:54
openstackLaunchpad bug 1626628 in tripleo "M/N Upgrade - major-upgrade-pacemaker times out" [Critical,New]15:54
bandiniI have the system live in case15:54
bandinibbiab supper time15:54
EmilienMlet's make puppet-tripleo great again15:55
openstackgerritEmilien Macchi proposed openstack/puppet-tripleo: Switch puppet-tripleo to use puppet-openstack_spec_helper  https://review.openstack.org/37491615:56
mariosbandini: thanks very much /me eod will have a look tomorrow15:57
beagles+1 on that15:57
beaglesre: puppet-tripleo that is15:57
jistrchem`: hi, i see you reported the gnocchi not starting bug, i think we're just missing: gnocchi-upgrade --config-file /etc/gnocchi/gnocchi.conf in the upgrade15:57
jristjtomasek: do you see what I'm saying with my -1 on https://review.openstack.org/#/c/374227/6/src/js/constants/StacksConstants.js ?15:58
*** dmacpher is now known as dmacpher-afk15:59
shardyEmilienM: +1 on next week for RC216:00
jistrchem`: commented on the upstream bug with a code pointer too16:00
EmilienMso we see the timeout http://logs.openstack.org/74/363674/25/experimental-tripleo/gate-tripleo-ci-centos-7-ovb-ha-ipv6/92d07bf/logs/undercloud/var/log/heat/heat-api.txt.gz#_2016-09-22_14_11_55_27116:00
jristwhat day for rc2?16:00
EmilienMjrist: I'll do it on Thursday16:00
EmilienMat evening16:00
EmilienMso we have 5 days16:01
d0ugalPanic!16:01
jristthx16:01
EmilienMwell no16:01
EmilienMwe'll have stable/newton in place16:01
d0ugal:)16:01
EmilienMand we'll be able to backport bugfixes16:01
shardyhttps://releases.openstack.org/newton/schedule.html16:01
shardyYeah we still have the possibility of making some fixes before declaring the release final16:02
d0ugalSo we technically have until summit for bug fixing?16:02
EmilienMshardy: do you have 2 min to help in looking why we timeout in ipv6 jobs?16:02
EmilienMshardy: I have found the timeout in heat api logs but not the actual resource16:02
shardyd0ugal: bug fixes can always be backported, but we're aiming to declare the release final very close to the main Newton release16:02
shardywhich is w/c 3rd October16:02
shardyso 2 weeks16:02
d0ugalshardy: k, thanks.16:02
shardyEmilienM: sure16:03
shardyEmilienM: FYI I think one of the reasons we're using more memory is more heavy usage of yaql in the templates16:04
EmilienMok16:04
shardyit may be we can optimize that, but probably not before the release16:04
EmilienMI wasn't sure it was related16:04
hewbroccao NOES the yaql parser is a memory hog16:04
shardyI'd like to convert a lot of the yaql calls to native heat functions, which should be a lot less expensive16:04
*** myoung is now known as myoung|biab16:04
shardylet(root => $) -> $.data.map.items().where($[0] in $root.data.services).select($[1]).reduce($1.mergeWith($2), {})16:05
*** jpich has quit IRC16:05
shardyhow could that not eat memory? :D16:05
EmilienMsounds like using python in heat would help16:05
shardyEmilienM: yeah, that's what I'd like to do16:05
EmilienMshardy: do we have WIP in CI to reduce failures?16:06
EmilienMI saw a patch from bnemec to use jistr's template for low memory setups16:06
jistrEmilienM: it will not work though, i'm just reviewing it16:06
EmilienMbut iiuc we're already using low worker #16:06
bnemecEmilienM: That's basically a noop though.16:06
bnemecYeah16:06
EmilienMyeah16:06
bnemecYeah!16:06
bnemec:-)16:06
jristshardy or EmilienM - how do I connect two series together? i.e. rc1 -> rc2 in launchpad16:07
shardyEmilienM: panda|break started a ML thread, we have to decide if we tune for lower memory usage or accept an even bigger undercloud16:07
shardyjrist: wha?16:07
shardyjrist: not sure what you mean16:07
jristyou have newton-1 -> newton-2 ... https://launchpad.net/tripleo16:07
*** masco has joined #tripleo16:07
*** b00tcat has quit IRC16:07
jtomasekjrist: hmm, I don't16:08
jristis it based on branch?16:08
jristjtomasek: the inconsistency in the sentences16:08
bnemecGah, that password prompt bug is still there.16:08
shardyhttps://launchpad.net/tripleo/newton16:08
* bnemec rages16:08
EmilienMbnemec: yes16:09
shardyjrist: sorry, I still don't follow - I just created the milestones in LP, and we targetted stuff to them16:09
EmilienM:(16:09
shardyjrist: they're not linked, other than by being part of the same series, e.g newton16:09
jristshardy: oh16:09
EmilienMshardy: how we could tune more?16:09
jristlet me see, thanks shardy16:09
jtomasekjrist: I don't see any comment there. I extracted those from DeploymentStatus component. I am ok with changing them as you like, just let me know what to change16:10
shardyEmilienM: I suspect we could tune rabbit and mysql better for memory constrained systems16:10
shardythey're the two biggest memory users after heat-engine IME16:10
jristshardy: I think I'm doing something wrong https://launchpad.net/tripleo-ui16:10
jristjtomasek: https://review.openstack.org/#/c/374227/1/src/js/constants/StacksConstants.js16:11
shardyjrist: you've created multiple series instead of milestones against one newton series16:11
jtomasekjrist: oh, I see it now, sorry.... I am blind16:11
jristshardy: but don't bother looking since we're going to tripleo proper16:11
mwhahahashardy: no it's all heat engine by a factor of like 416:11
*** jlinkes has quit IRC16:11
mwhahahamysql/rabbit only use like 500M max16:11
d0ugalFun, the tripleoclient unit tests are broken.16:11
mwhahahacombined16:11
jristshardy: so series should be newton16:11
jristand milestones should be in the series16:11
shardyjrist: yes16:12
shardymwhahaha: Ok, in my previous testing all three were taking over 1G16:13
shardyprobably need to look at it again16:13
mwhahahashardy: i've got a basic deploy going and heat-engine is near 2g16:13
mwhahahasorry 3G math is hard16:14
*** egafford has quit IRC16:15
shardymwhahaha: Ok, we may have an issue then, I reported some heat bugs earlier in the cycle and we got the memory usage down way below that16:16
shardyat least for small developer deployments16:16
shardyhttp://people.redhat.com/~shardy/heat/plots/heat_before_after_filesfix_plot.png16:16
mwhahahayea it's way more than that now16:17
shardythat was a 2 node nonha deployment before/after the fixes16:17
openstackgerritGiulio Fidente proposed openstack/puppet-tripleo: Move inclusion of ::manila::db::mysql in manila/api profile  https://review.openstack.org/37496116:17
*** masco has quit IRC16:18
*** dtantsur is now known as dtantsur|afk16:19
openstackgerritBen Nemec proposed openstack-infra/tripleo-ci: Use low-memory-usage.yaml in ci  https://review.openstack.org/37493116:20
mwhahahawe also may want to put a cap on workers for the services16:20
*** rcernin has joined #tripleo16:20
weshaypanda|break, fyi https://review.openstack.org/#/c/374922/16:21
shardymwhahaha: Yeah, we already do for the overcloud in CI, but perhaps we can reduce them further on the undercloud16:21
*** rbrady is now known as rbrady|afk16:23
*** fultonj has quit IRC16:24
ayoungjrist, where do I find the files that describe how to configure Keystone for the undercloud?16:26
ayoungI know where in the puppet module to look, but not what calls that16:26
panda|breakbnemec: you think your v6 environments may be used to test ipv6 locally ? my loca test hangs at NetworkDeployment16:26
mwhahahaayoung: https://github.com/openstack/instack-undercloud/tree/master/elements/puppet-stack-config16:27
jristayoung: did you mean jistr?16:27
*** fultonj has joined #tripleo16:27
ayoungjrist, I means mwhahaha obviously16:27
bnemecpanda|break: Make sure network-environment.yaml actually matches your configuration.16:27
mwhahaha:D16:27
bnemecpanda|break: Usually when that happens to me it's because I used a non-default cidr on the undercloud and forgot to change network-environment.yaml to match.16:28
ayoungI really just asked someone I thoght might be able to point me in the right direction, or point me at someone that could point me, or ....16:28
mwhahahaayoung: i was in there looking for something else, thought i'd share16:28
*** pkovar has quit IRC16:28
panda|breakweshay: thanks, I also wonder why the NETISO_V4/6 condition exist16:28
panda|breakbnemec: generally or on vlan10 ?16:28
bnemecpanda|break: Oh, and that will only work if your local environment is ovb.  You may need to change the nic-configs to match in other environments.16:29
ayoungmwhahaha, is that instack?16:29
shardymwhahaha: Ok, I can confirm your observations, heat is using a huge amount more memory for me too16:29
mwhahahaayoung: yea openstack undercloud install calls instack-undercloud-setup or whatever it's called16:29
panda|breakbnemec: no, libvirt ...16:29
ayoungok16:30
bnemecpanda|break: Okay, then those templates probably won't work.  Try https://github.com/openstack/tripleo-heat-templates/blob/master/environments/net-multiple-nics-v6.yaml instead.16:30
panda|breakbnemec: replace both with this one ?16:33
*** hewbrocca is now known as hewbrocca-afk16:34
bnemecpanda|break: Not network-isolation-v6.yaml.  You'll still need that one.16:34
panda|breakbnemec: great, thanks.16:34
*** lucas-hungry is now known as lucasagomes16:35
bnemecpanda|break: And you may still need some of network-environment.yaml.  In CI we used http://git.openstack.org/cgit/openstack-infra/tripleo-ci/tree/test-environments/net-iso.yaml16:35
bnemecpanda|break: Although looking now, I think those are all the default, so I'm not sure why we did that. :-/16:35
openstackgerritDougal Matthews proposed openstack/python-tripleoclient: Change the level of mocking for the wait_for_stack_ready test  https://review.openstack.org/37496816:37
d0ugal^ That fixes the tripleoclient tests, reviews please!16:38
panda|breakbnemec: usually that means that they converged gradually :)16:38
d0ugal(and by fixes, I mean they are slightly less terrible, I'd like to rm -rf them in Ocata and start over)16:38
*** mcornea has quit IRC16:39
*** derekh has quit IRC16:40
*** zoli|wfh is now known as zoli|gone16:41
*** zigo has quit IRC16:41
*** zoli|gone is now known as zoli_gone-proxy16:41
*** cylopez has quit IRC16:42
*** bana_k has joined #tripleo16:42
*** pkovar has joined #tripleo16:43
bnemecd0ugal: Reviewed, but I think you changed the test's behavior.16:43
d0ugalbnemec: Yeah, I stopped it testing heatclient? maybe I missed something else.16:44
shardypanda|break: can you link a failing CI job with OOM (or tell me the heat version from a failing job)?16:45
bnemecd0ugal: See my inline comment.  I explained how the test flow changed.16:45
d0ugalk, thanks16:45
shardyit looks like we promoted yesterday, but I just want to ensure we're running the latest heat16:45
*** nyechiel has quit IRC16:45
ccamacho|afkshardy https://review.openstack.org/#/c/374660/16:46
bnemechttp://logstash.openstack.org/#/dashboard/file/logstash.json?query=build_name:%20*tripleo-ci*%20AND%20build_status:%20FAILURE%20AND%20message:%20%5C%22503%20Service%20Unavailable%5C%2216:46
ccamacho|afk^OOM16:46
bnemec^OOM query16:46
bnemecIRC race condition. :-)16:46
bnemecLogstash appears to have quit indexing at 19:00 two days ago though.16:47
bnemecIt claims there are no OOM messages since then, which is definitely bogus.16:47
shardyDo we aggregate the dstat data anywhere?16:49
shardyI know there's Dan's graphite server, but that seems to only contain timing stats16:49
shardyit'd be super useful to identify when the memory usage went up significantly16:50
*** zigo has joined #tripleo16:51
*** zigo is now known as Guest1865616:52
panda|breakshardy: the OOM was in a local job I started this morning, and I overwrote it. I relaunched with the same version, yum info reports this release: 0.20160921092642.38a4afa.el7.centos, on the second run, with swap on, memory used on swap is 800M16:52
*** ohamada has quit IRC16:54
openstackgerritBen Nemec proposed openstack/tripleo-docs: Add IP Assignment to node_placement  https://review.openstack.org/37497716:54
*** Guest18656 has quit IRC16:56
*** zigo_ has joined #tripleo16:59
gfidentetbarron, that line in scheduler.pp is defining a refresh17:02
gfidenteif db sync happens, -share is notified17:02
gfidentebut it does not enforce a dependency on db sync17:03
gfidenteit's this guy pulling it in https://github.com/openstack/puppet-manila/blob/master/manifests/api.pp#L18517:03
tbarrongfidente: i see, so puppet-manila is doing it from api and THT had it from api after all,17:04
*** jpena is now known as jpena|off17:05
tbarronno and puppet-manila had it from api after all, I can't type17:05
tbarrongfidente: ^^17:05
tbarrongfidente: now i understand why your proposed fix s/b sufficient17:05
EmilienMshardy: we don't have dstat but I can add it17:06
EmilienMshardy: we have it in puppet ci17:06
* tbarron is doing too many things at once, you OOO guys don't know what that's like :-P17:07
gfidenteahahah17:07
*** zigo_ has quit IRC17:07
tbarrongfidente: thanks for all the help, i'll be back in an hour or less17:07
EmilienMmhh we already have WORKSPACE/logs/dstat-csv.log17:09
EmilienMhttp://logs.openstack.org/74/363674/25/experimental-tripleo/gate-tripleo-ci-centos-7-ovb-ha-ipv6/92d07bf/logs/dstat-csv.txt.gz17:09
EmilienMthe format is really broken though or I'm dumb?17:09
*** zigo_ has joined #tripleo17:11
bnemecEmilienM: Yeah, but we don't have a way to track changes in dstat output over time.17:11
bnemecI'm also not sure dstat by itself is enough.  We need it broken down per-process.17:12
bnemecUnless we want to just assume heat is always the culprit when memory increases. ;-)17:12
shardywell as a first step, just knowing when the entire host memory usage went up would help17:12
mwhahahahave you guys used atop17:12
shardybut yeah, ideally per-process graphs would be great17:12
bkeroThat's another top-like util with some more verbose defaults, right?17:12
EmilienMbnemec: dstat can track per process17:13
shardythen maybe push all the maximums to the graphite server with the timings or something17:13
mwhahahait captures proc/cpu/disk over time and provided a replayable file17:13
EmilienMnot sure what you meant17:13
EmilienMbut let me show how we have dstat in puppet ci17:13
mwhahahafuel uses it to capture what's happening and it's useful if you want to know where the utilization is occuring17:13
EmilienMhttp://logs.openstack.org/04/374604/1/check/gate-puppet-openstack-integration-3-scenario003-tempest-centos-7/ca8202f/logs/dstat.txt.gz17:13
bnemecEmilienM: Okay, well _our_ dstat output doesn't appear to.17:13
EmilienMbnemec: I'm proposing a change17:13
EmilienMmwhahaha: looks like a good idea17:14
mwhahahahttps://linux.die.net/man/1/atop17:14
mwhahahait's a little funky in file rotation, but would be super useful for at least ci17:14
openstackgerritEmilien Macchi proposed openstack-infra/tripleo-ci: dstat: improve output to track high cpu process  https://review.openstack.org/37498217:15
*** rbrady|afk is now known as rbrady17:15
EmilienMmwhahaha: do you have an example of output?17:16
mwhahahayea let me go pull a file out of a fuel log dump17:16
EmilienMthanks17:16
mwhahahait's one of those things like sar where you have to use it to parse the output17:16
*** egafford has joined #tripleo17:18
openstackgerritEmilien Macchi proposed openstack/puppet-tripleo: Switch puppet-tripleo to use puppet-openstack_spec_helper  https://review.openstack.org/37491617:18
*** bana_k has quit IRC17:18
*** florianf has quit IRC17:19
*** social has quit IRC17:19
mwhahahaso not sure the best way to provide an atop file17:20
EmilienMon the undercloud at least17:21
*** ebarrera has quit IRC17:21
EmilienMlike we have in dstat17:21
*** trown is now known as trown|lunch17:21
mwhahahayea there's an atopsar you can run on the file after the fact17:21
mwhahahabut the true usefulness is running atop after the fact because it's a top like interface that allows you to go back and forth through time in the file17:21
mwhahahaso if you know something happened at like 13:45, you can see what cpu/mem is like and what processes are doing what17:22
mwhahahabut like i said there is also atopsar which lets you dump the same info out for a file, https://linux.die.net/man/1/atopsar17:22
mwhahahaso you can get like the top3 memory consumers over time, http://paste.openstack.org/show/582632/17:23
pradkcan i get some reviews on https://review.openstack.org/#/c/360004/ please17:23
openstackgerritmathieu bultel proposed openstack/python-tripleoclient: Keystone credentials and CephClusterFSID needs to be set with the overcloud password  https://review.openstack.org/37489217:24
mwhahahaor most cpu http://paste.openstack.org/show/582633/17:24
EmilienMinteresting17:24
mwhahahaso if you install it and it's running, we'd just need to capture the atop file after the fact17:24
EmilienMpradk: have you tested it?17:25
EmilienMmwhahaha: a first good step would be to install it, run it and save the file in workspace. is it big?17:25
mwhahahaif you grab https://ci.fuel-infra.org/job/master.fuel-library.pkgs.ubuntu.smoke_neutron/7920/artifact/logs/7920/fail_error_deploy_neutron_tun-fuel-snapshot-2016-09-22_15-50-45.tar17:26
mwhahahaand extract the logs-fuel.tar.gz17:26
mwhahahathere's an atop file in fuel/var/log/atop17:26
mwhahahait's about 7M17:26
mwhahahayou can yum install atop and use atop or atopsar on it17:26
EmilienM2016-09-22 17:25:42.011604 | Gem::RemoteFetcher::UnknownHostError: timed out17:26
EmilienMsigh17:27
EmilienMI'll push again to have a gem mirror in OpenStack Infra17:27
EmilienMfor the record, I started this work almost a year ago https://review.openstack.org/#/c/253616/17:27
pradkEmilienM, in the process, jumping through some workarounds to get upgrade going17:30
beaglespradk: I read it, but sadly I think my upgrade-fu is too shaky to weigh in.17:31
pradkmy unercloud upgrade fails .. so looking at workarounds now17:31
* beagles is working on that as it is going to be something that is "always going to be there"17:31
pradkyea17:32
shardyhttp://people.redhat.com/~shardy/heat/plots/heat_before_after_end_newton.png17:34
shardyouch17:34
shardyzaneb: ^^ FYI17:35
shardyso we definitely have another heat memory leak :(17:35
beaglesshardy: out of curiousity, what is the time scale in?17:36
beaglesruns?17:36
slaglebeagles: increments of 2217:36
slaglewhatever that means :)17:37
beagleslol17:37
shardybeagles: seconds I think17:37
slagleevery 22 of something....we leak memory17:37
shardyI have a script which measures the sum of all heat-engine memory usage every second17:37
panda|breakshardy: nice.17:37
beaglesooohh.. okay I thought this was over a larger time scale17:37
beaglesnice17:37
shardybeagles: it's one deployment17:37
shardyone 2 node nonha overcloud deployment17:37
shardythe lower lines are the same test done earlier in the cycle before/after some fixes17:38
beaglesshardy: right.. got it. The different sets of lines are for different points in the cycle...17:38
beaglesyeah17:38
beaglescool17:38
shardywe're now in much worse shape than even before those fixes :(17:38
beaglesbut sucky17:38
shardyya17:38
*** pradk has quit IRC17:39
*** pradk has joined #tripleo17:42
*** bana_k has joined #tripleo17:44
*** tosky has quit IRC17:47
*** pkovar has quit IRC17:48
EmilienMshardy: do we know what in Heat could cause it? Is it also related to YAQL work in THT?17:50
openstackgerritEmilien Macchi proposed openstack/puppet-tripleo: Switch puppet-tripleo to use puppet-openstack_spec_helper  https://review.openstack.org/37491617:51
EmilienMmwhahaha: it pass my local tests :) ^17:51
d0ugalbnemec: replied to your comment17:51
shardyEmilienM: Not yet, I suspect it's not related to the yaql work, because those steps show big jumps in memory usage during the deployment, but all of the yaql evaluations happen at the start17:52
openstackgerritEmilien Macchi proposed openstack-infra/tripleo-ci: dstat: improve output to track high cpu process  https://review.openstack.org/37498217:53
shardyEmilienM: what we need ideally is some historical data, so we can estimate when the memory usage started to get bad again17:53
shardyalternatively, I'll have to spend a day bisecting with different heat builds and/or memory profiling heat17:53
shardyI've done that before, several times, and it's going to be time consuming :(17:53
EmilienMright17:54
EmilienMI'm wondering if we can automate it17:54
EmilienMusing delorean builds17:54
shardyhttps://bugs.launchpad.net/heat/+bug/162667517:54
openstackLaunchpad bug 1626675 in heat "Further memory usage issues with big stacks" [Undecided,New]17:54
shardyI raised that, so if anyone collects any data, please add it there17:54
EmilienMlike a loop that would deploy tripleo using old delorean builds17:54
shardyEmilienM: Yeah, that would be ideal, I'm sure it can be scripted17:54
d0ugalAny other input on this would be good. The tripleoclient tests are currently broken. https://review.openstack.org/#/c/374968/17:55
shardywe'd have to back out a couple of tht patches using very new heat features, such as the conditionals one beagles added a few days ago17:55
EmilienMshardy: I even think we don't have to redeploy the whole undercloud17:55
shardyEmilienM: No, we don't, I test locally with delorean builds of heat all the time17:55
bnemecd0ugal: Ah, so part of the problem is that we moved the polling out of tripleoclient17:55
shardyit's just the time to build them, then redeploy the overcloud17:56
EmilienMright17:56
EmilienMPuppet CI noticed some heat timeouts a few weeks ago17:56
EmilienMlet me find the history17:56
EmilienMI remember /me complaining about it17:56
bnemecd0ugal: And I see we actually have a test for the create_complete case (although it looks sketchy to me at first glance, but that's a problem for another time).17:56
EmilienMshardy: let me find it17:56
EmilienMit's in http://status.openstack.org/openstack-health/#/ somewhere17:57
shardyhttps://review.openstack.org/#/c/370467/ is one fix which was mentioned, but we have that17:57
shardyI suspect there's some other circular reference issue that's crept in17:57
bnemecd0ugal: Okay, +2.17:58
EmilienMhttp://status.openstack.org/openstack-health/#/job/gate-puppet-openstack-integration-3-scenario003-tempest-centos-7?resolutionKey=day17:58
EmilienMtempest.api.orchestration.stacks.test_stacks.StacksTestJSON17:59
slagleEmilienM: did you mean to drop the -csv from the dstat logfile name too?17:59
slaglejust noticed17:59
EmilienMslagle: yes17:59
EmilienMthe format will be like puppet CI17:59
EmilienMhttp://logs.openstack.org/04/374604/1/check/gate-puppet-openstack-integration-3-scenario003-tempest-centos-7/ca8202f/logs/dstat.txt.gz17:59
EmilienMit will be mor ehelpful than a csv format17:59
EmilienMlet me find in logstash where heat started to be slow in puppet CI18:00
slagleEmilienM: you are still using --output though, so it will still be in csv format18:01
EmilienMdamn18:01
EmilienMok18:01
EmilienMa sec18:01
openstackgerritSteven Hardy proposed openstack/tripleo-heat-templates: Move keystone::auth into service_config_settings  https://review.openstack.org/37057318:01
openstackgerritEmilien Macchi proposed openstack-infra/tripleo-ci: dstat: improve output to track high cpu process  https://review.openstack.org/37498218:02
*** shardy has quit IRC18:09
*** dmsimard is now known as dmsimard|afk18:10
EmilienMgoo.gl/5xsr7i18:11
EmilienMthat's the query I'm using to find when in puppet CI we had the heat bug18:12
EmilienMand all started on September 12th18:12
EmilienMand the promotion wat on 12th, I'll check timestamps but maybe something in Heat between 9th and 12th broke (9th was the previous promotion)18:13
*** myoung|biab is now known as myoung18:13
d0ugalbnemec: thanks18:14
EmilienMok logstash is still computing the query and it happenned before18:14
*** yamahata has quit IRC18:15
EmilienMfirst time on 2016-09-13T21:45:38.058Z18:15
EmilienMso now I'm investigating commits between 12 and 13th september in heat18:21
EmilienMbecause that's where it started to be unstable for us18:21
EmilienM(in puppet CI)18:21
EmilienMmy first feeling is about https://github.com/openstack/heat/commit/f18e57e004e65faf0ed2d043384709007f83b2b018:22
EmilienMzaneb: ^ wdyt?18:22
*** milan has quit IRC18:23
EmilienMbnemec: when do you see the timeout thing in logstash for tripleo jobs?18:23
EmilienMbnemec: when does it start i mean18:23
*** trown|lunch is now known as trown18:24
*** ayoung has quit IRC18:25
* EmilienM running the query agai for tripleo18:25
EmilienMbnemec: do you think there is a better way to find this problem rather than message: "503 Service Unavailable" ?18:25
pradkhas anyone tried upgrades using upstream repos?18:26
pradkbandini, jistr, ^^?18:26
EmilienMso even in bnemec's query, failures started on September 12th18:26
bandinipradk: I am, currently stuck on the major-upgrade-pacemaker step18:27
bandinipradk: here is my diary https://etherpad.openstack.org/p/tripleo-mitaka-newton-upgrades18:28
pradkbandini, what repos are you using when you switch for undercloud upgrades18:28
pradkbandini, thx i'll check18:28
pradkbandini, so instead of sudo rhos-release -P 10 -r 7.3 .. which upstream repo are you using18:29
pradkbandini, that could be why my upgrade is failing, perhaps i have an older snapshot repo18:29
bnemecEmilienM: Timeout or oom?18:29
bandinipradk: https://paste.fedoraproject.org/432843/74568987/ this is what I am doing18:30
bandinithe undercloud upgrade has worked pretty much all the time in the last weeks18:30
bandinipradk: what issue are you seeing?18:30
pradkk lemme try that.. i was getting traceback on some dib18:31
pradkbandini, you still had to turn off openstack-* and neutron-* services before upgrade?18:32
bandinipradk: I do it because I had it in my scripts from the previous upgrade cycle, not sure they are strictly still needed18:33
bandinipradk: let's say they don't hurt ;)18:33
EmilienMbnemec: anything that says heat timeout18:33
EmilienMbnemec: I documented my little research https://bugs.launchpad.net/heat/+bug/1626675/comments/118:34
openstackLaunchpad bug 1626675 in heat "Further memory usage issues with big stacks" [Undecided,New]18:34
EmilienMtherve, zaneb: when you get a moment, please look https://bugs.launchpad.net/heat/+bug/1626675/comments/118:34
pradkk cool18:34
*** rasca has quit IRC18:36
*** snecklifter has quit IRC18:36
*** fzdarsky has quit IRC18:37
zanebEmilienM: why is that patch suspicious?18:41
EmilienMzaneb: because it talks about performance a little18:41
EmilienMzaneb: my only assumption is that something broke us on 11/12/13 th18:41
zanebright, it should have improved performance if anything18:42
*** coolsvap has quit IRC18:42
zanebEmilienM: what's the 503 the result of? OOM killer?18:43
EmilienMbnemec ^18:45
openstackgerritJames Slagle proposed openstack/instack-undercloud: Update default VM memory  https://review.openstack.org/37505418:45
*** sshnaidm is now known as sshnaidm|afk18:45
EmilienMhttps://github.com/openstack/heat/compare/b9d1e30...a9e9b3118:45
EmilienMmwhahaha, zaneb^ that's the diff between when it used to work fine and when it broke18:45
EmilienM(still for puppet CI)18:46
EmilienMhttps://github.com/openstack/heat/commit/873a40851dd7807c6de0ee73affb7af2be875519 is also suspicious18:48
openstackgerritJames Slagle proposed openstack/tripleo-docs: Update minimum specs for virt setup  https://review.openstack.org/37505718:48
EmilienMzaneb: wdyt ^?18:48
zanebEmilienM: https://github.com/openstack/heat/commit/d79236468931db780fe90e4297f5033fe9db24cf is by far the most suspicious of that lot18:49
*** yamahata has joined #tripleo18:49
EmilienMzaneb: why18:50
EmilienMwe don't use it in RDO18:50
EmilienMwe use oslo deps from latest tag18:50
EmilienMin uc18:50
zanebthe rest just seems really unlikely18:51
EmilienM¯\_(ツ)_/¯18:51
mwhahahawe used the same oslodb18:51
zanebstuff related to specific resource types that you don't use18:51
mwhahahain puppet18:51
EmilienMmwhahaha: yes I just checked, that's not oslo db18:51
zanebreverting a patch from a couple of days before18:51
zanebstuff related to convergence that doesn't do anything yet, and anyway we don't use convergence in TripleO (although we might in this gate)18:52
EmilienMi don't know if we use it in puppet gate18:52
zanebit's the default, so if you don't *not* use it then you do18:52
EmilienMso in puppet gate we use it18:54
openstackgerritMerged openstack/tripleo-docs: Add IP Assignment to node_placement  https://review.openstack.org/37497718:55
bnemecI thought we explicitly turned off convergence on the undercloud?18:56
EmilienMyes we do18:57
EmilienMheat::engine::convergence_engine: false18:57
mwhahahaare we assuming the puppet timeout issue is related to the current memory thing?18:59
*** absubram has joined #tripleo19:01
zanebgood question19:01
mwhahahacause i don't think they are related :D19:02
mwhahahai think the puppet one is https://github.com/openstack/heat/commit/873a40851dd7807c6de0ee73affb7af2be87551919:02
EmilienMindeed19:02
*** absubram has quit IRC19:03
zanebEmilienM: the puppet one is https://bugs.launchpad.net/heat/+bug/1622979 ?19:04
openstackLaunchpad bug 1622979 in heat "Stack DELETE IN_PROGRESS: Unhandled error in asynchronous task" [Undecided,New]19:04
*** [1]cdearborn has joined #tripleo19:05
*** absubram has joined #tripleo19:06
EmilienMzaneb: yes19:07
dtrainorI have a stubborn deployment that seems to be stuck.  I don't see any errors on either my 1 controller or 1 compute except for an authentication failure for ro_snmp_user (I don't think that's related, but I'll look in to it later).  Here's some output from os-collect-config https://paste.fedoraproject.org/432883/57119314/19:07
zanebEmilienM: ok, I just closed that as a duplicate of https://bugs.launchpad.net/heat/+bug/1626173 - it's the exact same stack trace19:07
openstackLaunchpad bug 1626173 in heat "stack failed to reach DELETE_COMPLETE status (timeout)" [High,Fix released] - Assigned to Crag Wolfe (cwolfe)19:07
*** ayoung has joined #tripleo19:14
*** ayoung has quit IRC19:14
*** cdearborn has quit IRC19:15
*** r-mibu has quit IRC19:17
*** r-mibu has joined #tripleo19:17
openstackgerritChris Jones proposed openstack/tripleo-quickstart: Add ssh option IdentitiesOnly.  https://review.openstack.org/37476919:22
*** jprovazn has quit IRC19:24
*** zigo_ is now known as zigo19:30
openstackgerritBen Nemec proposed openstack/diskimage-builder: Shorten DHCP timeout in dhcp-all-interfaces  https://review.openstack.org/37507319:30
*** cdearborn has joined #tripleo19:35
openstackgerritPradeep Kilambi proposed openstack/tripleo-heat-templates: Ceilometer Wsgi Mitaka->Newton upgrades  https://review.openstack.org/36000419:42
*** [1]cdearborn has quit IRC19:49
bnemecThat moment when you realize you just wiped the vm that had all of your custom heat templates. :-(19:50
*** snecklifter has joined #tripleo19:50
*** ayoung has joined #tripleo19:51
*** jeckersb is now known as jeckersb_gone19:51
*** snecklifter has quit IRC19:54
mwhahahaoh noes19:57
*** ayoung has quit IRC19:57
*** chem` has quit IRC19:57
openstackgerritMerged openstack/tripleo-common: Fix the default plan creation  https://review.openstack.org/37134720:02
*** chem` has joined #tripleo20:09
EmilienMslagle: what happenned to gate-tripleo-ci-centos-7-nonha-multinode ?20:11
EmilienM44 min :-O20:11
slaglei regularly see runs under 45 minutes :)20:11
slagleit's all over the map though20:11
EmilienMso dstat20:12
EmilienMhttp://logs.openstack.org/82/374982/3/check/gate-tripleo-ci-centos-7-nonha-multinode/7c1a7e9/logs/dstat.txt.gz20:12
slaglefaster i have ever seen it run is 39 minutes20:12
EmilienMthat's super fast20:12
*** chem` has quit IRC20:13
*** nyechiel has joined #tripleo20:14
bnemecThe osic nodes are really fast, from what I can tell.20:15
bnemecOn a semi-related note, does anyone know why we still have an ovb-nonha job in the experimental pipline?20:16
bnemec*pipeline even20:16
EmilienMto test nonha on 2 nodes I guess?20:17
EmilienMthe multinode job test nonha on a single overcloud node20:17
EmilienMI'm not sure of the benefit to keep the ovb non ha indeed20:17
EmilienMI know it not high priority but here are some easy/small patches to support recent version of puppet: https://review.openstack.org/#/q/topic:tripleo/puppet420:18
openstackgerritMerged openstack/python-tripleoclient: Change the level of mocking for the wait_for_stack_ready test  https://review.openstack.org/37496820:18
bnemecOh wait, I know this actually.20:20
openstackgerritMerged openstack-infra/tripleo-ci: Add myself to the planet  https://review.openstack.org/37438520:20
bnemecI'm pretty sure that's there so we can do check experimental on other projects like heat and ironic.20:20
openstackgerritOpenStack Proposal Bot proposed openstack/tripleo-common: Updated from global requirements  https://review.openstack.org/37372220:21
EmilienMbnemec: also20:22
EmilienMbnemec: but I'm not sure the reason of running it in tripleo projects20:22
pradkEmilienM, can we merge this.. https://review.openstack.org/#/c/371591/20:22
bnemecEmilienM: We've only got one experimental-tripleo queue.  We'd have to create a separate one to not run it on tripleo projects.20:22
bnemecWhich, meh.20:23
bnemecExperimental jobs get lowest priority on test envs anyway.20:23
EmilienMpradk: done20:23
pradkthx20:23
*** panda|break is now known as panda20:25
*** Goneri has quit IRC20:28
*** absubram_ has joined #tripleo20:32
*** bfournie has quit IRC20:33
*** jayg is now known as jayg|g0n320:33
*** absubram has quit IRC20:33
*** absubram_ is now known as absubram20:33
*** nyechiel has quit IRC20:41
*** lucasagomes is now known as lucas-afk20:42
*** paramite has quit IRC20:43
*** fpan has quit IRC20:46
*** cylopez has joined #tripleo20:47
slaglescale down is busted20:49
slaglehttps://bugs.launchpad.net/tripleo/+bug/162673620:49
openstackLaunchpad bug 1626736 in tripleo " Unable to delete overcloud node" [Critical,New]20:49
*** jeckersb_gone is now known as jeckersb20:51
slaglejrist: does the UI intend to support scaling down?20:53
slagleb/c we don't have a workflow for that. i don't think anyone is working on it20:54
jrist"intend"?20:56
jristI haven't seen any request for that20:56
jristin general20:56
slagleyea, intend meaning is it a requirement20:57
slaglei guess not :)20:57
jristis it something the CLI supports?20:57
slagleyes, although it's currently broken. due to moving the templates to swift20:58
jristso20:58
slagleit looks like that part would be an easy fix though20:58
jristthe UI should use the same workflow as the CLI20:58
jristtheoretically...20:58
slaglewell, there is no mistral workflow20:58
rbradythere could be20:58
rbrady:)20:58
slagleyes :), that's what i was trying to ascertain20:59
rbradylet's file a bug on it so it doesn't get lost20:59
slagleif we needed one20:59
slaglerbrady: i think we could just use https://bugs.launchpad.net/tripleo/+bug/162673620:59
openstackLaunchpad bug 1626736 in tripleo " Unable to delete overcloud node" [Critical,Confirmed] - Assigned to Carlos Camacho (ccamacho)20:59
slaglerbrady: feel free to reassign it if you'd like20:59
jristah that's the fancy name for scaling down?20:59
*** mburned is now known as mburned_out21:00
jristdelete?21:00
jrist:)21:00
*** dmsimard|afk is now known as dmsimard21:00
slagleyea b/c that's what the cli argument is called21:00
jristaffects UI or effects UI21:00
jristaffects21:01
*** fpan has joined #tripleo21:01
rbradyslagle: I've tagged it with workflows and we'll get to it21:01
slaglerbrady: cool, thanks21:01
*** rhallisey has quit IRC21:02
*** dsneddon has quit IRC21:08
*** myoung is now known as myoung|gone21:10
openstackgerritDougal Matthews proposed openstack/python-tripleoclient: Remove the get_hiera_key function  https://review.openstack.org/36736721:10
openstackgerritDougal Matthews proposed openstack/tripleo-common: Remove the old, deprecated Mistral action names  https://review.openstack.org/36652921:10
openstackgerritDougal Matthews proposed openstack/tripleo-common: Remove the unused service_host arg from node registration  https://review.openstack.org/32603621:10
*** dsneddon has joined #tripleo21:13
openstackgerritOpenStack Proposal Bot proposed openstack/python-tripleoclient: Updated from global requirements  https://review.openstack.org/37512521:16
*** jbadiapa has quit IRC21:32
*** rcernin has quit IRC21:34
*** fpan has quit IRC21:36
*** bkopilov has quit IRC21:37
*** liverpooler has quit IRC21:37
*** nijaba has quit IRC21:37
*** adarazs has quit IRC21:37
*** zoli_gone-proxy has quit IRC21:37
*** onovy has quit IRC21:37
*** akrzos has quit IRC21:37
*** athomas has quit IRC21:37
*** rdopiera has quit IRC21:37
*** dbecker has quit IRC21:37
*** tdasilva has quit IRC21:37
*** colonwq has quit IRC21:37
*** myoung|gone has quit IRC21:37
*** slagle has quit IRC21:37
*** greghaynes has quit IRC21:37
*** dobson has quit IRC21:37
*** shadower has quit IRC21:37
*** dmanchad has quit IRC21:37
*** bswartz has quit IRC21:37
*** mwhahaha has quit IRC21:37
*** timothyb89 has quit IRC21:37
*** rodrigods has quit IRC21:37
*** jpeeler has quit IRC21:37
*** bandini has quit IRC21:37
*** kbyrne has quit IRC21:37
*** hewbrocca-afk has quit IRC21:37
*** ansiwen has quit IRC21:37
*** bkero has quit IRC21:37
*** mandre has quit IRC21:37
*** lazy_prince has quit IRC21:37
*** toure has quit IRC21:37
*** markmc has quit IRC21:37
*** rajinir has quit IRC21:37
*** tzumainn has quit IRC21:37
*** oshvartz has quit IRC21:37
*** ccamacho|afk has quit IRC21:37
*** jroll has quit IRC21:37
*** eggmaster has quit IRC21:37
*** CaptTofu has quit IRC21:37
*** sirushti has quit IRC21:37
*** jayg|g0n3 has quit IRC21:37
*** HenryG has quit IRC21:37
*** mrunge has quit IRC21:37
*** andreaf has quit IRC21:37
*** mgagne has quit IRC21:37
*** athomas has joined #tripleo21:38
*** rdopiera has joined #tripleo21:38
*** dbecker has joined #tripleo21:38
*** tdasilva has joined #tripleo21:38
*** colonwq has joined #tripleo21:38
*** myoung|gone has joined #tripleo21:38
*** slagle has joined #tripleo21:38
*** greghaynes has joined #tripleo21:38
*** dobson has joined #tripleo21:38
*** bswartz has joined #tripleo21:38
*** shadower has joined #tripleo21:38
*** dmanchad has joined #tripleo21:38
*** mwhahaha has joined #tripleo21:38
*** timothyb89 has joined #tripleo21:38
*** gregwork has quit IRC21:40
*** rajinir has joined #tripleo21:40
*** tzumainn has joined #tripleo21:40
*** oshvartz has joined #tripleo21:40
*** ccamacho|afk has joined #tripleo21:40
*** jroll has joined #tripleo21:40
*** eggmaster has joined #tripleo21:40
*** CaptTofu has joined #tripleo21:40
*** sirushti has joined #tripleo21:40
*** jayg|g0n3 has joined #tripleo21:40
*** HenryG has joined #tripleo21:40
*** mrunge has joined #tripleo21:40
*** andreaf has joined #tripleo21:40
*** mgagne has joined #tripleo21:40
*** mwhahaha has quit IRC21:40
*** fpan has joined #tripleo21:41
*** bkopilov has joined #tripleo21:41
*** liverpooler has joined #tripleo21:41
*** nijaba has joined #tripleo21:41
*** adarazs has joined #tripleo21:41
*** zoli_gone-proxy has joined #tripleo21:41
*** cylopez has quit IRC21:41
*** CaptTofu has quit IRC21:41
*** onovy has joined #tripleo21:41
*** akrzos has joined #tripleo21:41
*** NachoDuck has quit IRC21:41
*** rodrigods has joined #tripleo21:42
*** jpeeler has joined #tripleo21:42
*** bandini has joined #tripleo21:42
*** kbyrne has joined #tripleo21:42
*** hewbrocca-afk has joined #tripleo21:42
*** bkero has joined #tripleo21:42
*** ansiwen has joined #tripleo21:42
*** mandre has joined #tripleo21:42
*** lazy_prince has joined #tripleo21:42
*** toure has joined #tripleo21:42
*** markmc has joined #tripleo21:42
EmilienMgreat now irc is broken21:43
EmilienMreally a good day today21:43
*** rajinir has quit IRC21:44
*** hrybacki has quit IRC21:45
pandaEmilienM: wait until it starts to rain.21:46
EmilienMit's raining here21:48
*** jcoufal_ has quit IRC21:49
*** jcoufal has joined #tripleo21:50
*** jcoufal has quit IRC21:50
pandalol21:52
*** ayoung has joined #tripleo21:56
*** gregwork has joined #tripleo22:02
openstackgerritBen Nemec proposed openstack/diskimage-builder: Shorten DHCP timeout in dhcp-all-interfaces  https://review.openstack.org/37507322:03
*** NachoDuck has joined #tripleo22:03
bnemec^should be 9 minutes off every CI run.22:04
bnemecYou're welcome. :-)22:04
EmilienMit's a lot, thanks22:05
jristnice bnemec22:06
*** rajinir has joined #tripleo22:10
openstackgerritAlex Schultz proposed openstack/tripleo-docs: Update minimum memory requirements  https://review.openstack.org/37513622:12
*** hrybacki has joined #tripleo22:13
pandabnemec: I'm reading http://docs.openstack.org/developer/tripleo-docs/advanced_deployment/network_isolation.html#create-network-environment-file as a night reading .. but still have a lot of doubts .. do you have 10 minutes tomorrow to walk me through a configuration for my test env ?22:18
*** CaptTofu has joined #tripleo22:18
*** cdearborn has quit IRC22:19
*** jrist has quit IRC22:19
bnemecpanda: Yeah, I should be able to.  Do you generally join the DF scrum?  We could maybe stay on after that talk it over.22:22
pandabnemec: I always intend to join, then miss it for a reason or another. Bt scrum comes late in my day, and I wanted to set up the env by the end of the week.22:25
bnemecpanda: Okay, we can figure something out tomorrow.  We can always jump on my bluejeans call too.22:26
*** egafford has quit IRC22:27
pandabnemec: thanks a lot! ping you tomorrow then. Have a nice evening22:28
*** panda is now known as panda|zZ22:28
dsneddonbnemec, Do you know of anywhere where I could find documentation on setting per-node ExtraConfig options? I think I've seen an example where a JSON map was passed to a script, but I can't find one now.22:40
*** mwhahaha has joined #tripleo22:55
*** social has joined #tripleo22:57
*** ayoung has quit IRC23:13
*** HenryG has quit IRC23:28
*** HenryG has joined #tripleo23:28
*** mburned_out is now known as mburned23:29
openstackgerritEmilien Macchi proposed openstack-infra/tripleo-ci: WIP - Deploy TripleO with Puppet 4  https://review.openstack.org/37120923:48
bnemecdsneddon: http://docs.openstack.org/developer/tripleo-docs/advanced_deployment/node_config.html23:53
bnemecOr http://docs.openstack.org/developer/tripleo-docs/advanced_deployment/extra_config.html23:53
bnemecdsneddon: Actually maybe what you're looking for is http://docs.openstack.org/developer/tripleo-docs/advanced_deployment/node_specific_hieradata.html23:54
bnemecThat one uses a JSON map.23:54

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!