*** chlong_ has quit IRC | 00:03 | |
openstackgerrit | Merged openstack/sahara: Making health verification periodics distributed https://review.openstack.org/284688 | 00:49 |
---|---|---|
openstackgerrit | Merged openstack/sahara: implement sending health notifications https://review.openstack.org/281194 | 01:23 |
*** chlong_ has joined #openstack-sahara | 01:49 | |
*** links has joined #openstack-sahara | 03:26 | |
*** Poornima has joined #openstack-sahara | 04:26 | |
openstackgerrit | Jaxon Wang proposed openstack/sahara: Refine the code for CDH PluginUtils class https://review.openstack.org/250810 | 05:08 |
openstackgerrit | Jaxon Wang proposed openstack/sahara: CDH plugin config helper refactoring https://review.openstack.org/255825 | 05:46 |
openstackgerrit | Jaxon Wang proposed openstack/sahara: Update CDH user doc for CDH 5.5.0 https://review.openstack.org/281670 | 05:46 |
openstackgerrit | Jaxon Wang proposed openstack/sahara: CDH plugin edp engine code refactoring https://review.openstack.org/257309 | 05:46 |
openstackgerrit | Jaxon Wang proposed openstack/sahara: Add CDH 5.5 support https://review.openstack.org/279964 | 05:46 |
openstackgerrit | Jaxon Wang proposed openstack/sahara: Update CDH user doc for CDH 5.5.0 https://review.openstack.org/281670 | 06:01 |
openstackgerrit | Jaxon Wang proposed openstack/sahara: CDH plugin edp engine code refactoring https://review.openstack.org/257309 | 06:01 |
openstackgerrit | Jaxon Wang proposed openstack/sahara: Add CDH 5.5 support https://review.openstack.org/279964 | 06:01 |
*** sgotliv has joined #openstack-sahara | 06:14 | |
*** links has quit IRC | 06:26 | |
*** sgotliv has quit IRC | 06:33 | |
*** sgotliv has joined #openstack-sahara | 06:33 | |
*** nkrinner has joined #openstack-sahara | 06:37 | |
openstackgerrit | Jaxon Wang proposed openstack/sahara-specs: Add new spec for NFS-as-a-data-source blueprint https://review.openstack.org/210839 | 07:03 |
*** rcernin has joined #openstack-sahara | 07:05 | |
*** sgotliv has quit IRC | 07:10 | |
openstackgerrit | Jaxon Wang proposed openstack/sahara-specs: Add new spec for NFS-as-a-data-source blueprint https://review.openstack.org/210839 | 07:16 |
*** Haomeng has quit IRC | 07:19 | |
*** Haomeng has joined #openstack-sahara | 07:20 | |
*** chlong_ has quit IRC | 07:26 | |
*** vgridnev has joined #openstack-sahara | 08:01 | |
*** vgridnev has quit IRC | 08:05 | |
*** witlessb has joined #openstack-sahara | 08:21 | |
openstackgerrit | Nikita Konovalov proposed openstack/sahara: HA for NameNode and ResourceManager in HDP 2.2 https://review.openstack.org/197551 | 08:28 |
openstackgerrit | Luigi Toscano proposed openstack/sahara: Use the integrated tempest.lib module https://review.openstack.org/285159 | 08:39 |
openstackgerrit | Evgeny Sikachev proposed openstack/sahara-tests: Add check of scaling for CDH and Ambari https://review.openstack.org/274675 | 08:47 |
*** vgridnev has joined #openstack-sahara | 08:56 | |
openstackgerrit | Evgeny Sikachev proposed openstack/sahara-tests: Add pylint to tox https://review.openstack.org/285173 | 09:10 |
*** esikachev has joined #openstack-sahara | 09:12 | |
*** rcernin has quit IRC | 09:14 | |
*** tellesnobrega is now known as tellesnobrega_af | 09:20 | |
*** esikachev has quit IRC | 09:24 | |
*** rcernin has joined #openstack-sahara | 09:28 | |
openstackgerrit | Luigi Toscano proposed openstack/sahara-ci-config: Temporary install tempest-lib in tox venv https://review.openstack.org/285186 | 09:35 |
openstackgerrit | Merged openstack/sahara: Updated from global requirements https://review.openstack.org/285069 | 09:55 |
*** _degorenko|afk is now known as degorenko | 10:16 | |
openstackgerrit | Luigi Toscano proposed openstack/sahara: Use the integrated tempest.lib module https://review.openstack.org/285159 | 10:25 |
*** vgridnev has quit IRC | 10:27 | |
*** vgridnev has joined #openstack-sahara | 10:31 | |
openstackgerrit | Merged openstack/sahara-dashboard: Changing name of "Data image registry" https://review.openstack.org/283259 | 10:33 |
openstackgerrit | Merged openstack/sahara-dashboard: Use oslo timeutils for durations https://review.openstack.org/284697 | 10:36 |
openstackgerrit | Merged openstack/sahara-dashboard: Job duration column added https://review.openstack.org/273981 | 10:36 |
*** tosky has joined #openstack-sahara | 10:43 | |
tosky | sorry for the mess with the reviews, just trying to find the most reliable way to adapt to the changes in tempest/tempest-lib | 10:43 |
*** sgotliv has joined #openstack-sahara | 10:49 | |
*** sgotliv has quit IRC | 11:08 | |
*** Poornima has quit IRC | 11:25 | |
*** vgridnev has quit IRC | 11:44 | |
*** vgridnev has joined #openstack-sahara | 11:51 | |
*** krotscheck_dcm is now known as krotscheck | 12:13 | |
openstackgerrit | Nikita Konovalov proposed openstack/sahara-dashboard: Uptime column for clusters https://review.openstack.org/285262 | 12:13 |
*** raildo-afk is now known as raildo | 12:32 | |
openstackgerrit | Merged openstack/python-saharaclient: Updated from global requirements https://review.openstack.org/285067 | 12:53 |
*** itisha has joined #openstack-sahara | 13:15 | |
openstackgerrit | Andrey Pavlov proposed openstack/python-saharaclient: Adding "health verification --show" CLI call https://review.openstack.org/285295 | 13:22 |
*** AndreyPavlov has joined #openstack-sahara | 13:22 | |
*** _crobertsrh is now known as crobertsrh | 13:25 | |
*** Akanksha08 has quit IRC | 13:42 | |
openstackgerrit | Chad Roberts proposed openstack/sahara: Updating dashboard user guide post-reorg https://review.openstack.org/284886 | 13:46 |
*** nkrinner has quit IRC | 14:26 | |
*** tmckay has joined #openstack-sahara | 14:33 | |
rickflare | morning Sahara studs!!!!! | 14:38 |
rickflare | I tried several times last night to deploy my HDP cluster and I keep getting the following error from heat Error Authentication Failed Heat_Include_Password=1 | 14:40 |
vgridnev | hey, rickflare | 14:40 |
vgridnev | what version of sahara you are using? | 14:40 |
rickflare | morning vgridnev how are you? | 14:40 |
rickflare | let me check | 14:40 |
rickflare | I am running liberty | 14:40 |
vgridnev | Ah, got it | 14:41 |
vgridnev | I'm nice | 14:41 |
tmckay | rickflare, hmm, could it be the role? | 14:41 |
rickflare | openstack-sahara-common-3.0.0-5.cc218ddgit.el7.noarch | 14:41 |
tmckay | I have seen problems when the user I am using to launch the sahara cluster does not have "heat_owner" | 14:41 |
vgridnev | so, it looks like heat stack was not moved to CREATE_COMPLETE in one hour | 14:41 |
tmckay | I forget exactly how to spell heat owner | 14:41 |
rickflare | role? | 14:42 |
rickflare | tmckay | 14:42 |
rickflare | I am using the admin account under the admin project | 14:42 |
rickflare | if that helps | 14:42 |
* tmckay checks something on his liberty packstack install | 14:42 | |
vgridnev | rickflare, could you please execute heat stack-list --show-hidden? | 14:43 |
rickflare | http://pastebin.com/jxt7dvEA | 14:44 |
rickflare | not sure why my hdp cluster does not show up here | 14:44 |
tosky | rickflare: openstack user role list -> does admin have heat_stack_owner? | 14:44 |
tmckay | rickflare, just a guess based on errors I've seen -- if you source the admin rc file and do "keystone user-role-list" does it say the admin user has "heat_stack_owner" and "heat_stack_user" ? | 14:44 |
tmckay | or what tosky said :) | 14:45 |
tosky | tmckay: future-proof change: "keystone" CLI client is deprecated :) | 14:45 |
tmckay | well, just heat_stack_owner for me ... | 14:45 |
tmckay | tosky, yes, yes, you're right :) | 14:45 |
rickflare | http://pastebin.com/L1M2yUDQ | 14:45 |
tmckay | I don't like change ;-) wrong business | 14:46 |
tmckay | ok, that looks alright | 14:46 |
rickflare | I dont know what to do | 14:50 |
tosky | rickflare: can you please post more from your heat logs (for api and engine), paying attention to sanitize them from confidential details? Especially when you start the cluster | 14:50 |
rickflare | not sure what to grap tosky | 14:51 |
tosky | rickflare: stop heat, clean heat logs (/var/log/heat/heat-*.log), restart heat, run a cluster, wait for failure, get the heat logs | 14:53 |
*** vgridnev has quit IRC | 14:54 | |
elmiko | tmckay: lol, don't like change, works on cutting edge software ;) | 14:54 |
tosky | reading around, that error could be just a red herring, coming from the timeout, and the real issue could happen before | 14:54 |
rickflare | whoa | 14:55 |
rickflare | just got some strange errors | 14:55 |
elmiko | you mean, more strange errors ;) | 14:55 |
rickflare | this is on the engine restart | 14:55 |
rickflare | http://pastebin.com/p8ZQ5FP6 | 14:55 |
rickflare | elmiko! | 14:55 |
rickflare | its Friday | 14:55 |
rickflare | WHOOOOOOOOOO! | 14:55 |
elmiko | \o/ | 14:55 |
elmiko | wow, zaqar, mistral, *and* designate. going for a full openstack buffet eh? | 14:56 |
tosky | I wonder why the import error in heat - are there references in the configuration? Maybe you installed them, then removed but the entrypoint stayed | 14:57 |
*** vgridnev has joined #openstack-sahara | 14:57 | |
rickflare | those python errors | 14:59 |
rickflare | though? | 14:59 |
*** vgridnev has quit IRC | 14:59 | |
elmiko | those are stevedore errors though, so it might be something related to the dynamic loading of those plugins | 15:00 |
elmiko | like, some config file is calling for their inclusion | 15:00 |
*** vgridnev has joined #openstack-sahara | 15:00 | |
rickflare | so the package python-zaqarclient is not installed on my system | 15:01 |
rickflare | i just installed it | 15:01 |
tosky | did you install it at some point in the past, through package or pip? | 15:02 |
*** vgridnev has quit IRC | 15:04 | |
rickflare | what python rpm provides mistralclient.api | 15:05 |
rickflare | tosky nope | 15:05 |
elmiko | presumably python-mistralclient, but i'm not sure if rdo has packaged that one. tosky you know? | 15:05 |
rickflare | its not in yum | 15:06 |
rickflare | and heat is bitching about it | 15:06 |
elmiko | seems weird for rdo to configure these extra packages | 15:06 |
tosky | not here http://cbs.centos.org/repos/cloud7-openstack-liberty-release/x86_64/os/Packages/ | 15:06 |
elmiko | i'm not even sure if any of those(zaqar,mistral,designate) have been included for GA yet | 15:06 |
tmckay | elmiko, btw, this one needs another +2, on the ff list https://review.openstack.org/#/c/284653/ | 15:06 |
tmckay | looks like all of the health check stuff has passed except for "tempest tests", failing on multiple (probably broken) | 15:07 |
tosky | rickflare: I wonder what happened then; how the heat initialization tries to find them | 15:07 |
elmiko | tmckay: yup, it's on my list just didn't get to it last night :/ | 15:07 |
tmckay | k, thanks :) | 15:07 |
tosky | tmckay: I have at least two patches to fix the client scenario tests | 15:07 |
elmiko | i also want to hit the review you had up, sadly though, i need to go heads down at some point and do something *besides* reviews.... | 15:08 |
tosky | but the gates are horribly slow, so I don't have the results yet | 15:08 |
tmckay | yay! File "/usr/local/lib/python2.7/dist-packages/sahara/tests/tempest/scenario/data_processing/client_tests/test_job_binaries.py", line 16, in <module> from tempest_lib.common.utils import data_utils ImportError: No module named tempest_lib.common.utils | 15:08 |
tosky | check my reviews, it's all explained there | 15:08 |
tmckay | tosky :) will do. | 15:08 |
tosky | one for sahara, one for sahara-ci-config | 15:08 |
rickflare | rebuilding the cluster again | 15:13 |
rickflare | this should brighting everyones Friday! https://www.youtube.com/watch?v=xk8mm1Qmt-Y | 15:16 |
rickflare | man HDP takes for ever to spawn | 15:20 |
tosky | rickflare: do you see other exceptions? And can you run a simple cirros instance? | 15:22 |
rickflare | yea | 15:22 |
rickflare | i have stood up a spark cluster | 15:22 |
rickflare | no prob | 15:22 |
*** vgridnev has joined #openstack-sahara | 15:24 | |
*** vgridnev has quit IRC | 15:28 | |
*** vgridnev has joined #openstack-sahara | 15:28 | |
*** vgridnev has quit IRC | 15:31 | |
rickflare | this is so strange | 15:33 |
rickflare | if i boot the hdp image | 15:33 |
rickflare | as a instance | 15:33 |
rickflare | it boot just fin | 15:33 |
rickflare | run it though the cluster | 15:33 |
rickflare | and its like it hangs | 15:33 |
elmiko | weird | 15:35 |
rickflare | oh shit | 15:35 |
rickflare | DUHHHHHHHH | 15:35 |
rickflare | i figured it out | 15:35 |
elmiko | pbkac? | 15:35 |
rickflare | this is so dumb on my part | 15:35 |
rickflare | i some how selected my centos image | 15:35 |
rickflare | let me test this | 15:36 |
rickflare | i know my ubuntu image works | 15:36 |
rickflare | but the centos one | 15:36 |
rickflare | not sure | 15:36 |
openstackgerrit | Chad Roberts proposed openstack/sahara: Updating dashboard user guide post-reorg https://review.openstack.org/284886 | 15:37 |
rickflare | so i am not sure the mirantis hdp images work | 15:41 |
tosky | rickflare: but hdp 2.0.6 is centos only, do you talk about ubuntu image for another plugin? | 15:43 |
rickflare | so I used ubuntu for spark | 15:44 |
rickflare | and that image works fine | 15:44 |
rickflare | the image I downloaded from mirantis | 15:44 |
rickflare | boots | 15:44 |
rickflare | but when it gets to the dhcp ip part | 15:44 |
rickflare | it hands | 15:44 |
rickflare | and just sits there | 15:44 |
tosky | iirc it takes a while for resizing | 15:45 |
rickflare | resizing the disk? | 15:45 |
rickflare | but why would it take 20times the time the ubuntu image does | 15:45 |
rickflare | well | 15:45 |
rickflare | it is 1.6gb | 15:45 |
rickflare | but | 15:45 |
rickflare | still | 15:45 |
rickflare | seems long | 15:45 |
rickflare | to me | 15:46 |
tosky | it should not take so long | 15:47 |
rickflare | trying again | 15:47 |
tosky | if you did run the instance manually (outside the cluster) can you please paste the logs from the console? | 15:47 |
rickflare | strange | 15:48 |
rickflare | the instance finally booted | 15:48 |
rickflare | im going to make my clust smaller | 15:48 |
rickflare | all the instances are getting written to a raid 5 arrasy | 15:48 |
rickflare | maybe the large cluster i just overwhelming the IO | 15:49 |
rickflare | and its hanging | 15:49 |
*** vgridnev has joined #openstack-sahara | 15:49 | |
*** esikachev has joined #openstack-sahara | 15:50 | |
*** vgridnev has quit IRC | 15:51 | |
*** vgridnev has joined #openstack-sahara | 15:53 | |
openstackgerrit | Andrey Pavlov proposed openstack/sahara: Adding cluster scaling and changing CLI calls in grenade https://review.openstack.org/250422 | 16:03 |
*** vgridnev has quit IRC | 16:10 | |
*** logan- has quit IRC | 16:14 | |
*** logan- has joined #openstack-sahara | 16:14 | |
openstackgerrit | Michael Ionkin proposed openstack/sahara-dashboard: Added Base Image field on Node Group Template form https://review.openstack.org/259535 | 16:16 |
*** sgotliv has joined #openstack-sahara | 16:17 | |
openstackgerrit | Evgeny Sikachev proposed openstack/sahara-tests: Fix using proxy node for checks https://review.openstack.org/279447 | 16:17 |
*** rcernin has quit IRC | 16:31 | |
rickflare | yup | 16:31 |
rickflare | that was it | 16:31 |
rickflare | it was timing out | 16:31 |
rickflare | reduced the cluster size | 16:31 |
rickflare | and seeming to be moving along now | 16:31 |
rickflare | in the waiting stage | 16:31 |
openstackgerrit | Nikita Konovalov proposed openstack/sahara-dashboard: Import reorg and cleanup https://review.openstack.org/285411 | 16:34 |
*** tellesnobrega_af is now known as tellesnobrega | 16:34 | |
tosky | rickflare: how big was the cluster? | 16:35 |
*** AndreyPavlov has quit IRC | 16:40 | |
rickflare | 12 nodes | 16:41 |
rickflare | x.large instances | 16:41 |
rickflare | im now 8 | 16:41 |
openstackgerrit | Merged openstack/sahara-tests: Add more infomation when create cluster failed for scenario test https://review.openstack.org/281095 | 16:55 |
openstackgerrit | Merged openstack/sahara-tests: Use ostestr instead of the custom pretty_tox.sh https://review.openstack.org/284831 | 16:55 |
rickflare | geeze | 16:59 |
rickflare | still waiting | 16:59 |
tosky | uhm | 16:59 |
*** tmckay is now known as _tmckay | 16:59 | |
tosky | many compute nodes/ | 17:00 |
tosky | ? | 17:00 |
rickflare | 6 | 17:00 |
rickflare | 1 master | 17:00 |
*** AndreyPavlov has joined #openstack-sahara | 17:02 | |
tosky | just to be clear, I was asking about the compute nodes from openstack point of view, not the workers in the hadoop world | 17:02 |
rickflare | its one massive server | 17:03 |
rickflare | 64 cores | 17:03 |
rickflare | 15TB | 17:03 |
rickflare | 2TB ssd | 17:03 |
rickflare | 500GB ram | 17:03 |
rickflare | I have no idea | 17:07 |
rickflare | what its doing | 17:07 |
rickflare | at this point | 17:07 |
openstackgerrit | Evgeny Sikachev proposed openstack/sahara-tests: Add ability export results to file https://review.openstack.org/284611 | 17:08 |
*** ekarlso- has quit IRC | 17:20 | |
*** krotscheck has quit IRC | 17:20 | |
*** bapalm has quit IRC | 17:20 | |
*** krotscheck has joined #openstack-sahara | 17:20 | |
*** bapalm has joined #openstack-sahara | 17:21 | |
*** esikachev has quit IRC | 17:24 | |
*** esikachev has joined #openstack-sahara | 17:26 | |
*** ekarlso- has joined #openstack-sahara | 17:27 | |
openstackgerrit | Merged openstack/sahara-tests: Add pylint to tox https://review.openstack.org/285173 | 17:31 |
openstackgerrit | Merged openstack/sahara-tests: Fix scenario tests for correct output to swift https://review.openstack.org/276624 | 17:31 |
*** esikachev has quit IRC | 17:45 | |
openstackgerrit | Luigi Toscano proposed openstack/python-saharaclient: Use ostestr instead of the custom pretty_tox.sh https://review.openstack.org/285467 | 17:54 |
openstackgerrit | Luigi Toscano proposed openstack/python-saharaclient: Use ostestr instead of the custom pretty_tox.sh https://review.openstack.org/285467 | 17:56 |
*** tosky has quit IRC | 17:59 | |
*** degorenko is now known as _degorenko|afk | 18:01 | |
*** sgotliv has quit IRC | 18:02 | |
*** _tmckay is now known as tmckay | 18:13 | |
rickflare | its still waiting | 18:25 |
rickflare | wtf | 18:25 |
tmckay | rickflare, the sahara status "waiting" I believe is sahara waiting to be able to log in to each of the instances. In the case of hdp, it might be trying to connect to the ambari server. | 18:44 |
tmckay | rickflare, I'd look at the sahara-engine logs and try to see what sahara thinks it's doing | 18:45 |
tmckay | one thing to check is whether or not you can ssh into each of the boxes from the controller using the key your provided | 18:45 |
tmckay | elmiko, do you remember if "waiting" includes connecting to the ambari server? ^^ | 18:46 |
elmiko | hmm | 18:46 |
elmiko | it could | 18:46 |
rickflare | i can ssh into each node | 18:46 |
elmiko | i concur with tmckay's advice though, look at the sahara api controller logs to see where it thinks it is stuck at | 18:47 |
tmckay | rickflare, okay, so somewhere in the sahara-engine log it should say something like "all instances available" | 18:47 |
elmiko | you should be able to get a good picture of what stage it is in | 18:47 |
tmckay | you could try grepping for "instances.*available", I think | 18:47 |
rickflare | doing a tiny cluster now | 18:48 |
rickflare | 4 nodes | 18:48 |
tmckay | k. you have debug level logging in sahara? | 18:48 |
tmckay | that would be good | 18:48 |
rickflare | naw | 18:48 |
rickflare | I dont think its turned on | 18:48 |
tmckay | doh! yeah, I would turn that on | 18:48 |
elmiko | +1 | 18:48 |
tmckay | especially for this | 18:48 |
rickflare | how do I turn it on | 18:49 |
tmckay | should just be "debug = true" in sahara.conf, in the DEFAULT section | 18:49 |
tmckay | then systemctl to restart the sahara processes | 18:49 |
tmckay | should be logging to /var/log/sahara, I believe | 18:50 |
rickflare | ok | 18:50 |
rickflare | will do | 18:50 |
rickflare | soon as this finished | 18:50 |
tmckay | ack | 18:50 |
rickflare | https://bugs.launchpad.net/sahara/+bug/1547653 | 18:51 |
openstack | Launchpad bug 1547653 in Sahara "Sahara Image Elements Fails to build Centos or Fedora images using Cloudera CDH 5.4" [High,Confirmed] | 18:51 |
rickflare | anyone know what the status is with this? | 18:51 |
elmiko | i have not looked into it | 18:52 |
crobertsrh | rickflare: doesn't seem to be assigned to anyone yet. | 18:52 |
crobertsrh | Not my forte, otherwise I'd have grabbed it. | 18:52 |
elmiko | likewise :/ | 18:53 |
crobertsrh | brb...errand | 18:54 |
*** agireud has quit IRC | 18:55 | |
*** agireud has joined #openstack-sahara | 18:57 | |
tmckay | rickflare, trying on my F21 box. Jaxon (owner of this review) has been doing a lot of work with cdh lately -- might be the person to contact for ideas https://review.openstack.org/#/c/279964/ | 19:04 |
rickflare | try | 19:05 |
rickflare | hdp | 19:05 |
rickflare | cdh worked fine | 19:05 |
tmckay | oh, I mean the bug | 19:05 |
tmckay | bug was cdh, wasn't it? | 19:05 |
rickflare | cdh | 19:06 |
rickflare | and | 19:06 |
rickflare | hdp | 19:06 |
*** dencaval has joined #openstack-sahara | 19:13 | |
*** Alex____ has joined #openstack-sahara | 19:29 | |
*** Alex____ has quit IRC | 19:30 | |
elmiko | tmckay: just workflow'd the spark 1.0.0 removal reviews | 19:34 |
tmckay | elmiko, thanks! | 19:34 |
tmckay | elmiko, sorting through details of proxy command and gateway host related to use_floating_ip logical tests :) | 19:35 |
elmiko | cool | 19:35 |
elmiko | i'm almost done with reviews for the week though, i'm cooked and have a bunch of other stuff to get to | 19:35 |
tmckay | lol, trying to keep my gray matter contained | 19:35 |
tmckay | elmiko, ack, I hear you | 19:36 |
elmiko | heh, wish i could do that... ;) | 19:36 |
openstackgerrit | Michael McCune proposed openstack/sahara: remove hdp from the default plugin list https://review.openstack.org/284922 | 19:40 |
openstackgerrit | Michael McCune proposed openstack/sahara: remove hdp from the default plugin list https://review.openstack.org/284922 | 19:41 |
tmckay | elmiko, you're just going to rebase when the ambari change merges? | 19:48 |
elmiko | i'm rebasing now | 19:49 |
rickflare | man | 19:49 |
rickflare | i think hdp | 19:49 |
elmiko | sigh... except it's fighting with me | 19:49 |
rickflare | is a no go | 19:49 |
openstackgerrit | Michael McCune proposed openstack/sahara: remove hdp from the default plugin list https://review.openstack.org/284922 | 19:50 |
openstackgerrit | Michael McCune proposed openstack/sahara: enable ambari plugin by default https://review.openstack.org/284719 | 19:50 |
elmiko | sigh... | 19:51 |
tmckay | rickflare, btw, I just successfully built tox -e venv -- sahara-image-create -i centos -p cloudera -v 5.4 -j oracle-java on Fedora 21 | 19:51 |
tmckay | I'll add that note to the bug | 19:51 |
rickflare | wait | 19:52 |
tmckay | crobertsrh, ^^, worked with sie master on F21 | 19:52 |
rickflare | do centos7 on 2.7.1 vanilla | 19:52 |
tmckay | k, in progress tox -e venv -- sahara-image-create -i centos7 -p vanilla -v 2.7.1 | 19:54 |
rickflare | NOICE! | 19:54 |
tmckay | I left off the -j oracle on this one | 19:54 |
tmckay | we'll see if it builds. sounds from crobertsrh comment that there might be a delta between F21 and F23, or if not then it's environmental | 19:56 |
tmckay | or I suppose there could have been a recent sie commit | 19:56 |
tmckay | hmm, no recent commits | 19:58 |
*** sgotliv has joined #openstack-sahara | 19:59 | |
*** itisha has quit IRC | 20:09 | |
crobertsrh | Not sure if the f21 -> f23 difference is key or not. Did you get it going? | 20:12 |
openstackgerrit | Merged openstack/sahara: Remove support for spark 1.0.0 https://review.openstack.org/282528 | 20:14 |
*** sgotliv has quit IRC | 20:37 | |
*** esikachev has joined #openstack-sahara | 20:42 | |
crobertsrh | elmiko, tmckay: any chance you guys have seen this? http://paste.openstack.org/show/488427/ | 20:49 |
crobertsrh | I get it when trying to run tools/get_auth_token.py (in a test script I have) | 20:49 |
tmckay | hmm, haven't seen that before. first guess would be stale venv? | 20:51 |
rickflare | so | 20:51 |
rickflare | I notice | 20:51 |
rickflare | the correct key was aplied to my hdp instances | 20:51 |
crobertsrh | tmckay: that was my guess, but fresh venv didn't fix it | 20:51 |
rickflare | but I can ssh in | 20:51 |
rickflare | i get a permission denied | 20:51 |
tmckay | k, centos7 vanilla 2.71 build failed, complaining that grub2 was already installed. | 20:53 |
rickflare | yup | 20:53 |
rickflare | I dont know what the hell is going on | 20:54 |
*** dave-mccowan has quit IRC | 20:56 | |
tmckay | rickflare, did you build the images with the debug flag, so you can log in as root? | 20:56 |
openstackgerrit | Merged openstack/sahara: Add default templates for spark plugin, version 1.6.0 https://review.openstack.org/282530 | 21:02 |
rickflare | no | 21:02 |
rickflare | these are direct from mirantis | 21:02 |
tmckay | rickflare, okay. hmm, so how do you know for sure that the right key was applied? (other silly possibility is using the wrong user to log in?) | 21:03 |
elmiko | crobertsrh: not sure if i've seen that specific one, but for me it either mean old venv or old version of tox | 21:03 |
rickflare | well | 21:03 |
crobertsrh | elmiko: I have a bead on it | 21:04 |
rickflare | i could be using the wrong user | 21:04 |
elmiko | crobertsrh: what is it? | 21:04 |
crobertsrh | Looks like keystoneclient.middleware is now keystonemiddleware | 21:04 |
rickflare | its the centos image | 21:04 |
rickflare | so i assumes | 21:04 |
crobertsrh | filing a bug/fix now | 21:04 |
rickflare | centos is the user | 21:04 |
elmiko | ah.. right | 21:04 |
elmiko | crobertsrh: good spot | 21:04 |
tmckay | rickflare, try cloud-user | 21:04 |
tmckay | I think it changed somewhere along the line ... | 21:04 |
openstackgerrit | Evgeny Sikachev proposed openstack/sahara-tests: Fix using proxy node for checks https://review.openstack.org/279447 | 21:05 |
rickflare | oh geeze | 21:06 |
rickflare | lmao | 21:06 |
rickflare | FAIL | 21:06 |
tmckay | https://stackops.zendesk.com/hc/en-us/articles/201923327-Centos-6-X-and-7-0-default-username-and-password | 21:06 |
rickflare | im in | 21:06 |
tmckay | centos 6 is cloud-user, 7 is centos | 21:06 |
tmckay | rickflare, not your fault :) | 21:06 |
*** esikachev has quit IRC | 21:07 | |
rickflare | ok | 21:07 |
rickflare | so im in the master node | 21:08 |
rickflare | and its like | 21:08 |
rickflare | nothing is going on | 21:08 |
tmckay | is this during cluster launch? | 21:08 |
rickflare | yea | 21:09 |
tmckay | rickflare, hmm, just a guess -- what if you gave sahara the wrong user name also when you launched the image? I wonder if it's trying to use that user as well for ssh commands | 21:10 |
tmckay | launched the cluster, I mean | 21:10 |
rickflare | fuck | 21:10 |
rickflare | LMAO | 21:10 |
rickflare | yup | 21:10 |
rickflare | sorry for the cussing | 21:10 |
tmckay | well, you *could* hack it | 21:10 |
rickflare | ill rebuild | 21:10 |
rickflare | thats it | 21:10 |
rickflare | one sec | 21:10 |
tmckay | rickflare, if you want to, and you generate your own images, you can throw -d on create and it will setup up root login through the console for you. | 21:12 |
tmckay | that can help sometimes | 21:12 |
rickflare | man | 21:14 |
rickflare | this def | 21:14 |
rickflare | would have kept it from working | 21:14 |
tmckay | growing pains | 21:14 |
openstackgerrit | Merged openstack/sahara-dashboard: Import reorg and cleanup https://review.openstack.org/285411 | 21:15 |
*** AndreyPavlov has quit IRC | 21:15 | |
openstackgerrit | Chad Roberts proposed openstack/sahara: Updating get_auth_token to use keystonemiddleware https://review.openstack.org/285543 | 21:15 |
rickflare | spawning | 21:16 |
rickflare | again | 21:16 |
* tmckay hopes for the best ... | 21:17 | |
rickflare | dude | 21:18 |
rickflare | that was it | 21:18 |
rickflare | way more cpu activity | 21:19 |
rickflare | now | 21:19 |
tmckay | excellent! | 21:19 |
tmckay | if you tail the sahara log, and you have debug set, you should see it doing a bunch of stuff to the machine | 21:19 |
tmckay | and it should come out of "waiting" and go to "configuring" and "starting" | 21:19 |
tmckay | rickflare, yeah, why centos 6 used "cloud-user", I don't know | 21:20 |
tmckay | it was the outlier | 21:20 |
rickflare | so | 21:20 |
rickflare | now waiting on the disked to be filled | 21:20 |
rickflare | so | 21:20 |
rickflare | with nova | 21:20 |
rickflare | I just made a big raid 10 partitiion | 21:20 |
rickflare | i wonder I should have just left them a bunch of small disks | 21:21 |
tmckay | I don't know, now you're out of my domain :) I know little about openstack/hadoop volume optimization | 21:21 |
rickflare | i know for hadoop this is not optimal | 21:22 |
rickflare | i need to read more on swift | 21:22 |
*** crobertsrh is now known as _crobertsrh | 21:28 | |
tmckay | ah, an oldie but a goodie "An admin context does not possess the service catalog and therefore does not allow neutron interaction (or interaction with nova etc). So the neutron info is recorded in the job execution to allow that repeated task to execute successfully in a neutron environment." | 21:34 |
tmckay | went and tracked this down from 2013, wondering if it was necessary to keep extra['neutron'] in the job_execution | 21:35 |
tmckay | hmm, I wonder if this situation has changed ... | 21:35 |
_crobertsrh | probably not changed is my best guess | 21:36 |
tmckay | _crobertsrh, I'm trying to factor out use of use_floating_ips and ran into this quirky neutron info storage in the job manager, with logic duplicated in ssh_remote.py | 21:36 |
_crobertsrh | At least there was a comment hanging around to 'splain it | 21:37 |
tmckay | would be nice if it could disappear ... sounds like it's easy to kick the tires, though -- just erase and see if periodics fail :) | 21:37 |
tmckay | yeah, comment should go in the code | 21:37 |
_crobertsrh | oh, I was assuming the comment was in the code | 21:37 |
tmckay | it's in an old review I had to find by successive git blame, git checkout commit-1, check review, repeat | 21:37 |
tmckay | back to december 2013 | 21:38 |
_crobertsrh | wow, there's no quit in tmckay on a Friday afternoon....blame or bust! | 21:38 |
tmckay | lol, I should have been an archeologist | 21:39 |
_crobertsrh | That level of work gets you my nod for employee of the week. | 21:39 |
_crobertsrh | it's not worth anything really, but well done. | 21:39 |
tmckay | \o/ | 21:40 |
tmckay | thx :) I may add the comment as part of my refactor | 21:40 |
*** agireud has quit IRC | 21:41 | |
_crobertsrh | +a bunch for commenting something like that | 21:43 |
*** jamielennox is now known as jamielennox|away | 21:43 | |
*** agireud has joined #openstack-sahara | 21:43 | |
rickflare | still building | 21:48 |
rickflare | I made the cluster much much bigger | 21:48 |
rickflare | 30 nodes | 21:48 |
rickflare | all x.large | 21:48 |
rickflare | 16GB of ram each | 21:48 |
tmckay | that's bigger than anything I've made ... | 21:50 |
_crobertsrh | tmckay: maybe we should get a new test machine so we can make such a monstrosity of a cluster....purely for testing purposes, mind you. | 21:51 |
* rickflare believes go hard or go home | 21:51 | |
rickflare | seriously | 21:51 |
rickflare | when I get to TX | 21:51 |
rickflare | we have to get some drinks | 21:51 |
rickflare | i owe you guys | 21:51 |
rickflare | should be a good time! | 21:51 |
* _crobertsrh looks forward to that. | 21:51 | |
rickflare | if you cant already tell | 21:51 |
rickflare | im a fun time | 21:51 |
rickflare | LOL | 21:51 |
tmckay | crobertsrh is also fun | 21:52 |
rickflare | I was even more fun when I was younger | 21:52 |
tmckay | me, not so much | 21:52 |
* rickflare was the life of the party | 21:52 | |
rickflare | tmckay you are fun bro | 21:52 |
_crobertsrh | whaaa?? tmckay is plenty of fun. | 21:52 |
rickflare | i can tell | 21:52 |
tmckay | wonder if misc will be there .. now there's some fun | 21:52 |
_crobertsrh | I still owe him a beating at billiards from the Atlanta openstack summit. | 21:52 |
_crobertsrh | pro tip rickflare....do NOT play pool against tmckay. | 21:52 |
tmckay | oh, yeah, I've got to practice | 21:53 |
elmiko | lol | 21:53 |
tmckay | muscle memory from when I was 11, that's all it is | 21:53 |
_crobertsrh | elmiko witnessed the greatness of tmckay | 21:53 |
elmiko | yup | 21:53 |
* _crobertsrh felt the greatness | 21:53 | |
elmiko | haha, let's not go too far ;) | 21:53 |
rickflare | listen dont challange me to a drinking contest | 21:53 |
elmiko | oh, i won't. but _crobertsrh might ;) | 21:53 |
tmckay | I think I must have played a lot when I was a kid, and just don't remember how much | 21:53 |
* rickflare has blown past 20 shots of tequila | 21:54 | |
_crobertsrh | Heh. It's important for us to lay out what NOT to do when we meet. | 21:54 |
elmiko | hahaha | 21:54 |
tmckay | we're supposed to get work done, remember | 21:54 |
tmckay | 20 shots of tequila != good design decisions | 21:55 |
rickflare | lol | 21:55 |
rickflare | truth | 21:55 |
* rickflare sets a 3 shot max | 21:55 | |
rickflare | lol | 21:55 |
_crobertsrh | "Yeah, let's rewrite sahara in pascal" | 21:55 |
tmckay | pascal, haskell, anything that rhymes | 21:55 |
rickflare | haha | 21:55 |
rickflare | or in Ada | 21:55 |
rickflare | woot woot | 21:55 |
tmckay | I did Ada for 13 years | 21:56 |
rickflare | i did it for 6 | 21:56 |
rickflare | horrible times | 21:56 |
rickflare | in my life | 21:56 |
* rickflare was a vms guy as well | 21:56 | |
tmckay | oh, yeah, vms | 21:57 |
tmckay | been there | 21:57 |
_crobertsrh | ok, I'm actually bailing now. Getting thirsty. Have a good weekend everyone. | 21:58 |
rickflare | ok have a good one crobertsrh | 21:59 |
tmckay | bye, I need to head out too in few minutes | 22:01 |
elmiko | hey, haskell is actually pretty cool | 22:01 |
elmiko | later _crobertsrh | 22:01 |
rickflare | wooooo | 22:02 |
rickflare | its working | 22:02 |
rickflare | naw | 22:04 |
rickflare | spoke too soon | 22:04 |
rickflare | yum is running now though | 22:05 |
tmckay | that's good. post in here later if it succeeds, I'll peek occasionally | 22:05 |
*** tmckay is now known as _tmckay | 22:06 | |
openstackgerrit | Merged openstack/sahara: Await start datanodes in Spark plugin https://review.openstack.org/279105 | 22:09 |
rickflare | time out error | 22:12 |
rickflare | how do I extend the timeout | 22:12 |
elmiko | which timeout? there are several values you can control in the conf file | 22:22 |
*** witlessb has quit IRC | 22:43 |
Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!