Wednesday, 2016-02-17

*** sgotliv has quit IRC00:56
*** ekarlso has quit IRC01:30
*** ekarlso has joined #openstack-sahara01:45
openstackgerritOpenStack Proposal Bot proposed openstack/python-saharaclient: Updated from global requirements  https://review.openstack.org/28100702:08
*** egafford has quit IRC02:38
*** crobertsrh is now known as _crorh02:40
*** coolsvap|away is now known as coolsvap02:52
*** dave-mccowan has quit IRC04:22
*** links has joined #openstack-sahara04:41
*** links has quit IRC04:57
*** itisha has quit IRC05:17
*** Poornima has joined #openstack-sahara05:46
*** apavlov has joined #openstack-sahara05:48
*** nkrinner has joined #openstack-sahara05:56
*** sgotliv has joined #openstack-sahara06:25
*** apavlov has quit IRC06:45
-openstackstatus- NOTICE: A problem with the mirror used for CI jobs in the rax-iad region has been corrected. Please recheck changes that recently failed jobs on nodes in rax-iad.06:49
*** akuznetsov has joined #openstack-sahara07:01
*** akuznetsov has quit IRC07:04
*** akuznetsov has joined #openstack-sahara07:04
*** akuznetsov has quit IRC07:04
*** akuznetsov has joined #openstack-sahara07:05
*** akuznetsov has quit IRC07:10
openstackgerritJaxon Wang proposed openstack/sahara-tests: Add CDH 5.5.0 scenario test  https://review.openstack.org/28109207:32
*** rcernin has joined #openstack-sahara07:42
openstackgerritJaxon Wang proposed openstack/sahara-tests: Add more infomation when create cluster failed for scenario test  https://review.openstack.org/28109507:42
openstackgerritlu huichun proposed openstack/sahara: [EDP] Add suspend_job() for sahara edp engine(oozie implementation)  https://review.openstack.org/20144808:00
openstackgerritMerged openstack/sahara: Adding doc about distributed periodics  https://review.openstack.org/27668208:27
*** _degorenko|afk is now known as degorenko08:28
*** pcaruana has joined #openstack-sahara08:42
openstackgerritMichael Ionkin proposed openstack/sahara: Added scaling support for HDP 2.2 / 2.3  https://review.openstack.org/19308108:42
*** nkrinner has quit IRC09:22
*** nkrinner has joined #openstack-sahara09:22
openstackgerritMichael Ionkin proposed openstack/sahara: Added scaling support for HDP 2.2 / 2.3  https://review.openstack.org/19308109:46
*** pino|work has quit IRC09:52
*** dmitryme has quit IRC09:52
*** zhiyan has quit IRC09:52
*** dmitryme has joined #openstack-sahara09:53
*** pino|work has joined #openstack-sahara09:53
*** zhiyan has joined #openstack-sahara09:58
*** zhiyan has quit IRC10:03
*** rcernin has quit IRC10:03
*** rcernin has joined #openstack-sahara10:04
*** Poornima has quit IRC10:08
*** tmckay has quit IRC10:08
*** chlong has quit IRC10:08
*** aignatov has quit IRC10:08
*** Poornima has joined #openstack-sahara10:08
*** chlong has joined #openstack-sahara10:08
*** aignatov has joined #openstack-sahara10:09
*** tmckay has joined #openstack-sahara10:10
*** zhiyan has joined #openstack-sahara10:12
openstackgerritXi Yang proposed openstack/sahara: Remove cinder v1 api support  https://review.openstack.org/27062310:25
openstackgerritEvgeny Sikachev proposed openstack/sahara-ci-config: Add CDH 5.5.0 to sahara-ci  https://review.openstack.org/28118010:39
*** esikachev has joined #openstack-sahara10:43
openstackgerritEvgeny Sikachev proposed openstack/sahara-ci-config: Add CDH 5.5.0 to sahara-ci  https://review.openstack.org/28118010:46
*** esikachev has quit IRC10:49
*** dmitryme has quit IRC10:49
*** DuncanT has quit IRC10:49
*** al_indig_ has quit IRC10:49
*** logan- has quit IRC10:49
*** al_indigo has joined #openstack-sahara10:49
*** dmitryme has joined #openstack-sahara10:50
*** logan- has joined #openstack-sahara10:52
*** raildo-afk has quit IRC10:54
*** raildo-afk has joined #openstack-sahara10:57
*** DuncanT has joined #openstack-sahara11:02
*** rcernin has quit IRC11:03
*** al_indigo has quit IRC11:04
*** zhiyan has quit IRC11:04
*** Poornima has quit IRC11:04
*** witlessb has quit IRC11:04
*** Erming__ has quit IRC11:04
*** _crorh has quit IRC11:04
*** elmiko has quit IRC11:04
*** NikitaKonovalov has quit IRC11:04
*** NikitaKonovalov has joined #openstack-sahara11:04
*** al_indigo has joined #openstack-sahara11:04
*** Erming has joined #openstack-sahara11:04
*** Poornima has joined #openstack-sahara11:04
*** witlessb has joined #openstack-sahara11:05
*** crobertsrh has joined #openstack-sahara11:07
*** elmiko has joined #openstack-sahara11:09
openstackgerritMerged openstack/sahara-ci-config: Add CDH 5.5.0 to sahara-ci  https://review.openstack.org/28118011:12
openstackgerritVitaly Gridnev proposed openstack/sahara: [wip] implement sending health notifications  https://review.openstack.org/28119411:12
*** zhiyan has joined #openstack-sahara11:16
*** esikachev has joined #openstack-sahara11:22
*** crobertsrh has quit IRC11:39
*** logan- has quit IRC11:39
*** aignatov has quit IRC11:39
*** krotscheck has quit IRC11:39
*** alazarev has quit IRC11:39
*** _mattf has quit IRC11:39
*** alazarev has joined #openstack-sahara11:40
*** _mattf has joined #openstack-sahara11:40
*** aignatov has joined #openstack-sahara11:40
*** crobertsrh has joined #openstack-sahara11:40
*** logan- has joined #openstack-sahara11:42
*** krotscheck has joined #openstack-sahara11:42
*** egafford has joined #openstack-sahara11:46
*** rcernin has joined #openstack-sahara11:49
*** aignatov has quit IRC11:50
*** tmckay has quit IRC11:50
*** bapalm has quit IRC11:50
*** degorenko has quit IRC11:50
*** bapalm has joined #openstack-sahara11:50
*** aignatov has joined #openstack-sahara11:50
*** tmckay has joined #openstack-sahara11:53
*** degorenko has joined #openstack-sahara11:53
*** openstack has joined #openstack-sahara12:04
*** htruta has joined #openstack-sahara12:06
*** hogepodge has quit IRC12:07
openstackgerritEvgeny Sikachev proposed openstack/sahara-tests: [wip]Fix using proxy node for checks  https://review.openstack.org/27944712:10
*** coolsvap is now known as coolsvap|away12:11
openstackgerritEvgeny Sikachev proposed openstack/sahara-tests: [wip] put input datasources to hdfs  https://review.openstack.org/28070112:13
*** vgridnev has joined #openstack-sahara12:21
*** witlessb has quit IRC12:24
*** DuncanT has quit IRC12:24
*** nkrinner has quit IRC12:24
*** kgalanov has quit IRC12:24
*** sreshetn1ak has quit IRC12:24
*** jamielennox has quit IRC12:24
*** zigo has quit IRC12:24
*** sreshetnyak has joined #openstack-sahara12:25
*** nkrinner has joined #openstack-sahara12:25
*** nkrinner has quit IRC12:25
*** nkrinner has joined #openstack-sahara12:25
*** zigo has joined #openstack-sahara12:26
*** witlessb has joined #openstack-sahara12:27
*** jamielennox has joined #openstack-sahara12:30
*** elmiko has quit IRC12:32
*** NikitaKonovalov has quit IRC12:32
*** agireud has quit IRC12:32
*** openstackgerrit_ has quit IRC12:32
*** openstackgerrit has quit IRC12:32
*** SergeyLukjanov has quit IRC12:32
*** egafford has quit IRC12:32
*** elmiko has joined #openstack-sahara12:32
*** elmiko has quit IRC12:32
*** elmiko has joined #openstack-sahara12:32
*** NikitaKonovalov has joined #openstack-sahara12:32
*** openstackgerrit has joined #openstack-sahara12:33
*** openstackgerrit_ has joined #openstack-sahara12:34
*** vgridnev_ has joined #openstack-sahara12:36
*** SergeyLukjanov has joined #openstack-sahara12:37
*** agireud has joined #openstack-sahara12:38
*** vgridnev has quit IRC12:38
*** DuncanT has joined #openstack-sahara12:41
*** kgalanov has joined #openstack-sahara12:43
*** raildo-afk is now known as raildo13:09
*** n-anzen has quit IRC13:10
*** kgalanov has quit IRC13:20
*** kgalanov has joined #openstack-sahara13:22
*** raildo is now known as raildo-afk13:24
*** esikachev has quit IRC13:25
*** egafford has joined #openstack-sahara13:27
*** raildo-afk is now known as raildo13:29
*** vgridnev_ has quit IRC13:32
*** vgridnev has joined #openstack-sahara13:32
*** dave-mccowan has joined #openstack-sahara13:33
*** esikachev has joined #openstack-sahara13:34
openstackgerritMerged openstack/sahara: CDH plugin versionhandler refactoring  https://review.openstack.org/26119213:34
openstackgerritMerged openstack/sahara: Add test cases for CDH plugin config_helper  https://review.openstack.org/25349413:35
openstackgerritVitaly Gridnev proposed openstack/sahara: base cluster verifications implementation  https://review.openstack.org/27358713:36
*** vgridnev_ has joined #openstack-sahara13:39
*** vgridnev has quit IRC13:42
*** dhellmann has quit IRC13:44
*** dhellmann has joined #openstack-sahara13:47
openstackgerritVitaly Gridnev proposed openstack/sahara: cloudera health checks implementation  https://review.openstack.org/27900713:47
openstackgerritVitaly Gridnev proposed openstack/sahara: base cluster verifications implementation  https://review.openstack.org/27358713:47
openstackgerritVitaly Gridnev proposed openstack/sahara: ambari health check implementation  https://review.openstack.org/28020313:50
openstackgerritVitaly Gridnev proposed openstack/sahara: implement sending health notifications  https://review.openstack.org/28119413:50
*** vgridnev_ has quit IRC13:58
*** hogepodge has joined #openstack-sahara14:00
*** crobertsrh1 has joined #openstack-sahara14:06
*** vgridnev_ has joined #openstack-sahara14:07
*** egafford has quit IRC14:09
openstackgerritMerged openstack/sahara: Replace assertNotEqual(None,) with assertIsNotNone  https://review.openstack.org/28078814:09
openstackgerritVitaly Gridnev proposed openstack/sahara: cloudera health checks implementation  https://review.openstack.org/27900714:11
*** Poornima has quit IRC14:17
openstackgerritEvgeny Sikachev proposed openstack/sahara-tests: Fix using proxy node for checks  https://review.openstack.org/27944714:26
openstackgerritEvgeny Sikachev proposed openstack/sahara-tests: Disable ssl_verify as default  https://review.openstack.org/28076214:28
*** vgridnev__ has joined #openstack-sahara14:30
*** vgridnev_ has quit IRC14:33
*** witlessb has quit IRC14:43
*** witlessb has joined #openstack-sahara14:44
openstackgerritVitaly Gridnev proposed openstack/sahara: ambari health check implementation  https://review.openstack.org/28020314:47
openstackgerritEvgeny Sikachev proposed openstack/sahara-tests: Add autoregistering of image  https://review.openstack.org/28131514:50
*** tmckay has quit IRC14:51
*** vgridnev__ has quit IRC14:51
*** vgridnev__ has joined #openstack-sahara14:52
*** vgridnev__ has quit IRC14:55
*** vgridnev__ has joined #openstack-sahara14:56
*** vgridnev__ has quit IRC14:58
*** dhellmann has quit IRC15:05
*** dhellmann has joined #openstack-sahara15:05
*** vgridnev__ has joined #openstack-sahara15:08
rickflaremorning15:15
elmikohi15:16
rickflarehey elmiko how are you?15:16
elmikosleepy...15:16
elmikoyou?15:16
rickflaresame15:16
rickflarehoping tmckay is back15:16
rickflareso we can finish this spark stuff15:17
rickflarei think I have finally gotten over my neutron networking hurdle15:17
elmikoi'm sure he'll be around at some point15:17
elmiko\o/15:17
crobertsrhrickflare:  where did you leave off?15:19
crobertsrhtmckay sent me a text saying that you guys ran into something with a spark 1.6 classpath?  Is that right?15:20
*** Erming has quit IRC15:20
elmikovgridnev__: saw this yesterday, thought you might find it interesting: http://blog.kortar.org/?p=27915:20
*** Erming has joined #openstack-sahara15:20
rickflareso15:21
rickflarewe got the job15:21
rickflarebut it kept saying done with errors15:21
rickflarei also found a bug with horizon15:21
rickflarewhen you add the binary path in switf15:21
rickflareswift15:22
rickflareif you add swift://container/binary15:22
rickflarehorizon will put it in as swift://swift://container/binary15:22
crobertsrhThat rings a bell.  It may have been written up already.  I'll double check in launchpad.15:22
rickflareand then we will not be able to delete it15:22
crobertsrhIt might have already been fixed for a future release.15:22
elmikothat bug has been fixed15:23
crobertsrhthanks elmiko:  it was sounding really familiar15:24
rickflareok15:25
rickflareso its in mikata15:25
crobertsrhYes15:25
* rickflare feels like he is actually contributing now.15:25
crobertsrhAbsolutely15:26
rickflareyea so we were having a class path issue15:26
crobertsrhAny bugs that you do find, definitely write-up on launchpad:  https://bugs.launchpad.net/sahara15:26
rickflarei was telling tmckay15:27
rickflareI hope to prep to make a big sell to my customers on using sahara15:27
crobertsrhExcellent15:27
rickflarethe next thing im going to really need to do is find out how using sahara image elements15:27
crobertsrhthat'll be a piece of cake15:28
rickflareto be able to produce harden images that actually work15:28
rickflareright now these images are far to loose15:28
rickflareand open15:28
rickflarethey must be locked down15:28
rickflareand some may even need to have selinux enabled15:28
crobertsrhAh, I see15:28
*** vgridnev__ has quit IRC15:28
crobertsrhMight be an interesting bit of work.15:29
rickflareso I salt formulas15:29
rickflarethat can do a lot of this15:29
rickflarebut getting salt into the images was not the bad15:30
rickflarehowever it seems like this is something that would be best handled by heat15:30
rickflareand I really need to understand more the order in which things happen and where15:30
rickflareso I can certain I am injecting in the correct locations15:30
rickflareif that makes sense15:30
*** vgridnev__ has joined #openstack-sahara15:31
crobertsrhRight.  There is a fair amount of documentation for Sahara.  Some of it might help you figure out the flow.15:31
rickflareyea I am about to start really digging into it15:31
rickflareive been so slowed down by install issues15:31
rickflarebut I feel they have been resolved15:32
rickflareI did another install15:32
crobertsrhYeah, seems like you're up and running nicely15:32
rickflareand I now know how to get this integrated now15:32
rickflarevlans and more will come later15:32
rickflarebut for now this is fine15:32
*** vgridnev__ has quit IRC15:35
*** vgridnev__ has joined #openstack-sahara15:37
*** vgridnev__ has quit IRC15:42
*** vgridnev__ has joined #openstack-sahara15:42
*** nkrinner has quit IRC15:43
*** esikachev has quit IRC15:48
*** vgridnev__ has quit IRC15:52
crobertsrhrickflare:  Here is the Sahara blueprint page:  https://blueprints.launchpad.net/sahara15:52
*** vgridnev__ has joined #openstack-sahara15:52
rickflareok15:52
*** tmckay has joined #openstack-sahara15:52
*** vgridnev__ has quit IRC15:52
rickflareso I can just type up what I like to see done and submit it?15:52
crobertsrhYeah.  If you have ideas on the "how", feel free to add them there as well.15:53
rickflareok15:53
rickflareill do this soon as I finish that reading15:53
rickflareand understand the flow more15:54
crobertsrhOnce the idea is a bit better baked, you or someone else will write up a "spec", which is much more detailed.15:54
crobertsrhHere's an example of what our specs look like:  https://review.openstack.org/#/c/245571/10/specs/mitaka/edp-log-enhancement.rst15:54
rickflareunderstood15:56
openstackgerritMerged openstack/sahara-tests: Define variables via args in scenario tests  https://review.openstack.org/27069916:00
elmikodid tmckay end up finding a bug for rickflare to fix?16:02
tmckayelmiko, no16:03
tmckaysomething weird with tox, crobertsrh and I can see errors but not everywhere16:03
elmikok, i'll try to work up a softball today ;)16:03
tmckayso we need something simple. I'm looking into one -- we discovered yesterday that rickflare submitted a spark job without a main class and it let him. That should be an error16:04
crobertsrhtmckay:  I upgraded tox and pep8, but still see the same errors16:04
elmikoi was gonna format another bandit fix for a bug, should be simple16:04
elmikoi'm scared to try the tests now...16:05
tmckayrickflare, also I need to track down that spark 1.6 issue. What version of sahara are you running? Is it from the git, or a package?16:05
*** degorenko is now known as _degorenko|afk16:05
elmikocrobertsrh, tmckay, fyi, i've started running tox from a venv as the fedora versions have caused issues for me16:06
tmckaycrobertsrh, another bug we found -- he added swift:// to a url in horizon, and it came through as swift://swift://16:06
crobertsrhoh, tmckay:  re pep8, I still see the errors on my ubuntu (14) machine, but NOT on my fedora 23 machine.16:06
elmikotmckay: that was fixed16:06
crobertsrhelmiko:  probably wise16:06
rickflarehey hey16:06
rickflarethis was from using the sahara image element build16:06
tmckayelmiko, when? so he hust have an older version16:06
rickflarei did a tox build of the image16:06
elmikotmckay: it was fixed a month or two ago16:07
tmckayrickflare, but your sahara binaries, sahara-engine and sahara-api -- how do you install them? from packstack?  what package?16:07
tmckayI need to see if I can patch your spark edp engine. Or, you might need to kill the cluster and launch a spark 1.3.1 cluster instead (rickflare)16:08
tmckayelmiko, heh, when I was on review vacation. doh16:08
tmckayelmiko, when you say it was fixed a few  months ago, you mean the swift://swift:// or being able to run a job without a main class?16:11
elmikotmckay: swift://swift://16:12
elmikolooking for the fix now16:12
rickflarebrb guys16:12
crobertsrhmight have been fixed before we left the horizon repo16:12
elmikoi think it was16:12
tmckaywas thinking about usability after helping rickflare, I was wondering if an additional field on a job binary to store a list of the runnable classes in a jar would be a nice option16:13
tmckayso when you create the binary, you tag it with a list of class names optionally16:14
tmckaythen when someone goes to run it, they don't have to guess, or go dig up the binary and run "jar tf" on it to see what's in there16:14
crobertsrhMight be useful.  Partially solved by the job template interface stuff that was added.16:14
tmckaythat's always a pain point for me16:14
tmckaycrobertsrh, yeah, but does it have a list of main class values?  It could be added there, maybe16:15
crobertsrhNo, nothing about a "list", but it does include a field for the one that you'll need16:15
tmckaydidn't show rickflare the job interface stuff yet16:16
crobertsrhHow common is it for a jar to have multiple runnables crammed inside?16:16
tmckayI'll take a look16:16
crobertsrhIs it only something that we tend to see in the example jars?16:16
crobertsrhOr do people do that "for realz"?16:16
*** esikachev has joined #openstack-sahara16:17
tmckayI don't know, it happens to me all the time. I never remember what the class is inside a jar.  It should be autodetectable imho (maybe it is, I don't know much bout Java)16:17
tmckaykind of annoyed that hadoop is written in it to begin with16:18
tmckay:)16:18
crobertsrhYeah, I see what you're saying.  I've felt that pain more than once.16:18
*** esikachev has quit IRC16:24
*** apavlov has quit IRC16:33
*** pcaruana has quit IRC16:35
*** vgridnev__ has joined #openstack-sahara17:03
*** vgridnev__ has quit IRC17:04
elmikotmckay, crobertsrh, fyi, just ran a fresh tox from a venv with python3.4 and tox 2.3.1 on f23. everything passed17:25
crobertsrhpython3.4, eh?17:25
elmikoyea, i'm trying to use py3 for more stuff these days17:25
crobertsrhThings pass nicely on my f23 box, just not on my other (ubuntu) machine17:26
elmikoweird...17:26
elmikofirst thing i check these days in the tox version17:26
crobertsrhactually, I think I see a few bashate warnings, but that's it...still "succeeded"17:26
crobertsrhYeah, tox version is often key17:26
elmikoi get bashates too17:26
*** apavlov has joined #openstack-sahara17:32
tmckayelmiko, I found a patch for rickflare, a real bug17:33
* tmckay lunch17:33
elmikotmckay: ack17:34
*** DuncanT has quit IRC17:34
*** zigo has quit IRC17:34
*** jamielennox has quit IRC17:35
*** zigo has joined #openstack-sahara17:36
rickflaretmckay sorry17:37
rickflarei am back17:37
rickflarehad a fire I had to tend to17:37
*** DuncanT has joined #openstack-sahara17:39
rickflarelmk when you are back and we can resume working on that job17:40
*** vgridnev__ has joined #openstack-sahara17:42
*** jamielennox has joined #openstack-sahara17:44
*** thumpba has joined #openstack-sahara17:51
*** esikachev has joined #openstack-sahara18:10
*** Erming has quit IRC18:12
*** Erming_ has joined #openstack-sahara18:12
*** egafford has joined #openstack-sahara18:13
tmckayrickflare, back. so, question of the day is what version of sahara are you running? Where did it come from?18:21
* tmckay looks for that spark 1.6 classpath issue18:22
tmckayrickflare, also have a simple bug for you to fix18:22
rickflareok18:26
rickflareawesome18:26
rickflareim back!18:26
rickflareI got my tunes on18:27
rickflareim ready to rock18:27
tmckayrickflare, ok, to get that spark job to run that we launched yesterday (the last one), I believe you need this patch https://review.openstack.org/#/c/276734/418:30
tmckaythat should make the spark.xml file available on the classpath for spark-submit18:31
tmckaythis is why I was asking where your sahara came from -- to make sure you don't already have it18:31
*** egafford has quit IRC18:32
rickflareah18:33
rickflareis this patch18:33
rickflareapplied in the instance18:33
rickflareor on the openstack host18:33
rickflareccccccejlbrtdgklbjdhunvtjiuhlubiitdgdvcngcee18:33
rickflarewhoops18:33
tmckaythe openstack host, sahara controller. If you're running spark 1.6, though, I'm a little confused, it should already be there. unless of course you're using a spark 1.6 image and sahara cluster thinks it's a 1.3.1 cluster18:34
rickflarethats what it is18:35
rickflarei bet18:35
rickflarebecause i have not seen 1.6.018:35
rickflareanywhere in horizon18:35
rickflareand I have 1.3.1 selected18:35
tmckayokay, it's a little clearer now. So, two choices -- in this case, I don't think there is really much difference from a sahara perspective between spark 1.3 and spark 1.618:36
rickflarek18:36
*** rcernin has quit IRC18:36
tmckayso, we can 1) delete the cluster, generate a spark 1.3.1 image, relaunch the cluster, and you should be good or 2) patch your sahara in /usr/lib to have the fix so the classpath works for 1.618:37
tmckay#1 is the "right" thing to do18:37
tmckay#2 is a hack18:37
tmckaybut relatively simple18:37
tmckayyour choice18:37
rickflarelet hack for now18:37
rickflarelets hack18:37
rickflarenow where do I patch>18:38
rickflare?18:38
tmckayokay :) so you should be able to download that patch from gerrit as a patch file, then go to /usr/lib/pythonX.X/site-packages/sahara and apply the patch with "patch -p1 < whatever.patch"18:39
* tmckay that should be where packstack put it18:39
* tmckay double checks18:40
tmckayyeah, I think that's right18:40
rickflareforgive me but how do i download the patch18:40
tmckayk, doing it alongside you, hold on18:40
rickflareim a gerrit newb18:40
tmckaydownload link up in the right-hand corner, you can get it as a zip18:41
rickflarepatch file? it looks like a diff18:41
tmckayyeah, that's it18:42
tmckayhmm, maybe the format patch link works better, hold on18:43
rickflareyea18:44
rickflarebecause18:44
rickflareim getting a file to patch prompt18:44
rickflarewhen I run the patch command18:44
rickflarewhich means its not working18:44
tmckayyeah, I bumped up a level to site-packages and did patch -p1 < sahara/blah.patch, but the changes field. if you're using RDO packages it may be too old. what do you have for rpm -qa | grep sahara18:47
rickflareopenstack-sahara-api-3.0.0-5.cc218ddgit.el7.noarch18:47
rickflareopenstack-sahara-common-3.0.0-5.cc218ddgit.el7.noarch18:47
rickflarepython-saharaclient-0.11.1-1.el7.noarch18:47
rickflareopenstack-sahara-engine-3.0.0-5.cc218ddgit.el7.noarch18:47
tmckayok, so, liberty. shouldn't be too different.18:49
*** egafford has joined #openstack-sahara18:55
*** egafford has left #openstack-sahara18:55
*** esikachev has quit IRC18:57
openstackgerritGrigoriy Rozhkov proposed openstack/sahara: Remove unsupported MapR plugin versions  https://review.openstack.org/26644419:04
*** rcernin has joined #openstack-sahara19:06
*** vgridnev__ has quit IRC19:07
*** vgridnev__ has joined #openstack-sahara19:11
*** vgridnev__ has quit IRC19:14
*** vgridnev__ has joined #openstack-sahara19:18
*** vgridnev__ has quit IRC19:19
*** vgridnev__ has joined #openstack-sahara19:22
*** vgridnev__ has quit IRC19:25
*** egafford has joined #openstack-sahara19:26
*** vgridnev__ has joined #openstack-sahara19:32
*** egafford has quit IRC19:33
*** vgridnev__ has quit IRC19:38
*** vgridnev__ has joined #openstack-sahara19:38
*** esikachev has joined #openstack-sahara19:42
*** vgridnev__ has quit IRC19:44
*** apavlov has quit IRC19:45
tmckayrickflare, btw, here is the bug I have for you https://bugs.launchpad.net/sahara/+bug/154670119:53
openstackLaunchpad bug 1546701 in Sahara "Validation for main class checks if the key is present but does not check non-null" [High,Triaged] - Assigned to Trevor McKay (tmckay)19:53
*** vgridnev__ has joined #openstack-sahara20:02
tmckayvgridnev__, hi. so, we don't think swift works with spark 1.6?20:04
vgridnev__it should be working, as I know. michael inonkin got a fix, as I know20:05
vgridnev__tmckay, ^^20:05
vgridnev__hm, so many __ at the end of nickname20:05
tmckayvgridnev__, ah, the classpath fix with working dir?20:05
*** apavlov has joined #openstack-sahara20:06
vgridnev__yep20:06
tmckayhmm, ok. I was helping rickflare, he was running a spark 1.6 image an sahara from liberty (we patched it for the classpath fix), job started running but got a socket timeout trying to authenticate to keystone20:07
tmckayweird, since he could ping it20:07
tmckaybut, this is maybe unsupported -- running a 1.6 image under liberty.20:07
vgridnev__I've got an idea20:07
tmckayI had him generate a 1.3.1 image and he's going to launch a new cluster20:07
vgridnev__https://bugs.launchpad.net/sahara/+bug/148617320:08
openstackLaunchpad bug 1486173 in Sahara "SocketTimeoutException on multi domain enviroment" [Undecided,New]20:08
tmckayah, interesting20:08
tmckaythat is the same error we got20:10
tmckaynot sure what it means by "running the same job on the default domain worked as expected"20:10
vgridnev__maybe we should configure node_domain in default section?20:11
vgridnev__https://github.com/openstack/sahara/blob/e0e20b2e33349373568c01493614a4388f4ab10c/sahara/config.py#L7320:12
tmckayhmm, maybe20:12
vgridnev__tmckay, also for reference: https://bugs.launchpad.net/sahara/+bug/119219320:15
openstackLaunchpad bug 1192193 in Sahara "Savanna should determine domain name dynamically" [Low,Incomplete]20:15
tmckaythanks, vgridnev__20:16
*** vgridnev__ has quit IRC20:21
*** vgridnev__ has joined #openstack-sahara20:28
*** vgridnev__ has quit IRC20:28
*** vgridnev has joined #openstack-sahara20:28
*** vgridnev has quit IRC20:32
rickflareinteresting20:33
*** krotscheck is now known as krotscheck_dcm20:34
tmckayrickflare, also, here is a debug hint for you20:39
tmckaybecause of the way the job_launch.log is written, you can open up that log in vi, copy the job launch command, and execute from the command line in the job run dir on the master node20:40
tmckayand re-run the job manually without launching from sahara20:40
tmckayoutout from spark will stream on your console20:41
tmckayThis is a good way to play with arguments, or if you're tweaking network config, etc20:42
rickflareahh20:44
rickflareok good to know20:44
rickflarespawning my new cluster as we speak20:44
tmckaygot another debug hint for you, too. With a tweak to the hadoop core site, you can do "hadoop fs -ls swift://demo.sahara/myfile" and that will let you know if hadoop can access swift20:47
tmckayIf it works, you get a dir listing. If not, you'll get the same exception that you would have gotten from a job run20:48
tmckayrickflare, really fast test when debugging network config issues ^^20:48
tmckayI can tell you how to modify the core-site.xml, simple20:48
* tmckay anticipating that maybe 1.3.1 cluster will have the same problem20:48
tmckayNetworking, by Charles Dickens.  It was the best of times, it was the worst of times.20:49
rickflareok20:55
rickflarecluster is backup20:55
tmckayalright, give the relaunch of that same job a try and let's see what happens20:55
rickflareand I am running 1.3.120:55
rickflarelike a idiot20:55
rickflareI deleted the job20:55
tmckayheh!20:55
rickflarei got to relaunch it20:55
tmckayokay. just remember, main class, swift configs, input output args20:56
*** thumpba has quit IRC20:57
rickflarecan you message me20:57
rickflarethose args again20:57
rickflarefor the admin20:57
rickflareand password20:57
*** raildo is now known as raildo-afk21:00
*** esikachev has quit IRC21:18
tmckayelmiko, crobertsrh, egafford, anyone else around, you ever see this as an error trying to write to swift from spark?  Read works fine, write to local hdfs works fine21:32
tmckayhttps://cryptbin.com/i92#72387cfa10b2a429c9e6b5659588306e21:32
crobertsrhlookin'21:32
tmckaycraziest thing, this is liberty, with a fresh spark 1.3.1 image21:33
tmckayeverything is working but this last silly error -- spark.xml is there, etc etc21:33
crobertsrhI don't think I've seen that.  I haven't tried spark jobs recently though.  I'll try a quick one.  I think I can get a cluster up quickly.21:33
crobertsrhoh, bonus...I have a cluster already21:34
elmikocryptbin, that's a new one for me21:34
tmckayworse than that, the container got created ...21:34
tmckaywonder if there is something in it21:35
elmikomy first guess would be to check the swift logs21:35
crobertsrhyeah, +1 to swift logs21:35
elmikomay some sort of issue with acls or the user groups or something21:35
elmikoalso, you should be able to check the core-site.xml (i think) to double check the credentials that are used to validate with swift21:36
elmikothat would be on the cluster node21:36
tmckayrickflare ^^, great idea. mine the swift logs on the controller21:36
elmikoand log in to the cluster master node to dbl check creds, imo21:36
tmckayelmiko, we've got the creds right, we can hadoop fs -ls from the cluster node21:36
tmckaywe hacked core-site21:37
elmikoack21:37
elmikocould be an issue with write acls on swift?21:37
tmckaymaybe, it created the output container, and create a _temporary file successfully.  huh?21:38
rickflareguys ill be back21:38
tmckayelmiko, then some mysterious permission denied. makes no sense21:38
rickflarei got to bug out now21:38
elmikotmckay: weird...21:39
rickflareive seen this before with hadoop21:39
rickflarellll be be back21:39
elmikotake care rickflare21:39
tmckayalright, well, we're doing everything right at this point21:39
tmckayI am officially clueless21:39
* elmiko really hopes rickflare wears a cape at summit21:39
tmckayoh yea, gotta confirm his bug21:40
tmckayelmiko, https://bugs.launchpad.net/sahara/+bug/154670121:40
openstackLaunchpad bug 1546701 in Sahara "Validation for main class checks if the key is present but does not check non-null" [High,Triaged] - Assigned to Trevor McKay (tmckay)21:40
tmckayverified for spark, 100% it crashes java too but I haven't gotten a generic cluster up21:40
elmikocrazy...21:41
elmikogood find though21:41
tmckayelmiko, crobertsrh, either of you have a vanilla/cdh/hdp cluster up?21:41
tmckayelmiko, it's like a 2 line fix :)21:41
crobertsrhI don't, just spark atm21:41
elmikolet me look21:41
tmckayk, I'll try. I tried an hdp2 but it hung in configure forever21:41
elmikoyea, i have a vanilla 2.7.1 up21:42
tmckayooo, oooo, can you run a java wordcount from edp-examples?21:42
elmikoi can try, yea21:42
tmckayjust want to run it, and leave the main class blank on purpose21:42
elmikook21:43
tmckaythe dynamite should go boom21:43
elmikoso, setup a new job, but leave the main class blank?21:43
tmckayyep21:43
tmckayit should launch, and oozie should barf. It should not fail on the sahara side, or at least not during validation21:43
elmikok21:44
tmckayvalidation should be catching it but isn't (at least for spark)21:44
tmckayooo, lookie there, I have a centos7 vanilla 2.6 image lying around21:47
elmikoman, we need to update the docs in those edp-examples21:47
tmckayit's like Christmas21:47
elmikolol21:47
tmckayyeah, EDP needs some love21:47
tmckayelmiko, rickflare has made it abundantly clear to me indirectly that log aggregation has to happen21:48
tmckayreally21:48
tmckayfor N, that is all I'm going to do, unless someone with authority makes me do otherwise ;-)21:48
tmckayit is crazy stupid that I've got him poking around in /tmp/spark-edp to debug21:49
elmiko+121:49
elmikoi think we've all agreed that logs could be better21:49
tmckaybeen deferred long enoug, I thin it is now "the most important thing"21:49
elmikowell, it's on our short list of "most important things" ;)21:50
tmckaylol, yeah21:50
tmckayit's just big, and hard21:50
elmikoyup21:50
elmikoalso, for the regex help stuff, we should totally create these little context "?" icons. it is super helpful in the job tempate creation form21:51
tmckaysounds good. I am unfamiliar with the "?" icons21:51
tmckayI wanted this cycle to be all about usability, then I got wooed by baremetal21:51
elmikolook at job template create21:52
elmikolol!21:52
tmckaydarn you baremetal. you're dead to me21:52
elmikook, job launched without issue (no main lib specified)21:53
tmckayk, should get an error21:55
crobertsrhtmckay:  Are you still trying to get the wordcount job to run?  Or did I miss that great success?21:55
elmikoyup, it got killed21:55
tmckayelmiko, main class you mean, not main lib, right?21:55
elmikoyea, main class21:55
elmikoi only specified the args21:55
tmckaycrobertsrh, no, ended with that permission denied. but it created the ouput dir in swift, and it created the temporary file21:56
tmckaythen choked21:56
crobertsrhOk, just making sure I haven't missed anything.21:56
crobertsrhI did crank up my cluster and run wordcount.  Managed to run for me.21:57
elmikowhat *is* the main class for word count?21:57
crobertsrhsahara.edp.spark.SparkWordCount21:58
elmikothanks21:58
crobertsrhof course :)21:58
elmikoi'm not confident that this will work even with the main class21:58
*** crobertsrh1 has quit IRC21:58
elmikobut, i have terrible luck running jobs21:59
tmckayelmiko, crobertsrh gave you the main class for spark22:00
tmckayfor java, it's different. This is a java job you're running, right?22:00
crobertsrhah...my bad :)22:01
* tmckay checks for it22:01
crobertsrhI was indeed in spark land22:01
elmikoha! i typed that and didn't even think about it22:01
elmikoyes, this is java22:01
tmckayorg/openstack/sahara/examples/WordCount22:01
tmckaywith . for /, of course22:02
tmckayso, guys, another interesting usability wrinkle in this22:02
elmikoand do i add arguments in the arguments section on the configure tab of launch job or do i use the interface arguments tab?22:02
tmckayapparently you can specify main class with a MANIFEST22:02
tmckayelmiko, you can just use arguments on the configure tab to keep it simple22:03
elmikobut either will work?22:03
tmckaybut, even though Java/spark may allow main class in a MANIFEST, we currently are requiring a --class argument for spark, and I'm not sure Oozie will handle a manifest specification (but it might)22:03
tmckayelmiko, yeah22:04
tmckaywouldn;t it be cool if users could build their jars so that they didn't need a main class value?22:04
tmckaythat would be awesome22:04
elmikohuh... i'm just terrible at actually operating sahara... sigh22:04
tmckayyeah, me too, you get rusty after a while. This excercise with rf has been good22:04
elmikoand of course, now that the job has failed i have no clue why it failed...22:04
tmckayback in the trenches22:05
elmikoyup22:05
elmikojust like when i joined ;)22:05
tmckayelmiko, you can follow the Oozie console link and find out ...22:05
elmikoyea, i need to setup the routes and everything though. my devstack box is another machine22:05
tmckayelmiko, only if you want to -- I think you've proven what I wanted. I'm going to try to spin up my own22:06
elmikoit can wait, i'm in the middle of reviewing crobertsrh stuff atm22:06
tmckaythanks for checking22:06
elmikonp22:06
openstackgerritMerged openstack/python-saharaclient: Updated from global requirements  https://review.openstack.org/28100722:14
openstackgerritMerged openstack/sahara: Add support running Sahara as wsgi app  https://review.openstack.org/26249222:19
openstackgerritVitaly Gridnev proposed openstack/sahara: honor api_insecure parameters  https://review.openstack.org/27999622:23
openstackgerritVitaly Gridnev proposed openstack/sahara: implement sending health notifications  https://review.openstack.org/28119422:24
*** tmckay has left #openstack-sahara22:44
*** apavlov has quit IRC22:48

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!