Thursday, 2018-06-21

*** links has joined #openstack-sahara01:33
*** brad[] has joined #openstack-sahara01:55
*** rcernin has quit IRC02:52
*** rcernin has joined #openstack-sahara03:10
*** rcernin has quit IRC03:55
*** rcernin has joined #openstack-sahara04:14
*** prasen has quit IRC04:22
*** pgadiya has joined #openstack-sahara04:40
*** pgadiya has quit IRC04:40
*** prasen has joined #openstack-sahara04:55
*** links has quit IRC05:32
*** links has joined #openstack-sahara05:56
*** zhuli__ has quit IRC06:49
*** Bhujay has joined #openstack-sahara07:03
*** zhuli__ has joined #openstack-sahara07:04
*** tesseract has joined #openstack-sahara07:11
*** whooliganface has joined #openstack-sahara07:17
*** links has quit IRC07:21
*** whooligan has quit IRC07:21
*** links has joined #openstack-sahara07:21
*** pcaruana has joined #openstack-sahara07:22
*** tosky has joined #openstack-sahara07:33
*** links has quit IRC07:42
*** rcernin has quit IRC07:47
*** links has joined #openstack-sahara07:55
*** Bhujay has quit IRC08:23
*** Bhujay has joined #openstack-sahara09:14
openstackgerritsunqingliang proposed openstack/sahara-dashboard master: add a link to release notes in README file  https://review.openstack.org/57710409:21
*** rcernin has joined #openstack-sahara09:39
*** rcernin has quit IRC09:53
*** rcernin has joined #openstack-sahara10:55
*** rcernin has quit IRC11:19
*** rcernin has joined #openstack-sahara11:39
*** rcernin has quit IRC11:44
openstackgerritMerged openstack/sahara-dashboard master: add a link to release notes in README file  https://review.openstack.org/57710411:46
*** prasen has quit IRC12:16
*** links has quit IRC12:43
openstackgerritLuigi Toscano proposed openstack/sahara-tests master: DNM Experimental scenario job based on RadosGW (not Swift)  https://review.openstack.org/57692913:00
*** zchkun has joined #openstack-sahara13:47
*** jeremyfreudberg has joined #openstack-sahara13:56
*** links has joined #openstack-sahara14:17
*** links has quit IRC14:23
openstackgerritwangqi proposed openstack/puppet-sahara master: add release note link in readme  https://review.openstack.org/57720114:50
*** pcaruana has quit IRC14:56
tellesnobregajeremyfreudberg, tosky please continue15:00
toskysooo15:00
toskywe were at (more or less) cloudera jar (and the S3 support that they advertise) is not enough15:01
jeremyfreudbergyes, let me collect my thoughts for a minute15:01
jeremyfreudbergso, let's talk about one issue, which is the problems with hadoop-aws.jar:15:02
jeremyfreudbergbackground knowledge, there are two ways of addressing the real S3 hosted by amazon15:02
jeremyfreudbergbucket.host.com15:03
jeremyfreudbergor host.com/bucket15:03
jeremyfreudbergradosgw and all other object store solutions which are not amazon itself only support host.com/bucket15:03
jeremyfreudberghowever, in hadoop-aws.jar bundled with hadoop before 2.8, it defaults to bucket.host.com and cannot be configured (it will fallback to host.com/bucket only if the endpoint is an ip address and not a domain name)15:04
jeremyfreudbergso, we need to "backport" the support for host.com/bucket to hadoop-aws for hadoop 2.6 and 2.715:05
jeremyfreudbergevery plugin except Vanilla 2.8.2 uses this old jar15:06
toskyI see15:06
toskyand I guess that vendor plugins didn't patch this themselves15:07
jeremyfreudbergooh, well that's interesting, i didn't think to check if the vendors did that15:07
toskygiven that "speed" that they move, I would not bet too much15:07
toskybut still15:07
toskyworth checking15:07
tellesnobregayes15:08
tellesnobregaI'm heading out for lunch, I will read the docs when I'm back15:09
tellesnobregas/docs/logs15:10
jeremyfreudbergno problem, enjoy the lunch15:10
tellesnobregathanks15:10
tellesnobregajust an update, storm 1.2.0 image just created15:10
toskynice!15:10
tellesnobreganeed to test the cluster15:10
tellesnobregabut that is one step closer15:10
tellesnobregaplease continue with the hard issues15:11
toskyjeremyfreudberg: another question: if I remember correctly, you mentioned that we use s3a, but isn't that only for Hadoop 2.7+?15:11
jeremyfreudberghadoop 2.6 was the initial implementation15:11
toskybut not s3a, do we use the other one when using the older hadoop?15:14
jeremyfreudbergno, 2.6 has s3a still, i just peeked in the upstream and cdh jars15:15
jeremyfreudbergmaybe it's not very good, but its there15:15
jeremyfreudbergtosky: the cdh jar has the necesssary patch already, from my inspection!15:15
toskyI see15:15
toskyoh!15:15
toskynuce15:16
toskynice*15:16
jeremyfreudbergi will look in the other vendor jars later15:17
jeremyfreudberglet me just touch on one other issue now15:17
jeremyfreudbergso, you'll recall i just said, the hadoop-aws.jar in hadoop>=2.8 fixes the bucket-in-path problem15:17
jeremyfreudbergBUT15:17
jeremyfreudbergthat creates a problem for spark15:18
jeremyfreudbergon vanilla15:18
jeremyfreudbergbecause spark manages its own installation of the hadoop common libraries15:19
jeremyfreudbergand when running a spark job hadoop-aws will look in Spark's installation of those15:19
jeremyfreudbergthe problem is, spark is distrubited only with hadoop common 2.6 or 2.715:19
jeremyfreudbergsee: http://archive.apache.org/dist/spark/spark-2.2.0/15:19
toskyso another patch?15:20
jeremyfreudbergyes15:20
jeremyfreudbergbecause hadoop aws 2.8, which is what vanilla 2.8, provides, is incompatible with hadoop-common 2.715:20
jeremyfreudbergdue to a one word change in the code15:20
jeremyfreudbergjava private -> public15:20
toskybut can't spark 2.2/2.3 work with hadoop 2.8?15:21
toskyare there other incompatibilities? Just untested by ustream?15:21
jeremyfreudbergit should work, but nothing is there upstream15:21
toskyat least someone tried: https://gist.github.com/momijiame/fefb9831e9f032ef264d8d517df57cb415:21
jeremyfreudbergit is possible to download spark "without hadoop", and copy in hadoop 2.8 ourselves15:22
toskyI know someone who is working with spark a lot15:22
* tosky summons elmiko as usual15:22
jeremyfreudbergindeed15:22
toskyany other issues that we can discuss about before the summoning triggers properly? :)15:24
jeremyfreudbergjust that testing all this stuff is going to be a pain15:24
jeremyfreudbergwe have so many job types, and so many plugins15:24
toskyoh, ok, that's the "usual" complicated part15:25
jeremyfreudbergand setting up the CI on my hardware is so far down my backlog...15:25
toskydepending on how difficult is to patch and fix S3, it may go up :)15:25
toskylet's see15:25
jeremyfreudbergthat's a bit of tangent15:26
jeremyfreudbergoh, i remember what the other problem is now15:27
jeremyfreudbergor not "the other", "another"15:27
jeremyfreudbergwe will still have oozie classpath problems on mapr and hdp15:28
jeremyfreudbergactually, we might not15:28
jeremyfreudbergdepends how dumb oozie is and how smart those plugins are15:29
jeremyfreudbergregardless, that's enough of a reason to maybe push those plugins till later15:29
*** Bhujay has quit IRC15:29
toskyuh, what was the oozie classpath issue?15:30
jeremyfreudbergoozie has trouble finding hadoop-aws15:30
jeremyfreudbergi had actually addressed this in my first s3_hadoop patch, but it was totally lost in the s3_hadoop revision15:31
elmikohello15:31
jeremyfreudbergbasically oozie does not care what the hadoop classpath really is15:31
toskyoki15:31
jeremyfreudbergand has its own ideas15:31
toskyelmiko: hi! Do you (or one of your spark friends :) know if Spark (any latest version) can work with Hadoop 2.815:31
toskydespite being shipped only with Hadoop 2.6 and 2.7?15:31
elmikoi _think_ it can, but you have to load the 2.8 jars manually. someone was just looking at this, let me ask him15:32
toskyI don't care too much how, as long as it's possible (without patching stuff)15:32
toskythanks!15:32
jeremyfreudbergyes, i just care if someone has already tested it with success15:33
elmikonot successfully yet15:33
elmikosomeone in our chat is working on it, but they have not been able to make it work yet15:33
toskyuh, interesting15:33
elmikoso, it's /theoretically/ possible15:34
jeremyfreudbergelmiko: okay ,good to know15:34
elmikoi can keep you all updated when i hear more15:34
elmikoit might require a recompile of spark though to use the new dependencies15:34
toskysure, thanks, it would be really useful15:34
jeremyfreudbergyes: thanks elmiko!15:34
toskyso not a simple drop-in?15:34
toskyeven without using new features15:34
elmikoi'm not sure about the full extent, it sounded like a simple drop-in but then the person working on it ran into trouble15:35
jeremyfreudbergtosky: it looks like hdp might have the patch too, but i think i'll have to actually boot an image to be sure15:44
toskyack15:45
toskystill, good news15:45
toskywhich version of HDP? Or Ambari? Or both?15:45
jeremyfreudbergi did a spot check on a few of these: http://repo.hortonworks.com/content/repositories/releases/org/apache/hadoop/hadoop-aws/15:46
*** zchkun has quit IRC15:54
*** zchkun has joined #openstack-sahara16:01
*** tesseract has quit IRC16:09
*** jeremyfreudberg has quit IRC16:21
*** raopajay has joined #openstack-sahara16:45
openstackgerritLuigi Toscano proposed openstack/sahara-tests master: DNM Experimental scenario job based on RadosGW (not Swift)  https://review.openstack.org/57692916:50
tellesnobregajust read the logs17:04
tellesnobregahdp has the patches to work with s3? is that correct? spark may be a pain17:05
tellesnobregadid I get it all correct?17:05
toskyand cdh may have the patches too17:05
tellesnobregacool17:06
tellesnobregamapr issue is only with oozie?17:06
*** jeremyfreudberg has joined #openstack-sahara17:23
jeremyfreudbergmapr remains a mystery17:24
jeremyfreudbergtosky, actually, for spark, forget about hadoop 2.817:24
jeremyfreudbergi can just patch the hadoop 2.7.3 aws jar, instead patching hadoop-common17:24
jeremyfreudbergwhat i mean is, the "hadoop2.7" distro here http://archive.apache.org/dist/spark/spark-2.2.0/ is 2.7.317:25
toskyI see17:26
jeremyfreudbergprobably somebody (tosky) will think, wouldn't it be nice with we had hadoop 2.8 in spark anyway, to match the vanilla version17:28
jeremyfreudbergs/if/with17:28
jeremyfreudbergargh17:28
jeremyfreudbergs/with/if17:28
toskywe have an active watch on someone working on that (thanks to elmiko :)17:30
jeremyfreudberghere's an unrelated question17:31
jeremyfreudbergif i did one of those "Submitted Together" things on gerrit, what's the best way to edit the earlier commit?17:31
tellesnobregaI think you need to update the first and update the second all together17:32
toskymultiple patches? Heavy use of git interactive rebase17:32
jeremyfreudbergyeah, i guess it's interactive rebase17:33
toskydepending on how the fix touches the later patches, you can simply apply the fix on top and the rebase interactivly changing the order and merging two patches,17:33
toskyor interactivly rebase by stopping on the first patch and manually editing it17:33
toskyetc17:33
jeremyfreudbergtosky: the patch series here is the keystone thing17:33
toskythe fun part of git, where you can shot yourself in the foot and lose your work :D17:34
elmikotosky: i'll poke the person more on monday18:58
*** jeremyfreudberg has quit IRC19:19
toskythanks!19:22
openstackgerritLuigi Toscano proposed openstack/sahara-tests master: DNM Experimental scenario job based on RadosGW (not Swift)  https://review.openstack.org/57692922:16
*** rcernin has joined #openstack-sahara22:26
*** tosky has quit IRC23:34

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!