*** rcernin_ has joined #openstack-sahara | 00:02 | |
*** rcernin has quit IRC | 00:03 | |
*** rcernin has joined #openstack-sahara | 00:29 | |
*** rcernin has quit IRC | 00:29 | |
*** rcernin has joined #openstack-sahara | 00:30 | |
*** rcernin_ has quit IRC | 00:32 | |
*** rickflare2 has joined #openstack-sahara | 01:04 | |
*** Bhujay has joined #openstack-sahara | 03:04 | |
*** Bhujay has quit IRC | 03:40 | |
*** dave-mccowan has quit IRC | 04:12 | |
*** Bhujay has joined #openstack-sahara | 04:21 | |
*** links has joined #openstack-sahara | 04:43 | |
*** pcaruana has joined #openstack-sahara | 06:44 | |
*** rcernin has quit IRC | 07:02 | |
*** openstackgerrit has joined #openstack-sahara | 07:12 | |
openstackgerrit | wutao proposed openstack/openstack-ansible-os_sahara master: Drop un-used packages from role https://review.openstack.org/591569 | 07:12 |
---|---|---|
*** Bhujay has quit IRC | 08:10 | |
*** openstackstatus has quit IRC | 08:12 | |
*** Bhujay has joined #openstack-sahara | 08:34 | |
*** openstackstatus has joined #openstack-sahara | 09:42 | |
*** ChanServ sets mode: +v openstackstatus | 09:42 | |
*** hoonetorg has quit IRC | 09:43 | |
*** hoonetorg has joined #openstack-sahara | 09:57 | |
*** tellesnobrega has joined #openstack-sahara | 12:27 | |
*** dave-mccowan has joined #openstack-sahara | 12:43 | |
Bhujay | tellesnobrega: the cluster verification passed when RM is with HA . Cluster start event failed. Which is usual and normally i could recover it from the ambari dashbord by reinsllating and restarting services. However , with RM and HA , the RM service is not starting | 13:02 |
tellesnobrega | Bhujay, this is both good and bad | 13:04 |
tellesnobrega | did you check why it didn't start? | 13:05 |
*** Bhujay has quit IRC | 13:07 | |
*** Bhujay has joined #openstack-sahara | 13:07 | |
Bhujay | tellesnobrega: yah , the good new is verfication logic works . thanks for your info yesterday. I will go through the logs for RM/YARN startup problem and update you | 13:13 |
Bhujay | hdfs , zookeeper , knox and ambari merics have started OK | 13:14 |
tellesnobrega | cool | 13:15 |
tellesnobrega | thanks | 13:15 |
Bhujay | yarn , mapreduce2 , hive , hbase , oozie and spark histry srv fails to start . Need to understand the interdependency , will take some time. In case you gys have any idea please share | 13:16 |
tellesnobrega | Bhujay, sure. Did all those services started without HA? | 13:17 |
tellesnobrega | If we bring the java issue here, it might explain the failure to start those services, but I'm not certain of it | 13:17 |
tellesnobrega | it could be an issue | 13:17 |
Bhujay | namenode was with HA, that have started ok, The resource manager ( one of them ) starts and fails within a minute . I will check the server log for any java issue | 13:20 |
tellesnobrega | cool | 13:21 |
*** Bhujay has quit IRC | 13:40 | |
*** Bhujay has joined #openstack-sahara | 14:52 | |
openstackgerrit | Alex Schultz proposed openstack/puppet-sahara master: Prepare Rocky RC1 https://review.openstack.org/591721 | 14:58 |
*** links has quit IRC | 15:41 | |
Bhujay | tellenobrega , RM log after starting the service shows it gets selected as active node after the ActiveStandbyElector process and then starts failing after recovery.RMStateStore (RMStateStore.java:checkVersion(634)) - Loaded RM state version info 1.2 | 15:51 |
Bhujay | the detail log is here http://paste.openstack.org/show/728020/ | 15:51 |
Bhujay | tellesnobrega | 15:52 |
Bhujay | is there any special settings /parameters need to be passed to the cluster while enabling RM HA? | 15:53 |
Bhujay | could it be due to the fact that we have used HA for name node ? https://community.hortonworks.com/questions/167395/operation-category-read-is-not-supported-in-state.html | 16:00 |
*** pcaruana has quit IRC | 16:02 | |
openstackgerrit | Merged openstack/puppet-sahara master: Add Sahara API WSGI support https://review.openstack.org/590354 | 16:15 |
tellesnobrega | Bhujay, looking into it now | 16:44 |
Bhujay | thanks | 16:45 |
tellesnobrega | Bhujay, can you try this https://community.cloudera.com/t5/Storage-Random-Access-HDFS/ls-Operation-category-READ-is-not-supported-in-state-standby/m-p/61578 | 16:49 |
tellesnobrega | using nameservices to connect | 16:49 |
tellesnobrega | maybe you could try was well with a single namenode, no HA on it | 16:50 |
tellesnobrega | if it starts it should work properly | 16:50 |
tellesnobrega | this can be helpful too | 16:51 |
tellesnobrega | https://community.cloudera.com/t5/Storage-Random-Access-HDFS/Cannot-start-an-HA-namenode-with-name-dirs-that-need/td-p/61468 | 16:51 |
Bhujay | sure , i will try that . thanks | 16:51 |
tellesnobrega | no problem | 16:51 |
*** Bhujay has quit IRC | 17:22 | |
openstackgerrit | Merged openstack/puppet-sahara master: Make providers use auth_url for authentication https://review.openstack.org/588521 | 21:34 |
openstackgerrit | Merged openstack/puppet-sahara stable/queens: Make providers use auth_url for authentication https://review.openstack.org/589097 | 21:36 |
Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!