Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Added copying the XML files for BI Server
Wiki Markup
{scrollbar}

h1. Client Configuration

These instructions are for Hadoop distros other than MapR.  If you are using MapR go to the [Configure Pentaho for MapR] page.  If you are using Cloudera CDH4 MRv1 go to the [BAD:Configure Pentaho for Cloudera CDH4] page.

h2. Kettle Client

# Download and extract Kettle CE from the [Downloads] page.
_The Kettle Client comes pre-configured for Apache Hadoop 0.20.2. If you are using this distro and version, no further configuration is required._
# Configure PDI Client for a different version of Hadoop
## Delete $PDI_HOME/libext/bigdata/hadoop-0.20.2-core.jar
## For all other distributions you should replace this core jar with the one from your cluster. For example, if you are using Cloudera CDHu3:
Copy $HADOOP_HOME/hadoop-core-0.20.2-cdh3u3.jar to $PDI_HOME/libext/bigdata
## For Hadoop 0.20.205 you also need to have Apache Commons Configuration included in your set of PDI libraries.  In that case copy [commons-configuration-1.7.jar|http://commons.apache.org/configuration/download_configuration.cgi] to $PDI_HOME/libext/commons
## For Cloudera CDH3 Update 3 you also need to copy $HADOOP_HOME/lib/guava-r09-jarjar.jar to $PDI_HOME/libext/bigdata.
# Apply Hadoop client configuration files by placing the core-site, hdfs-site, and mapred-site.xml files in the $PDI_HOME directory.

h2. Pentaho Report Designer (PRD)

# Download and extract PRD from the [Downloads] page.
_PRD comes pre-configured for Apache Hadoop 0.20.2. If you are using this distro and version, no further configuration is required._
# Configure PRD for a different version of Hadoop
## Delete $PRD_HOME/lib/bigdata/hadoop-0.20.2-core.jar
## Copy $HADOOP_HOME/hadoop-core.jar from your distribution into $PRD_HOME/lib/bigdata
## For Hadoop 0.20.205 you also need to have Apache Commons Configuration included in your set of PDI libraries.  In that case copy [commons-configuration-1.7.jar|http://commons.apache.org/configuration/download_configuration.cgi] to $PRD_HOME/lib/bigdata
## For Cloudera CDH3 Update 3 you also need to copy $HADOOP_HOME/lib/guava-r09-jarjar.jar to $PRD_HOME/lib.

h2. Pentaho BI Server

# Download and extract The BI Server from the [Downloads] page.
_The BI Server comes pre-configured for Apache Hadoop 0.20.2. If you are using this distro and version, no further configuration is required._
# Configure BI Server for a different version of Hadoop
## Delete $BI_SERVER_HOME/tomcat/webapps/pentaho/WEB-INF/lib/hadoop-0.20.2-core.jar
## Copy $HADOOP_HOME/hadoop-core.jar from your distribution into $BI_SERVER_HOME/tomcat/webapps/pentaho/WEB-INF/lib/
## For Hadoop 0.20.205 you also need to have Apache Commons Configuration included in your set of PDI libraries.  In that case copy [commons-configuration-1.7.jar|http://commons.apache.org/configuration/download_configuration.cgi] to $BI_SERVER_HOME/tomcat/webapps/pentaho/WEB-INF/lib
## For Cloudera CDH3 Update 3) you also need to copy $HADOOP_HOME/lib/guava-r09-jarjar.jar to $PDI_HOME/libext/pentaho.
# Place the Hadoop configuration files (hdfs-site.xml, core-site.xml, mapred-site.xml) into ${BI$BI_SERVER_HOME}/tomcat/webapps/pentaho/WEB-INF/classes
{include:Known Configuration Issues}