Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migration of unmigrated content due to installation of a new plugin
Wiki Markup
{scrollbar}

h1. Client Configuration

These instructions are for Hadoop distros other than MapR.  If you are using MapR go to the [Configure Pentaho for MapR] page.  If you are using Cloudera CDH4 MRv1 go to the [BAD:Configure Pentaho for Cloudera CDH4] page.

h2. Kettle Client

# Download and extract Kettle CE from the [Downloads] page.
_The Kettle Client comes pre-configured for Apache Hadoop 
Include Page
Archive Warning - 4.3
Archive Warning - 4.3

Client Configuration

These instructions are for Hadoop distros other than MapR.  If you are using MapR go to the Configure Pentaho for MapR page.  If you are using Cloudera CDH4 MRv1 go to the Configure Pentaho for Cloudera CDH4 page.

Kettle Client

  1. Download and extract Kettle CE from the Downloads page.
    The Kettle Client comes pre-configured for Apache Hadoop 0.20.2.

...

  1. If

...

  1. you

...

  1. are

...

  1. using

...

  1. this

...

  1. distro

...

  1. and

...

  1. version,

...

  1. no

...

  1. further

...

  1. configuration

...

  1. is

...

  1. required.

...

  1. Configure

...

  1. PDI

...

  1. Client

...

  1. for

...

  1. a

...

  1. different

...

  1. version

...

  1. of

...

  1. Hadoop

...

    1. Delete

...

    1. $PDI_HOME/libext/bigdata/hadoop-0.20.2-core.jar

...

    1. For

...

    1. all

...

    1. other

...

    1. distributions

...

    1. you

...

    1. should

...

    1. replace

...

    1. this

...

    1. core

...

    1. jar

...

    1. with

...

    1. the

...

    1. one

...

    1. from

...

    1. your

...

    1. cluster.

...

    1. For

...

    1. example,

...

    1. if

...

    1. you

...

    1. are

...

    1. using

...

    1. Cloudera

...

    1. CDHu3:

...


    1. Copy

...

    1. $HADOOP_HOME/hadoop-core-0.20.2-cdh3u3.jar

...

    1. to

...

    1. $PDI_HOME/libext/bigdata

...

    1. For

...

    1. Hadoop

...

    1. 0.20.205

...

    1. you

...

    1. also

...

    1. need

...

    1. to

...

    1. have

...

    1. Apache

...

    1. Commons

...

    1. Configuration

...

    1. included

...

    1. in

...

    1. your

...

    1. set

...

    1. of

...

    1. PDI

...

    1. libraries.

...

    1.  In that case copy commons-configuration-1.7.jar

...

    1. to

...

    1. $PDI_HOME/libext/commons

...

    1. For

...

    1. Cloudera

...

    1. CDH3

...

    1. Update

...

    1. 3

...

    1. you

...

    1. also

...

    1. need

...

    1. to

...

    1. copy

...

    1. $HADOOP_HOME/lib/guava-r09-jarjar.jar

...

    1. to

...

    1. $PDI_HOME/libext/bigdata.

...

  1. Apply Hadoop client configuration files by placing the core-site, hdfs-site, and mapred-site.xml files in the $PDI_HOME directory.

Pentaho Report Designer (PRD)

  1. Download and extract PRD from the Downloads page.
    PRD comes pre-configured for Apache Hadoop 0.20.2.

...

  1. If

...

  1. you

...

  1. are

...

  1. using

...

  1. this

...

  1. distro

...

  1. and

...

  1. version,

...

  1. no

...

  1. further

...

  1. configuration

...

  1. is

...

  1. required.

...

  1. Configure

...

  1. PRD

...

  1. for

...

  1. a

...

  1. different

...

  1. version

...

  1. of

...

  1. Hadoop

...

    1. Delete

...

    1. $PRD_HOME/lib/bigdata/hadoop-0.20.2-core.jar

...

    1. Copy

...

    1. $HADOOP_HOME/hadoop-core.jar

...

    1. from

...

    1. your

...

    1. distribution

...

    1. into

...

    1. $PRD_HOME/lib/bigdata

...

    1. For

...

    1. Hadoop

...

    1. 0.20.205

...

    1. you

...

    1. also

...

    1. need

...

    1. to

...

    1. have

...

    1. Apache

...

    1. Commons

...

    1. Configuration

...

    1. included

...

    1. in

...

    1. your

...

    1. set

...

    1. of

...

    1. PDI

...

    1. libraries.

...

    1.  In that case copy commons-configuration-1.7.jar

...

    1. to

...

    1. $PRD_HOME/lib/bigdata

...

    1. For

...

    1. Cloudera

...

    1. CDH3

...

    1. Update

...

    1. 3

...

    1. you

...

    1. also

...

    1. need

...

    1. to

...

    1. copy

...

    1. $HADOOP_HOME/lib/guava-r09-jarjar.jar

...

    1. to

...

    1. $PRD_HOME/lib.

...

Pentaho

...

BI

...

Server

...

  1. Download

...

  1. and

...

  1. extract

...

  1. The

...

  1. BI

...

  1. Server

...

  1. from

...

  1. the

...

  1. Downloads

...

  1. page.

...


  1. The

...

  1. BI

...

  1. Server

...

  1. comes

...

  1. pre-configured

...

  1. for

...

  1. Apache

...

  1. Hadoop

...

  1. 0.20.2.

...

  1. If

...

  1. you

...

  1. are

...

  1. using

...

  1. this

...

  1. distro

...

  1. and

...

  1. version,

...

  1. no

...

  1. further

...

  1. configuration

...

  1. is

...

  1. required.

...

  1. Configure

...

  1. BI

...

  1. Server

...

  1. for

...

  1. a

...

  1. different

...

  1. version

...

  1. of

...

  1. Hadoop

...

    1. Delete

...

    1. $BI_SERVER_HOME/tomcat/webapps/pentaho/WEB-INF/lib/hadoop-0.20.2-core.jar

...

    1. Copy

...

    1. $HADOOP_HOME/hadoop-core.jar

...

    1. from

...

    1. your

...

    1. distribution

...

    1. into

...

    1. $BI_SERVER_HOME/tomcat/webapps/pentaho/WEB-INF/lib/

...

    1. For

...

    1. Hadoop

...

    1. 0.20.205

...

    1. you

...

    1. also

...

    1. need

...

    1. to

...

    1. have

...

    1. Apache

...

    1. Commons

...

    1. Configuration

...

    1. included

...

    1. in

...

    1. your

...

    1. set

...

    1. of

...

    1. PDI

...

    1. libraries.

...

    1.  In that case copy commons-configuration-1.7.jar

...

    1. to

...

    1. $BI_SERVER_HOME/tomcat/webapps/pentaho/WEB-INF/lib

...

    1. For

...

    1. Cloudera

...

    1. CDH3

...

    1. Update

...

    1. 3)

...

    1. you

...

    1. also

...

    1. need

...

    1. to

...

    1. copy

...

    1. $HADOOP_HOME/lib/guava-r09-jarjar.jar

...

    1. to

...

    1. $PDI_HOME/libext/pentaho

...

    1. .
  1. Place the Hadoop configuration files (hdfs-site.xml, core-site.xml, mapred-site.xml) into $BI_SERVER_HOME/tomcat/webapps/pentaho/WEB-INF/classes
    Include Page
    Known Configuration Issues
    Known Configuration Issues