Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 4.0

...

Excerpt

How to set up and configure Kettle for your specific Hadoop distribution.

This page applies to Kettle and BA Suite version 4.4 (suite 4.8) only, for 5.0 go here

The Pentaho applications come pre-configured for Apache Hadoop 0.20.2.

...

If

...

you

...

are

...

using

...

this

...

distro

...

and

...

version,

...

no

...

further

...

configuration

...

is

...

required.

...

Documentation

...

for

...

configuring

...

Pentaho

...

for

...

distros

...

other

...

than

...

Apache

...

Hadoop

...

0.20.2

...

is

...

now

...

located

...

on

...

the

...

Pentaho

...

Infocenter

...

here

Currently supported Hadoop distributions:

Pentaho uses an abstraction layer to facilitate supporting the rapid and never ending distributions version updates. We call this layer a shim. The following list shows the current known support and status of various distributions. We generally do not have to update a shim for a minor or patch version change.

Upgrade your Big Data Plugin to version 1.3.3.1

...

The

...

Big

...

Data

...

plugin

...

has

...

been

...

updated

...

to

...

version

...

1.3.3.1

...

and

...

is

...

available

...

for

...

download.

This upgrade works with PDI 4.4 (Suite 4.8)

...

and

...

is

...

compatible

...

with

...

both

...

EE

...

and

...

CE

...

editions

...

of

...

Pentaho.

...

Additional

...

shims

...

that

...

were

...

not

...

shipped

...

with

...

the

...

updated

...

plugin

...

are

...

available

...

on

...

the

...

Additional

...

Shims

...

download

...

page.

...

Important information about supported Hadoop versions

Pentaho does not ship all available shims with the product. Shims that support older distributions as well as new ones created after release are available for download. If the note says that a later version of a shim also supports your version, Pentaho recommends using the later version.

Click Install Hadoop Distribution Shim for installation instructions.

Wiki Markup
{composition-setup}{composition-setup}
Wiki Markup
{deck:id=MyDeck|class=tan}

Wiki Markup
{card:label=Apache}
||

Hadoop

...

Version

...

Shim

...

Pentaho

...

Suite

...

Ver

...

Download

...

Notes

0.20.x

...

hadoop-20

...

4.8+

...

included

 

1.0.x

...

NS*

...

 

 

No support planned See this blog post

1.1.x

...

NS*

...

 

 

Not likely to be done in favor of 1.2.x

...

...

1.2.x

...

NS*

...

 

 

Possibly in patch post 5.0

...

but

...

not

...

committed

...

...

2.x.x

...

NS*

...

 

 

Distro is Alpha

Go to Apache releases

Wiki Markup
{card}
Wiki Markup
{card:label=Cloudera}
||

Hadoop

...

Version

...

Shim

...

Pentaho

...

Suite

...

Ver

...

Download

...

Notes

CDH3u3,

...

u4

...

and

...

u5

...

cdh3U4

...

4.8+

...

download

Support will be dropped in 5.0

CDH4.0,

...

4.0.1,

...

4.1,

...

4.1.1

...

cdh4

...

4.8+

...

download

The cdh42 shim also supports this configuration

CDH4.1.2

...

cdh412

...

4.8

...

+

...

BD

...

Plugin

...

1.3.2+

...

download

The cdh42 shim also supports this configuration

CDH4.1.3

...

cdh413

...

4.8

...

+

...

BD

...

Plugin

...

1.3.2+

...

download

The cdh42 shim also supports this configuration

CDH4.2

cdh42

4.8 + BD Plugin 1.3.2+

...

included

Backward compatible with all earlier cdh4.x

...

distros

...

CDH4.2.1

...

cdh42

...

4.8

...

+

...

BD

...

Plugin

...

1.3.3.1+

...

included

 

CDH4.3

...

cdh42

...

4.8

...

+

...

BD

...

Plugin

...

1.3.3.1+

...

included

 

CDH4.4.x

...

cdh42

4.8

...

+

...

BD

...

Plugin

...

1.3.3.1+

...

included

 

Go to Cloudera releases

NOTE: the cdh42 shim supports all versions of CDH from 4.0 through 4.4.x

Wiki Markup
{card}
Wiki Markup
{card:label=DataStax}
||

Hadoop

...

Version

...

Shim

...

Pentaho

...

Suite

...

Ver

...

Download

...

Notes

DSE 3.0.x

...

NS*

...

 

 

Possibly in patch post 5.0

...

but

...

not

...

committed

...

...

DSE 2.2.x

...

NS*

...

 

 

No current plans to support

Go to DataStax releases

Wiki Markup
{card}
Wiki Markup
{card:label=Hortonworks}
||

Hadoop

...

Version

...

Shim

...

Pentaho

...

Suite

...

Ver

...

Download

...

Notes

HDP 1.2.x

...

hdp12

...

4.8

...

+

...

BD

...

Plugin

...

1.3.2+

...

included

 

HDP 1.3.x

...

hdp13

...

4.8

...

+

...

BD

...

Plugin

...

1.3.2+

...

download

 

HDP 2.x

...

NS*

...

 

 

In patch post 5.0

...

-

...

...

HDP 1.1

...

for

...

Win

...

NS*

...

 

 

In patch post 5.0

...

-

...

...

Go to Hortonworks releases

Wiki Markup
{card}
Wiki Markup
{card:label=Intel}
||

Hadoop

...

Version

...

Shim

...

Pentaho

...

Suite

...

Ver

...

Download

...

Notes

IDH 2.3

...

ihd23

...

4.8

...

+

...

BD

...

Plugin

...

1.3.2+

...

Go to Intel releases

Wiki Markup
{card}
Wiki Markup
{card:label=MapR}
||

Hadoop

...

Version

...

Shim

...

Pentaho

...

Suite

...

Ver

...

Download

...

Notes

1.1.3,

...

1.2.0

...

mapr

...

4.8+

...

download

 

2.0.x

...

NS*

...

 

 

No Support planned PDI-9648

2.1.x

...

mapr21

...

4.8

...

+

...

BD

...

Plugin

...

1.3.2+

...

included

 

3.0.x

...

NS*

...

 

 

Planned for immediately post 5.0

...

...

Go to MapR releases

Wiki Markup
{card}
Wiki Markup
{deck}

* NS - Not supported. See Hadoop Configurations for information on how to create or modify a shim to support your configuration

+ Pentaho Ver is the earliest version of the Pentaho suite that supports this shim. Subsequent Pentaho versions will also support this shim unless otherwise noted.

Wiki Markup
{HTMLComment}
{tip}
The Pentaho support policy for Hadoop is available on the [Pentaho Support Plan for Hadoop Distributions] page.
{tip}
{HTMLComment}

...

Open

...

JIRA

...

cases

...

for

...

Distro

...

Support

...

Jira Issues
columnskey;fixVersion;summary;status;assignee;updated

...

anonymoustrue
urlhttp://jira.pentaho.com/sr/jira.issueviews:searchrequest-xml/temp/SearchRequest.xml?jqlQuery=labels+%3D+BD_Distro+AND+status+in+%28Open%2C+%22In+Progress%22%2C+%22Resolved%22%2C+Reopened%2C+%22Ready+For+Test%22%2C+%22Ready+for+Publishing%22%29&tempMax=1000

...

Release resources

Image Added Image Added