Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

The pentaho-hadoop-shims-api project provides the API/SPI for developing a shim implementation. A Hadoop configuration is a combination of shim implementation and supporting metadata and libraries. The following SPIs exist for interfacing with Hadoop-related libraries:

  • org.pentaho.hadoop.shim.spi.HadoopShim: Hadoop-related functions including HDFS, Hadoop Configuration, and Hive JDBC driver
  • org.pentaho.hadoop.shim.spi.SqoopShim: Ability to execute Sqoop tools
  • org.pentaho.hadoop.shim.spi.PigShim: Simple interface for executing Pig scripts

Defauilt implementations are provided for all shims as well as supporting objects.

...