Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Introduction to Spoon

What is Spoon?

Kettle is an acronym for "Kettle E.T.T.L. Environment." Kettle is designed to help you with your ETTL needs, which include the Extraction, Transformation, Transportation and Loading of data.

Spoon is a graphical user interface that allows you to design transformations and jobs that can be run with the Kettle tools — Pan and Kitchen. Pan is a data transformation engine that performs a multitude of functions such as reading, manipulating, and writing data to and from various data sources. Kitchen is a program that executes jobs designed by Spoon in XML or in a database repository. Jobs are usually scheduled in batch mode to be run automatically at regular intervals.

Note: For a complete description of Pan or Kitchen, see the Pan and Kitchen user guides.

Transformations and Jobs can describe themselves using an XML file or can be put in a Kettle database repository. Pan or Kitchen can then read the data to execute the steps described in the transformation or to run the job. In summary, Pentaho Data Integration makes data warehouses easier to build, update, and maintain.

Installing Spoon

Follow the instructions below to install Spoon:

  1. Install the Sun Microsystems Java Runtime Environment version 1.4 or higher. You can download a JRE for free at http://www.javasoft.com/.
  2. Unzip the zip-file: Kettle-3.0.zip in a directory of your choice.
  3. Under Unix-like environments (Solaris, Linux, MacOS, for example), you must make the shell scripts executable. Execute these commands to make all shell scripts in the Kettle directory executable:
Code Block

cd Kettle
chmod +x *.sh

Launching Spoon

The scripts below allow you to launch Spoon on different platforms:

  • Spoon.bat: launch Spoon on the Windows platform.
  • spoon.sh: launch Spoon on a Unix-like platform such as Linux, Apple OSX, Solaris

To make a shortcut under the Windows platform, an icon is provided. Use "spoon.ico" to set the correct icon. Point the shortcut to the spoon.bat file.

Supported platforms

The Spoon GUI is supported on the following platforms:

  • Microsoft Windows: all platforms since Windows 95, including Vista
  • Linux GTK: on i386 and x86_64 processors, works best on Gnome
  • Apple's OSX: works both on PowerPC and Intel machines
  • Solaris: using a Motif interface (GTK optional)
  • AIX: using a Motif interface
  • HP-UX: using a Motif interface (GTK optional)
  • FreeBSD: preliminary support on i386, not yet on x86_64

Known Issues

Below is a list of known issues associated with Spoon.

Linux
Occasional JVM crashes running SuSE Linux and KDE. Running under Gnome presents no problems. (detected on SUSE Linux 10.1 but earlier versions also have the same problem)

FreeBSD
Problems with drag and drop. Use the right click pop up menu on the canvas as a workaround.

Note: (Insert new step)

Check the Tracker lists at http://kettle.javaforge.com for up-to-date information about
recently discovered issues.

User Interface Overview

The Main tree in the upper-left panel of Spoon allows you to browse connections associated with the jobs and transformations you have open. When designing a transformation, the Core Objects palate in the lower left-panel contains the available steps used to build your transformation including input, output, lookup, transform, joins, scripting steps and more. When designing a job, the Core objects palate contains the available job entries. The Core Objects bar contains a variety of job entry types. These items are described in detail in the following chapters: 3.0 Database Connections, 6.0 Hops, 9.0 Transformation Steps, 11.0 Job Entries, 12.0 Graphical View.

Command line options

Below are the command line options that you can use when starting the Spoon application:

Code Block

-file=filename

The option below runs the specified transformation (.ktr : Kettle Transformation).

Code Block

-logfile=Logging Filename

The option below allows you to specify the location of the log file. The default is the standard output.

Code Block

-level=Logging Level

The level option sets the log level for the transformation being run. Below are the possible values:

Nothing:

Do not display any output

Error:

Only display errors

Minimal:

Use minimal logging

Basic:

This is the default basic logging level

Detailed:

Give detailed logging output

Debug:

Show detailed output for debugging purposes.

Rowlevel:

Detailed logging at a row level. Warning - this generates a lot of data.

Connect to the repository with name "Repository name" as shown below:

Code Block

-rep=Repository name

Note: You must specify the options -user, -pass and -trans described below. The repository details are loaded from the file repositories.xml in the local directory or in the Kettle directory:

Code Block

$HOME/.kettle/ or C:\Documents and Settings\<username>\.kettle on Windows.

Below is the user name with which you want to connect to the repository.

Code Block

-user=Username

Below is the password used to connect to the repository.

Code Block

-pass=Password

Use the option below to select the transformation to run from the repository.

Code Block

-trans=Transformation Name

Use the option below to select the job to run from the repository.

Code Block

-job=Job Name

Importants:

  1. On Windows, Pentaho advises you to use the /option:value format to avoid command line parsing problems by the MS-DOS shell.
  2. Fields in italic represent the values that the options use.
  3. Use quotes or double quotes if there are spaces in option values; quotes ensure that option values stay together.

Note: Examine the examples below for more information (what examples?)

Repository

Spoon allows you to store transformation and job files to the local file system or in the Kettle repository. The Kettle repository can be housed in any common relational database. To load a transformation from a database repository, you must connect to this repository. Define a database connection to the repository when you start Spoon, as shown below:
Image Removed
The information associated with repositories is stored in "repositories.xml". This file is located in the hidden directory ".kettle" in your default home directory. On Windows, the file is located in C:\Documents and Settings\<username>\.kettle

Note: The complete path and filename associated with repositories.xml file is displayed on the Spoon console.

If you don't want Welcome dialog to appear each time Spoon starts up, disable the 'Present this dialog at startup' check box or use the Options dialog under the Edit / Options menu. See also 2.14. Options.

Note: The default password for the administrator is admin. Pentaho strongly recommends that you change this default password using the Repository Explorer or the "Repository/Edit User" menu.

Repository Auto-Login

You can have Spoon automatically log into the repository by setting the following environment variables: KETTLE_REPOSITORY, KETTLE_USER and KETTLE_PASSWORD.

(warning) Important: Because logging automatically poses a security risk, Pentaho strongly recommends that you always lock your computer to prevent unauthorized access to the repository.

License

Beginning with version 2.2.0, Kettle was released into the public domain under the LGPL license. Please refer to Appendix A for the full text of this license.

Note: Pentaho Data Integration is referred to as "Kettle" below.

No Format

Copyright (C) 2006 Pentaho Corporation

Kettle is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.

Kettle is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.

You should have received a copy of the GNU Lesser General Public License along with the Kettle distribution; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA

Definitions

Transformation Definitions

...