Community Wiki Home
Spaces
Apps
Templates
Create
Pentaho Data Integration
All content
Space settings
Content
Results will update as you type.
Show more above
•
Monitoring SWT Graphics Resources with Sleak
Data Quality Integration Home
•
Partitioning data with PDI
•
Import User Documentation
•
Configuring log tables for concurrent access
Pentaho Data Integration (aka Kettle) Concepts, Best Practices and Solutions
•
Pig Script Executor
•
Marketplace
The Thin Kettle JDBC driver
•
Database transactions in jobs and transformations
•
Job checkpoints and restartability
•
Carte Configuration
•
Column Format
•
MongoDB Output IC
•
NuoDB
•
Documentation Template for Steps and Job Entries
•
MongoDB Input IC
•
Services_Yarn_Documentation
Alfresco Output Plugin for Kettle
Pentaho Data Integration Steps
•
Closure Generator
•
Data Validator
•
Excel Input Step
•
Switch-Case
•
XML Join
•
Metadata Structure
•
Add XML
•
Text File Output (Deprecated)
•
Generate Random Value
•
Text File Input
•
Table Input
•
Get System Info
•
Generate Rows
•
De-serialize from file
•
XBase Input
•
Excel Input (XLS, XLSX) including OpenOffice Workbooks (ODS)
•
XML Input
•
Get File Names
•
Table Output
•
Insert - Update
•
Update
•
Delete
•
Serialize to file
•
XML Output
•
Excel Output
•
Access Output
•
Database lookup
•
Stream Lookup
•
Call DB Procedure
•
HTTP Client
•
Select Values
•
Filter Rows
•
Sort rows
•
Add sequence
•
Dummy (do nothing)
•
Row Normaliser
•
Split Fields
•
Unique Rows
•
Group By
•
Null If
•
Calculator
•
XML Add
•
Add Constants
•
Row denormaliser
•
Flattener
•
Value Mapper
•
Blocking step
•
Join Rows (Cartesian product)
•
Database Join
•
Merge rows
•
Sorted Merge
•
Merge Join
•
JavaScript Values
•
Modified Java Script Value
•
Execute SQL script
•
Dimension Lookup-Update
•
Combination lookup-update
•
Mapping
•
Get rows from result
•
Copy rows to result
•
Set Variables
•
Get Variable
•
Get files from result
•
Set files in result
•
Injector
•
Socket reader
•
Socket writer
•
Aggregate Rows
•
Streaming XML Input
•
Abort
•
Oracle Bulk Loader
•
Append
•
Regex Evaluation
•
CSV File Input
•
Fixed File Input
•
Access Input
•
LDAP Input
•
Mondrian Input
•
Get Files Rows Count
Get Data From XML
•
LDIF Input
•
Property Input
•
Mail Validator
•
Property Output
•
SQL File Output
•
Add a checksum
•
Append streams
•
Clone row
•
Delay row
•
Split field to rows
•
XSD Validator
•
XSL Transformation
•
Check if a column exists
•
File exists
•
Table Exists
•
Web services lookup
•
Mapping Input
•
Mapping Output
•
PostgreSQL Bulk Loader
•
Analytic Query
•
User Defined Java Expression
•
Google Analytics
•
Google Docs Input
•
HTTP Post
•
Execute a process
•
Formula
•
If field value is null
•
Process files
•
Execute row SQL script
•
RSS Input
•
Synchronize after merge
•
Dynamic SQL row
•
GZIP CSV Input
•
MySQL Bulk Loader
•
Salesforce Input
•
User Defined Java Class
•
Replace in String
•
SAP Input (Deprecated)
•
OLAP Input
•
Greenplum Load
•
HBase Input
•
HBase Output
•
Get File Name Step
•
Ingres VectorWise Bulk Loader
•
Get ID from Slave Server
•
XML Input Stream (StAX)
•
Automatic Documentation Output
•
Mail (step)
•
Check if webservice is available
•
Rule Executor
•
Rule Accumulator
•
Job Executor
•
Metadata Structure of Stream
•
OpenERP Object Input (Deprecated)
•
OpenERP Object Delete (Deprecated)
•
OpenERP Object Output (Deprecated)
•
Run SSH commands
•
Palo Dimension Input (Deprecated)
•
Palo Cell Input (Deprecated)
•
Palo Dimension Output (Deprecated)
•
Palo Cell Output (Deprecated)
Show more below
Blogs
Pentaho Data Integration
/
Pentaho Data Integration Steps
/
Set Variables
Summarize
Set Variables