Community Wiki Home
Spaces
Apps
Templates
Create
Pentaho Data Integration
All content
Space settings
Content
Results will update as you type.
Show more above
•
Feature checkboxes
Frequently Asked Questions
•
Getting Started
•
Kitchen User Documentation
•
Launching job entries in parallel
•
My transformation is running slow, what do I do?!
•
Named Parameters
•
Pan User Documentation
PDI Developer information
Pentaho Data Integration (Kettle) Tutorial
Pentaho Data Integration 3.0 migration guide
•
Pentaho Data Integration Case Studies
•
Pentaho Data Integration - Java API Examples
Pentaho Data Integration Job Entries
•
Pentaho Data Integration Screenshots
Pentaho Data Integration Recorded Demos
Pentaho Data Integration v3.2. Job Entries
Slave servers and clustering
Special database issues and experiences
Spoon User Guide
•
Step performance monitoring
•
What's new in PDI version 3.1
•
What's new in PDI version 3.2
Special Operating System issues and experiences
Writing your own Pentaho Data Integration Plug-In
Documenting Pentaho Data Integration (Kettle) Projects
•
Kettle dependency management
Kettle Exchange
•
Monitoring SWT Graphics Resources with Sleak
Data Quality Integration Home
•
Partitioning data with PDI
•
Import User Documentation
•
Configuring log tables for concurrent access
Pentaho Data Integration (aka Kettle) Concepts, Best Practices and Solutions
•
Pig Script Executor
•
Marketplace
The Thin Kettle JDBC driver
•
Database transactions in jobs and transformations
•
Job checkpoints and restartability
•
Carte Configuration
•
Column Format
•
MongoDB Output IC
•
NuoDB
•
Documentation Template for Steps and Job Entries
•
MongoDB Input IC
•
Services_Yarn_Documentation
Alfresco Output Plugin for Kettle
Pentaho Data Integration Steps
•
Closure Generator
•
Data Validator
•
Excel Input Step
•
Switch-Case
•
XML Join
•
Metadata Structure
•
Add XML
•
Text File Output (Deprecated)
•
Generate Random Value
•
Text File Input
•
Table Input
•
Get System Info
•
Generate Rows
•
De-serialize from file
•
XBase Input
•
Excel Input (XLS, XLSX) including OpenOffice Workbooks (ODS)
•
XML Input
•
Get File Names
•
Table Output
•
Insert - Update
•
Update
•
Delete
•
Serialize to file
•
XML Output
•
Excel Output
•
Access Output
•
Database lookup
•
Stream Lookup
•
Call DB Procedure
•
HTTP Client
•
Select Values
•
Filter Rows
•
Sort rows
•
Add sequence
•
Dummy (do nothing)
•
Row Normaliser
•
Split Fields
•
Unique Rows
•
Group By
•
Null If
•
Calculator
•
XML Add
•
Add Constants
•
Row denormaliser
•
Flattener
•
Value Mapper
•
Blocking step
•
Join Rows (Cartesian product)
•
Database Join
•
Merge rows
•
Sorted Merge
•
Merge Join
•
JavaScript Values
•
Modified Java Script Value
•
Execute SQL script
•
Dimension Lookup-Update
•
Combination lookup-update
•
Mapping
•
Get rows from result
•
Copy rows to result
•
Set Variables
•
Get Variable
•
Get files from result
•
Set files in result
•
Injector
•
Socket reader
•
Socket writer
•
Aggregate Rows
•
Streaming XML Input
•
Abort
•
Oracle Bulk Loader
•
Append
•
Regex Evaluation
•
CSV File Input
•
Fixed File Input
•
Access Input
•
LDAP Input
•
Mondrian Input
•
Get Files Rows Count
Get Data From XML
•
LDIF Input
•
Property Input
•
Mail Validator
•
Property Output
•
SQL File Output
•
Add a checksum
•
Append streams
•
Clone row
•
Delay row
•
Split field to rows
•
XSD Validator
•
XSL Transformation
•
Check if a column exists
•
File exists
•
Table Exists
•
Web services lookup
•
Mapping Input
•
Mapping Output
•
PostgreSQL Bulk Loader
•
Analytic Query
•
User Defined Java Expression
•
Google Analytics
•
Google Docs Input
•
HTTP Post
•
Execute a process
•
Formula
•
If field value is null
•
Process files
•
Execute row SQL script
•
RSS Input
•
Synchronize after merge
•
Dynamic SQL row
•
GZIP CSV Input
Show more below
Blogs
Pentaho Data Integration
/
Pentaho Data Integration Steps
/
Sort rows
Summarize
Sort rows
Former user (Deleted)
Virginia Agnew
Former user (Deleted)
+2
Owned by
Former user (Deleted)
Last updated:
Aug 13, 2021
by
Virginia Agnew
2 min read
Loading data...