Community Wiki Home
Spaces
Apps
Templates
Create
Pentaho Data Integration
All content
Space settings
Content
Results will update as you type.
Show more above
•
Closure Generator
•
Data Validator
•
Excel Input Step
•
Switch-Case
•
XML Join
•
Metadata Structure
•
Add XML
•
Text File Output (Deprecated)
•
Generate Random Value
•
Text File Input
•
Table Input
•
Get System Info
•
Generate Rows
•
De-serialize from file
•
XBase Input
•
Excel Input (XLS, XLSX) including OpenOffice Workbooks (ODS)
•
XML Input
•
Get File Names
•
Table Output
•
Insert - Update
•
Update
•
Delete
•
Serialize to file
•
XML Output
•
Excel Output
•
Access Output
•
Database lookup
•
Stream Lookup
•
Call DB Procedure
•
HTTP Client
•
Select Values
•
Filter Rows
•
Sort rows
•
Add sequence
•
Dummy (do nothing)
•
Row Normaliser
•
Split Fields
•
Unique Rows
•
Group By
•
Null If
•
Calculator
•
XML Add
•
Add Constants
•
Row denormaliser
•
Flattener
•
Value Mapper
•
Blocking step
•
Join Rows (Cartesian product)
•
Database Join
•
Merge rows
•
Sorted Merge
•
Merge Join
•
JavaScript Values
•
Modified Java Script Value
•
Execute SQL script
•
Dimension Lookup-Update
•
Combination lookup-update
•
Mapping
•
Get rows from result
•
Copy rows to result
•
Set Variables
•
Get Variable
•
Get files from result
•
Set files in result
•
Injector
•
Socket reader
•
Socket writer
•
Aggregate Rows
•
Streaming XML Input
•
Abort
•
Oracle Bulk Loader
•
Append
•
Regex Evaluation
•
CSV File Input
•
Fixed File Input
•
Access Input
•
LDAP Input
•
Mondrian Input
•
Get Files Rows Count
Get Data From XML
•
Get Data from XML - Handling Large Files
•
LDIF Input
•
Property Input
•
Mail Validator
•
Property Output
•
SQL File Output
•
Add a checksum
•
Append streams
•
Clone row
•
Delay row
•
Split field to rows
•
XSD Validator
•
XSL Transformation
•
Check if a column exists
•
File exists
•
Table Exists
•
Web services lookup
•
Mapping Input
•
Mapping Output
•
PostgreSQL Bulk Loader
•
Analytic Query
•
User Defined Java Expression
•
Google Analytics
•
Google Docs Input
•
HTTP Post
•
Execute a process
•
Formula
•
If field value is null
•
Process files
•
Execute row SQL script
•
RSS Input
•
Synchronize after merge
•
Dynamic SQL row
•
GZIP CSV Input
•
MySQL Bulk Loader
•
Salesforce Input
•
User Defined Java Class
•
Replace in String
•
SAP Input (Deprecated)
•
OLAP Input
•
Greenplum Load
•
HBase Input
•
HBase Output
•
Get File Name Step
•
Ingres VectorWise Bulk Loader
•
Get ID from Slave Server
•
XML Input Stream (StAX)
•
Automatic Documentation Output
•
Mail (step)
•
Check if webservice is available
•
Rule Executor
•
Rule Accumulator
•
Job Executor
•
Metadata Structure of Stream
•
OpenERP Object Input (Deprecated)
•
OpenERP Object Delete (Deprecated)
•
OpenERP Object Output (Deprecated)
•
Run SSH commands
•
Palo Dimension Input (Deprecated)
•
Palo Cell Input (Deprecated)
•
Palo Dimension Output (Deprecated)
•
Palo Cell Output (Deprecated)
•
SAS Input
•
Single Threader
•
Cassandra Input
•
Cassandra Output
•
Microsoft Excel Writer
•
LucidDB bulk loader
•
MongoDB Input
•
MongoDB Output
•
HL7 Input
•
ETL Metadata Injection
•
Teradata Fastload Bulk Loader
•
Edi to XML
•
Avro Input (Deprecated)
•
Add value fields changing sequence
•
Block this step until steps finish
•
Change file encoding
•
Sample rows
•
Pentaho Reporting Output
•
Check if file is locked
Show more below
Blogs
Pentaho Data Integration
/
/
Get Data From XML
/
Get Data from XML - Handling Large Files
Summarize
Get Data from XML - Handling Large Files
Former user (Deleted)
Owned by
Former user (Deleted)
Last updated:
Sept 18, 2008
Version comment
Loading data...