...
- Start PDI on your desktop. Once it is running choose 'File' -> 'New' -> 'Transformation' from the menu system or click on the 'New file' icon on the toolbar and choose the 'Transformation' option.
- Create a Database Connection: You need to create a database connection so you can query hive, so right click on the 'Database connections' in the View palette and select New. If you already have a database connection for your RDBMS database you may skip this step.
The Database Connection window will appear. Enter the following information:- Connection Name: Enter 'RDBMS'
- Connection Type: Select 'MySQL'
- Host Name and Port Number: Your connection information for the MySQL Server. For a local MySQL database Host Name is 'localhost' and Port Number is '3306'.
- Database Name: Enter your database name. For a local MySQL database use 'test'
- User Name and Password: Your database username and password.
When you are done your window should look like:
Notice that there are lots of connection types that you could have used.
Click 'Test' to verify your connection is working properly. If the test fails verify your Hive RDBMS server is running and you have entered the correct connection information.
Click 'OK' to close the Database Connection window.
- Share the RDBMS Connection: You will want to use your RDBMS connection in future transformations, so share the connection by expanding 'Database Connections' in the View Palette, right clicking on 'RDBMS', and selecting 'Share'.
Sharing the connection will prevent you from having to recreate the connection every time you want to access the RDBMS in a transformation.