Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 4.0

With Greenplum's external tables and parallel file server, gpfdist, efficient data loads can be achieved.  Spoon provides a convenient loading mechanism as described with this simple two step job.

The "Create External Table", as shown below creates an external table, external_samples_customer2.  The data is provided by two locations on the same etl server, etl1.  Two instances of gpfdist are running o this server, one on port 9080 , the other on port 9081

.

The customers-100.txt file server by both gpfdist servers share an identical layout but not data- they are partitioned.  The server at port 9081 is provding customer data where the customer id is in the range 1 t0 49.  The 9082 server provides customers 50 through 100.

...