Accessing HDFS Data with gphdfs

A newer version of this documentation is available. Click here to view the most up-to-date release of the Greenplum 5.x documentation.

Accessing HDFS Data with gphdfs

Greenplum Database leverages the parallel architecture of a Hadoop Distributed File System to read and write data files efficiently using the gphdfs protocol.

There are three steps to using the gphdfs protocol with HDFS: