Accessing HDFS Data with gphdfs (Deprecated)

A newer version of this documentation is available. Click here to view the most up-to-date release of the Greenplum 5.x documentation.

Accessing HDFS Data with gphdfs (Deprecated)

Greenplum Database leverages the parallel architecture of a Hadoop Distributed File System to read and write data files efficiently using the gphdfs protocol.

Note: The gphdfs external table protocol is deprecated and will be removed in the next major release of Greenplum Database. Consider using the Greenplum Platform Extension Framework (PXF) pxf external table protocol to access data stored in a Hadoop file system.

There are three steps to using the gphdfs protocol with HDFS: