Greenplum Platform Extension Framework (PXF)

The Greenplum Platform Extension Framework (PXF) provides parallel, high throughput data access and federated queries across heterogeneous data sources via built-in connectors that map a Greenplum Database external table definition to an external data source. PXF has its roots from Apache HAWQ project.

  • PXF Architecture

    This topic describes the architecture of PXF and its integration with Greenplum Database.

  • About the PXF Installation

    The PXF installation directories are identified in this topic.

  • Configuring PXF

    This topic details the PXF configuration, initialization, and startup procedures.

  • Upgrading PXF

    This topic describes the procedure that you must perform to upgrade PXF when you install a new version of Greenplum Database.

  • Using PXF to Read and Write External Data

    This topic describes important PXF procedures and concepts, including enabling PXF for use in a database and PXF protocol and external table definitions.

  • Reading Data from HDFS

    This topic describes how to use the PXF HDFS connector and related profiles to read Text and Avro format HDFS files.

  • Writing Data to HDFS

    This topic describes how to use the PXF HDFS connector and related profiles to write Text and SequenceFile format binary data to HDFS files.

  • Accessing Hive Table Data

    This topic describes how to use the PXF Hive connector and related profiles to read Hive tables stored in TextFile, RCFile, Parquet, and ORC storage formats.

  • Accessing HBase Table Data

    This topic describes how to use the PXF HBase connector to read HBase table data.

  • Accessing an External SQL Database

    This topic describes how to use the PXF JDBC connector to read from and write to an external SQL database such as Postgres or MySQL.

  • Troubleshooting PXF

    This topic details the service- and database- level logging configuration procedures for PXF. It also identifies some common PXF errors and describes how to address PXF memory issues.

  • Using the PXF Java SDK

    The PXF SDK provides the Java classes and interfaces that you use to add support for external data stores and new data formats and data access APIs to Greenplum Database. This topic describes how to set up your PXF development environment, use the PXF API, and deploy your extension.