Greenplum Platform Extension Framework (PXF)
The Greenplum Platform Extension Framework (PXF) provides parallel, high throughput data access and federated queries across heterogeneous data sources via built-in connectors that map a Greenplum Database external table definition to an external data source. PXF has its roots in the Apache HAWQ project.
This topic describes the architecture of PXF and its integration with Greenplum Database.
The PXF installation and configuration directories are discussed in this topic.
This topic details the PXF configuration, initialization, and startup procedures.
This topic describes the procedure that you must perform to upgrade PXF when you install a new version of Greenplum Database.
This topic describes important PXF procedures and concepts, including enabling PXF for use in a database and PXF protocol and external table definitions.
This topic describes how to use the PXF HDFS connector and related profiles to read Text and Avro format HDFS files.
This topic describes how to use the PXF HDFS connector and related profiles to write Text and SequenceFile format binary data to HDFS files.
This topic describes how to use the PXF Hive connector and related profiles to read Hive tables stored in TextFile, RCFile, Parquet, and ORC storage formats.
This topic describes how to use the PXF HBase connector to read HBase table data.
This topic describes how to use the PXF JDBC connector to read from and write to an external SQL database such as Postgres or MySQL.
This topic details the service- and database- level logging configuration procedures for PXF. It also identifies some common PXF errors and describes how to address PXF memory issues.
The PXF utility reference.
The PXF SDK provides the Java classes and interfaces that you use to add support for external data stores and new data formats and data access APIs to Greenplum Database. This topic describes how to set up your PXF development environment, use the PXF API, and deploy your extension.