Installing Greenplum Loader
Installing Greenplum Loader
This section contains information for installing the Greenplum data loading programs on your Windows machine and for enabling Greenplum Database to accept remote client connections:
- Installing Python
- Running the Loader Installer
- Configuring Greenplum Loader
- Enabling Greenplum Database for Remote Client Connections
See the Greenplum Database Release Notes for the list of currently supported platforms for the Load Tools.
Installing Python
The Greenplum loader program (gpload.py) for Windows requires Python 2.5 (32-bit version) to also be installed on your machine. If you do not have an installation of Python, you can get one from www.python.org.
To install Python
- Download the Python 2.5 installer for Windows from https://www.python.org/downloads/.
- Locate, then double-click on the python Load Tools for Windows-2.5.x.msi package to launch the installer.
- Select Install for all users and click Next.
- By default, Python will be installed into C:\Pythonxx. Click Up or New if you want to choose another location. Click Next.
- Click Next to install the selected Python components.
- Click Finish to complete your installation.
Running the Loader Installer
The Greenplum loader installer installs the following loader programs:
- gpload.py (loader program)
- gpfdist.exe (parallel file distribution program used by gpload.py)
To install Greenplum loader
- Download the greenplum-clients-4.3.x.x-WinXP-x86_32.msi package from Pivotal Network.
- Double-click on the greenplum-clients-4.3.x.x-WinXP-x86_32.msi package to launch the installer.
- Click Next on the Welcome screen.
- Click I Agree on the License Agreement screen.
- By default, Greenplum loader will be installed into greenplum-db-4.3.x.x. Click Browse to choose another location.
- Click Next.
- Click Install to begin the installation.
- Click Finish to exit the installer.
About Your Installation
Your Greenplum loader installation contains the following files and directories:
- bin — loader command-line utilities (gpload.py and gpfdist.exe)
- bin/lib — additional Python libraries needed by gpload.py
- greenplum_loaders_path.bat — sets the required environment variables
Configuring Greenplum Loader
Greenplum provides a batch program (greenplum_loaders_path.bat) to set the required environment settings for Greenplum loader (located in greenplum-db-4.3.x.x by default).
To set the required environment settings
- Open a Windows command prompt (Start > Run and type cmd).
- At the command prompt, go to the directory where you installed Greenplum
loader. For
example:
cd \"Program Files"\Greenplum\greenplum-loaders-4.3.x.x dir
- Execute the greenplum_loaders_path.bat
program:
greenplum_loaders_path.bat
- Verify that you can execute the gpload.py
program:
gpload.py -?
The loader command-line tools also require several connection parameters in order to be able to connect to a Greenplum database. In order to save some typing on the command-line, you can optionally create the following environment variables in your Windows Control Panel.
- PGDATABASE — The name of the default Greenplum database to connect to.
- PGHOST — The Greenplum Database master host name or IP address.
- PGPORT — The port number that the Greenplum master instance (postmaster process) is running on.
- PGUSER — The default database role name to use for login.
To add a new user environment variable on Windows XP
- In Windows Explorer, go to C:\Control Panel.
- Double-click the System icon.
- On the Advanced tab, click Environment Variables (bottom).
- Click New.
- Define the new environment variable. For example:
- Click OK.
Enabling Greenplum Database for Remote Client Connections
In order for Greenplum Database to be able to accept remote client connections, you must configure your Greenplum Database master so that connections are allowed from the client hosts and database users that will be connecting to Greenplum Database.
To enable remote client connections
- Make sure that the pg_hba.conf file of the Greenplum
Database master is correctly configured to allow connections from the users to the
database(s) using the authentication method you want. For details, see "Editing the
pg_hba.conf File" in the Greenplum Database Administration Guide, and also see
the Greenplum Database Security Configuration Guide.
Make sure the authentication method you choose is supported by the client tool you are using.
- If you edited pg_hba.conf file, the change requires a server reload (using the gpstop -u command) to take effect.
- Make sure that the databases and roles you are using to connect exist in the system and that the roles have the correct priviliges to the database objects.