pgEdge Platform Ansible Collection

Ansible is a common utility for deploying and configuring infrastructure and cluster resources. This is an ansible collection designed for managing pgEdge clusters in this same capacity. This collection provides a series of Ansible roles which will build a full pgEdge cluster when executed.

Each of these will be described here, along with any variables necessary to modify behavior or configuration characteristics.

Installation

These roles are intended to eventually be available on Ansible Galaxy. Until then, they should be installed this way:

git clone git@github.com:pgEdge/pgedge-ansible.git
cd pgedge-ansible
make install

It will then be possible to use these roles as the user who installed them. Simply include two lines like this in any playbook:

collections:
- pgedge.platform

For example:

- hosts: all

  collections:
  - pgedge.platform

  roles:
  - init_server

Configuration

All of the roles are meant to operate in conjunction. They are simplified to reduce complexity and add potential for more complex cluster deployments. Roles will collectively recognize the following configuration parameters:

Parameter	Default	Description
cluster_name	demo	Canonical name for the cluster, primarily used for descriptive items and generated values.
repo_name	download	Can be one of `download`, `upstream`, or `devel`. This will control which pgEdge repository is used for software installation.
repo_prefix	None	If set, makes it possible to install specific custom or automated builds based on repository prefix. Consult a member of pgEdge staff for valid setting here.
zone	1	Zone or region for a node. This helps organize HA clusters. It also doubles as the snowflake ID of a node. For non-HA clusters, just use one node per zone.
pg_version	16	Postgres version to install. This is left at 16 to facilitate upgrade tests.
spock_version	4.0.9	Version of the Spock extension to install.
db_names	demo	List of database names to use for the Spock cluster. At least one database name is expected to initialize the cluster. Any missing databases will be created and owned by `db_user`.
db_user	admin	Database username. Must be something other than the OS username performing the installation. Note that the CLI will create a database user named after the OS user for its own purposes as part of the installation and setup process.
db_password	secret	Password for the `db_user` user.
is_ha_cluster	false	If true, install etcd and Patroni on all nodes in the `pgedge` group. If HAProxy nodes exist, they will reflect nodes in the same zone. Subscriptions from other pgEdge nodes will also pass through the zone HAProxy.
proxy_node	None	If specified in an HA cluster, will be used as the pgEdge Spock subscription target instead of the inventory node itself. Otherwise, subscriptions will target the first available HAProxy node in the same zone as the inventory node, or the inventory node if no other options are available.
replication_user	replicator	This user is specifically for Patroni replication purposes.
replication_password	secret	Password for the `replication_user` user.
synchronous_mode	false	Enable to allow Patroni to manage `synchonous_commit` and `synchronous_standby_names` parameters based on HA cluster state.
synchronous_mode_strict	false	When synchronous mode is enabled, Patroni will normally disable synchronous replication if no synchronous replicas are available. Enable this parameter to always enforce synchronous commit.
debug_pgedge	true	When enabled, various kernel settings will be configured to retain all core files produced during a process crash.
manage_host_file	true	When enabled, all hosts in the cluster will be listed in the `/etc/hosts` file of every other host. Set to false if external DNS is in use, or inventory hostnames are IP addresses.
haproxy_extra_routes	See description	Parameters to list additional HAProxy routes recognized by the Patroni REST interface. The key should match the type of check being performed, and sub-keys of `port` for the HAProxy listening port and `lag` for maximum allowed lag for viable routing. The `port` field must be supplied. Default: `{ replica: { port: 5433 }`
backup_host	''	Should be set to the hostname where PgBackRest backups should be transmitted. If left empty and `backup_repo_type` is ssh, will default to the first node in the `backup` node group in the same zone.
backup_repo_type	ssh	The type of PgBackRest backup repository to use. Using 'ssh' means backups are stored on a dedicated backup server as defined in the `backup` host group. Using 's3' will use a remote AWS S3 bucket and requires `backup_repo_params` to be set. Other options will be added as requested.
backup_repo_path	`$cluster_path/data/backrest`	Full path to storage location of PgBackRest repository. If using S3 as a repo type, set this to something simple like `/backrest`.
backup_repo_cipher_type	aes-256-cbc	Encryption algorithm to use for backup files stored in the PgBackRest repository.
backup_repo_cipher	(random)	This should be specified and will define the encryption cipher used for backup files stored in the PgBackRest repository. If not defined, will be set to a 20-character deterministic random string based on the backup repository name.
full_backup_count	1	Defines how many full backups to retain in the backup repository.
diff_backup_count	6	Defines how many differential backups to retain in the backup repository.
full_backup_schedule	`10 0 * * 0`	Defines a cron-style schedule for automating full PgBackRest backup operations. The default will run every Sunday at 10 minutes after midnight.
diff_backup_schedule	`10 0 * * 0`	Defines a cron-style schedule for automating differential PgBackRest backup operations. The default will run every Monday - Saturday at 10 minutes after midnight.
backup_repo_params	See description	Parameters to remote backup repositories (S3 for now). Should be a dictionary with the following fields (defaults listed): `{ region: us-east-1, endpoint: s3.amazonaws.com, bucket: pgbackrest, access_key: '', secret_key: '' }`.
exception_behaviour	transdiscard	Defines what Spock should do when it encounters an exception. Default is `transdiscard` to skip offending transaction. See documentation.

Modifying other parameters will have no effect on the cluster.

Role Variables

The roles described here often make use of shortcut variables internally. If seeking to modify these roles, these variables may prove useful. To augment configurability, some of these may be moved to be role defaults rather than variables.

Notable items are listed here:

Variable	Value	Description
repo_url	https://pgedge-$repo.s3.amazonaws.com/REPO	This is based on the sanitized value of the `repo` role parameter.
cluster_path	`$HOME/pgedge`	In most cases, this is `/home/pgedge/pgedge`. This is the default location where the CLI will install itself.
pg_path	`$cluster_path/pg${version}`	For a Postgres 16 install, this will likely be `/home/pgedge/pgedge/pg16`.
pg_data	`$cluster_path/data/pg${version}`	For a Postgres 16 install, this will likely be `/home/pgedge/pgedge/data/pg16`.
nodes_in_zone	Node list	Should be a list of all nodes in the `pgedge` group which are in the same zone as this node. Used in several roles for service configuration.

Role List

The full list of roles is as follows, in the expected order of execution:

init_server - Prepares each server in the cluster to operate in the stack. It also retrieves the public SSH key for each server into a host-keys directory on the Ansible controller for potential use by other roles. This should be executed on every available server.
install_pgedge - Only installs the pgedge CLI software, and does nothing else.
setup_postgres - Uses the CLI software setup command to create a Postgres instance on each node. This will also install the snowflake and spock extensions. In HA clusters, multiple nodes can be assigned to each zone. Of these, all but the first will have the Postgres stopped, and the data directory wiped. This is a preparation step for Patroni.
install_etcd - Only used for HA clusters. Uses the CLI to download and install the version of etcd packaged by pgEdge. The service is not yet configured or started.
install_patroni - Only used for HA clusters. Uses the CLI to download and install the version of etcd packaged by pgEdge. The service is not yet configured or started.
setup_etcd - Only used for HA clusters. Fully configures etcd on each node. Nodes in the pgedge group in same zone are configured to be part of the same etcd quorum. In zones with fewer than 3 nodes, etcd will still function, but will require all nodes to be operational to maintain quorum.
setup_patroni - Only used for HA clusters. Fully configures patroni on each node. Nodes in the pgedge group in the same zone are configured to be part of the same Patroni cluster. The first node in the list is used to bootstrap Patroni itself, and Patroni will rebuild all other Postgres instances in the zone from this.
setup_haproxy - Only used for HA clusters. This should be executed before setup_pgedge, as HA clusters are intended to communicate through the proxy layer so subscriptions survive failover events.
setup_pgedge - Establishes a pgEdge node for all nodes in the pgedge group. Also subscribes each node to every other node. Will additionally set the snowflake node ID to be the same as the zone, so use the zone as a logical node identifier. In HA clusters, node creation only takes place once per zone. HA clusters are also subscribed to either the first HAProxy node in the same zone as the remote pgEdge node, or if this is missing, the first pgEdge node in that zone. This allows for "hybrid" clusters for simplified testing, where a single pgEdge node interacts with a Patroni-managed sub-cluster.
install_backrest - Installs PgBackRest backup software on nodes where the role is defined.
setup_backrest - Configures PgBackRest on each configured node with the parameter-defined repository target for long-term backup and WAL storage. Additionally:

Configures Postgres to transmit archived WAL files to the PgBackRest repository, and also retrieve WAL files from this location during recovery.
Initializes a backup repository for each defined zone.
Invokes a single backup to bootstrap the repository for each defined zone. As a result, this is best used on new clusters or those without large amounts of data.
Defines full and differential backup automation.

Usage

Files in the sample-playbooks directory provide a sample inventory and playbook to illustrate how these roles should be utilized:

simple-cluster - Produces a standard three-node pgEdge cluster.
ultra-ha - Produces an Ultra-HA cluster with two zones, three pgEdge nodes in each zone, and one HAProxy node per zone. This is a total of eight nodes.

Compatibility

These roles have been verified as compatible with the following Linux platforms:

Debian 12 / Bookworm
Rocky 9

They may also work with other Debian or RHEL variants as-is, but this has not been validated.

Other Notes

These roles are very early in their life, and will likely undergo heavy revision as they mature. They are not fully re-entrant if they encounter an unexpected error, so re-running the playbook after a failure may result in further problems. This situation will improve with time as development continues.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
meta		meta
roles		roles
sample-playbooks		sample-playbooks
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
VERSION		VERSION
galaxy.template.yml		galaxy.template.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

pgEdge Platform Ansible Collection

Installation

Configuration

Role Variables

Role List

Usage

Compatibility

Other Notes

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

pgEdge/pgedge-ansible

Folders and files

Latest commit

History

Repository files navigation

pgEdge Platform Ansible Collection

Installation

Configuration

Role Variables

Role List

Usage

Compatibility

Other Notes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages