Releases: aws/aws-parallelcluster
AWS ParallelCluster v3.1.4
We're excited to announce the release of AWS ParallelCluster 3.1.4
Upgrade
How to upgrade?
sudo pip install --upgrade aws-parallelcluster
ENHANCEMENTS
- Add validation for
DirectoryService/PasswordSecretArn
to fail in case the secret does not exist.
CHANGES
- Upgrade Slurm to version 21.08.8-2.
- Build Slurm with JWT support.
- Do not require
PlacementGroup/Enabled
to be set totrue
when passing an existingPlacementGroup/Id
. - Add
lambda:TagsResource
toParallelClusterUserRole
used by ParallelCluster API stack for cluster creation and image creation.
BUG FIXES
- Fix the ability to export cluster's logs when using
export-cluster-logs
command with the--filters
option. - Fix AWS Batch Docker entrypoint to use
/home
shared directory to coordinate Multi-node-Parallel job execution.
AWS ParallelCluster v2.11.6
We're excited to announce the release of AWS ParallelCluster 2.11.6
Upgrade
How to upgrade?
sudo pip install aws-parallelcluster==2.11.6
ENHANCEMENTS
- Improve exception management in case of missing networking.
CHANGES
- OS package updates and security fixes.
AWS ParallelCluster v3.1.3
We're excited to announce the release of AWS ParallelCluster 3.1.3
Upgrade
How to upgrade?
sudo pip install --upgrade aws-parallelcluster
ENHANCEMENTS
- Execute SSH key creation alongside with the creation of HOME directory, i.e.
during SSH login, when switching to another user and when executing a command as another user. - Add support for both FQDN and LDAP Distinguished Names in the configuration parameter
DirectoryService/DomainName
. The new validator now checks both the syntaxes. - New
update_directory_service_password.sh
script deployed on the head node supports the manual update of the Active Directory password in the SSSD configuration.
The password is retrieved by the AWS Secrets Manager as from the cluster configuration. - Add support to deploy API infrastructure in environments without a default VPC.
- Add validation for
DirectoryService/AdditionalSssdConfigs
to fail in case of invalid overrides.
CHANGES
- Disable deeper C-States in x86_64 official AMIs and AMIs created through
build-image
command, to guarantee high performance and low latency. - OS package updates and security fixes.
- Change Amazon Linux 2 base images to use AMIs with Kernel 5.10.
BUG FIXES
- Fix build-image stack in
DELETE_FAILED
after image built successful, due to new EC2ImageBuilder policies. - Fix the configuration parameter
DirectoryService/DomainAddr
conversion toldap_uri
SSSD property when it contains multiples domain addresses.
AWS ParallelCluster v2.11.5
We're excited to announce the release of AWS ParallelCluster 2.11.5
Upgrade
How to upgrade?
sudo pip install aws-parallelcluster==2.11.5
ENHANCEMENTS
- Add support for
NEW_CHANGED_DELETED
as value of FSx for LustreAutoImportPolicy
option.
CHANGES
- Drop support for SGE and Torque schedulers.
- Disable log4j-cve-2021-44228-hotpatch service on Amazon Linux to avoid incurring in potential performance degradation.
- Upgrade Intel MPI Library to 2021.4.0.441.
- Upgrade NVIDIA driver to version 470.103.01.
- Upgrade CUDA library to version 11.4.4.
- Upgrade NVIDIA Fabric manager to version 470.103.01.
- Extend head node creation timeout to 1h.
BUG FIXES
- Fix DCV connection through browsers.
- Fix YAML quoting to prevent custom Tags being parsed as numbers.
AWS ParallelCluster v3.1.2
We're excited to announce the release of AWS ParallelCluster 3.1.2
Upgrade
How to upgrade?
sudo pip install --upgrade aws-parallelcluster
CHANGES
- Upgrade Slurm to version 21.08.6.
BUG FIXES
- Fix the update of
/etc/hosts
file on computes nodes when a cluster is deployed in subnets without internet access. - Fix compute nodes bootstrap by waiting for ephemeral drives initialization before joining the cluster.
AWS ParallelCluster v3.1.1
We're excited to announce the release of AWS ParallelCluster 3.1.1
Upgrade
How to upgrade?
sudo pip install --upgrade aws-parallelcluster
ENHANCEMENTS
- Add support for multiple users cluster environments by integrating with Active Directory (AD) domains managed via AWS Directory Service.
- Enable cluster creation in subnets with no internet access.
- Add abbreviated flags for
cluster-name
(-n),region
(-r),image-id
(-i) andcluster-configuration
/image-configuration
(-c) to the cli. - Add support for multiple compute resources with same instance type per queue.
- Add support for
UseEc2Hostnames
in the cluster configuration file. When set totrue
, use EC2 default hostnames (e.g. ip-1-2-3-4) for compute nodes. - Add support for GPU scheduling with Slurm on ARM instances with NVIDIA cards. Install NVIDIA drivers and CUDA library for ARM.
- Add
parallelcluster:compute-resource-name
tag to LaunchTemplates used by compute nodes. - Add support for
NEW_CHANGED_DELETED
as value of FSx for LustreAutoImportPolicy
option. - Explicitly set cloud-init datasource to be EC2. This save boot time for Ubuntu and CentOS platforms.
- Improve Security Groups created within the cluster to allow inbound connections from custom security groups when
SecurityGroups
parameter is specified for head node and/or queues. - Build Slurm with
slurmrestd
support.
CHANGES
- Upgrade Slurm to version 21.08.5.
- Upgrade NICE DCV to version 2021.3-11591.
- Upgrade NVIDIA driver to version 470.103.01.
- Upgrade CUDA library to version 11.4.4.
- Upgrade NVIDIA Fabric manager to version 470.103.01.
- Upgrade Intel MPI Library to 2021.4.0.441.
- Upgrade PMIx to version 3.2.3.
- Disable package update at instance launch time on Amazon Linux 2.
- Enable possibility to suppress
SlurmQueues
andComputeResources
length validators. - Use compute resource name rather than instance type in compute fleet Launch Template name.
- Disable EC2 ImageBuilder enhanced image metadata when building ParallelCluster custom images.
- Remove dumping of failed compute nodes to
/home/logs/compute
. Compute nodes log files are available in CloudWatch
and in EC2 console logs.
BUG FIXES
- Redirect stderr and stdout to CLI log file to prevent unwanted text to pollute the
pcluster
CLI output. - Fix exporting of cluster logs when there is no prefix specified, previously exported to a
None
prefix. - Fix rollback not being performed in case of cluster update failure.
- Do not configure GPUs in Slurm when NVIDIA driver is not installed.
- Fix
ecs:ListContainerInstances
permission inBatchUserRole
. - Fix
RootVolume
schema for theHeadNode
by raising an error if unsupportedKmsKeyId
is specified. - Fix
EfaSecurityGroupValidator
. Previously, it may produce false failures when custom security groups were provided and EFA was enabled. - Fix FSx metrics not displayed in Cloudwatch Dashboard.
AWS ParallelCluster v3.0.3
We're excited to announce the release of AWS ParallelCluster 3.0.3
Upgrade
How to upgrade?
sudo pip install --upgrade aws-parallelcluster
CHANGES
- Disable log4j-cve-2021-44228-hotpatch service on Amazon Linux to avoid incurring in potential performance degradation.
AWS ParallelCluster v2.11.4
We're excited to announce the release of AWS ParallelCluster 2.11.4
Upgrade
How to upgrade?
sudo pip install aws-parallelcluster==2.11.4
CHANGES
- CentOS 8 is no longer supported (EOL on December 31st, 2021).
- Upgrade Slurm to version 20.11.8.
- Upgrade Cinc Client to version 17.2.29.
- Upgrade NICE DCV to version 2021.2-11190.
- Upgrade NVIDIA driver to version 470.82.01.
- Upgrade CUDA library to version 11.4.3.
- Upgrade NVIDIA Fabric manager to 470.82.01.
- Disable packages update at instance launch time on Amazon Linux 2.
- Disable unattended packages update on Ubuntu.
- Install Python 3 version of
aws-cfn-bootstrap
scripts on CentOS 7 and Ubuntu 18.04, aligning with Ubuntu 20.04 and Amazon Linux 2.
BUG FIXES
- Disable update of
ec2_iam_role
parameter. - Fix
CpuOptions
configuration in LaunchTemplate for t2 instances.
AWS ParallelCluster v3.0.2
We're excited to announce the release of AWS ParallelCluster 3.0.2
Upgrade
How to upgrade?
sudo pip install --upgrade aws-parallelcluster
3.0.2
CHANGES
- Upgrade EFA installer to version 1.14.1. Thereafter, EFA enables GDR support by default on supported instance type(s).
ParallelCluster does not reinstall EFA during node start. Previously, EFA was reinstalled ifGdrSupport
had been
turned on in the configuration file. TheGdrSupport
parameter has no effect and should no longer be used.- EFA configuration:
efa-config-1.9-1
- EFA profile:
efa-profile-1.5-1
- EFA kernel module:
efa-1.14.2
- RDMA core:
rdma-core-37.0
- Libfabric:
libfabric-1.13.2
- Open MPI:
openmpi40-aws-4.1.1-2
- EFA configuration:
BUG FIXES
- Fix issue that is preventing cluster names to start with
parallelcluster-
prefix.
AWS ParallelCluster v2.11.3
We're excited to announce the release of AWS ParallelCluster 2.11.3
Upgrade
How to upgrade?
sudo pip3 install "aws-parallelcluster<3.0" --upgrade --user
2.11.3
CHANGES
- Upgrade EFA installer to version 1.14.1. Thereafter, EFA enables GDR support by default on supported instance type(s). ParallelCluster does not reinstall EFA during node start. Previously, EFA was reinstalled if
enable_efa_gdr
had been
turned on in the configuration file.- EFA configuration:
efa-config-1.9-1
- EFA profile:
efa-profile-1.5-1
- EFA kernel module:
efa-1.14.2
- RDMA core:
rdma-core-37.0
- Libfabric:
libfabric-1.13.2
- Open MPI:
openmpi40-aws-4.1.1-2
- EFA configuration:
- Include tags from cluster configuration file in the RunInstances dry runs performed during configuration validation.
BUG FIXES
- Fix the create custom AMI functionality issues:
- SGE download URL no more reachable. Use Debian repository to download SGE source archive.
- Outdated CA certificates used by Cinc. Update ca-certificates package during AMI build time.
- Fix cluster update when using proxy setup.