title | author |
---|---|
Simple RKE2, Longhorn, and Rancher Install - AIR GAPPED |
Andy Clemenko, @clemenko, andy.clemenko@rancherfederal.com |
This guide is very similar to Simple RKE2, Longhorn, and Rancher Install , except in one major way. This guide will lay everything out for an air gapped install.
Throughout my career there has always been a disconnect between the documentation and the practical implementation. The Kubernetes (k8s) ecosystem is no stranger to this problem. This guide is a simple approach to installing Kubernetes and some REALLY useful tools. We will walk through installing all the following.
- RKE2 - Security focused Kubernetes
- Rancher - Multi-Cluster Kubernetes Management
- Longhorn - Unified storage layer
We will need a few tools for this guide. We will walk through how to install helm
and kubectl
.
Table of Contents:
Just a geek - Andy Clemenko - @clemenko - andy.clemenko@rancherfederal.com
The prerequisites are fairly simple. We need 4 Rocky Linux servers with one of the servers having access to the internet. To be fair we are going to use the internet to get the bits. They can be bare metal, or in the cloud provider of your choice. I prefer Digital Ocean. We need an ssh
client to connect to the servers. And finally DNS to make things simple. Ideally we need a URL for the Rancher interface. For the purpose of the this guide let's use rancher.dockr.life
. We will need to point that name to the first server of the cluster. While we are at it, a wildcard DNS for your domain will help as well.
Because we are moving bit across an air gap we need a server on the internet. Because I am using a cloud provider I am going to spin a 4th Rocky Linux server name rancher4
. Most of the challenge of air gaps is getting all the bits. Don't ask me how I know. Let's ssh into rancher4
to start the downloading process. Since we are connected to the internet we can install a few tools like Skopeo. Once we have all the tars, and images we will run a docker registry for installing Rancher and Longhorn. We are going to assume root access since this is a throw away server.
Skopeo is a great tool to inspect and interact with registries. We can use it to download the images in a clean manor.
dnf -y install skopeo
# create install directory
mkdir /root/rke2/
cd /root/rke2/
# download rke, rancher and longhorn
curl -#OL https://github.com/rancher/rke2/releases/download/v1.23.4%2Brke2r2/rke2-images.linux-amd64.tar.zst
curl -#OL https://github.com/rancher/rke2/releases/download/v1.23.4%2Brke2r2/rke2.linux-amd64.tar.gz
curl -#OL https://github.com/rancher/rke2/releases/download/v1.23.4%2Brke2r2/sha256sum-amd64.txt
# get the install script
curl -sfL https://get.rke2.io --output install.sh
# create helm dir
mkdir /root/helm/
cd /root/helm/
# get helm
curl -#L https://raw.githubusercontent.com/helm/helm/main/scripts/get-helm-3 | bash
# add repos
helm repo add jetstack https://charts.jetstack.io
helm repo add rancher-latest https://releases.rancher.com/server-charts/latest
helm repo add longhorn https://charts.longhorn.io
helm repo update
# get charts
helm pull jetstack/cert-manager
helm pull rancher-latest/rancher
helm pull longhorn/longhorn
# get the cert-manager crt
curl -#LO https://github.com/jetstack/cert-manager/releases/download/v1.7.2/cert-manager.crds.yaml
Please be patient. Downloading images will take some time!
# create image dir
mkdir -p /root/images/{cert,rancher,longhorn}
cd /root/images/
# rancher image list
curl -#L https://github.com/rancher/rancher/releases/download/v2.6.4/rancher-images.txt -o ./rancher/orig_rancher-images.txt
# shorten rancher list with a sort
sed -i -e '0,/busybox/s/busybox/library\/busybox/' -e 's/registry/library\/registry/g' rancher/orig_rancher-images.txt
for i in $(cat rancher/orig_rancher-images.txt|awk -F: '{print $1}'); do
grep -w $i rancher/orig_rancher-images.txt | sort -Vr| head -1 >> rancher/version_unsorted.txt
done
grep -x library/busybox rancher/orig_rancher-images.txt | sort -Vr| head -1 > rancher/rancher-images.txt
cat rancher/version_unsorted.txt | sort -u >> rancher/rancher-images.txt
# We need to add the cert-manager images
helm template /root/helm/cert-manager-*.tgz | awk '$1 ~ /image:/ {print $2}' | sed s/\"//g > ./cert/cert-manager-images.txt
# longhorn image list
curl -#L https://raw.githubusercontent.com/longhorn/longhorn/v1.2.4/deploy/longhorn-images.txt -o ./longhorn/longhorn-images.txt
# skopeo cert-manager
for i in $(cat cert/cert-manager-images.txt); do
skopeo copy docker://$i docker-archive:cert/$(echo $i| awk -F/ '{print $3}'|sed 's/:/_/g').tar:$(echo $i| awk -F/ '{print $3}')
done
# skopeo - longhorn
for i in $(cat longhorn/longhorn-images.txt); do
skopeo copy docker://$i docker-archive:longhorn/$(echo $i| awk -F/ '{print $2}'|sed 's/:/_/g').tar:$(echo $i| awk -F/ '{print $2}')
done
# skopeo - Rancher - This will take time getting all the images
for i in $(cat rancher/rancher-images.txt); do
skopeo copy docker://$i docker-archive:rancher/$(echo $i| awk -F/ '{print $2}'|sed 's/:/_/g').tar:$(echo $i| awk -F/ '{print $2}') &
done
# in order to server out the images we need a utility called nerctl
mkdir /root/nerdctl/
cd /root/nerdctl/
curl -#LO https://github.com/containerd/nerdctl/releases/download/v0.18.0/nerdctl-0.18.0-linux-amd64.tar.gz
# for later
chmod 755 /usr/local/bin/nerdctl
ln -s /run/k3s/containerd/containerd.sock /run/containerd/containerd.sock
ln -s /usr/local/bin/nerdctl /usr/local/bin/docker
For this guide we do not actually need to package and move all the bits. We are going to install from this location. If this was an actual air gapped situation we would need to sneaker net the tarball over.
# cd /root
cd /root
# compress all the things
tar -zvcf rke2_rancher_longhorn.tgz helm rke2 images nerdctl
----- STOP ------
Move the tar to Rancher1. We will use this server as the "registry" for the entire cluster.
# untar
cd /root/
tar -zxvf rke_rancher_longhorn.tgz
For the sake of this guide we are going to use Ubuntu. Our goal is a simple deployment. The recommended size of each node is 4 Cores and 8GB of memory with at least 80GB of storage. One of the nice things about Longhorn is that we do not need to attach additional storage. Here is an example list of servers. Please keep in mind that your server names can be anything. Just keep in mind which ones are the "server" and "agents".
name | ip | memory | core | disk | os |
---|---|---|---|---|---|
rancher1 | 142.93.189.52 | 8192 | 4 | 160 | Ubuntu 21.10 x64 |
rancher2 | 68.183.150.214 | 8192 | 4 | 160 | Ubuntu 21.10 x64 |
rancher3 | 167.71.188.101 | 8192 | 4 | 160 | Ubuntu 21.10 x64 |
For Kubernetes we will need to "set" one of the nodes as the control plane. Rancher1 looks like a winner for this. First we need to ssh
into all three nodes and make sure we have all the updates and add a few things. For the record I am not a fan of software firewalls. Please feel free to reach to me to discuss. :D
Rocky / Centos / RHEL:
# Rocky instructions
# stop the software firewall
systemctl stop firewalld
systemctl disable firewalld
# get updates, install nfs, and apply
yum install -y nfs-utils cryptsetup iscsi-initiator-utils container-selinux iptables libnetfilter_conntrack libnfnetlink libnftnl policycoreutils-python-utils
# enable iscsi for Longhorn
systemctl start iscsid.service
systemctl enable iscsid.service
# update all the things
yum update -y
# clean up
yum clean all
Cool, lets move on to getting the bits.
Now that we have all the nodes up to date, let's focus on rancher1
. While this might seem controversial, curl | bash
does work nicely. For the air gapped install we are going use the tarball we downloaded in the previous section. Here are the rke2 docs and air install for reference.
# On rancher1
curl -sfL https://get.rke2.io | INSTALL_RKE2_TYPE=server sh -
# start and enable for restarts -
systemctl enable rke2-server.service
systemctl start rke2-server.service
Here is what the Ubuntu version should look like:
Let's validate everything worked as expected. Run a systemctl status rke2-server
and make sure it is active
.
Perfect! Now we can start talking Kubernetes. We need to install the kubectl
cli on rancher1
. FYI, We can install a newer kubectl
without any issues.
# curl all the things - kubectl
curl -L# https://dl.k8s.io/release/v1.23.0/bin/linux/amd64/kubectl -o /usr/local/bin/kubectl
# make it executable
chmod 755 /usr/local/bin/kubectl
# add kubectl conf
export KUBECONFIG=/etc/rancher/rke2/rke2.yaml
# check node status
kubectl get node
Hopefully everything looks good! Here is an example.
For those that are not TOO familiar with k8s, the config file is what kubectl
uses to authenticate to the api service. If you want to use a workstation, jump box, or any other machine you will want to copy /etc/rancher/rke2/rke2.yaml
. You will want to modify the file to change the ip address. We will need one more file from rancher1
, aka the server, the agent join token. Copy /var/lib/rancher/rke2/server/node-token
, we will need it for the agent install.
Side note on Tokens. RKE2 uses the TOKEN has a way to authenticate the agent to the server service. This is a much better system than "trust on first use". The goal of the token process is to setup a control plane Mutual TLS (mtls) certificate termination.
The agent install is VERY similar to the server install. Except that we need an agent config file before starting. We will start with rancher2
. We need to install the agent and setup the configuration file.
# we add INSTALL_RKE2_TYPE=agent
curl -sfL https://get.rke2.io | INSTALL_RKE2_TYPE=agent sh -
# create config file
mkdir -p /etc/rancher/rke2/
# change the ip to reflect your rancher1 ip
echo "server: https://$RANCHER1_IP:9345" > /etc/rancher/rke2/config.yaml
# change the Token to the one from rancher1 /var/lib/rancher/rke2/server/node-token
echo "token: $TOKEN" >> /etc/rancher/rke2/config.yaml
# enable and start
systemctl enable rke2-agent.service
systemctl start rke2-agent.service
What should this look like:
Rinse and repeat. Run the same install commands on rancher3
. Next we can validate all the nodes are playing nice by running kubectl get node -o wide
on rancher1
.
Huzzah! RKE2 is fully installed. From here on out we will only need to talk to the kubernetes api. Meaning we will only need to remain ssh'ed into rancher1
.
For more information about the Rancher versions, please refer to the Support Matrix. We are going to use the latest version. For additional reading take a look at the Rancher docs.
For Rancher we will need Helm. We are going to live on the edge! Here are the install docs for reference.
# on the server rancher1
# add helm
curl -#L https://raw.githubusercontent.com/helm/helm/main/scripts/get-helm-3 | bash
# add needed helm charts
helm repo add rancher-latest https://releases.rancher.com/server-charts/latest
helm repo add jetstack https://charts.jetstack.io
Quick note about Rancher. Rancher needs jetstack/cert-manager to create the self signed TLS certificates. We need to install it with the Custom Resource Definition (CRD). Please pay attention to the helm
install for Rancher. The URL will need to be changed to fit your FQDN. Also notice I am setting the bootstrapPassword
and replicas. This allows us to skip a step later. :D
# still on rancher1
# add the cert-manager CRD
kubectl apply -f https://github.com/jetstack/cert-manager/releases/download/v1.6.1/cert-manager.crds.yaml
# helm install jetstack
helm upgrade -i cert-manager jetstack/cert-manager --namespace cert-manager --create-namespace
# helm install rancher
helm upgrade -i rancher rancher-latest/rancher --create-namespace --namespace cattle-system --set hostname=rancher.dockr.life --set bootstrapPassword=bootStrapAllTheThings --set replicas=1
Here is what it should look like.
root@rancher1:~# helm repo add rancher-latest https://releases.rancher.com/server-charts/latest
"rancher-latest" has been added to your repositories
root@rancher1:~# helm repo add jetstack https://charts.jetstack.io
"jetstack" has been added to your repositories
root@rancher1:~# kubectl apply -f https://github.com/jetstack/cert-manager/releases/download/v1.6.1/cert-manager.crds.yaml
customresourcedefinition.apiextensions.k8s.io/certificaterequests.cert-manager.io created
customresourcedefinition.apiextensions.k8s.io/certificates.cert-manager.io created
customresourcedefinition.apiextensions.k8s.io/challenges.acme.cert-manager.io created
customresourcedefinition.apiextensions.k8s.io/clusterissuers.cert-manager.io created
customresourcedefinition.apiextensions.k8s.io/issuers.cert-manager.io created
customresourcedefinition.apiextensions.k8s.io/orders.acme.cert-manager.io created
root@rancher1:~# helm upgrade -i cert-manager jetstack/cert-manager --namespace cert-manager --create-namespace
Release "cert-manager" does not exist. Installing it now.
NAME: cert-manager
LAST DEPLOYED: Mon Mar 21 14:14:47 2022
NAMESPACE: cert-manager
STATUS: deployed
REVISION: 1
TEST SUITE: None
NOTES:
cert-manager v1.7.1 has been deployed successfully!
In order to begin issuing certificates, you will need to set up a ClusterIssuer
or Issuer resource (for example, by creating a 'letsencrypt-staging' issuer).
More information on the different types of issuers and how to configure them
can be found in our documentation:
https://cert-manager.io/docs/configuration/
For information on how to configure cert-manager to automatically provision
Certificates for Ingress resources, take a look at the `ingress-shim`
documentation:
https://cert-manager.io/docs/usage/ingress/
root@rancher1:~# helm upgrade -i rancher rancher-latest/rancher --create-namespace --namespace cattle-system --set hostname=rancher.dockr.life --set bootstrapPassword=bootStrapAllTheThings --set replicas=1
Release "rancher" does not exist. Installing it now.
NAME: rancher
LAST DEPLOYED: Mon Mar 21 14:15:08 2022
NAMESPACE: cattle-system
STATUS: deployed
REVISION: 1
TEST SUITE: None
NOTES:
Rancher Server has been installed.
NOTE: Rancher may take several minutes to fully initialize. Please standby while Certificates are being issued, Containers are started and the Ingress rule comes up.
Check out our docs at https://rancher.com/docs/
If you provided your own bootstrap password during installation, browse to https://rancher.dockr.life to get started.
If this is the first time you installed Rancher, get started by running this command and clicking the URL it generates:
\```
echo https://rancher.dockr.life/dashboard/?setup=$(kubectl get secret --namespace cattle-system bootstrap-secret -o go-template='{{.data.bootstrapPassword|base64decode}}')
\```
To get just the bootstrap password on its own, run:
\```
kubectl get secret --namespace cattle-system bootstrap-secret -o go-template='{{.data.bootstrapPassword|base64decode}}{{ "\n" }}'
\```
Happy Containering!
root@rancher1:~#
We can also run a kubectl get pod -A
to see if everything it running. Keep in mind it may take a minute or so for all the pods to come up. GUI time...
Assuming DNS is pointing to first server, rancher1
for me, we should be able to get to the GUI. The good news is that be default rke2 installs with the nginx
ingress controller. Keep in mind that the browser may show an error for the self signed certificate. In the case of Chrome you can type "thisisunsafe" and it will let you proceed.
Once past that you should see the following screen asking about the password. Remember the helm install? bootStrapAllTheThings
is the password.
We need to validate the Server URL and accept the terms and conditions. And we are in!
Let's take a second and talk about Ranchers Multi-cluster design. Bottom line, Rancher can operate in a Spoke and Hub model. Meaning one k8s cluster for Rancher and then "downstream" clusters for all the workloads. Personally I prefer the decoupled model where there is only one cluster per Rancher install. This allows for continued manageability during networks outages. For the purpose of the is guide we are concentrate on the single cluster deployment. There is good documentation on "importing" downstream clusters.
There are two methods for installing. Rancher has Chart built in.
Now for the good news, Longhorn docs show two easy install methods. Helm and kubectl
. Let's stick with Helm for this guide.
# get charts
helm repo add longhorn https://charts.longhorn.io
# update
helm repo update
# install
helm upgrade -i longhorn longhorn/longhorn --namespace longhorn-system --create-namespace
Fairly easy right?
One of the benefits of Rancher is it's ability to adjust to whats installed. Meaning the Rancher GUI will see Longhorn is installed and provide a link. Click it.
This brings up the Longhorn GUI.
One of the other benefits of this integration is that rke2 also knows it is installed. Run kubectl get sc
to show the storage classes.
root@rancher1:~# kubectl get sc
NAME PROVISIONER RECLAIMPOLICY VOLUMEBINDINGMODE ALLOWVOLUMEEXPANSION AGE
longhorn (default) driver.longhorn.io Delete Immediate true 3m58s
Now we have a default storage class for the cluster. This allows for the automatic creation of Physical Volumes (PVs) based on a Physical Volume Claim (PVC). The best part is that "it just works" using the existing, unused storage, on the three nodes. Take a look around in the gui. Notice the Volumes on the Nodes. For fun, here is a demo flask app that uses a PVC for Redis. kubectl apply -f https://raw.githubusercontent.com/clemenko/k8s_yaml/master/flask_simple_nginx.yml
Yes we can automate all the things. Here is the repo I use automating the complete stack https://github.com/clemenko/rke2. This repo is for entertainment purposes only. There I use tools like pdsh to run parallel ssh into the nodes to complete a few tasks. Ansible would be a good choice for this. But I am old and like bash. Sorry the script is a beast. I need to clean it up.
There may be a container that can get all the bits for you. Just going to put this here : https://github.com/clemenko/rke_airgap_install/blob/3ccba9b7990e3cbdc3a2e1597a1e01892072e3b2/Dockerfile#L11.
As we can see, setting up RKE2, Rancher and Longhorn is not that complicated. We can get deploy Kubernetes, a storage layer, and a management gui in a few minutes. Simple, right? One of the added benefits of using the Suse / Rancher stack is that all the pieces are modular. Use only what you need, when you need it. Hope this was helpful. Please feel free reach out, or open any issues at https://github.com/clemenko/rke_install_blog.
thanks!