Bottlerocket node in EKS unable to join the cluster, userData not passed to kubelet that fails to load the cluster certificate #4585
Unanswered
lorenzophys
asked this question in
Q&A
Replies: 1 comment
-
This question has also been opened as a bug: #4586 |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello Bottlerocket community. In my team we are exploring the Bottlerocket AMIs for our EKS cluster, so we got an AMI from our platform team (we are not using a public one, but the custom ami is just a copy-paste of this variant:
aws-k8s-1.29
, which has id:ami-04040aebb8892ed24
).Maybe a bug report would be more suitable? #4586
Problem
The Bottlerocket (custom) AMI is unable to join the cluster: the
userData
passed via Karpenter'sEC2NodeClass
seems to be ignored and as a consequence the kubelet is unable to start:In particular:
Which is telling me that some configuration is applied correctly, but not the one passed via
userData
Which indicates that the cluster certificate is not recognized. In fact the
/etc/kubernetes/pki/ca.crt
is empty:The kubernetes settings I get form querying the apiclient:
If I try to
curl
the cluster endpoint I get403
, which is expected since there's no certificate, but this also means that the endpoint is reachable:Setup
We use Karpenter as autoscaler and we configured the
EC2NodeClass
this way:aws eks describe-cluster --name mlops-sandbox --query 'cluster' --output text --region eu-central-1
instanceProfile: BottlerocketKarpenterAndS3
, contains these policies and, except for the S3 one, they're the same as all the other normally functioning nodes:Version of the main components
1.29
EKSv1.0.6
v1.19.2
Notes
Everything else in the cluster is working normally, including Karpenter and all the non-Bottlerocket nodes. I have a AWS support ticket open, so if someone from AWS is reading this and wants to take a look at the full logs and have additional info, please reach out.
I will be grateful for any help, suggestion or pointers on how to debug these type of issues.
Beta Was this translation helpful? Give feedback.
All reactions