Skip to content

Updated coredns troubleshooting steps #9727

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 4 additions & 0 deletions docs/content/en/docs/troubleshooting/troubleshooting.md
Original file line number Diff line number Diff line change
Expand Up @@ -651,10 +651,14 @@ DNS resolution is managed by coreDNS, a DNS server deployed as a pod within the
[ERROR] plugin/errors: 2 example-service.default.svc.cluster.local. A: read udp 10.0.0.2:59857->10.96.0.10:53: i/o timeout
```
- Incorrect DNS server can cause issues in cluster lifecycle workflows like create and update.
- When coredns pods are deployed on Kubernetes cluster, kubelet agent running on worker node will cache the upstream DNS Server IPs (Eg: DNS Server deployed in vSphere) with the coredns pod. When these upstream DNS Server IPs are changed, they are not automatically reflected in coredns pod cache. Restart coredns pods on management cluster so that new DNS Server IPs can be update in the cache. And then, Restart eksa-controller pod on Management cluster so that it can successfully resolve DNS queries and proceed with curated packages installation on workload clusters.

#### DNS Debugging

##### Verify CoreDNS deployment

Note: For clusters with Management and Workload clusters, below checks should be performed on both Management and Workload Clusters.

1. Check if CoreDNS pods are running
```
kubectl get pods --namespace=kube-system -l k8s-app=kube-dns
Expand Down