GitHub - jahangir842/kubernetes-notes: kubernetes-notes

Cloud Native Computing Foundation (CNCF)

Cloud Native Computing Foundation (CNCF): A major project under the Linux Foundation, CNCF accelerates the adoption of containers, microservices, and cloud-native apps.
Project Categories: Projects are categorized by maturity levels—Sandbox, Incubating, and Graduated. Over a dozen projects have achieved Graduated status, including Kubernetes, Helm, and Prometheus.
Popular Graduated Projects: Examples include Kubernetes, Argo, CoreDNS, Fluentd, Linkerd, and Envoy.
Key Incubating Projects: Notable incubating projects include Buildpacks.io, Knative, KubeVirt, and Contour.
Dynamic Sandbox Projects: New projects in areas like metrics, monitoring, and serverless are progressing toward higher maturity levels, while some, like rkt and Brigade, have been archived.
Full Lifecycle Support: CNCF projects cover the entire cloud-native app lifecycle, from container runtimes to monitoring and logging.

CNCF and Kubernetes

The Cloud Native Computing Foundation (CNCF) supports Kubernetes in several key ways:

Provides a neutral home for the Kubernetes trademark and ensures proper usage.
Conducts license scanning for both core and vendor code.
Offers legal guidance on patent and copyright matters.
Develops and maintains open-source learning materials, training, and certifications, such as KCNA, CKA, CKAD, and CKS.
Oversees a software conformance working group to ensure standards.
Actively promotes Kubernetes through marketing.
Supports ad hoc initiatives and events.
Sponsors Kubernetes-related conferences and meetups.

CNCF Landscape

Explore the Cloud Native Computing Foundation (CNCF) landscape:
https://landscape.cncf.io

Roots of Kubernetes

The evolution of Kubernetes started from Borg, Google's very own distributed workload manager.

Cloud Native Computing Foundation (CNCF) currently hosts the Kubernetes project, along with other popular cloud native projects, such as Argo, Cilium, Prometheus, Fluentd, etcd, CoreDNS, cri-o, containerd, Helm, Envoy, Istio, and Linkerd, just to name a few.

What Is Kubernetes?

According to the Kubernetes website,

Kubernetes is an open-source system for automating deployment, scaling, and management of containerized applications

Kubernetes comes from the Greek word κυβερνήτης, which means helmsman or ship pilot. With this analogy in mind, we can think of Kubernetes as the pilot on a ship of containers.

Kubernetes is also referred to as k8s (pronounced Kate's), as there are 8 characters between k and s.

Kubernetes is highly inspired by the Google Borg system, a container and workload orchestrator for its global operations, Google has been using for more than a decade. It is an open source project written in the Go language and licensed under the License, Version 2.0.

Kubernetes was started by Google and, with its v1.0 release in July 2015, Google donated it to the Cloud Native Computing Foundation (CNCF), one of the largest sub-foundations of the Linux Foundation.

New Kubernetes versions are released in 4 month cycles. The current stable version is 1.29 (as of December 2023). https://kubernetes.io/releases/

Kubernetes Features

Kubernetes provides a comprehensive set of features for container orchestration, including:

Automatic Bin Packing: Automatically schedules containers based on resource needs and constraints, maximizing resource utilization without compromising availability.
Extensibility: Allows extending a Kubernetes cluster with custom features without altering the core code.
Self-Healing: Detects and replaces failed containers, reschedules them from failed nodes, and restarts unresponsive containers according to health checks and policies. Prevents routing traffic to unhealthy containers.
Horizontal Scaling: Offers manual or automatic scaling of applications based on CPU usage or custom metrics.
Service Discovery & Load Balancing: Assigns IP addresses to containers and provides a DNS name for a set of containers to facilitate load balancing across them.

Additional Features

Automated Rollouts & Rollbacks: Seamlessly handles application updates and configuration changes, monitoring the application’s health to avoid downtime.
Secret & Configuration Management: Manages sensitive information (like credentials) separately from container images, ensuring secure handling without embedding secrets in code repositories.
Storage Orchestration: Automates the mounting of storage solutions, including local, cloud, distributed, and network storage, to containers.
Batch Execution: Supports batch processing, long-running jobs, and automatic replacement of failed containers.
IPv4/IPv6 Dual-Stack: Supports both IPv4 and IPv6 addressing for network communication.

Kubernetes also integrates common Platform as a Service (PaaS) features such as deployment, scaling, and load balancing, with flexible options for adding monitoring, logging, and alerting via plugins.

Additionally, many Kubernetes features evolve through alpha or beta phases, such as stable support for Role-Based Access Control (RBAC) since version 1.8 and cronjobs since version 1.21. These features bring even more value as they mature in stability.

Why Use Kubernetes?

Portability: Kubernetes can be deployed in various environments, including local or remote VMs, bare metal, and public/private/hybrid/multi-cloud setups.
Extensibility: Kubernetes supports integration with 3rd-party open-source tools and has a modular, pluggable architecture. It can orchestrate microservices-based applications and extend functionality through custom resources, operators, APIs, scheduling rules, or plugins.
Thriving Community: Kubernetes has a large and active community, with over 3,500 contributors and 120,000+ commits. It is supported by Special Interest Groups (SIGs) focusing on different topics like scaling, networking, and storage.

Container Orchestrators

Most container orchestrators can be deployed on the infrastructure of our choice - on bare metal, Virtual Machines, on-premises, on public and hybrid clouds. Kubernetes, for example, can be deployed on a workstation, with or without an isolation layer such as a local hypervisor or container runtime, inside a company's data center, in the cloud on AWS Elastic Compute Cloud (EC2) instances, Google Compute Engine (GCE) VMs, DigitalOcean Droplets, IBM Virtual Servers, OpenStack, etc.

In addition, there are turnkey cloud solutions which allow production Kubernetes clusters to be installed, with only a few commands, on top of cloud Infrastructures-as-a-Service. These solutions paved the way for the managed container orchestration as-a-Service, more specifically the managed Kubernetes as-a-Service (KaaS) solution, offered and hosted by the major cloud providers. Examples of KaaS solutions are Amazon Elastic Kubernetes Service (Amazon EKS), Azure Kubernetes Service (AKS), DigitalOcean Kubernetes, Google Kubernetes Engine (GKE), IBM Cloud Kubernetes Service, Oracle Container Engine for Kubernetes, or VMware Tanzu Kubernetes Grid.

Orchestration Tools:

Below is a list of popular container orchestration tools and services available today, though it’s not exhaustive:

Amazon Elastic Container Service (ECS)
- A hosted service from Amazon Web Services (AWS) that allows running containers at scale on AWS infrastructure.
Azure Container Instances (ACI)
- A basic container orchestration service provided by Microsoft Azure for simple container deployments.
Azure Service Fabric
- An open-source container orchestrator by Microsoft Azure, designed for distributed applications and microservices.
Kubernetes
- An open-source container orchestration platform originally developed by Google and now managed by CNCF.
Marathon
- A container orchestration framework built on Apache Mesos and DC/OS for running containers at scale.
Nomad
- A flexible container and workload orchestrator provided by HashiCorp, designed for multi-cloud environments.
Docker Swarm
- A native container orchestrator built into Docker Engine, enabling clustering and scaling of Docker containers.

Kubernetes Cloud Solutions

If you're considering Kubernetes cloud solutions, several managed services are available that take care of infrastructure management, scaling, and operations, allowing you to focus on deploying and managing your applications. Here's a breakdown of the major cloud-based Kubernetes solutions:

1. Amazon EKS (Elastic Kubernetes Service)

Provider: Amazon Web Services (AWS)
Description: A managed Kubernetes service that makes it easy to run Kubernetes on AWS without needing to manage the Kubernetes control plane. AWS handles the control plane, scaling, and availability.
Features:
- Fully managed control plane with automatic upgrades.
- Integrated with AWS services like IAM, VPC, and ALB.
- Supports Fargate (serverless) for running pods without managing EC2 instances.
Pros:
- Deep integration with AWS services.
- Flexible scaling with support for auto-scaling groups and spot instances.
Cons:
- Tightly coupled with AWS ecosystem, which may increase lock-in.
Use Case: Best for those already using AWS or looking for a tightly integrated solution with other AWS services.

Getting Started:

Create a Kubernetes cluster using the AWS CLI:
```
eksctl create cluster --name my-cluster
```

2. Google Kubernetes Engine (GKE)

Provider: Google Cloud Platform (GCP)
Description: GKE is a fully managed Kubernetes service on GCP, offering strong integration with Google’s cloud services, including networking, security, and AI/ML tools.
Features:
- Automated operations like upgrades, scaling, and backups.
- Integration with Google Cloud services (e.g., Cloud Run, Cloud Functions).
- Support for Anthos, a multi-cloud Kubernetes platform.
Pros:
- Leading-edge Kubernetes service with early access to new features.
- Excellent security and scalability features.
- Efficient cluster management tools like GKE Autopilot for a more hands-off experience.
Cons:
- Pricing can get complex depending on usage.
Use Case: Ideal for users leveraging Google Cloud services or looking for an advanced Kubernetes service with the latest features.

Getting Started:

Create a GKE cluster using the GCloud CLI:

gcloud container clusters create my-cluster

3. Azure Kubernetes Service (AKS)

Provider: Microsoft Azure
Description: AKS is Microsoft Azure’s managed Kubernetes service. It offers deep integration with Azure services, making it easier to deploy Kubernetes clusters and integrate with Azure’s cloud resources like storage, networking, and identity.
Features:
- Built-in monitoring and log collection with Azure Monitor.
- Integration with Azure Active Directory (AAD) for authentication.
- Support for both Linux and Windows containers.
Pros:
- Great for hybrid cloud setups with on-premise and cloud integration.
- Excellent integration with Azure DevOps and pipelines.
Cons:
- Azure pricing can be complex.
Use Case: Best for organizations already leveraging the Microsoft Azure ecosystem or looking to integrate with Azure’s DevOps tools and enterprise features.

Getting Started:

Create a Kubernetes cluster using Azure CLI:

az aks create --resource-group myResourceGroup --name myCluster

4. IBM Cloud Kubernetes Service

Provider: IBM Cloud
Description: A managed Kubernetes service by IBM, designed for hybrid cloud deployments and built with enterprise-grade security and scalability features.
Features:
- Full lifecycle management of Kubernetes clusters.
- Integration with IBM’s AI, blockchain, and analytics services.
- Compliance with enterprise security standards (e.g., GDPR, HIPAA).
Pros:
- Great for hybrid and multi-cloud environments.
- Strong enterprise security features and compliance support.
Cons:
- Fewer integrations compared to AWS or GCP.
Use Case: Ideal for enterprises focused on hybrid cloud environments and using IBM services like AI and analytics.

Getting Started:

Create a cluster using the IBM Cloud CLI:

ibmcloud ks cluster-create --name myCluster

5. Oracle Container Engine for Kubernetes (OKE)

Provider: Oracle Cloud Infrastructure (OCI)
Description: A fully managed Kubernetes service from Oracle, designed for running enterprise workloads on Oracle Cloud. OKE focuses on high performance and security, making it suitable for business-critical applications.
Features:
- Fully integrated with Oracle Cloud’s compute, networking, and storage.
- Built-in security features like Oracle Cloud Guard.
- Autoscaling, monitoring, and automated upgrades.
Pros:
- Strong for Oracle-based applications and databases.
- Good for performance and low-latency use cases.
Cons:
- Smaller ecosystem compared to AWS or GCP.
Use Case: Best for enterprises using Oracle databases or applications in production.

Getting Started:

Deploy a cluster using OCI CLI:
```
oci ce cluster create --name myCluster
```

6. DigitalOcean Kubernetes

Provider: DigitalOcean
Description: A simple, affordable managed Kubernetes service targeted toward developers and small businesses. It offers a straightforward experience for deploying Kubernetes with minimal setup complexity.
Features:
- Simple interface with one-click cluster creation.
- Easy integration with DigitalOcean services like databases, storage, and load balancers.
- Cost-effective for small and medium-sized businesses.
Pros:
- Easy to set up and manage, ideal for smaller teams and projects.
- Affordable pricing compared to other providers.
Cons:
- Not as feature-rich as larger cloud providers like AWS or GCP.
Use Case: Best for startups, small businesses, or developers who want a simple, cost-effective way to run Kubernetes in production.

Getting Started:

Deploy a cluster via the DigitalOcean CLI:

doctl kubernetes cluster create my-cluster

7. Linode Kubernetes Engine (LKE)

Provider: Linode (now part of Akamai)
Description: A managed Kubernetes service by Linode, designed for developers and small businesses. It provides a cost-effective, simple-to-use platform for Kubernetes.
Features:
- Fast setup and cluster deployment with Linode infrastructure.
- Simple and affordable pricing with no hidden fees.
- Integration with Linode services like Block Storage and NodeBalancers.
Pros:
- Budget-friendly.
- Easy to use for small-scale deployments.
Cons:
- Limited advanced features compared to AWS or GCP.
Use Case: Ideal for developers and small businesses looking for a straightforward Kubernetes deployment with transparent pricing.

Getting Started:

Create a cluster using the Linode CLI:

linode-cli lke cluster-create --name my-cluster

8. Alibaba Cloud Container Service for Kubernetes (ACK)

Provider: Alibaba Cloud
Description: A fully managed Kubernetes service that runs on Alibaba Cloud, offering comprehensive integration with Alibaba’s services and designed for running containerized workloads in China and globally.
Features:
- Support for enterprise-grade security and compliance.
- Deep integration with Alibaba Cloud services like Object Storage Service (OSS) and Elastic Compute Service (ECS).
- Flexible scaling and management of clusters.
Pros:
- Ideal for businesses operating in China or using Alibaba’s global infrastructure.
- Great for scaling workloads across multiple regions.
Cons:
- Less documentation and community support compared to AWS or GCP.
Use Case: Ideal for companies operating in China or leveraging Alibaba Cloud for international expansion.

Getting Started:

Create a Kubernetes cluster using Alibaba CLI:

aliyun cs CREATE-KUBERNETES-CLUSTER --name my-cluster

Summary of Cloud Kubernetes Options:

For AWS integration: Use Amazon EKS.
For Google Cloud services: Use Google GKE.
For Azure environments: Use Azure AKS.
For hybrid and enterprise environments: Consider IBM Cloud Kubernetes or Oracle OKE.
For simpler, cost-effective options: Use DigitalOcean Kubernetes or Linode Kubernetes.
For scaling in China: Consider Alibaba Cloud ACK.

Each of these managed services handles much of the operational overhead, letting you focus on deploying and scaling applications without having to manage the control plane or underlying infrastructure manually.

Kubernetes On-Prem Solutions:

If you want to run Kubernetes in production without relying on cloud providers, there are several on-premise and self-managed options that allow you to deploy and manage Kubernetes clusters. Below are some alternatives you can consider:

1. Kubeadm

Description: A tool that helps you bootstrap a secure Kubernetes cluster on bare-metal or virtual machines. It's part of Kubernetes and is widely used for setting up production clusters.
How it works: Kubeadm initializes the control plane and joins worker nodes to the cluster.
Pros:
- Direct control over configuration and infrastructure.
- Highly customizable.
- Lightweight and uses native Kubernetes components.
Cons:
- Requires you to manage infrastructure, networking, and storage manually.
- No built-in monitoring or dashboards (these need to be set up separately).
Use Case: Best for those who want to self-manage Kubernetes on their own hardware or VMs.

Getting Started:

Install Kubernetes using kubeadm on multiple nodes:
```
kubeadm init
kubeadm join <control-plane-node>
```

2. Rancher

Description: Rancher is a complete container management platform built on Kubernetes. It provides an intuitive UI for managing Kubernetes clusters and also simplifies multi-cluster deployments.
How it works: You can deploy Rancher on any bare-metal or virtualized infrastructure, and it helps you set up and manage Kubernetes clusters.
Pros:
- Offers a user-friendly UI and multi-cluster management.
- Integrates monitoring, logging, and storage solutions out of the box.
- Support for HA deployments.
Cons:
- Requires some learning to manage large, production-level clusters.
- Adds an extra layer of complexity on top of Kubernetes.
Use Case: Ideal for teams that want a Kubernetes management platform and need features like multi-cluster management and a visual UI.

Getting Started:

Install Rancher on a node:

docker run -d --restart=unless-stopped -p 80:80 -p 443:443 rancher/rancher

3. K3s (Lightweight Kubernetes)

Description: K3s is a lightweight Kubernetes distribution designed for IoT, edge, and resource-constrained environments. It’s a production-ready Kubernetes distribution from Rancher but uses less memory and has fewer dependencies.
How it works: K3s simplifies Kubernetes installation by packaging it into a single binary and includes built-in components like a local storage provider and simplified networking.
Pros:
- Lightweight and easy to set up.
- Ideal for resource-limited environments or small-scale deployments.
- Simplified version of Kubernetes, making it easier to manage.
Cons:
- Limited functionality for large-scale, complex deployments.
- Doesn’t have all the advanced features of full Kubernetes distributions.
Use Case: Great for small clusters, edge computing, or environments with fewer resources.

Getting Started:

Install K3s on your system:
```
curl -sfL https://get.k3s.io | sh -
```

4. MicroK8s

Description: A lightweight, single-package Kubernetes distribution maintained by Canonical (the makers of Ubuntu). It's ideal for both local development and small production environments.
How it works: MicroK8s installs Kubernetes with a minimal footprint and includes optional add-ons (e.g., Istio, Knative, and Prometheus).
Pros:
- Fast and simple to install.
- Minimal system requirements, ideal for development or small production clusters.
- Managed by Canonical, with official Ubuntu support.
Cons:
- Not suitable for very large production environments.
- Lacks some features present in larger distributions.
Use Case: Best for small-scale Kubernetes clusters or single-node setups that need to be easy to manage.

Getting Started:

Install MicroK8s:
```
sudo snap install microk8s --classic
```

5. OpenShift (OKD)

Description: OKD (OpenShift Kubernetes Distribution) is the open-source version of Red Hat’s OpenShift. It's a Kubernetes distribution with enterprise features such as integrated CI/CD pipelines, developer tools, and security enhancements.
How it works: OKD adds extra management tools and features on top of Kubernetes, providing a more enterprise-friendly solution.
Pros:
- Provides a rich set of built-in features (e.g., monitoring, CI/CD).
- Enhanced security features like multi-tenancy and advanced RBAC.
- Managed through a user-friendly web interface.
Cons:
- Complex to set up and maintain.
- Resource-heavy and overkill for smaller clusters.
Use Case: Suited for enterprise environments needing a robust Kubernetes distribution with built-in tools for development and security.

Getting Started:

Deploy OKD via the official OKD documentation.

6. Tanzu Kubernetes Grid (VMware)

Description: VMware’s Kubernetes distribution, integrated with vSphere, designed for on-premise and hybrid cloud Kubernetes environments.
How it works: Tanzu manages Kubernetes clusters and provides lifecycle management, monitoring, and scaling, all tightly integrated with VMware's vSphere.
Pros:
- Seamless integration with VMware environments.
- Built for enterprise use with advanced management tools.
- Lifecycle management and enterprise-grade support.
Cons:
- Primarily focused on VMware environments.
- Complex to set up for non-VMware users.
Use Case: Perfect for enterprises already using VMware that want to integrate Kubernetes into their infrastructure.

Getting Started:

More information can be found on VMware Tanzu.

7. Bare Metal Kubernetes

Description: Deploying Kubernetes directly on physical servers (bare metal) gives you full control over your infrastructure. You'll have to manage everything manually, from the underlying hardware to networking and storage.
How it works: You can use tools like Kubeadm, Kuberspray, or MetalLB to manage load balancing, networking, and other components.
Pros:
- Full control over hardware and configurations.
- No abstraction layers like VMs, resulting in better performance.
Cons:
- Requires in-depth knowledge of infrastructure, networking, and storage.
- Complex to manage, monitor, and maintain.
Use Case: Ideal for performance-critical applications where you want full control over the environment and hardware.

8. Kuberspray (Kubernetes the Hard Way)

Description: Kuberspray is an open-source project that uses Ansible to deploy production-ready Kubernetes clusters on any environment (bare metal, VMs, or cloud).
How it works: You use Ansible playbooks to automate the installation and configuration of Kubernetes, networking, and security.
Pros:
- Automates Kubernetes deployments across various platforms.
- Can be used for both small and large-scale production environments.
Cons:
- More complex than tools like K3s or MicroK8s.
- Requires knowledge of Ansible for configuration.
Use Case: Suitable for production environments where you need flexible, automated Kubernetes deployments.

Getting Started:

Follow Kuberspray installation guide from the official GitHub repository.

Conclusion:

If you prefer self-managed Kubernetes environments without cloud dependencies, the best choices include Kubeadm, Rancher, K3s, or MicroK8s. For large, enterprise-level setups, consider OpenShift (OKD) or Tanzu Kubernetes Grid. For complete control and performance, Bare Metal Kubernetes or Kuberspray are great options.

Each solution offers a different level of complexity and feature set depending on your production needs.

Control Plane Node

Function: The control plane node is the central management component of a Kubernetes cluster, overseeing its operational state.
Communication: Users interact with the control plane using:
- Command Line Interface (CLI)
- Web User Interface (Web UI)
- Application Programming Interface (API)
Importance: Maintaining a functional control plane is critical to avoid downtime and service disruption, which can lead to business losses.
High Availability (HA):
- Control plane replicas can be added for fault tolerance.
- Only one node actively manages the cluster, but all nodes remain in sync.
State Persistence: Cluster configuration data is stored in a distributed key-value store, which is separate from client workload data.
Key-Value Store Configurations:
- Stacked Topology: The key-value store is hosted on the control plane node, benefiting from HA of control plane replicas.
- External Topology: The key-value store is hosted separately, requiring its own HA setup, which increases operational costs.

A control plane node hosts several essential components and agents, including the

API server
scheduler
controller managers
key-value data store.

Additionally, it runs:

Container runtime
Node agent (kubelet)
Proxy (kube-proxy)
Optional observability add-ons, such as:
- Dashboard
- Cluster-level monitoring
- Logging tools

API Server:

The kube-apiserver is a central control plane component responsible for coordinating all administrative tasks within a Kubernetes cluster. Here’s a concise overview of its functions:

Request Handling: The API Server intercepts RESTful calls from users, administrators, developers, operators, and external agents, validating and processing these requests.
State Management: It reads the current state of the Kubernetes cluster from a key-value store. After executing a call, the resulting state is saved back to the key-value store for persistence.
Single Point of Interaction: The API Server is the only control plane component that interacts directly with the key-value store, serving as the intermediary for other control plane agents querying the cluster's state.
Configurability: It is highly configurable and can be customized to suit specific needs.
Scalability: The API Server can scale horizontally and supports the addition of custom secondary API Servers. In this setup, the primary API Server acts as a proxy, routing incoming RESTful calls to the appropriate secondary API Servers based on custom-defined rules.

Scheduler:

The kube-scheduler assigns new workloads, such as pods, to nodes based on the Kubernetes cluster's state and workload requirements. Key functions include:

Workload Assignment: Assigns pods to worker nodes by evaluating resource usage and workload constraints.
Decision Factors: Considers Quality of Service (QoS), data locality, affinity/anti-affinity rules, taints, tolerations, and cluster topology.
Scheduling Algorithm: Filters potential node candidates and scores them to determine the best fit for the workload.
Communication: Reports decisions back to the API Server for deployment coordination.
Configurability: Highly configurable through scheduling policies and supports custom schedulers.
Complexity: More complex in multi-node clusters; simpler in single-node setups for learning and development.

Controller Managers:

Controller managers are key components of the control plane that ensure the Kubernetes cluster maintains its desired state. Here’s how they function:

Watch-Loop Process: Continuously compares the desired state (from configuration data) with the current state (from the key-value store via the API Server).
Corrective Actions: Takes action to rectify any discrepancies between the desired and current states.
Kube-Controller-Manager:
- Manages node availability.
- Maintains expected pod counts.
- Creates endpoints, service accounts, and API access tokens.
Cloud-Controller-Manager:
- Interacts with cloud provider infrastructure.
- Manages storage volumes provided by cloud services.
- Oversees load balancing and routing.

Together, these managers ensure the Kubernetes cluster operates effectively and meets defined configurations.

Key-Value Data Store (etcd)

etcd is an open-source, distributed key-value data store under the Cloud Native Computing Foundation (CNCF), essential for persisting the state of a Kubernetes cluster. Here are the key points about etcd:

Data Management:
- Data is appended, never replaced, and obsolete data is periodically compacted to reduce size.
- Only the API Server communicates with the etcd data store.
Management Tool:
- The CLI tool etcdctl offers snapshot save and restore capabilities, useful for single-instance clusters in development.
High Availability (HA):
- In production environments, etcd should be replicated in HA mode for data resiliency.
- Supports both stacked (running on the same control plane node) and external topologies (isolated on a separate host).
Raft Consensus Algorithm:
- Ensures that a group of machines can function cohesively and tolerate node failures, including leader node failures.
- One node acts as the leader while others are followers; leader elections are handled gracefully.
Usage:
- Besides cluster state, etcd stores configuration details like subnets, ConfigMaps, and Secrets.

By utilizing etcd, Kubernetes ensures a reliable and consistent state management system for the cluster.

Worker Nodes:

Running Environment: Worker nodes provide the environment for running client applications, which are packaged as containers inside Pods.
Pods: The smallest scheduling unit in Kubernetes, encapsulating one or more containers. Pods are managed by control plane agents and scheduled on worker nodes.
Resource Management: Worker nodes offer compute, memory, storage, and networking resources for Pods to run and communicate.
Network Traffic: In multi-worker clusters, network traffic between users and containerized applications is handled directly by worker nodes, bypassing the control plane node.

Worker nodes are essential for executing and maintaining application workloads in a Kubernetes cluster.

Worker Node Components

A Kubernetes worker node hosts the components that allow it to run and manage Pods, which contain your application containers. These components includes:

Container runtime
Node agent (kubelet)
CRI shims
Network proxy (kube-proxy)
Add-ons to extend functionality

Here’s a detailed breakdown of each component:

1. Container Runtime

Kubernetes is a container orchestration engine but cannot manage or run containers directly. To handle container lifecycle management, each node in a Kubernetes cluster (both control plane and worker nodes) needs a container runtime. The container runtime is responsible for pulling container images, starting, stopping, and managing containers on the node. Kubernetes supports several container runtimes through the Container Runtime Interface (CRI).

Note The recommendation is to run the Kubernetes control plane components as containers, hence the necessity of a runtime on the control plane nodes. Kubernetes supports several container runtimes:

CRI-O: A lightweight container runtime specifically designed for Kubernetes. It supports image registries like quay.io and Docker Hub, providing a minimal and efficient solution for running Open Container Initiative (OCI)-compatible containers.
containerd: This is a simple, powerful, and portable container runtime. Originally a lower-level component of Docker, it has now become an independent runtime that directly integrates with Kubernetes via the CRI.
Docker Engine: Once the most popular runtime for Kubernetes, Docker uses containerd under the hood to manage containers. However, the direct integration of Docker with Kubernetes via the dockershim has been deprecated in Kubernetes v1.24.
Mirantis Container Runtime (MCR): Previously known as Docker Enterprise Edition, MCR is a commercial container platform that also integrates with Kubernetes via the new cri-dockerd adapter, ensuring Docker Engine compatibility even after dockershim's removal.

Each node in the Kubernetes cluster must have a container runtime installed, and for the control plane nodes, the control plane components themselves typically run as containers.

2. Node Agent - Kubelet

The kubelet is a critical agent that runs on every node in the Kubernetes cluster, including both control plane and worker nodes. Its primary responsibility is to communicate with the control plane components, particularly the API Server, and ensure that the containers specified in Pod definitions are running correctly on the node. The kubelet does this by:

Receiving Pod definitions: The kubelet receives Pod specifications from the API Server, which is part of the control plane, and ensures that the containers specified in those Pods are running as expected on the node.
Monitoring the health of containers: The kubelet continuously monitors the state of the containers running on its node. If a container crashes or becomes unhealthy, the kubelet can restart it based on the configuration.
Interfacing with the container runtime: The kubelet works with the container runtime to handle operations such as pulling container images, creating containers, and managing their lifecycle.

The kubelet does not manage containers directly; instead, it relies on the container runtime (such as containerd or CRI-O) through the Container Runtime Interface (CRI).

3. Kubelet - CRI Shims

The kubelet communicates with the container runtime through the Container Runtime Interface (CRI), which abstracts the differences between various container runtimes, allowing Kubernetes to work with any runtime that implements the CRI. The kubelet does this by using CRI shims, which are implementations or adapters that allow the kubelet to interface with specific container runtimes. Examples include:

cri-containerd: A CRI shim that integrates directly with containerd, allowing kubelet to manage containers using containerd without requiring direct interaction with the runtime.
CRI-O: This CRI shim enables kubelet to use any OCI-compliant runtime, such as runC. It’s particularly lightweight and optimized for Kubernetes use cases.
cri-dockerd: After the deprecation of dockershim in Kubernetes v1.24, the cri-dockerd shim was introduced by Docker Inc. and Mirantis to maintain support for the Docker Engine. This shim ensures that Docker can still be used as a container runtime in Kubernetes clusters by adhering to the CRI standard.

The introduction of CRI allowed Kubernetes to decouple the kubelet from specific runtimes like Docker, providing more flexibility and standardization.

4. Proxy - Kube-proxy

The kube-proxy is a network component that runs on every node, including both control plane and worker nodes. Its role is to manage network traffic to and from Pods, ensuring that services running in the cluster are accessible to internal and external clients. Kube-proxy handles:

Service discovery and load balancing: Kube-proxy routes traffic between services in the cluster by maintaining network rules and forwarding traffic to the appropriate Pods. It ensures that requests are forwarded to healthy Pods based on user-defined Services.
Network rule management: Kube-proxy dynamically updates the networking rules on the node to reflect the current state of the cluster. It can manage rules for forwarding TCP, UDP, and SCTP traffic.
Integration with iptables: On Linux nodes, kube-proxy works closely with iptables, a built-in firewall utility that manages packet filtering and NAT (Network Address Translation) rules. Kube-proxy uses iptables to implement forwarding and load-balancing rules across multiple Pods.

Kube-proxy abstracts the underlying network details, making it easier to manage communication between Pods and the external world.

5. Add-ons

Add-ons provide additional features and functionalities that are not part of the core Kubernetes platform but are essential for many production environments. These components are usually implemented through third-party plugins or services and include:

DNS: A core add-on that provides DNS records for Kubernetes resources such as services and Pods. This allows applications within the cluster to communicate with each other using friendly DNS names instead of IP addresses.
Dashboard: A web-based user interface that allows administrators to manage and monitor their Kubernetes clusters visually. The dashboard offers functionality like managing workloads, viewing logs, and monitoring cluster health.
Monitoring: Cluster-level monitoring solutions collect metrics from nodes, containers, and Pods, providing insights into performance, resource usage, and potential issues. Prometheus is a popular monitoring tool used in Kubernetes clusters.
Logging: A centralized logging solution aggregates logs from containers running across all nodes in the cluster. This is critical for troubleshooting, auditing, and monitoring application behavior. Popular tools include Elasticsearch and Fluentd.
Device Plugins: Device plugins allow nodes to advertise hardware resources (such as GPUs, FPGAs, or specialized NICs) to the cluster. Pods can request and utilize these resources for performance-intensive applications, such as machine learning and data processing.

These add-ons extend Kubernetes' capabilities and help meet the operational needs of complex and scalable applications.

These worker node components work together to ensure that Kubernetes can effectively schedule, run, and manage containerized workloads while providing the necessary networking, resource management, and observability for applications.

Networking Challenges in Kubernetes

In microservices-based applications, networking is critical to achieving the same level of interaction that was once inherent in monolithic architectures. Microservices rely heavily on efficient networking to maintain communication between loosely coupled components. Kubernetes, as a containerized microservices orchestrator, must address a series of unique networking challenges:

Container-to-Container communication within Pods
Pod-to-Pod communication on the same node and across different nodes in the cluster
Service-to-Pod communication within and across namespaces in the cluster
External-to-Service communication for client access to applications running in the cluster

Each of these networking challenges must be effectively managed by a Kubernetes cluster and its associated networking plugins.

Networking Challenges Breakdown

1. Container-to-Container Communication Inside Pods

Kubernetes utilizes the container runtime to create isolated network spaces, often referred to as network namespaces in Linux. These namespaces allow for isolation between containers, enabling them to have their own virtualized network interfaces, IP addresses, and routing tables.

In Kubernetes, containers within the same Pod must be able to communicate seamlessly. To achieve this, Kubernetes uses a special infrastructure container, often called the Pause container. The Pause container is initialized when a Pod starts, and it creates a network namespace that all containers in the Pod share. This ensures that containers within the same Pod can communicate using localhost as if they were running on the same physical machine, without needing explicit networking configurations.

This approach simplifies intra-Pod communication, as all containers share the same IP address and port space, making it easy for applications running in the same Pod to interact directly.

2. Pod-to-Pod Communication Across Nodes

In a Kubernetes cluster, Pods are distributed across multiple nodes in a dynamic and often unpredictable manner. However, Pods must still be able to communicate with each other, regardless of where they are scheduled. To ensure seamless communication, Kubernetes implements an IP-per-Pod networking model, where each Pod is assigned a unique IP address, just like a virtual machine in a traditional network.

This unique IP allows Pods to communicate directly with one another without the need for Network Address Translation (NAT). Kubernetes expects that every Pod can talk to any other Pod in the cluster, regardless of their physical location on different nodes. This model simplifies communication and ensures that developers don't have to deal with complex network routing.

However, while containers inside a Pod can communicate via localhost, Pods across nodes rely on the Container Network Interface (CNI), which uses various CNI plugins to assign IP addresses and configure networking. CNI abstracts away the complexities of container networking by enabling third-party Software Defined Networking (SDN) solutions such as Flannel, Weave, Calico, and Cilium to integrate with Kubernetes and provide advanced networking features, including Network Policies.

These SDN solutions not only provide the basic Pod-to-Pod communication but can also enforce fine-grained network security policies and other advanced functionalities within the cluster.

3. Service-to-Pod Communication

Kubernetes introduces the concept of Services to manage how Pods communicate both within the same namespace and across different namespaces in a cluster. A Service is a logical abstraction that defines a policy for accessing a group of Pods, typically through a stable virtual IP address and a set of rules that allow requests to be forwarded to the appropriate Pods.

The primary function of a Service is to provide load balancing and service discovery. When a client or another Pod makes a request to a Service, Kubernetes ensures that the request is routed to one of the Pods associated with the Service, even if the Pods themselves might be running on different nodes. This abstraction helps to decouple the client from the specifics of the individual Pods, providing resilience and scalability.

Within the same namespace or across namespaces, Kubernetes ensures that Pods can interact with Services via the internal DNS provided by the cluster. As Pods are ephemeral and may be recreated with different IP addresses, the Service's stable virtual IP makes it easier to route traffic consistently.

4. External-to-Service Communication

Once a containerized application is deployed inside a Kubernetes cluster, it often needs to be accessible by users or external systems outside of the cluster. Kubernetes handles external accessibility through Services, specifically through types like NodePort, LoadBalancer, or Ingress.

Kubernetes uses kube-proxy, a network proxy running on each node, to manage the routing of external traffic to the appropriate Services. kube-proxy creates and maintains a set of network rules, typically stored in iptables on the nodes, which handle the forwarding of traffic to the correct Pods.

When a Service is exposed to the external world, it is assigned a virtual IP address and a dedicated port number. kube-proxy uses this virtual IP and port to forward traffic from external clients to the Pods behind the Service, ensuring that applications are accessible from outside the cluster.

Additionally, Kubernetes supports advanced configurations like Ingress controllers, which provide fine-grained control over external traffic, including SSL termination and routing based on HTTP hostnames and paths.

Container Network Interface (CNI) Core Plugins

Kubernetes relies on the Container Network Interface (CNI) to manage the networking for Pods. CNI plugins are responsible for assigning IP addresses to Pods, setting up network routing, and enforcing any network policies. The container runtime offloads the responsibility of IP assignment to CNI, which uses a selected plugin (like Bridge or MACvlan) to provide an IP address and configure networking.

These plugins play a critical role in ensuring that Pods can communicate both within the node and across the cluster, adhering to Kubernetes' networking model. While Kubernetes provides some built-in networking options, many production environments prefer third-party CNI plugins due to their additional features, scalability, and support for custom network policies.

For more in-depth information, you can refer to the Kubernetes documentation.

Name		Name	Last commit message	Last commit date
Latest commit History 297 Commits
articles		articles
docs		docs
k3s-cluster		k3s-cluster
kubeadm-cluster		kubeadm-cluster
kubernetes-hardway		kubernetes-hardway
minikube		minikube
mlflow		mlflow
monitoring		monitoring
nginx-demo		nginx-demo
README.md		README.md

jahangir842/kubernetes-notes

Folders and files

Latest commit

History

Repository files navigation

Cloud Native Computing Foundation (CNCF)

CNCF and Kubernetes

CNCF Landscape

Roots of Kubernetes

What Is Kubernetes?

Kubernetes Features

Additional Features

Why Use Kubernetes?

Container Orchestrators

Orchestration Tools:

Kubernetes Cloud Solutions

1. Amazon EKS (Elastic Kubernetes Service)

2. Google Kubernetes Engine (GKE)

3. Azure Kubernetes Service (AKS)

4. IBM Cloud Kubernetes Service

5. Oracle Container Engine for Kubernetes (OKE)

6. DigitalOcean Kubernetes

7. Linode Kubernetes Engine (LKE)

8. Alibaba Cloud Container Service for Kubernetes (ACK)

Summary of Cloud Kubernetes Options:

Kubernetes On-Prem Solutions:

1. Kubeadm

2. Rancher

3. K3s (Lightweight Kubernetes)

4. MicroK8s

5. OpenShift (OKD)

6. Tanzu Kubernetes Grid (VMware)

7. Bare Metal Kubernetes

8. Kuberspray (Kubernetes the Hard Way)

Conclusion:

Control Plane Node

API Server:

Scheduler:

Controller Managers:

Key-Value Data Store (etcd)

Worker Nodes:

Worker Node Components

1. Container Runtime

2. Node Agent - Kubelet

3. Kubelet - CRI Shims

4. Proxy - Kube-proxy

5. Add-ons

Networking Challenges in Kubernetes

Networking Challenges Breakdown

1. Container-to-Container Communication Inside Pods

2. Pod-to-Pod Communication Across Nodes

3. Service-to-Pod Communication

4. External-to-Service Communication

Container Network Interface (CNI) Core Plugins

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages