Interactive Kubernetes Guide

The Kubernetes Landscape

In 2025, Kubernetes is the de facto operating system for the cloud. This dashboard provides a high-level overview of the most impactful trends, feature updates, and ecosystem tools shaping the future of cloud-native development. Explore the sections to dive deeper into core concepts, commands, and best practices.

Key Industry Trends

🗃️

Rise of Stateful Containers

Running databases and other stateful workloads directly in Kubernetes is now a mainstream practice, simplifying management but requiring robust disaster recovery plans.
엣

Kubernetes at the Edge

The platform is expanding beyond data centers to manage fleets of devices in remote locations like retail stores and banks for real-time local data processing.
🤝

Convergence of Containers & VMs

A growing need to manage both containerized and traditional VM workloads from a single, unified platform to reduce operational overhead.

Ecosystem Tooling Dominance

The Kubernetes ecosystem continues to be dominated by powerful open-source tools. This chart shows the leading technologies across key categories based on industry adoption rates.

What's New: Key Feature Updates

Feature	Status	Impact
StatefulSet PVC Auto-Deletion	GA in v1.32	Automatically cleans up Persistent Volume Claims, preventing orphaned storage resources.
AppArmor Support	GA in v1.31	Enhances cluster security with fine-grained control over container permissions.
NFTables Backend for kube-proxy	Beta in v1.31	Improves network performance and scalability, especially for large-scale clusters.

Core Architecture & Objects

Understanding the fundamental architecture is key to mastering Kubernetes. A cluster is divided into a Control Plane (the brain) and a Data Plane (the workers). Interact with the diagram below to learn about each component and its role in maintaining your application's desired state.

Select a Component

Click on a component in the diagram to see its description here.

Kubernetes Architecture

Control Plane

API Server

etcd

Scheduler

Controller Manager

↕️

Data Plane (Worker Nodes)

Kubelet

Kube-proxy

Container Runtime

kubectl Cheatsheet

A comprehensive, searchable reference for the most common `kubectl` commands. Use the search box to quickly filter commands by name or description.

Command	Description

Deployment Strategies

Choosing the right deployment strategy is crucial for application availability and reliability. Kubernetes supports several methods, each with distinct trade-offs. Select a strategy below to see how it works and when to use it.

Rolling Update (Default)

Incrementally replaces old Pods with new ones, ensuring zero downtime. It's the default and most common strategy, ideal for stateless applications that can run multiple versions simultaneously.

🔵🔵🔵

Version 1

→

🔵🟢🔵

Updating...

→

🟢🟢🟢

Version 2

Pros:

Zero downtime
Gradual rollout reduces risk
Simple to configure

Cons:

Slow rollout for large clusters
Complex to manage stateful apps
Rollbacks can be slow

Security Best Practices: Zero Trust

A security-first mindset is paramount. The most effective posture is a **Zero Trust architecture**: "never trust, always verify." This assumes no entity, inside or outside the network, is trusted by default. Below are the key pillars for implementing this model in Kubernetes.

Role-Based Access Control (RBAC)

Implement granular permissions to ensure users and services are granted only the minimum access necessary to perform their tasks. This limits the potential for an attacker to move laterally within the cluster.

Network Policies

Use network policies as a firewall to restrict communication between Pods. By default, all Pods can communicate. Policies provide microsegmentation, drastically reducing the attack surface.

Secure Secrets Management

By default, Kubernetes Secrets are only base64 encoded, not encrypted. It is a critical best practice to configure encryption-at-rest to protect sensitive data like passwords and API keys from unauthorized access.

Helm & Kustomize Cheatsheet

Searchable reference for common Helm and Kustomize commands and patterns.

Command	Description

Advanced Operations

Command-specific techniques for advanced Kubernetes operations including custom resources, admission control, certificate management, and cluster maintenance.

Custom Resource Management

kubectl get crd — List all Custom Resource Definitions.
kubectl describe crd — Inspect CRD schema and status.
kubectl get — List custom resource instances.
kubectl explain --recursive — Show custom resource API documentation.

Certificate & Security Management

kubectl get certificatesigningrequests — List pending certificate requests.
kubectl certificate approve — Approve certificate signing requests.
kubectl get secrets --field-selector type=kubernetes.io/tls — Find TLS secrets.
kubectl label namespace pod-security.kubernetes.io/enforce=restricted — Apply Pod Security Standards.

Resource Management & Quotas

kubectl describe resourcequota -n — Check quota usage.
kubectl get priorityclass — List Pod priority classes.
kubectl get horizontalpodautoscaler — Monitor HPA status.
kubectl get limitrange -n — Check namespace resource limits.

Advanced Networking & Monitoring

kubectl describe networkpolicy — Inspect network segmentation rules.
kubectl get endpointslices -l kubernetes.io/service-name= — Debug service endpoints.
kubectl get --raw /metrics — Access raw cluster metrics.
kubectl top nodes --sort-by=cpu — Monitor resource usage trends.

Backup & Plugin Management

kubectl get all,pv,pvc,secrets,configmaps -o yaml > backup.yaml — Export cluster state.
kubectl plugin list — List installed kubectl plugins.
kubectl krew search — Find plugins with Krew.
kubectl tree — Show resource ownership hierarchy.

Validation & Debugging

kubectl apply --dry-run=server -f — Server-side validation.
kubectl get events --field-selector involvedObject.kind=Pod — Filter events by resource type.
kubectl get events --field-selector reason=Failed — Show only failure events.
kubectl get lease -n kube-system — Check leader election status.

Troubleshooting Guide

When things go wrong, a systematic approach is key. This guide covers common Pod lifecycle issues and their solutions. The most valuable command is `kubectl describe pod `, which reveals events and status details.

Common Pod Status Issues

CrashLoopBackOff ▼

Meaning:

The container is repeatedly starting, crashing, and restarting. Kubernetes is trying to run it, but it fails immediately.

Common Solutions:

Check logs for application errors: `kubectl logs `
Verify configuration in ConfigMaps or Secrets.
Ensure the application isn't running out of memory (check for `OOMKilled` in `kubectl describe pod`).
Check liveness/readiness probes for misconfiguration.

ImagePullBackOff ▼

Meaning:

The cluster was unable to pull the container image from the specified registry.

Common Solutions:

Verify the image name and tag are correct in your manifest.
Check for typos in the registry URL.
Ensure the cluster has network connectivity to the registry.
If it's a private registry, confirm that an `imagePullSecret` is correctly configured and valid.

Exit Code 137 (OOMKilled) ▼

Meaning:

The container was forcefully terminated (`SIGKILL`) because it exceeded its memory limit.

Common Solutions:

Run `kubectl describe pod ` and look for the `Reason: OOMKilled` message.
Increase the memory `limit` and `request` for the container in your deployment manifest.
Analyze your application for memory leaks.

Pending ▼

Meaning:

The Pod has been accepted by the cluster, but one or more of the containers has not been started. Often due to unschedulable conditions.

Common Solutions:

Check scheduling events: `kubectl describe pod | grep -A3 -i events`.
Verify resource requests/limits are not too high for any node.
Inspect node taints and Pod tolerations.
Confirm required PersistentVolumes can bind to PVCs.

ContainerCreating ▼

Meaning:

Kubelet is pulling the image and setting up volumes and networks. Long waits can indicate CNI or image issues.

Common Solutions:

Check image pull progress/events and registry connectivity.
Verify CNI plugin Pods are healthy in `kube-system` namespace.
Ensure volume mounts exist and permissions are correct.

ImagePullBackOff (Private Registry) ▼

Meaning:

Authentication to the private registry failed.

Common Solutions:

Create a pull secret: `kubectl create secret docker-registry regcred --docker-server= --docker-username= --docker-password= --docker-email=`.
Reference it in your Pod/Deployment: `imagePullSecrets: [ { name: regcred } ]`.
Ensure the secret exists in the same namespace as the workload.

Init:CrashLoopBackOff ▼

Meaning:

One of the init containers is failing repeatedly, preventing main containers from starting.

Common Solutions:

Inspect init container logs: `kubectl logs -c `.
Validate that dependent services/volumes the init step needs are reachable/mounted.
Check init container resource limits; they can also be OOMKilled.

NodeNotReady ▼

Meaning:

A worker node has failed its readiness checks. Pods already scheduled there may stay running, but no new Pods will land until the node recovers.

Common Solutions:

Review node conditions: `kubectl describe node | grep -A4 Conditions`.
Check kubelet and systemd service status on the node; restart kubelet if required.
Validate CNI/CSI Pods on that node by inspecting `kubectl get pods -n kube-system -o wide`.
If the node is lost, `kubectl cordon` and `kubectl drain` it, then remove with `kubectl delete node`.

RunContainerError ▼

Meaning:

The container failed to start after being created. Startup commands, permissions, or image entrypoints are common culprits.

Common Solutions:

Inspect events with `kubectl describe pod ` to see detailed error messages.
Review container logs (if any were written) using `kubectl logs --previous`.
Ensure the `command`/`args` in the manifest match the container image expectations.
Use `kubectl debug --image=busybox` for an ephemeral shell to inspect mounted volumes and permissions.

CrashLoopBackOff with Custom Resources ▼

Meaning:

Pods that depend on custom resources (like cert-manager, operators) are failing due to missing or misconfigured CRDs.

Common Solutions:

Check if CRDs are properly installed: `kubectl get crd | grep `.
Verify custom resource status: `kubectl describe `.
Check operator logs: `kubectl logs -n -l app=`.
Ensure proper RBAC permissions for service accounts: `kubectl auth can-i '*' '*' --as system:serviceaccount::`.

PersistentVolumeClaim Pending ▼

Meaning:

A PVC cannot bind to an available PersistentVolume, often due to storage class issues or resource constraints.

Common Solutions:

Check available PVs: `kubectl get pv` and verify if any match the PVC requirements.
Verify storage class exists: `kubectl get storageclass` and check if it's set as default.
Review PVC events: `kubectl describe pvc ` for detailed binding failures.
Check if dynamic provisioner is running: `kubectl get pods -n kube-system | grep `.

Service Not Accessible ▼

Meaning:

A Service exists but traffic cannot reach the underlying Pods, indicating selector or networking issues.

Common Solutions:

Verify service endpoints: `kubectl get endpoints ` - should list Pod IPs.
Check label selectors match: `kubectl get pods --show-labels` and compare with service selector.
Test from within cluster: `kubectl run tmp-shell --rm -it --image=busybox -- nslookup `.
Check NetworkPolicies: `kubectl get networkpolicy` and verify they allow required traffic.

Certificate/TLS Issues ▼

Meaning:

TLS handshake failures, expired certificates, or missing certificate secrets causing connection issues.

Common Solutions:

Check certificate secrets: `kubectl get secrets --field-selector type=kubernetes.io/tls`.
Verify certificate validity: `kubectl get secret -o jsonpath='{.data.tls\.crt}' | base64 -d | openssl x509 -text`.
Review certificate signing requests: `kubectl get certificatesigningrequests`.
Check cert-manager logs if using it: `kubectl logs -n cert-manager -l app=cert-manager`.

Dashboard & Trends

The Kubernetes Landscape

Key Industry Trends

Rise of Stateful Containers

Kubernetes at the Edge

Convergence of Containers & VMs

Ecosystem Tooling Dominance

What's New: Key Feature Updates

Core Architecture & Objects

Select a Component

Kubernetes Architecture

Control Plane

Data Plane (Worker Nodes)

kubectl Cheatsheet

Deployment Strategies

Rolling Update (Default)

Pros:

Cons:

Blue/Green Deployment

Pros:

Cons:

Canary Deployment

Pros:

Cons:

Security Best Practices: Zero Trust

Role-Based Access Control (RBAC)

Network Policies

Secure Secrets Management

Helm & Kustomize Cheatsheet

Advanced Operations

Custom Resource Management

Certificate & Security Management

Resource Management & Quotas

Advanced Networking & Monitoring

Backup & Plugin Management

Validation & Debugging

Troubleshooting Guide

Common Pod Status Issues