I have worked as a Site Reliability Engineer (SRE) at Charles Schwab since 2024. Here, I plan to take the #100dayschallenge
to note important SRE topics and resources.
I have planned the contents for next 100 days, and I will be posting blog posts under the hashtag #100daysofSRE
. ✌️
- #100daysofSRE (Day 01): Introduction to Site Reliability Engineering
- #100daysofSRE (Day 02): History of SRE and its Evolution
- #100daysofSRE (Day 03): SLAs, SLOs, and SLIs — understanding the metrics of reliability
- #100daysofSRE (Day 04): Chaos Engineering and SRE - Techniques and Tools to Break Things on Purpose
- #100daysofSRE (Day 05): Automation Benefits, Techniques, and Tools in SRE
- #100daysofSRE (Day 06): Incident Management and Response for Site Reliability Engineers
- #100daysofSRE (Day 07): Effective Communication during Incidents for Better Incident Response
- #100daysofSRE (Day 08): Root Cause Analysis and Post-Incident Reviews for SRE
- #100daysofSRE (Day 09): Monitoring and Observability in SRE
- #100daysofSRE (Day 10): Grafana vs Splunk for Monitoring System and Applications
- #100daysofSRE (Day 11): Logging and Log Analysis in Site Reliability Engineering- Techniques, Tools, and Best Practices
- #100daysofSRE (Day 12): Alerting and Notification Strategies and Best Practices in SRE
- #100daysofSRE (Day 13): Capacity Planning and Management in Site Reliability Engineering
- #100daysofSRE (Day 14): Load Testing and Stress Testing in Site Reliability Engineering
- #100daysofSRE (Day 15): Disaster Recovery Planning and Testing in SRE
- #100daysofSRE (Day 16): High Availability and Redundancy Strategies for Data
- #100daysofSRE (Day 17): Techniques, Tools, and Best Practices for Performance Optimization and Tuning in Site Reliability Engineering
- #100daysofSRE (Day 18): 25 Intermediate-level Linux Commands useful for SysAdmin, DevOps, and SRE
- #100daysofSRE (Day 19): Simplifying Log Analysis with Linux Sed Command: Basic and Templates
- #100daysofSRE (Day 20): Simplifying Log Analysis with Linux awk Command: Basic and Templates
- #100daysofSRE (Day 21): How to use Supervisor to manage a script on Linux
- #100daysofSRE (Day 22): Essential /var/log Files for SREs and How to Analyze Them
- #100daysofSRE (Day 23): Modernize and Containerize your Applications or Microservices using Docker
- #100daysofSRE (Day 24): Writing a Dockerfile – Best Practices & Enhancements
- #100daysofSRE (Day 25): Writing a Production-Grade Dockerfile for Legacy Applications
- #100daysofSRE (Day 26): Docker Compose - Simplifying Multi-Container Deployments
- #100daysofSRE (Day 27): Building a Hacking Lab with Docker Compose
- #100daysofSRE (Day 28): Deploying an AI Chatbot with Docker Compose
- #100daysofSRE (Day 29): Kubernetes over Docker-compose – Why It’s Better for Production
- #100daysofSRE (Day 30): Learn Kubernetes Commands and Operations using Minikube
- #100DaysOfSRE (Day 31): How to Write Kubernetes Manifest Files: Kubernetes vs Docker-Compose
- #100DaysOfSRE (Day 32): Advanced Kubernetes: Ingress, ConfigMaps, Secrets & Helm
- #100DaysOfSRE (Day 33): Monitoring Kubernetes Apps with Prometheus & Grafana
- #100DaysOfSRE (Day 34): Automating Kubernetes Deployments with ArgoCD & GitOps
- #100DaysOfSRE (Day 35): Kubernetes CI/CD Pipeline with GitHub Actions & ArgoCD
- #100DaysOfSRE (Day 36): Kubernetes Helm Charts – Package & Deploy Applications