DevOps and Cloud
From servers and Linux to fully automated cloud delivery and observability. Duration twelve to sixteen weeks. Target outcome: ship and operate production services with infrastructure as code, CI and CD, containers, orchestration, and monitoring.
Overview
DevOps is the practice of building, shipping, and running software reliably and repeatably. This track assumes you can write code in one language and use a terminal. You move from operating system fundamentals through automation, containers, orchestration, cloud, and finally reliability engineering. Build something real at every stage so the tools stick.
Month 1: Foundations and automation
Week 1: Linux and networking
0 of 5- bash: the default shell for automation
- systemd: service management
- curl and dig: inspect HTTP and DNS
- The Linux Command Line by William Shotts: free book
- roadmap.sh DevOps: structured reference
- A bash script that backs up a directory, rotates old backups, and runs from cron with logging
Week 2: Git, GitHub, and trunk based flow
0 of 4- git: version control
- pre-commit: local quality gates
- A repository with branch protection, a pull request template, and a pre commit hook that lints
Week 3: Cloud fundamentals
0 of 5- One primary cloud (AWS recommended, GCP or Azure equivalents noted)
- the cloud CLI for that provider
- The official free tier docs for your chosen cloud
- AWS Skill Builder or GCP free courses
- Provision a virtual machine, an object store, and a managed database by hand, then write down every step to automate later
Week 4: Infrastructure as code
0 of 5- Terraform: cloud agnostic infrastructure as code
- tflint and checkov: lint and security scan
- Recreate last week's manual setup entirely in Terraform with remote state
Month 2: Containers, pipelines, and orchestration
Week 5: Docker and containers
0 of 5- Docker: container runtime and build
- Docker Compose: local multi service orchestration
- Containerize a web app plus a database and a cache with Docker Compose
Week 6: CI and CD pipelines
0 of 5- GitHub Actions: pipelines as code
- GitLab CI: alternative with the same concepts
- A pipeline that lints, tests, builds a Docker image, pushes it, and deploys to a staging environment
Week 7: Kubernetes fundamentals
0 of 5- Kubernetes: container orchestration
- kind or minikube: a local cluster
- kubectl: the cluster CLI
- Deploy your containerized app to a local Kubernetes cluster with a Deployment and a Service
Week 8: Kubernetes in production
0 of 6- Helm: package manager for Kubernetes
- a managed cluster (EKS, GKE, or AKS)
- Package your app as a Helm chart and deploy it to a managed cluster with autoscaling and an ingress
Month 3: Reliability, security, and operations
Week 9: Observability
0 of 5- Prometheus: metrics
- Grafana: dashboards
- Loki: logs
- OpenTelemetry: tracing
- Add metrics, logs, and traces to your app and build a dashboard with alerts on latency and error rate
Week 10: GitOps and progressive delivery
0 of 4- Argo CD or Flux: GitOps controllers
- Drive deployments through GitOps so a merge to main updates the cluster automatically
Week 11: Security and secrets
0 of 5- a secrets manager (cloud native or HashiCorp Vault)
- Trivy: image and dependency scanning
- OPA or Kyverno: policy enforcement
- Add image scanning to the pipeline and move all secrets into a manager with rotation
Week 12: Capstone and reliability
0 of 4- A complete platform: infrastructure as code, a CI and CD pipeline, a Kubernetes deployment with GitOps, full observability, scanned images, and managed secrets, documented with a runbook
Resource master reference
Books
The Phoenix Project by Gene Kim: the why of DevOps as a novel
The DevOps Handbook by Gene Kim and others
Site Reliability Engineering by Google: free online
Repositories
roadmap.sh DevOps roadmap
awesome devops curated lists
Courses and channels
KodeKloud for hands on labs
TechWorld with Nana on YouTube
Tools master list
Linux, bash, git, Terraform, Docker, Docker Compose, Kubernetes, Helm, GitHub Actions, Argo CD, Prometheus, Grafana, Loki, OpenTelemetry, Trivy, Vault
Interview focus
Explain a deployment pipeline end to end
Design a zero downtime deployment strategy
How do you debug a service that is slow in production
Trade offs of Kubernetes versus serverless
How do you manage secrets across environments
What are your service level objectives and how do you alert on them