Job Description
Opportunity Alert 👉 Site Reliability Engineer (SRE)
Experience: 5–10 years
Location: Toronto (Onsite) Full-Time
Key Responsibilities
📌 Lead and deliver SRE/DevOps projects end-to-end
📌Define & monitor SLAs/SLOs/SLIs
📌Drive incident reviews, automation, and observability
📌Collaborate with teams to improve reliability and reduce toil
📌Maintain and optimize Kubernetes clusters and containerized workloads
Must-Have Skills
📌Strong in Linux, Shell/Python scripting
📌Expertise in AWS or GCP, Kubernetes, Docker
📌Experience with monitoring tools (Prometheus, Grafana, CloudWatch)
📌Familiar with infra-as-code (Terraform, CloudFormation)
📌CI/CD: Jenkins, CircleCI, etc.
📌Exposure to systems like Kafka, Cassandra, PostgreSQL, Redis
Ready to build scalable, reliable systems? Send your resume to madhuri.rane@techdoquest.com or DM me Madhuri Rane 👍 !!