Sr Site Reliability Engineering – SRE

Job Description

  • Contractor
  • Anywhere

🚀 #Hiring: Sr Site Reliability Engineering – SRE | 🏢 Work Model: Remote (Canada)
📅 Duration: 6–12 months |👤 Experience: 8+ Years

⚠️[ Note : Only Candidates holding Canadian PR / OWP / Citizenship will be considered ]

🔹 Skills Required:
Strong experience in Observability, SRE, and DevOps practices
Deep expertise with Dynatrace, ELK, Splunk, and PagerDuty
Strong understanding of observability principles, instrumentation, correlation IDs, and SLI/SLO frameworks
Hands-on experience with Azure Kubernetes Service (AKS)
Advanced proficiency with Terraform and Infrastructure as Code (IaC)
Experience with Azure managed services including SQL MI, Redis, Functions, and Event Grid
Strong experience with distributed tracing, metrics collection, and log aggregation
Experience supporting Node.js and .NET applications in microservices and event-driven architectures
Strong troubleshooting and root cause analysis skills across distributed systems, APIs, databases, and caches
Experience with incident management tools such as PagerDuty and ServiceNow
Knowledge of incident, problem, and change management processes
Familiarity with CI/CD pipelines, automation, and operational resilience practices
Experience with chaos engineering and blameless postmortems
Strong communication, leadership, and cross-functional collaboration skills

📩 DM me or Email to apply!
Email: Rishabh.Yadav@Varite.com