Consultant – SRE, Cloud Platforms

Job Description

  • Contractor
  • Anywhere

We’re Hiring | Consultant – SRE, Cloud Platforms (Remote | Canada)
📍 Location: Ontario (Remote)
⏳ Duration: 6 Months
⭐ Client: Public Sector

About Subtility
Subtility is a leader in digital transformation, helping enterprises accelerate their journey in DevOps & Cloud Operations, Observability & ITOM, Application Modernization, and Data Engineering. We combine deep platform expertise with our proprietary Agentic AI platform to deliver intelligent, scalable solutions that solve complex business challenges.

Role Overview
We are seeking a senior-level SRE to serve as the strategic bridge between our Center of Excellence and our clients’ technical leadership. In this role, you will transition beyond traditional monitoring to architect resilient, self-healing cloud ecosystems. You will support our Technical Pre-Sales and Delivery teams by designing high-performance platforms that leverage Site Reliability Engineering (SRE) principles and Agentic AI to drive operational excellence.

Key Responsibilities
• Architectural Discovery: Lead deep-dive sessions to evaluate client cloud maturity, container strategies, and automation bottlenecks.
• Hybrid & Multi-Cloud Design: Develop scalable architectures across GCP and AWS, ensuring seamless integration between GKE and on-premise container platforms.
• SRE & Telemetry Strategy: Design robust telemetry pipelines focusing on the “Three Pillars”—Metrics, Logging, and Tracing—using modern stacks like OpenSearch and Prometheus.
• CI/CD Pipeline Engineering: Own the end-to-end automation strategy using Harness, Git/Bitbucket, and Infrastructure-as-Code (IaC) to ensure rapid, reliable delivery.
• Technical Leadership: Lead complex POCs (6-8 weeks) that demonstrate the value of automated SRE practices and AI-driven operations to stakeholders.

Requirements
• Experience: 10+ years of overall IT experience, with at least 5 years in customer-facing roles (Solutions Engineering or Technical Consulting).
• Cloud Mastery: Proven proficiency in GCP and AWS services, with deep expertise in Kubernetes (GKE & On-prem) and container orchestration.
• SRE Toolset: Extensive hands-on experience with OpenSearch, Splunk, Grafana, Prometheus, and Open Telemetry.
• DevOps & Automation: Expert knowledge of Harness, CI/CD workflows, Git-based version control (Bitbucket), and Infrastructure-as-Code (IaC).
• Data Literacy: Deep understanding of telemetry data structures (Metrics, Logs, and Traces) and how to derive actionable insights from complex datasets.
• Communication: Exceptional ability to lead whiteboarding sessions and present complex technical architectures to both C-level executives and engineering teams.

📩 Interested candidates can directly share their resume at:
mishra.neha@smsoftconsulting.com