~/zad

Engineering Manager, DevOps & Site Reliability

Mohammed Zadnoor

I lead a globally distributed DevOps & SRE team at a U.S. fintech SaaS platform, building reliability engineering practices for mission-critical infrastructure serving Credit Unions and Community Banks.

Current focus

  • SLO-driven reliability and incident management at 99.99% uptime
  • Kubernetes at scale across AWS EKS and GCP GKE
  • Cost, observability and disaster-recovery maturity

Recent experience

Where I've been most recently

Progressive promotions within a single U.S. fintech SaaS platform, alongside independent consulting and project work.

Engineering Manager, DevOps & Site Reliability

U.S. Fintech SaaS Platform · Credit Unions & Community Banks

Apr 2025 – Present · Remote (US / Canada / India)

  • Lead a globally distributed DevOps / SRE team spanning the US, Canada and India, with follow-the-sun on-call coverage for a mission-critical fintech platform.
  • Own end-to-end people management — hiring, goal-setting, performance reviews, career development and promotion planning — and partner with recruiting to continue scaling the team.
  • Designed and implemented a functional Disaster Recovery framework with defined SLOs and error budgets.
KubernetesAWSSLO/Error budgetsIncident managementObservabilityTeam leadership

Lead DevOps Engineer

U.S. Fintech SaaS Platform

Apr 2023 – Mar 2025 · Remote (India)

  • Led the reliability engineering team in achieving and sustaining 99.99% application uptime through SLO-driven practices, proactive monitoring and systematic incident management.
  • Attained 100% observability coverage across company infrastructure — logs, metrics, traces and events.
  • Deployed AI / ML workloads on EKS for real-time processing, orchestrating asynchronous tasks through Step Functions, Lambda and Batch.
AWS EKSStep FunctionsLambdaAWS BatchPrometheusGrafanaSLO/SLI

DevOps Engineer II

U.S. Fintech SaaS Platform

Apr 2022 – Apr 2023 · Remote (India)

  • Spearheaded security and compliance efforts contributing to SOC-2 certification.
  • Managed multi-tenant Kubernetes workflows across AWS EKS clusters.
  • Consolidated observability into a single platform, replacing a fragmented stack across CloudWatch, Elastic Cloud and NewRelic.
AWSKubernetesTerraformPythonGolangSOC-2

Core stack

What I build with

A condensed view — the full skill matrix is on the Skills page.

Cloud Platforms

Hands-on with AWS and GCP across production fintech and consulting SaaS engagements.

AWS EC2AWS LambdaAWS BatchAWS VPCAWS IAMAWS S3AWS RDSAWS ElastiCacheAWS Route 53AWS CloudFormationAWS GuardDutyAWS InspectorGCP GKEGCP ComputeGCP Cloud SQLGCP AlloyDBGCP Secret Manager

Orchestration & Runtime

Kubernetes at scale across EKS and GKE, plus task orchestration primitives.

KubernetesHelmKustomizeAWS EKSGCP GKEAWS ECSAWS Step FunctionsTemporal

Infrastructure as Code

Declarative provisioning and drift-free environments.

TerraformPulumiAWS CloudFormationAWS CDK

Observability & Monitoring

Unified metrics, logs, traces and alerting across multi-cloud estates.

PrometheusGrafanaSignozElastic CloudKibanaAWS CloudWatchNewRelicKloudfuseAlertmanager