~/zad

Experience

Work history

Progressive promotions within a single U.S. fintech SaaS platform, preceded by onsite and agency roles. Consulting and project work is covered on the Projects page.

Full-time

Career timeline

Engineering Manager, DevOps & Site Reliability

U.S. Fintech SaaS Platform · Credit Unions & Community Banks

Apr 2025 – Present · Remote (US / Canada / India)

  • Lead a globally distributed DevOps / SRE team spanning the US, Canada and India, with follow-the-sun on-call coverage for a mission-critical fintech platform.
  • Own end-to-end people management — hiring, goal-setting, performance reviews, career development and promotion planning — and partner with recruiting to continue scaling the team.
  • Designed and implemented a functional Disaster Recovery framework with defined SLOs and error budgets.
  • Drove cost-optimization initiatives across cloud infrastructure using metrics and observability to guide engineering trade-offs.
  • Overhauled observability tooling (metrics, logs, traces, alerts), refining alert strategy and cutting noise to improve incident response.
  • Integrated engineering teams and platforms inherited through M&A activity, onboarding new members and assuming ownership of acquired infrastructure.
KubernetesAWSSLO/Error budgetsIncident managementObservabilityTeam leadership

Lead DevOps Engineer

U.S. Fintech SaaS Platform

Apr 2023 – Mar 2025 · Remote (India)

  • Led the reliability engineering team in achieving and sustaining 99.99% application uptime through SLO-driven practices, proactive monitoring and systematic incident management.
  • Attained 100% observability coverage across company infrastructure — logs, metrics, traces and events.
  • Deployed AI / ML workloads on EKS for real-time processing, orchestrating asynchronous tasks through Step Functions, Lambda and Batch.
  • Evaluated and rolled out a unified observability solution, replacing a fragmented stack across CloudWatch, Elastic Cloud and NewRelic.
  • Migrated EC2-based microservices to Kubernetes with full observability, health checks and automated scaling.
  • Conducted comprehensive interviews for managerial and engineering positions.
AWS EKSStep FunctionsLambdaAWS BatchPrometheusGrafanaSLO/SLI

DevOps Engineer II

U.S. Fintech SaaS Platform

Apr 2022 – Apr 2023 · Remote (India)

  • Spearheaded security and compliance efforts contributing to SOC-2 certification.
  • Managed multi-tenant Kubernetes workflows across AWS EKS clusters.
  • Consolidated observability into a single platform, replacing a fragmented stack across CloudWatch, Elastic Cloud and NewRelic.
  • Managed a broad AWS estate — EC2, RDS, ElastiCache, S3, Route 53, VPC, Site-to-Site VPN, GuardDuty, Inspector and CloudFormation.
  • Automated repetitive operational tasks in Python and Bash.
  • Troubleshot production issues by analyzing logs, traces and metrics to find root causes in a Golang codebase.
AWSKubernetesTerraformPythonGolangSOC-2

DevOps Engineer

U.S. Fintech SaaS Platform

Jun 2021 – Mar 2022 · Remote (India)

  • Managed core AWS services — EC2, RDS, ElastiCache, S3, VPC, Site-to-Site VPN, CloudWatch, Transfer Family, GuardDuty, Inspector and CloudFormation.
  • Monitored infrastructure and application health using CloudWatch, NewRelic, Elastic Cloud and Kibana.
  • Provisioned AWS Site-to-Site VPN tunnels to customer on-premise networks using Terraform.
  • Participated in on-call rotations for production incident response.
AWSTerraformCloudWatchNewRelic

DevOps / Database Expert

Mercedes-Benz R&D India (via QBurst) · Onsite — Virtual Key Service (VKS)

Sep 2019 – Jun 2021 · Bangalore, India

  • Architected a high-availability, multi-master PostgreSQL cluster with automated failover and replication across geographically distributed tenants.
  • Automated infrastructure deployment with Terraform and led a lift-and-shift migration of services from on-premise to AWS.
  • Designed CI/CD pipelines using AWS CodeBuild and CodePipeline.
  • Migrated database services between PostgreSQL, DB2, MongoDB, AWS RDS and DynamoDB.
  • Conducted regular Disaster Recovery drills and automated periodic database backups.
PostgreSQLKubernetesTerraformAWS RDSCodeBuildCodePipeline

DevOps Engineer

QBurst · Internal Kubernetes Authentication Platform

May 2019 – Sep 2019 · Kerala, India

  • Configured a highly available on-premise Kubernetes cluster for internal development.
  • Built a Python / Django authentication and authorization portal integrated with Kubernetes RBAC.
  • Set up Prometheus and Grafana for cluster health monitoring and alerting.
KubernetesPythonDjangoPrometheusGrafana

Python Backend Developer

QBurst · Fintech asset-management iOS app

Jun 2018 – Apr 2019 · Kerala, India

  • Developed REST APIs with Django and deployed to AWS EC2 via a GitLab CI/CD pipeline.
  • Administered relational databases, optimized queries and managed AWS RDS.
  • Automated database sync between AWS RDS and on-premise databases using Python.
PythonDjangoAWS EC2AWS RDSGitLab CI