Full Timeline

The Journey

Month-by-month from student to Senior Software Engineer I. Every milestone, migration, and lesson learned.

← Back to About

2026

Senior Software Engineer I · Swym Corporation

Promoted to Senior Software Engineer I

Mar 2026

Recognized for leading AI transformation efforts, full infrastructure modernization, and engineering impact across the organization.

[Career Growth][Leadership]

Alfred — Internal AI Ops Assistant

Apr 2026

Built Swym's AI operations assistant from scratch in ~9 days. Gemini 2.5 Flash agentic loop with MCP access to Kubernetes, Prometheus, Jaeger, OpenSearch, ClickHouse, and GitHub. Thread-level memory, long-term memory on PVC, human approval gates, proactive idle research. GitHub Actions CI/CD for all three services.

[Gemini][MCP][Kubernetes][Slack][Redis Streams][Python]

MCP-Gateway for Observability

Mar 2026

Built a FastMCP-based unified gateway exposing Jaeger, OpenSearch, and Prometheus as a single AI tool endpoint. PII redaction middleware. Used as the backbone for Alfred and the AI auto-rightsizing agent.

[MCP][FastMCP][Observability][PII Redaction]

AG4C Migration — $120/day Saved

Feb 2026

Migrated swym-store (all tiers) to Azure Application Gateway for Containers using the Kubernetes Gateway API. Provisioned 4 AG4C instances with full Gateway API config, TLS policies, and certificates. Saved $120/day.

[Azure AG4C][Gateway API][Cost Optimization]

AI Auto-Rightsizing Agent

Mar 2026

Implemented an AI agent connected to MCP-Gateway and GitHub Copilot that queries Prometheus for CPU/memory metrics and automatically rightsizes Kubernetes container resources via the Helm charts repo.

[AI Agents][Prometheus][Kubernetes][Resource Optimization]

K8s Cluster Upgrade v1.33 → v1.34

Mar 2026

Upgraded production and operations AKS clusters, including ArgoRollouts. Preceded by 7 days of API access log collection and TB-scale log analysis on a self-hosted ClickHouse DB.

[Kubernetes][AKS][ClickHouse][Upgrades]

2025

Software Engineer I → Senior Software Engineer I · Swym Corporation

Promoted to Software Engineer I

Jan 2025

Promoted from Associate Software Engineer. Also created ilogu3000 — a benchmarking tool for Clojure logging frameworks that identified significant CPU overhead from pretty-printing in production.

[Career Growth][Clojure][JVM]
ilogu3000 on GitHub

swym-store Migration to Kubernetes

Feb 2025

Migrated our most critical application — swym-store — to Kubernetes across all 5 tiers. Set up new app gateway and ingress controllers, rewrote td-agent config to fluentd socket-based input, added Log4j2 changes, and built a Python genConfig init container.

[Kubernetes][Fluentd][Log4j2][App Gateway]

Zero-Downtime Deployments

Mar 2025

Eliminated deployment-related failed requests on swym-store. Root cause: readiness probe passing before all endpoints were ready. Fixed via readinessProbeDelay, preStop hook tuning, and AGIC lifecycle annotations.

[Kubernetes][Deployment Strategy][Zero Downtime][Lifecycle Hooks]

AI Transformation Lead — Operations Cohort

Jul 2025

Named lead for AI transformation efforts in the operations engineering cohort. Led LangGraph and LangChain explorations, built Slack deployment bots, and drove the AI ops strategy that would later become Alfred.

[AI][LangGraph][LangChain][Leadership]

Full Infrastructure Modernization — 3 New AKS Clusters

Jun–Sep 2025

Built 3 new AKS clusters (prod, staging, ops) with workload identities, VNets, DNS Zones, and Azure AD SSO on ArgoCD. Migrated all platform workloads via v2 Helm charts. New FluentBit agent-aggregator logging architecture.

[AKS][ArgoCD][Helm v2][FluentBit][Terraform][Workload Identity]

Self-Serve Monitoring Platform (Thanos + kube-prometheus-stack)

Oct 2025

Built a fully self-serve multi-team monitoring platform. Teams provision Prometheus instances and define alerts via ArgoCD. Thanos for long-term retention. Custom Redis queue exporter, Azure Monitor adapter, scalable Alertmanager routing. JVM metrics for Java apps.

[Thanos][Prometheus Operator][Alertmanager][kube-prometheus-stack]

BFCM 2025 — 1.01M Peak RPM

Nov 2025

Handled Black Friday/Cyber Monday peak traffic of 1.01 million RPM on Kubernetes with full observability. Resolved a live P0 incident caused by slow AGIC pod IP updates — migrated pods to a new app gateway and provisioned new subnets under load.

[Kubernetes][BFCM][1M+ RPM][Incident Response][Azure]

Penetration Testing Remediation

May 2025

Acted on vulnerability report from an external penetration testing team. Fixed multiple web application vulnerabilities — HTTP header misconfigurations and security misconfigurations — across production systems.

[Security][Penetration Testing][Vulnerability Remediation]

2024

Associate Software Engineer · Swym Corporation

Infra Cost Reduction ~30%

Jan–Mar 2024

Moved staging workloads to Azure Spot VMs, implemented OpenSearch hard-disk storage in place of SSDs, introduced ILM policies, and right-sized resources. ~30% infra cost reduction.

[Azure Spot VMs][Cost Optimization][OpenSearch ILM][Rightsizing]

OpenSearch Hot/Warm/Cold Architecture

Apr 2024

Re-architected OpenSearch from a flat master setup to a tiered hot/warm/cold architecture with ISM policies. Custom FluentBit Lua script auto-creates named indexes per application, eliminating manual reconfiguration for every newly onboarded service.

[OpenSearch][ISM][FluentBit][Lua][Hot/Warm/Cold]

Custom OTel Clojure SDK

May 2024

Built a custom Clojure library using Java interop to wrap the Java OTel SDK. Instrumented critical business flows across the entire organization.

[Clojure][OpenTelemetry][Java Interop]
Blog: Setup Jaeger Operator

Jaeger + Kafka (215M Spans / 15 Hours)

Jul 2024

Deployed Kafka via Strimzi Operator as an intermediary buffer between Jaeger Collector and OpenSearch. Solved span loss under OpenSearch backpressure. System handled 215,382,693 spans in 15 hours.

[Jaeger][Kafka][Strimzi][Observability]

Jenkins Build Times Cut by 75%

Aug 2024

Multi-stage Docker builds, layer caching, and workload parallelization. Reduced build times to 1/4th of original.

[Jenkins][Docker][Multi-stage Builds][CI/CD Optimization]

Devops-Tools Platform (Appsmith)

Oct 2024

Built a devops-tools API and Appsmith workspace allowing teams to self-serve Azure App Config changes without needing direct infra access.

[Appsmith][Azure App Config][Python][Self-Serve Tools]

K8s Production Cluster Upgrade v1.27 → v1.28

Nov 2024

Upgraded the production cluster and set up Pod Disruption Budgets (PDBs) for all apps and ingress gateways to prevent node eviction issues during future upgrades.

[Kubernetes][AKS][PDB][Upgrades]

2023

Intern → Associate Software Engineer (Full-Time) · Swym Corporation

Joined Swym as Backend Infrastructure Intern

Jan 2023

Onboarding, Clojure training, and first project — swym-changelog built with Gatsby and Shopify Polaris.

[Clojure][Gatsby][Onboarding]

Grafana Queue Monitor & Jenkins Automation

Apr 2023

Built a Grafana Queue Monitor dashboard using REST API and JSONPath. Wrote a Jenkins job to auto-update service thread allocation using Ansible and Azure CLI.

[Grafana][Jenkins][Ansible][Azure CLI]

Azure Key Vault Secret Management (SOC2)

May 2023

Migrated all service secrets from plaintext EDN config to Azure Key Vault with startup-time injection via Managed Identity. Key milestone toward SOC2 compliance.

[Azure Key Vault][Security][SOC2][Managed Identity]

Kubernetes Migration — Node.js Services to AKS

Jul 2023

Led the migration of production Node.js services from Azure App Services to AKS. Version-locked Dockerfiles, revamped Jenkins pipelines, Helm charts with Secret Provider Class, ArgoCD per-service apps. Coordinated staged production rollouts and wrote full documentation.

[Kubernetes][AKS][Helm][ArgoCD][Jenkins][Docker]

Promoted to Associate Software Engineer (Full-Time)

Jul 2023

Converted from intern to full-time ASE based on impact during the internship period.

[Career Growth]

Prometheus + Grafana OnCall Monitoring Stack

Aug 2023

Established Prometheus, AlertManager, and Grafana OnCall with Slack-based alert routing. Foundation for all future observability work at Swym.

[Prometheus][Grafana][AlertManager][Slack]

BFCM 2023 — First Scale Event

Nov 2023

Prepared BFCM 2023 infrastructure using Terraform. Provisioned and configured BFCM nodes, supported deployment operations during code freeze, monitored and handled events.

[Terraform][BFCM][Operations]

2022

Student · SASTRA Deemed University

Head of Daksh SoC (School of Computing Events)

2022–2023

Led the events team for Daksh and Utsav 2023 — SASTRA's flagship tech festivals. Mentored juniors, managed logistics, and coordinated with administration.

[Leadership][Event Management][Mentorship]

Content Writer at OWASP Foundation

2022

Authored accessible articles on cybersecurity, distributed systems, and AI for a global community.

[Cybersecurity][Technical Writing][Community]

Core Team — Team 1nf1n1ty (Cybersecurity)

2022

Represented SASTRA's official cybersecurity team in national CTFs. Top 5 rank in India. Specialized in forensics, cryptography, and network exploitation.

[CTF][Cybersecurity][Forensics][Cryptography]

2021

Student · SASTRA Deemed University

Joined Daksh SoC Events Team

2021

Started contributing to Daksh and Utsav tech festivals. Organized hybrid events during COVID-19 restrictions.

[Events][Teamwork][Adaptability]

Foundations in Technology

2021

Began exploring programming, Linux, and basic cybersecurity — the foundations for everything that followed.

[Linux][Networking][Foundations]