Reliability and infrastructure at scale.
CNCF Maintainer · Docker Captain · Creator of KubeSlice
I build systems that stay up when it matters.
For the last decade I've worked on one problem in different forms: making complex infrastructure dependable. I wrote KubeSlice from scratch and grew it into a CNCF project trusted for secure multi-cluster Kubernetes networking. I've owned the reliability of payment systems serving millions of businesses, and taken cloud products into locked-down, air-gapped enterprises that can't afford a single failure.
I work end to end. I'll architect a platform from a blank page, write the infrastructure that ships it, and lead the team that keeps it running at 3am.
What I'm focused on now is where reliability is going next: into how AI systems are operated, diagnosed, and trusted. As systems get more complex and more autonomous, dependability stops being the boring part and becomes the whole game.
KubeSlice · CNCF project for secure multi-cluster Kubernetes networking. Wrote the early codebase, grew the platform and the team around it.
K8sToolbox · A working toolkit for operating Kubernetes clusters.
k8s-pod-cpu-stressor · Lightweight tool for stress-testing pod CPU behavior under load.
Program Committee Member, KubeCon + CloudNativeCon (India, Japan, China). I write about Kubernetes, cloud-native, and reliability on Medium and Dev.to.




