Skip to content
View muntashir-islam's full-sized avatar

Block or report muntashir-islam

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
muntashir-islam/README.md

πŸ‘‹ Hi, I'm Muntashir Islam

Senior Site Reliability Engineer β€’ Platform Engineer β€’ Cloud & Kubernetes Specialist


πŸš€ Professional Overview

I am a Platform-focused SRE dedicated to building resilient distributed systems. I specialize in bridging the gap between development and operations by creating Internal Developer Platforms (IDPs) that reduce cognitive load and accelerate delivery.

  • Reliability: Implementing SLIs/SLOs, Error Budgets, and automated incident response.
  • Scalability: Managing multi-region Kubernetes clusters (AKS/EKS/GKE) and high-traffic cloud networking.
  • Developer Experience: Building "Golden Paths" using Backstage, Terraform, and GitOps.

πŸ›  Tech Stack

Category Tools & Technologies
Cloud AWS, Azure, Google Cloud Platform
Orchestration Kubernetes (Managed Service and Selfmanaged One), Docker, Nomad
Infrastructure Terraform, Pulumi, Crossplane, Ansible, Helm, Kustomize
CI/CD / GitOps ArgoCD, FluxCD, GitHub Actions, GitLab CI, Buildkite
Observability Prometheus, Thanos, Grafana, Loki, Opensearch, ELK Stack, Datadog, OpenTelemetry
Languages Go, Python

πŸ— Key Projects

βœ… Completed Projects

  • Multi-Cluster Metrics Aggregation (Thanos/Prometheus): Engineered a centralized observability platform across 10+ global clusters using Thanos to provide long-term storage and a single pane of glass for Grafana dashboards.
  • Kubernetes Operator for Timebased Scalling (Go): Developed a kubernetes operator that can scale workloads during specific time in the day, optimized 35% cost for dev cluster by scaling down workload
  • Kubernetes Postgres Backup Operator (Go): Developed a custom Go-based operator using the Controller-Runtime to manage automated database snapshots and offsite S3/Azure Blob syncing via CRDs which increase reliability by 50%.
  • Enterprise Hub-Spoke AKS Architecture: Designed a private-link-first network topology for Azure, securing traffic with AGIC (Application Gateway) and ensuring zero-trust communication via Calico policies.
  • Automated FinOps Dashboard: Built a Python tool integrated with AWS/Azure Billing APIs to identify orphaned resources and idle clusters, reducing cloud spend by 22% annually.

πŸ— Ongoing & Active Development

  • Internal Developer Portal (Backstage): Architecting a self-service portal that allows developers to spin up ephemeral environments and RDS instances with a single click using Crossplane.
  • Chaos Engineering Framework: Implementing LitmusChaos experiments into CI/CD pipelines to validate service resilience against pod evictions and network latency.
  • Multi-Cloud GitOps Controller: Building a custom controller in Go to synchronize secrets and configurations across disparate EKS and AKS environments seamlessly.

🎯 Current Focus

  • πŸ¦€ Learning Rust for high-performance systems tooling.
  • ☸️ Deep diving into eBPF for advanced network observability.
  • ☁️ Scaling Platform Engineering as a product within organizations.

πŸ“« Connect with Me


"Automate everything, document the rest."

Pinned Loading

  1. aws-iac-pulumi aws-iac-pulumi Public

    Different AWS iac module developed using python and pulumi iac

    Python

  2. k8s-admission-controller k8s-admission-controller Public

    Kubernetes custom admission controller

    Go

  3. pgdb-backup-azureblob-operator pgdb-backup-azureblob-operator Public

    Go

  4. timebased-autoscaling-operator timebased-autoscaling-operator Public

    Go

  5. url-shortener url-shortener Public

    AWS based url shortener

    HCL

  6. aws-eventbridge-pattern aws-eventbridge-pattern Public

    How we can use aws eventbridge effectively

    HCL