Swill

Swill Cheng

Profile

Senior DevOps Engineer with 8+ years of experience in enterprise cloud infrastructure and DevOps transformation. Expertise in multi-cloud architecture (GCP, Alibaba Cloud, AWS), hybrid Kubernetes clusters (100+ nodes), and Istio service mesh. Built enterprise DevOps platforms supporting 100+ developers and comprehensive observability solutions. Team leader managing 4 engineers.

Work Experience
  1. Pepperstone

    Sr. DevOps Engineer

    Feb 2026 - Present

    Shanghai, China

    Istio & Traffic Management · GitOps Migration (FluxCD to ArgoCD)

    Istio & Traffic Management

    • Architected multi-version deployment strategies using Istio Service Mesh to support canary releases and A/B testing.
    • Implemented traffic-tinting and "swimlane" isolation to enable end-to-end testing in shared clusters.
    • Configured VirtualServices and DestinationRules to automate traffic shifting and enhance environment resilience.

    GitOps Migration (FluxCD to ArgoCD)

    • Led the organizational migration from FluxCD to ArgoCD, standardizing GitOps workflows across multiple Kubernetes clusters.
    • Deployed the "App-of-Apps" pattern and ApplicationSets to manage microservices and CRD lifecycles at scale.
    • Integrated ArgoCD with CI/CD pipelines to ensure automated drift detection and self-healing for infrastructure manifests.
    • Built an ArgoCD file-download plugin (Go) to replace `kubectl cp`, letting developers self-serve pod artifact and log retrieval without kubectl access.

    AWS EU Infrastructure (0 to 1)

    • Bootstrapped AWS EU region with Terraform (VPC / EKS / RDS / IAM + observability) and rolled out ArgoCD-based GitOps from day one, aligning the new region with the org-wide delivery standard for regional expansion and data-residency compliance.
  2. Zeal Group

    Sr. DevOps Engineer

    Aug 2025 - Jan 2026

    Shanghai, China

    • Built GCP infrastructure and established observability platform (logging, metrics, distributed tracing) with Prometheus and Grafana for the new Traze environment.
    • Optimized ArgoCD/Kafka/GitLab CI deployment pipelines.
    • Took over DevSecOps responsibilities, integrating security practices into CI/CD pipelines.
  3. WOOTECH Limited (woox.io)

    Sr. DevOps Engineer

    Feb 2025 - Aug 2025

    Shanghai, China

    • Transformed GCP flow logs and HTTP load balancer logs into Prometheus metrics; configured Grafana dashboards for traffic pattern analysis.
    • Redesigned Prometheus alerting at the service level and integrated with PagerDuty for improved incident response.
    • Designed PVC migration strategy from multi-AZ to single-AZ in GKE, reducing cross-zone traffic costs.
    • Utilized GKE Metering with BigQuery and Grafana for service-level cost allocation dashboards.
    • Redesigned GitLab Runner architecture using Helm and Spot VM node pools to reduce infrastructure costs.
  4. Shanghai Mitsubishi Elevator Co., Ltd.

    Technician (DevOps/Cloud Engineer)

    Jun 2020 - Jan 2025

    Shanghai, China

    DevOps Platform Development (0 to 1) · Alibaba Cloud Infrastructure Architecture (0 to 1)

    DevOps Platform Development (0 to 1)

    • Built Jenkins + ArgoCD enterprise DevOps platform for 100+ developers, with Artifactory and GitHub Flow covering the full code-to-artifact lifecycle.
    • Integrated Sonar quality checks and X-ray vulnerability scanning for DevSecOps; applied GitOps for IaC and Pipeline-as-Code.

    Alibaba Cloud Infrastructure Architecture (0 to 1)

    • Led Alibaba Cloud architecture managing multi-million annual cloud spend; Terraform IaC across ECS / ACK / RDS / SLB / SLS / Redis / OSS.
    • Built cloud security framework (SSO + AK/RAM least privilege) and a hybrid-cloud architecture over dedicated line, cutting cost via reserved instances and storage tiering.

    Observability & Microservice Governance (SRE)

    • Managed hybrid cloud Kubernetes clusters (100+ nodes) for cross-environment container orchestration.
    • Built observability platform integrating Prometheus/Grafana for metrics and SLS for log collection; established end-to-end tracing with OpenTelemetry.
    • Led microservice architecture upgrade (Eureka to Nacos), achieving 90% containerization rate.

    Platform Development & Engineering Efficiency

    • Developed Kubernetes admission controller (Go) for automatic Java Agent injection and Jenkins Pipeline controller using Operator SDK.
    • Built alert bus system (Java) integrating Prometheus and Jenkins alerts.
  5. Shanghai DaoCloud Network Technology Co., Ltd

    Java Developer

    Dec 2017 - May 2020

    Shanghai, China

    • Led Kubernetes-native service discovery implementation replacing Eureka; developed service discovery server in Go with K8s API-server watch mechanism.
    • Led containerization transformation, creating Helm charts and standardizing deployment processes.

Skills

Cloud & IaCGCP, Alibaba Cloud, AWS · Terraform, Ansible
Container & Service MeshKubernetes, Docker, Helm, Istio
CI/CD & GitOpsJenkins, GitLab CI, GitHub Actions, ArgoCD, FluxCD
ObservabilityPrometheus, Grafana, OpenTelemetry, ELK Stack, PagerDuty
LanguagesJava (Expert), Golang (Proficient), Python, Shell

Education

Shanghai Maritime University

Bachelor's Degree in Communication Engineering

Sep 2012 - Jul 2017