Posted on: 05/09/2025
Architecting Test Systems :
- Architect test frameworks and infrastructure for validating microservices and infrastructure components in multi-cluster and hybrid-cloud environments.
- Oversee the design of complex test scenarios simulating production-like workloads, resource scaling, failure injection, and recovery across distributed clusters.
Automation & Scalability :
- Spearhead the development of scalable and maintainable test automation integrated with CI/CD (Jenkins, GitHub Actions, etc.).
- Leverage Kubernetes APIs, Helm, and service mesh tools to build comprehensive automation coverage, including system health, failover behavior, and network resilience.
- Promote test infrastructure-as-code and drive IaC forward on the team making sure the infrastructure code is repeatable, extensible and reliable.
Technical Proficiency :
- Deep understanding of Kubernetes internals, cluster lifecycle management, Helm, service meshes (e.g., Istio or Linkerd), and network policies.
- Strong scripting and automation capabilities (Python, Pytest, Bash, etc.).
- Familiarity with observability stacks (Prometheus, Grafana, Jaeger), Kubernetes security (RBAC, secrets management), and performance benchmarking tools (e.g., K6).
- Solid grounding in cloud architecture (AWS, Azure, GCP), infrastructure provisioning, and containerized CI/CD.
- Moderate to advanced linux knowledge and proficiency is required : Bash scripting and debugging, systemd/logs, networking/firewalling/routing, certificate/PKI management, containers (Docker/containerd), and Kubernetes tooling (kubectl/Helm with OCI registries, GitOps/Flux) to install, test, and troubleshoot multi-cluster environments.
Automation & Scalability :
- Spearhead the design and development of highly scalable, maintainable test automation systems, seamlessly integrated into CI/CD pipelines (Jenkins, GitHub Actions, GitLab, ArgoCD).
- Leverage Kubernetes APIs, Helm charts, and service mesh frameworks (Istio, Linkerd) to enable full automation coverage for system health monitoring, network resilience testing, failover validation, and scaling scenarios.
- Advocate and implement Test Infrastructure-as-Code (IaC), ensuring all test systems are repeatable, auditable, extensible, and reliable. Drive the adoption of GitOps practices for test environments.
Technical Proficiency :
- Kubernetes & Cloud Expertise : Deep understanding of Kubernetes internals, cluster lifecycle management, networking policies, Helm, and service meshes, combined with hands-on experience across major cloud platforms (AWS, Azure, GCP).
- Observability & Monitoring : Skilled in designing observability pipelines using Prometheus, Grafana, Jaeger, with strong focus on proactive monitoring, tracing, and alerting for distributed systems.
- Scripting & Automation : Proficient in Python (Pytest, automation frameworks), Bash scripting, and DevOps toolchains, ensuring streamlined automation and test orchestration.
- Security & Reliability : Familiar with RBAC, PKI management, secrets handling, and network security policies for production-grade Kubernetes clusters.
- System Benchmarking : Experienced in load and performance benchmarking using K6, Locust, and custom benchmarking harnesses to validate system throughput, latency, and resilience under stress.
- Linux & System Proficiency : Solid grounding in Linux administration, including systemd management, firewalling/routing, container runtimes (Docker, containerd), networking diagnostics, and log debugging.
Did you find something suspicious?
Posted By
Posted in
Quality Assurance
Functional Area
QA & Testing
Job Code
1541720
Interview Questions for you
View All