Posted on: 23/04/2026
Senior Platform Support Engineer
Experience : 7+ Years
Location : Remote
About the Role :
We are looking for a highly skilled Senior Engineer with deep expertise in Garden Linux, KVM virtualization, and Gardener Kubernetes to support and optimize enterprise-grade infrastructure environments.
This role involves advanced troubleshooting, system optimization, and contributing to high-impact incident resolution and platform stability.
Key Responsibilities :
- Provide advanced support for Garden Linux, KVM virtualization, and Gardener Kubernetes environments.
- Troubleshoot complex system, kernel, and infrastructure-level issues.
- Perform root cause analysis (RCA) and contribute to incident resolution for critical issues (P1/P2).
- Optimize system performance, reliability, and scalability across environments.
- Collaborate with cross-functional teams to ensure seamless deployment and operations.
- Develop and maintain technical documentation, best practices, and knowledge-sharing sessions.
- Automate processes and improve efficiency using scripting and CI/CD integrations.
Core Skills & Expertise :
Garden Linux Expertise :
- Strong knowledge of Debian-based Linux systems and kernel configuration.
- Experience with Garden Linux image customization and build pipelines.
- Proficiency in package management, systemd, and OS hardening.
- Expertise in debugging performance, boot, and kernel-level issues.
- Experience integrating CI/CD pipelines for OS image deployment and maintenance.
KVM / Virtualization Expertise :
- Advanced understanding of KVM, QEMU, and Libvirt architecture.
- Experience with Cloud Hypervisor environments setup and troubleshooting.
- Strong knowledge of virtualization networking (bridges, VLANs, SDN) and storage (NFS).
- Understanding of hardware virtualization and NUMA alignment.
- Proficiency in automation scripting (Golang, Python, Bash).
- Experience with host performance tuning and low-level debugging.
Gardener Kubernetes Expertise :
- Expert-level understanding of Kubernetes internals (control plane, networking, scheduling).
- Hands-on experience with Gardener architecture, including shoot and seed cluster management.
- Experience in cluster lifecycle management, upgrades, and node troubleshooting.
- Familiarity with observability tools like Prometheus and Perses.
- Strong problem-solving skills with ability to conduct deep root cause analysis.
Key Deliverables :
- Incident troubleshooting reports with detailed technical insights.
- Root Cause Analysis (RCA) reports for critical incidents (P1/P2).
- Configuration reviews and optimization recommendations.
- Documentation of best practices and operational guidelines.
- Conduct knowledge transfer sessions for internal teams.
Preferred Qualifications :
- Experience working in cloud-native environments.
- Exposure to DevOps practices and SRE principles.
- Strong collaboration and communication skills in distributed teams
Did you find something suspicious?