Posted on: 09/12/2025
Description :
The core responsibilities for the job include the following :
Kubernetes and Containerization :
- Architect, deploy, and manage Kubernetes (K8s) clusters on both bare-metal and cloud environments.
- Utilize Helm and similar tools for application packaging and deployment within K8s.
- Develop and maintain K8s deployment templates for consistent and repeatable service rollouts.
- Utilize installation tools and distributions like kubeadm, kubespray, k3s, Rancher, OpenShift, etc., for bare-metal K8s deployments.
Infrastructure and OS :
- Design and deploy new infrastructure solutions for bare-metal environments based on evolving requirements.
- Perform Linux sysadmin tasks and troubleshooting on physical servers.
- Utilize scripts and IaC for automated provisioning and management.
- Have a good understanding of database servers.
Networking and Security :
- Design, implement, and troubleshoot networking solutions, covering reachability, routing, DNS, load balancing, firewall management, etc.
- Deploy and maintain environments following security best practices, including system hardening, implementing least privileged access, and ensuring adherence to compliance standards.
CI/CD and Automation :
- Design, build, and maintain CI/CD pipelines to automate software build, testing, and deployment processes.
- Develop automation scripts (Python, Bash, Ansible, etc.) to streamline software packaging, deployment, and operational tasks.
Reliability and Operations :
- Implement and manage monitoring and alerting systems for proactive issue detection and resolution.
- Develop and execute backup and disaster recovery (DR) strategies to ensure data integrity and service availability.
- Conduct performance testing and capacity planning to optimize and scale infrastructure effectively.
- Collaborate with development teams to ensure new services are designed for reliability and seamless deployment.
Requirements :
- Bachelor's degree in Computer Science, Engineering, or a related field; or equivalent work experience.
- 4- 6 years of experience in DevOps/Site Reliability Engineering/Systems Administration, or a similar role with a strong focus on bare-metal environments.
- Expert-level knowledge of Linux system administration, scripting, troubleshooting, and performance tuning.
- Deep understanding of networking concepts, protocols, and firewalls.
- Strong experience in designing and implementing disaster recovery and backup strategies.
- Proficiency in managing and optimizing database technologies.
- Extensive experience with Kubernetes (both cloud and bare-metal deployments), including deployment, scaling, and management.
- Proficiency in scripting and automation (e.g., Python, Bash, Ansible, or similar).
- Experience with CI/CD pipelines and tools.
- Strong problem-solving skills and the ability to thrive in a fast-paced environment.
- Excellent communication and collaboration abilities.
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1586532
Interview Questions for you
View All