Posted on: 08/07/2025
About the Role :
You will be the Subject Matter Expert (SME) for the cloud platform and will play a critical role in architecting, upgrading, enhancing, and integrating key services and components. This includes working with containers, orchestration tools, and software-defined storage solutions.
Key Responsibilities :
- Manage and monitor the Red Hat OpenStack and OpenShift environments with a focus on performance, scalability, and availability.
- Lead cloud capacity planning, infrastructure automation, and high-availability strategies.
- Act as the technical lead for architecture discussions, system upgrades, and platform enhancements.
- Design and manage container-based applications using Docker, Kubernetes, OpenShift, LXC, and LXD.
- Automate container deployment and lifecycle management.
- Support microservices-based architecture deployments and upgrades.
- Build and maintain infrastructure-as-code using Ansible for repeatable deployments, configuration management, and automation.
- Participate in CI/CD pipeline creation and integration with cloud services.
- Ensure secure automation practices are implemented and documented.
- Implement and manage Ceph-based software-defined storage (block, object, and file).
- Optimize storage provisioning, monitoring, and troubleshooting performance or availability issues.
- Monitor the health of cloud services and proactively respond to infrastructure and application-level incidents.
- Conduct root cause analysis and performance tuning.
- Provide 24x7 support for production platforms as needed.
- Contribute to the cloud product roadmap by identifying new capabilities, third-party integrations, and
industry best practices.
- Evaluate and implement open-source or commercial solutions that enhance cloud platform features.
- Work closely with internal engineering, product, and operations teams for service evolution.
Required Skills & Qualifications :
- Strong expertise in Red Hat OpenStack Platform (RHOSP) and Red Hat OpenShift.
- Experience with Linux OS administration (RHEL/CentOS/Ubuntu).
- Proficiency with containerization technologies: Docker, Kubernetes, LXC/LXD.
- Experience in infrastructure automation using Ansible.
- Practical experience managing and scaling Ceph storage.
- Understanding of cloud network concepts (SDN, load balancers, firewalls, virtual networks).
- Knowledge of virtualization platforms such as KVM or QEMU.
- Solid experience in performance tuning, patch management, and upgrades.
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1509727
Interview Questions for you
View All