HamburgerMenu
hirist
showcase-imageshowcase-imageshowcase-image

Job Description

Note : Women Candidates Preferred


Shift Timings : 5 PM - 2 AM


Primary skills : AWS, Terraform, DevOps


Secondary skills : Linux and Windows

Primary Responsibilities :


Incident response :


- Troubleshoot and resolve complex cloud operations issues related to Provided Cloud Services (e.g., EC2, S3, Lambda, VPC networking, Security Groups, and IAM permission errors) that have been escalated by Tier 1.

Collaboration and communication :

- Collaborate with internal teams (Data Engineering, Compute Engineering, Network Engineering) to provide development teams guidance on best practices and resolve technical issues.

- Assist in the development of Internal Customer communication channels for effective communication of change events and platform activities.

Continuous improvement :


- Participate in a feedback loop with Tier 1 - > Tier 3 SMEs to identify common issues in the environment that can be redesigned for efficiency leveraging industry-standard improvement frameworks like Kaizen, PDCA etc.

Documentation :


- Maintain comprehensive documentation of cloud infrastructure, configurations, processes, and procedures to ensure knowledge sharing and team collaboration.


- Contribute to Knowledge Base documentation for Tier 1 engineers and development teams.

Additional Responsibilities :


- Backup and disaster recovery : Platform DR drills and support application teams with Disaster Recovery planning and execution.

- Cloud migration assistance for workloads moving to our cloud environment.

- Security and compliance : Enforce security configurations, best practices, and compliance requirements to Cloud resources at scale.

- Configuration management : Remediate infrastructure configuration and manage changes to maintain

consistency across environments.

- Assist in the deployment of predetermined cloud resources, including :


1. Cloud Network resources, Security Groups, Security Group Rules, Resource Tags, S3 Buckets & Lifecycle policies, VPC, VPC endpoints, and policies.

Skills :


- Advanced understanding of AWS services (EC2, S3, Lambda, VPC, Networking, Elastic Load Balancers, IAM, RDS) e.g., AWS Certified DevOps Engineer

- Experience with infrastructure as code (IaC) tools and operational knowledge of scripting languages like Python.

- Strong analytical, problem-solving, and communication skills

- Previous system administration and support experience (Linux and Windows operating systems; experience with mixed environments a plus).

Key Performance Indicators :


- Mean Time to Resolve (MTTR) for complex incidents.

- First Call Resolution (FCR) rate, Percentage of issues resolved directly by Tier 2 engineers without escalation.

- Percentage of process improvements implemented through feedback loop with Tier 3 SMEs.

The job is for:

Women candidates preferred
info-icon

Did you find something suspicious?

Similar jobs that you might be interested in