Posted on: 02/11/2025
Job Duties and Responsibilities
- Able to perform effective Incident Management from incident start through resolution, partnering with Development to determine root causes, and driving rigorous Problem Management to follow through on actions
- Proactive issue identification and resolution
- Own the production environment, monitoring availability and ensuring a holistic system health, handling application deployments
- Triage and Remediate production systems
- Resolve issues within the SLA
- Achieve 90% automation and reduce manual intervention
- Be the primary operational support engineer for multiple large, distributed, critical software applications.
Skills, Experience, and Requirements
- Engineering degree with 5+ years of experience in Application Support and Automation
- Strong understanding of modern monitoring and logging technologies (Logz.io, CloudWatch, DynaTrace)
- Understand microservice architecture
- Experience in Unix, Shell scripting/Python/Java, SQL, AWS, etc
- Experience in troubleshooting complex applications as well as environmental issues.
Did you find something suspicious?