Posted on: 07/05/2026


Job Description :
Skills Required :
Strong hands-on experience in :
1. Core Java or .NET technologies
2. Linux/Unix command-line operations
Good knowledge of databases including :
1. PostgreSQL
2. MySQL
3. Oracle
Experience working on platforms such as :
1. IBM OpenShift
2. Windows
3. Azure
4. Linux
Hands-on experience with monitoring and observability tools including :
- Dynatrace or equivalent infrastructure monitoring tools
- Splunk or equivalent log monitoring tools
- Experience using ticketing and tracking tools such as:
- ServiceNow or equivalent ticketing platform
- Rally or equivalent tracking tool
Experience working with batch scheduling tools such as :
1. ESP
2. Equivalent enterprise schedulers
Job Description :
- Contribute to the production support and operational stability of assigned applications and business domains.
- Work closely with senior engineers and cross-functional teams to ensure timely incident resolution and smooth day-to-day production operations.
1. Provide technical insights and inferences around :
- System behavior
- Application performance
- Recurring production issues
- Operational stability
2. Support initiatives to maintain :
- High availability
- Reliability
- Resilience across customer-facing and business-critical applications
3. Ensure stable production operations through :
- Active monitoring
- Alert handling
- Proactive issue detection
- Incident tracking
4. Manage the incident lifecycle for P2-P5 issues including :
- Incident triage
- Analysis
- Coordination
- Resolution
- Support P1 incidents under the guidance of senior team members and participate in critical issue resolution activities.
5. Assist in :
- Root Cause Analysis
- Resolution of recurring production issues
- Implementation of preventive and corrective measures
6. Maintain and update :
- Runbooks
- SOPs
- Playbooks
- Knowledge base documentation
7. Support and enhance observability setups including :
- Dashboards
- Alerts
- Logs
- Metrics
8. Collaborate with DevOps and SRE teams on :
- Automation initiatives
- Self-healing scripts
- Reduction of manual operational tasks
9. Provide operational support across :
- L1 activities
- L2 support
- L3 investigations under guidance
- Participate in on-call rotations and adhere to support schedules and workload priorities.
10. Communicate effectively with business and engineering teams regarding :
- Incident updates
- Business impact
- Resolution status
- Operational escalations
Did you find something suspicious?