Posted on: 05/08/2025
About the Job :
You will monitor critical systems, respond to incidents, and maintain hardware to support continuous service availability.
You will work closely with IT teams and vendors to troubleshoot and resolve issues, maintain documentation, and ensure compliance with security and operational standards.
Key Responsibilities :
- Continuously monitor servers, storage arrays, network devices, and other infrastructure components using monitoring platforms such as Nagios, Zabbix, SolarWinds, or equivalent.
- Respond promptly to alerts and system outages, performing initial diagnosis and troubleshooting.
- Escalate complex issues to senior engineers or appropriate teams with detailed logs and diagnostics for timely resolution.
- Perform installation, configuration, and deployment of operating systems (Linux, Windows Server, VMware ESXi, etc.) on new and existing servers.
- Set up and configure hardware including servers, switches, firewalls, storage devices, and other network components in racks.
- Ensure proper physical installation practices such as racking, cabling, labeling, and grounding are strictly followed.
- Diagnose hardware issues and coordinate with Original Equipment Manufacturers (OEMs) and support vendors to resolve faults under warranty or maintenance contracts.
- Manage call logs and communication with OEM support teams until issues are resolved.
- Perform regular preventive maintenance and health checks to minimize downtime.
- Maintain strict cable management standards and cleanliness within the data center to optimize airflow and reduce risk of hardware damage.
- Monitor environmental factors like temperature, humidity, and power supply to ensure they remain within recommended thresholds.
- Adhere to data center access policies, ensuring security and safety of all personnel and assets.
- Maintain accurate records of all hardware assets including serial numbers, warranty status, physical location, and configuration details.
- Manage inventory of spare parts, tools, and consumables required for data center operations.
- Track and report on asset lifecycle and coordinate procurement or replacement as needed.
- Create and update detailed documentation including rack layouts, cable maps, network diagrams, and operational procedures.
- Produce regular status reports on data center performance, incidents, and maintenance activities for IT management and stakeholders.
Qualifications & Skills :
- Bachelors degree in Computer Science, Information Technology, or related field preferred.
- Proven experience in data center operations, server administration, or network infrastructure management.
- Hands-on experience with monitoring tools such as Nagios, Zabbix, SolarWinds, or similar.
- Knowledge of server operating systems (Linux distributions, Windows Server, VMware ESXi).
- Strong troubleshooting skills for hardware and network issues.
- Familiarity with OEM hardware support processes and warranty management.
- Good organizational skills with attention to detail for documentation and asset tracking.
- Ability to follow and enforce strict security and access policies.
- Excellent communication skills for coordinating with teams, vendors, and stakeholders
Did you find something suspicious?