HamburgerMenu
hirist

Job Description

Job description :

We are looking for a skilled and experienced Hadoop Administrator to join our technology team. The ideal candidate will be responsible for the installation, configuration, monitoring, and maintenance of Hadoop clusters. You will work closely with data engineers, software developers, and system administrators to ensure the performance, availability, and security of our big data infrastructure. As a Hadoop Administrator, you will manage the day-to-day operations of Hadoop ecosystems including HDFS, YARN, Hive, HBase, and other related technologies. You will be expected to troubleshoot issues, optimize performance, and implement best practices for data storage and processing. Your role will also involve capacity planning, backup and recovery strategies, and ensuring compliance with data governance policies. You should have a strong understanding of distributed computing principles and be familiar with Linux system administration. Experience with automation tools, scripting languages, and cloud platforms is highly desirable. You will also be responsible for implementing security measures such as Kerberos authentication and encryption protocols to protect sensitive data. This role requires excellent problem-solving skills, attention to detail, and the ability to work independently as well as part of a team. You will be expected to stay up-to-date with the latest developments in big data technologies and contribute to continuous improvement initiatives within the organization. If you are passionate about big data and have a strong background in system administration and Hadoop technologies, we encourage you to apply and become a key player in our data-driven environment.

Role : Big Data Administrator

Experience : 5+ years

Location : Pune

NOTE : Immediate joiners

Key Skills : Hadoop ecosystem (HDFS, YARN, Hive, Impala), Cloudera/HDP administration, Linux, security, monitoring

Responsibilities :


- Administer and monitor big data infrastructure & jobs, manage user access, data storage, and cluster health; implement security controls and performance tuning; work with data engineers to support data lake operations.


- Administration of Bigdata ecosystem.


- Hadoop Cluster configuration and security.


- Active member or contributor to open-source Apache Hadoop projects.


- Capacity planning, performance tuning.


- Scripting background in automation, configuration management.


- HDFS maintenance and support.


- Onboarding users onto the platform.


- Supporting deployments of parcels.


- Analysis and optimization of workloads, performance monitoring, tuning, and automation.

- Ability to support large-scale Production Hadoop environments in any of the Hadoop distributions.

- Proficiency in Designing, Capacity planning, and cluster setup for Hadoop.

- Experience with RDBMS technologies and SQL language; Oracle, SQL Server, and MySQL.

- Good understanding of the workings of Apache Sentry, SSSD or Centrify or Quest QA, and AD linkage to HDFS.

- Experience in any of the Scripting Language (Perl, Shell, Python)

- Product knowledge of Hadoop distributions such as Cloudera, Hortonwork. or MapR.

- Administration, maintenance, control, and optimization of Hadoop capacity, security, configuration, process scheduling, and errors.

- Extensive Linux skills.

- Knowledge of NoSQL platforms.

- Addressing challenges of query execution across a distributed database platform on modern.

- Proficiency with at least one of the following : Java, Python, or Perl.

- Experience in tool Integration, automation, configuration management in GIT, Jira platforms.

- Excellent oral and written communication, presentation skills, analytical and problem-solving skills.

- Self-driven, ability to work independently and as part of a team.

- Heavily process-oriented and good communication skills.

- Development or administration of any NoSQL technologies.

- Development/scripting experience on Configuration management and provisioning tools e.g., Puppet, Chef.

- Development, Implementation, or deployment experience on the Hadoop ecosystem (HDFS, MapReduce, Hive, Hbase)

- Cloudera Hadoop Administrator Certification is preferred.


info-icon

Did you find something suspicious?