Posted on: 29/08/2025
We are seeking a Cloud Databricks Administrator to join our Cloud Engineering team.
In this role, you will monitor and maintain Databricks jobs, troubleshoot issues, ensure platform reliability, and manage access controls.
You will play a key part in operational support, defect management, and proactive issue resolution to keep our data pipelines running smoothly.
Key Responsibilities :
- Identify, log, and track defects/bugs through ticketing systems, ensuring timely resolution.
- Manage Databricks access via Azure AD groups with Admin, Edit, and Read permissions.
- Provide production support for Databricks environments, including cluster operations, job failures, and notebook troubleshooting.
- Collaborate with data engineers and platform teams to resolve platform-related incidents and performance bottlenecks.
- Proactively monitor system health, resource utilization, and performance metrics.
- Implement and enforce archival/retention policies for Databricks storage to optimize costs and performance.
- Support CI/CD pipelines (Jenkins, Azure Automation) and automate repetitive operational tasks.
- Maintain technical documentation, SOPs, and runbooks for Databricks operations.
- Ensure security compliance with RBAC, MFA, and encryption best practices.
Preferred Qualifications :
- Strong knowledge of Azure (AWS/GCP acceptable) and cloud-native services.
- Experience in SRE or production support environments with SLAs and ticketing systems
(ServiceNow, Jira).
- Proficiency in Python or Scala for data processing and automation.
- Familiarity with Power BI/Tableau for building monitoring and cost dashboards.
- Knowledge of CI/CD tools, version control (Git), and scripting languages (PowerShell, Bash).
- Understanding of cloud cost optimization and usage tracking.
- Excellent problem-solving skills, communication, and cross-team collaboration abilities.
Nice to Have :
- Background in monitoring, logging, and incident response for data platforms.
- Exposure to Kubernetes, Docker, and Terraform.
Required Skills :
- 5+ years in data engineering, cloud operations, or database administration.
- Proven ability to troubleshoot, communicate effectively, and collaborate across teams in a
fast-paced environment.
Did you find something suspicious?
Posted By
Posted in
Data Engineering
Functional Area
Systems Administration
Job Code
1537811
Interview Questions for you
View All