HamburgerMenu
hirist

Solytics Partners - Platform Engineer - Cloud Infrastructure

Posted on: 23/01/2026

Job Description

Role Summary :

Solytics Partners is seeking a high-caliber Senior Cloud / Platform Engineer to lead the management and operations of our award-winning, AWS-based analytics infrastructure.


In this role, you will act as the "Infrastructure Architect" for our multi-account AWS environment, ensuring high availability and security for platforms driven by AI, Machine Learning, and LLMs.


You will take full ownership of our Amazon EKS clusters, driving deployment excellence through ArgoCD (GitOps) and GitLab CI/CD. The ideal candidate is a technical expert in cloud-native networking, Kubernetes orchestration, and observability, with a "Security-First" mindset to satisfy global regulatory and compliance standards.

Responsibilities :

- Multi-Account AWS Governance: Manage and operate a complex AWS ecosystem (Product, Management, Internal accounts), designing robust VPC architectures, subnets, and secure VPC Peering frameworks.

- Kubernetes (EKS) Orchestration: Administer Amazon EKS clusters, overseeing the health of Nginx Ingress controllers, web/worker pods, and core platform services.

- GitOps & CI/CD Automation: Maintain and optimize deployment pipelines using GitLab CI/CD and ArgoCD, ensuring environment consistency and enabling rapid, reliable release rollbacks.

- Cloud Networking & Connectivity: Configure AWS ELB/ALB for traffic distribution and manage secure connectivity via VPN solutions (Arachnys) for cross-account and partner integrations.

- Data & Storage Management: Monitor and scale Amazon RDS (MySQL), ElastiCache (Redis), and Amazon S3, ensuring high performance and data durability for analytics workloads.

- Observability & Incident Response: Implement comprehensive monitoring using Prometheus for metrics and Splunk for centralized logging; lead root cause analysis (RCA) for production incidents.

- Security & IAM Enforcement: Drive cloud security through IAM least-privilege policies, secure secrets management, and strict network isolation to protect production data.

- Scalability & Cost Optimization: Continually audit infrastructure to improve resilience and optimize AWS spend without compromising on performance or stability.

- Cross-Functional Engineering Support: Collaborate with Data and DevOps teams to provide controlled, secure access and support the lifecycle of containerized AI/ML applications.

- Compliance & Regulatory Support: Ensure all infrastructure meets global audit requirements, safeguarding financial systems through rigorous access controls and encryption.

Technical Requirements :

- AWS Mastery: 58 years of experience managing multi-account AWS environments, with expert knowledge of VPC, IAM, RDS, and S3.

- Kubernetes Expertise: Hands-on experience administering Amazon EKS, including ingress management, pod autoscaling, and cluster resilience.

- Automation Toolkit: Proficiency in ArgoCD (GitOps) and GitLab CI/CD pipelines; experience with Infrastructure as Code (Terraform/CloudFormation) is highly preferred.

- Networking & Security: Strong understanding of ELB/ALB, VPC peering, VPN configurations, and network ACLs.

- Observability Stack: Professional experience with Prometheus and Splunk for monitoring and logging in a production environment.

Preferred Skills:

- AI/ML Infrastructure: Experience supporting platforms that deploy Generative AI or Large Language Models (LLMs).

- Database Tuning: Ability to optimize RDS MySQL and Redis for high-velocity data analytics.

- Scripting: Proficiency in Python or Bash for automating routine operational tasks and infrastructure health checks.

Core Competencies:

- Analytical Troubleshooting: A methodical approach to resolving complex infrastructure bottlenecks and system failures.

- Ownership & Reliability: A "Production-First" mindset, taking responsibility for the uptime and security of critical financial systems.

- Collaborative Leadership: Ability to work effectively in a hybrid environment, aligning with Engineering and Data teams to drive platform innovation.

- Structured Thinking: Capability to deconstruct complex regulatory requirements into technical security controls.


info-icon

Did you find something suspicious?

Similar jobs that you might be interested in