HamburgerMenu
hirist

Job Description

About Smarsh :

Smarsh empowers over 6,500 regulated organizations worldwide to manage digital risk and unlock business intelligence through comprehensive capture, archiving, supervision, and e-discovery of digital communications. With integrations across 80+ communication platforms and deep domain expertise, Smarsh enables proactive compliance, legal readiness, and operational insight.

Recognized by Gartner and Forrester as a consistent leader in the space, Smarsh is a dynamic, fast-growing company featured on the Inc. 5000 list of fastest-growing private companies every year since 2008.


Role Overview :

We are seeking a highly skilled Cloud Platform Engineer Networks to lead the design, implementation, and optimization of our global cloud and hybrid network infrastructure. This role demands expertise in cloud networking, infrastructure automation, security architecture, observability practices, and hybrid connectivity strategies.

The ideal candidate will have hands-on experience in large-scale, production-grade network systems within AWS, with a focus on reliability, scalability, and performance optimization. This position will play a pivotal role in shaping the platform foundation that supports Smarshs cloud-native services and mission-critical workloads.


Key Responsibilities :

1. Cloud & Hybrid Network Architecture :


- Architect and implement resilient, high-performance network topologies across AWS and hybrid environments.

- Design VPC architectures including subnets, routing tables, NAT gateways, and Transit Gateways.

- Configure and manage critical AWS networking services: Route 53, Direct Connect, Site-to-Site VPN, PrivateLink, and Global Accelerator.

- Ensure multi-tenant network segmentation, service mesh integration, and resource isolation.

- Apply advanced security controls using Security Groups, NACLs, and IAM policies aligned with compliance frameworks (SOC 2, ISO 27001, etc.).

- Lead efforts in DNS/IP address planning (IPAM), naming conventions, and network standardization.

- Develop high-availability and disaster recovery designs for critical cloud networking components.

- Participate in incident response for security and availability incidents related to network infrastructure.

2. Infrastructure Automation & DevOps :


- Build and maintain Infrastructure as Code (IaC) with Terraform and Terragrunt.

- Establish and manage GitOps pipelines integrating Git, Terraform Cloud, Concourse CI, and AWS CodePipeline.

- Automate provisioning, scaling, and configuration of network resources across environments.

- Integrate security and compliance validation into CI/CD workflows.

- Develop scripts and tooling using Python, Bash, or Go to automate network maintenance, monitoring, and diagnostics.

- Implement backup and disaster recovery strategies for network configuration state and critical routing policies.

3. Observability & Operations :


- Design and implement network monitoring and alerting with Prometheus, DataDog, and CloudWatch.

- Create dashboards and metrics for traffic patterns, error rates, performance bottlenecks, and anomalous behaviors.

- Develop and maintain runbooks for common operational scenarios, failovers, and network troubleshooting.

- Participate in on-call rotation for critical platform services.

- Lead root cause analysis (RCA) and postmortems for high-severity platform/network incidents.


Required Skills and Experience :


- Deep expertise in AWS networking (VPC, TGW, Direct Connect, Global Accelerator, Route 53, NAT Gateways).


- Strong hands-on experience with hybrid cloud connectivity, including VPNs, SD-WAN, and cross-region networking.


- Expertise in DNS, DHCP, IPAM, load balancing, firewalls, and traffic shaping.

- Proficient in Terraform, GitOps, and Infrastructure as Code methodologies.

- Understanding of microservices integration, service discovery, and API gateways in a Kubernetes-native ecosystem.

- Knowledge of Kubernetes, Docker, Service Mesh (Istio, Linkerd), and related cloud-native technologies.

- Proficiency in Python, Shell scripting, or Go for automation.

- Experience with OAuth 2.0, OpenID Connect, and Keycloak for authentication and access control.

- Familiarity with rate limiting, traffic control systems, and content delivery optimization.

Tools & Platforms :

- Git, Terraform, Concourse CI, Jenkins

- Prometheus, Grafana, DataDog, CloudWatch

- AWS (VPC, EC2, EKS, Lambda, IAM, Route 53, etc.)

- Keycloak or similar identity providers

- Kubernetes, Docker

Development & Operational Practices :

- Test-driven infrastructure development

- Continuous Integration/Deployment (CI/CD)

- Version control and branching strategies

- Infrastructure performance optimization

- Technical documentation and architecture diagrams

Education & Certifications :

- Bachelors or Masters degree in Computer Science, Information Systems, or a related technical field.

- Minimum of 6 years of experience in platform engineering, cloud network engineering, or SRE roles.

- Preferred : AWS Certified Advanced Networking Specialty.


info-icon

Did you find something suspicious?