Posted on: 12/02/2026
Description :
Company Profile :
LSEG (London Stock Exchange Group) is a world-leading financial markets infrastructure and data business.
We are dedicated, open-access partners with a commitment to excellence in delivering services across Data & Analytics, Capital Markets, and Post Trade.
Backed by three hundred years of experience, innovative technologies, and a team of over 23,000 people in 70 countries, our purpose is driving financial stability, empowering economies, and enabling customers to create sustainable growth.
Role Profile :
We are looking for a highly skilled Team Lead, Site Reliability Engineering within our Corporate Technology (CT) division.
This key role leads our External Digital team responsible for the administration and management of various mission-critical applications.
As a Team Lead, you will guide a team of engineers in ensuring the high availability, performance, and security of our application infrastructure.
The ideal candidate will bring strong technical expertise, excellent leadership skills, and a passion for reliability engineering.
Role : Lead, Site Reliability Engineering Corporate Technology
Company Profile :
LSEG (London Stock Exchange Group) is a world-leading financial markets infrastructure and data business.
We are dedicated, open-access partners with a commitment to excellence in delivering services across Data & Analytics, Capital Markets, and Post Trade.
Backed by three hundred years of experience, innovative technologies, and a team of over 23,000 people in 70 countries, our purpose is driving financial stability, empowering economies, and enabling customers to create sustainable growth.
Role Profile :
We are looking for a highly skilled Lead, Site Reliability Engineering within our Corporate Technology (CT) division.
This key role leads our External Digital team responsible for the administration and management of various mission-critical applications.
As a Tech Lead, you will guide a team of engineers in ensuring the high availability, performance, and security of our application infrastructure.
The ideal candidate will bring strong technical expertise, excellent leadership skills, and a passion for reliability engineering.
Key Responsibilities :
- Collaborate with development and infrastructure teams to design, implement, and maintain scalable, secure environments.
- Monitor, troubleshoot, and resolve system performance issues, providing root cause analysis for incidents and outages.
- Automate tasks and optimize processes through scripting and tools like Shell, Python, and PowerShell to improve system efficiency.
- Develop and maintain disaster recovery plans, ensuring regular testing and drills.
- Oversee application deployments, configuration changes, and system maintenance (patches, upgrades).
- Provide technical mentorship and guidance to team members, fostering professional development.
- Maintain documentation for system configurations, processes, and procedures.
- Collaborate with vendors, partners, and Adobe support to resolve platform issues and ensure smooth integrations.
- Participate in incident, change, and problem management processes, ensuring adherence to established protocols.
- Stay up to date with industry trends and technologies, ensuring that LSEG's infrastructure remains competitive and secure.
- Availability for escalation via on-call support members, when necessary, in a 24x7 on-call weekly rotation.
Technical Skills
- 6+ years of hands-on experience designing, deploying, and managing cloud infrastructures, primarily in AWS, including VPC design, IAM policies, EC2/ECS/EKS management, CloudFormation, and cost optimization.
- 6+ years of experience in Linux systems engineering, with deep expertise in performance tuning, system hardening, high availability, and security best practices across large-scale distributed environments.
- Strong leadership in implementing and maintaining highly reliable, scalable, and observable cloud architectures, with deep understanding of SRE principles: SLIs, SLOs, error budgets, and incident response protocols.
- Extensive experience in automation and Infrastructure-as-Code (IaC) using tools like Terraform, Ansible, and CloudFormation, enabling consistent and repeatable infrastructure provisioning.
- Advanced scripting and automation skills with Python, Bash/Shell, and PowerShell, driving efficiency in operations, system automation, and custom tooling.
- In-depth knowledge of DevOps practices and CI/CD pipelines, including version control (Git), CI tools (Jenkins, GitLab CI), artifact management (Nexus), and continuous testing and deployment strategies.
- Experienced in monitoring, observability, and alerting using platforms such as Prometheus, Grafana, Datadog, New Relic, and Splunk, with a focus on proactive issue detection and RCA.
- Strong proficiency in web technologies and middleware, including Java-based applications, Apache/Nginx, HTTP protocols, caching strategies, and performance tuning.
- Hands-on experience with DNS management, SSL/TLS/PKI infrastructure, and ITSM tools like Jira, Confluence, and ServiceNow to manage change, incidents, and knowledge.
- Proven track record of leading cross-functional SRE and DevOps teams, driving operational excellence, implementing reliability best practices, and mentoring engineers.
- Good to have: Experience with Adobe Experience Manager (AEM/AEM-admin), supporting content management system administration, deployment, and troubleshooting in enterprise environments.
Preferred Qualifications
- Strong organizational, time management, and problem-solving skills.
- Ability to work independently and collaboratively in a fast-paced environment.
- Strong communication skills in English, both written and spoken.
- Knowledge of financial markets and industry trends is a plus.
- Relevant AWS, AEM certifications are a benefit.
Education :
- BS/MS in Computer Science, Software Engineering, or a related STEM field (preferred)
Did you find something suspicious?
Posted by
Posted in
DevOps / SRE
Functional Area
Site Reliability Engineering
Job Code
1612223