Senior Site Reliability Engineer

Coventry, ENG, GB, United Kingdom

Job Description

Welcome to Moneycorp



We're delighted you're interested in being a part of Moneycorp.
In the last decade, Moneycorp has transformed from a largely domestic, consumer-focused provider of foreign exchange to an end-to-end global payments' ecosystem.
With two banking licenses and operations across the entire value chain of the international payments and foreign exchange sectors, we enable businesses, institutions, and individuals to thrive beyond borders.
We help our clients realise their growth ambitions by providing them with worldwide reach, relentless regulatory excellence, and tailored, relevant solutions that resiliently optimise their financial operations.
We're fervent about pursuing our goals, making substantial contributions to the payments industry, and consistently offering unwavering support to our clients at every stage of their journey.
Moneycorp is a place where energy, commitment to our shared success and collaboration are core to our DNA. We're restless in our drive to surpass the expectations of our clients and unlock opportunities to support them at every stage of their journey.
The foundation of our success is our people, and nurturing a culture of belonging for all of our colleagues is central to our journey as a global business.

Find out more about Moneycorp's offering, global footprint and capabilities here:
About Us | moneycorp

Your Next Challenge



Our Technology Journey:
We're at an exciting stage in our evolution. Having built strong foundations in traditional infrastructure and networking, we're now moving towards a cloud-native future -- re-imagining how we design, build, and run platforms that scale with the business. This is more than a technology shift -- it's a strategic transformation. We're modernising core services, adopting automation and DevOps practices, and building resilient, secure platforms ready for the future.

Why This Matters For You:
Joining us now means you'll help shape our direction, not just maintain it. You'll influence how we evolve from IaaS to cloud-native, work with modern technologies, and contribute to a collaborative team driving change. This journey will bring challenges, but with challenge comes opportunity -- for innovation, for growth, and for making a lasting impact.

As a Senior Site Reliability Engineer, you'll play a key role in shaping the future of our payments and FX platforms. You'll lead the transformation of legacy systems into modern, cloud-native architectures, tackling complex challenges around scale and resilience. This is your opportunity to work with cutting-edge cloud technologies, influence strategic reliability initiatives, and make a real impact on how we deliver secure, high-performing services. You'll have ownership of reliability standards, including SLO governance, resilience testing, and platform patterns, ensuring our systems meet the highest levels of operational resilience and regulatory compliance

Key Responsibilities:




Reliability Engineering & ObservabilityDefine and maintain SLOs/SLIs and error budgets for critical services Build and improve observability pipelines (metrics, logs, traces) Maintain dashboards for golden signals Develop incident runbooks and lead post-incident reviews Approve SLO/SLI targets for Tier-1 services

Proactive Monitoring, Capacity & PerformanceImplement anomaly detection and predictive monitoring Forecast capacity for cloud and IaaS workloads Optimize systems for throughput and latency Resolve performance issues using telemetry

Automation, DR & Resilience TestingAutomate backup, restore, and failover processes Validate RTO/RPO through regular DR testing Design and run chaos engineering experiments Enhance self-healing and rollback automation

Operational Excellence & Incident LeadershipLead SEV-1/SEV-2 incidents and authorize critical decisions Drive root cause analysis and permanent fixes Eliminate toil through automation Standardize reliability practices across teams

Risk, Compliance & Service MappingMap dependencies for key business services Conduct scenario-based resilience testing Sign off on resilience results and compliance evidence Support third-party resilience assessments

Refactoring & ModernisationIdentify and address platform reliability issues Prioritize and approve reliability-driven refactors Engineer modern replacements (e.g., containerisation, service mesh) Lead migrations with measurable reliability outcomes

Skills, Qualifications and Experience Required:

Site Reliability Engineering: 7+ years in SRE, platform, or systems roles with production ownership of high-availability, low-latency platforms. Cloud Platforms (Azure): Deep experience with Azure services including IaaS, ASE, AKS/ARO, VNets, App Gateway, Azure SQL/SQL Managed Instance/On-Prem SQL, Service Bus, Event Hubs, Kafka and Key Vault. Secure-by-Design: Strong background in architecture governance, design reviews, and change management. Demonstrated expertise in security-by-design, Zero Trust principles, and compliance with regulatory frameworks. Infrastructure as Code (IaC): Proven use of Terraform for modular infrastructure design, policy enforcement, and environment provisioning. CI/CD Pipelines: Experience with Azure DevOps and GitHub Actions for automated build, test, and deployment workflows. Hands-on experience with infrastructure as code (Terraform/Bicep), CI/CD pipelines, and automation. Observability & Monitoring: Hands-on with Prometheus, Grafana, OpenTelemetry, and log aggregation tools; building dashboards and alerting policies. Knowledge of observability and reliability engineering (SLOs, error budgets, monitoring, AIOps). Experience with FinOps practices, cost optimization, and cloud commercials (EA, reservations, savings plans). Incident Management: Leading SEV-1/SEV-2 incidents, conducting post-mortems, and driving root-cause elimination. Disaster Recovery & Resilience Testing: Designing and validating RTO/RPO targets, executing chaos engineering experiments, and automating recovery. IaaS & OS Engineering: Strong background in Windows Server (2019/2022/2025) and Linux (RHEL/Ubuntu) across Azure IaaS. Payments & FX Platforms: Familiarity with payments orchestration, FX workflows, and platform refactoring to improve scale and resilience. Operational Resilience: Understanding of UK regulatory expectations (FCA/PRA) including impact tolerances, service mapping, and scenario testing. Track record in incident management, DR/BCP testing, and resilience planning.

Education:

Bachelor's degree in Computer Science, Engineering, or a related technical discipline, or equivalent hands-on experience in platform engineering and reliability roles. Any of the following certifications would be advantageous (not mandatory): + Microsoft Azure: AZ-104 (Administrator), AZ-400 (DevOps), AZ-700 (Networking)
+ Kubernetes: Certified Kubernetes Administrator (CKA) or Certified Kubernetes Application Developer (CKAD)
+ HashiCorp: Terraform Associate

Interested?


If the role sounds like you, we invite you to upload a copy of your CV and can do this by clicking on the Apply Now button

Fostering a culture of belonging and inclusivity


We're committed to creating a workplace where every individual feels valued, respected, and included. As an Equal Opportunity Employer, we actively cultivate an inclusive culture where diversity thrives, and we empower our colleagues to drive meaningful change within our organisation through initiatives like our DE&I focus groups and value champion network.
Like many of our peers, we recognise that fostering inclusivity is an ongoing journey, and we remain steadfast in our commitment to progress. By measuring our efforts through regular assessments and listening to the feedback of our employees, we strive to ensure that our initiatives are impactful and responsive to the evolving needs of our workforce.
Together, we want to build a workplace where everyone can bring their authentic selves to work, as we believe this is the foundation of innovation, creativity, and collective success.

Connect with us


For company news, announcements and market insights, visit our News Hub.
You can also find Moneycorp on Facebook, Twitter UK, Twitter Americas, Instagram, LinkedIn, where you can discover how we are leading the way in global payments and currency risk management.

Beware of fraud agents! do not pay money to get a job

MNCJobs.co.uk will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD3974252
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Full Time
  • Job Location
    Coventry, ENG, GB, United Kingdom
  • Education
    Not mentioned