Senior Site Reliability Engineer (Disaster Recovery) (#4775)

REFERRAL BONUS
$1000
Europe, South America
Work type:
Office/Remote
Technical Level:
Senior
Job Category:
Software Development
Project:
Leading platform for electronic agreements

Our client is committed to building trust and making the world more agreeable for our employees, customers, and communities. Here, you have the opportunity to be heard, exchange ideas openly, contribute meaningfully, and be proud of your work as part of a team making a global impact.

Be part of Cloud Engineering, a team of specialists delivering resilient, scalable cloud solutions utilizing Microsoft Solutions Framework and cutting-edge automation.

Project Overview:

Seeking an engineer for Disaster Recovery and resource optimization of global clusters in Azure. You’ll focus on automation, rightsizing, and disk management using Terraform and MSF methodologies to support critical workloads.

Key Responsibilities:

  • Lead disaster recovery efforts for Azure clusters, applying MSF principles.
  • Analyze infrastructure utilization and proactively recommend rightsizing and tier-down of resources.
  • Implement and automate deployments with Terraform, focusing on managing persistent and cached disks.
  • Coordinate cross-region DR strategies that are aligned with best practices for reliability and cost management.
  • Collaborate with cross-functional partners and maintain strong documentation.

Requirements:

  • 5+ years of professional development experience using C#, Java, or C++ (or similar high-level languages) to build scalable, production-grade applications.
  • Proven expertise in Terraform and the Microsoft Solutions Framework (MSF) to architect, deploy, and govern complex cloud environments.
  • Extensive experience operating and maintaining large-scale clusters distributed across multiple Azure regions, ensuring high availability and low latency.
  • Demonstrated track record of storage optimization, including the ability to manage disks cached in various ways.
  • Expert-level ability to implement right-sizing strategies and tier down resources to maximize cost-efficiency without compromising performance.
  • Proven success leading Disaster Recovery (DR) projects, from designing failover architectures to executing recovery drills in complex, multi-region environments.


We offer*:

  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits

*not applicable for freelancers

×

Easy apply

    or
    Refer a friend