Manila · Hybrid

Senior Site Reliability Engineer (Hybrid-Flexible Options)

Role Overview

We are seeking a dynamic Senior Site Reliability Engineer (SRE) to lead the design, implementation, and operational support of our hybrid environments, spanning on-premises, private cloud, and public cloud platforms. This role will be pivotal in setting the foundation and strategy for our SRE practices while driving their implementation across the organization. The ideal candidate will combine technical expertise with leadership skills to guide our team on the SRE journey and ensure our environments are scalable, reliable, and secure.

Responsibilities

SRE Best Practices Implementation: Lead the rollout of SRE best practices, including error budgeting, service level objectives (SLOs), service level indicators (SLIs), and monitoring and alerting systems.
Automation and Efficiency: Develop and implement automation tools and processes to improve the reliability, scalability, and efficiency of our systems and services.
Incident Management: Respond to system outages and emergencies, participate in incident calls, and provide root cause analysis to prevent future occurrences.
Capacity Planning and System Design: Ensure infrastructure can handle increasing traffic and workloads through proactive capacity planning and system design.
Collaboration: Work with cross-functional teams, including application development, architecture, DevOps, quality engineering, and vendor teams, to align on solutions and operational standards.
Observability and Monitoring: Implement and optimize observability tools, such as Datadog, Splunk, and CloudWatch, to provide actionable insights into system performance and health.
Technical Leadership: Lead technical design sessions, set expectations for onshore and offshore SRE team members, and mentor junior associates.
Operational Governance: Manage vulnerabilities, end-of-life issues, and non-functional requirements (NFRs) within products and platforms.
Strategic Contributions: Define technical standards for infrastructure, automation, operational processes, and tooling to align with the organization’s long-term vision.

Your profile

Technical Expertise:
- Advanced knowledge of cloud platforms (AWS, Azure, private cloud) and on-premises environments.
- Hands-on experience with automation tools like Terraform, Ansible, Chef, Puppet, and Jenkins.
- Proficiency in scripting languages such as Python, Shell, and PowerShell.
- Experience with containerization technologies (Docker, Kubernetes) and middleware (databases, web servers, MQ, Kafka).
- Strong background in Linux and/or Windows systems administration and networking fundamentals.
SRE Practices:
- Demonstrated experience implementing SLOs, SLIs, and observability tools.
- Knowledge of error budgeting and incident management processes.
- Proven ability to troubleshoot complex technical issues and perform root cause analysis.
Soft Skills:
- Ability to work independently and proactively.
- Capable of engaging with global teams across different time zones.
- Strong leadership skills to empower and inspire others.
- Excellent written and verbal communication skills.
- Collaborative mindset with the ability to mentor and inspire team members.
- Ability to prioritize and adapt in a fast-paced environment

Preferred Qualifications:

Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.
7 to 10 + years of experience in site reliability engineering or related roles.
Familiarity with microservices architecture and modern design patterns.
Practical experience with tools like Datadog, CloudWatch, CloudTrail, and Splunk.
Proven track record in leading teams and driving organizational change related to SRE

Locations: Manila
Remote status: Hybrid

Apply for this job

Contact Khryztyn Joyce Areta Talent Aquisition Specialist – Human Resources

Manila

Diversity, Equity and Inclusion

We are dedicated to fostering a diverse, equitable, inclusive, and healthy environment. As a leading provider of technology, communications, and data and analytics solutions to businesses around the world, it is critical that we understand, embrace, and operate in a multicultural environment. Every associate has unique strengths, which, when fully appreciated and embraced, allow individuals to perform at their best, leading to our success.

Our goal is to ensure our associates at every level of the organization represent the diversity of the clients we serve and the communities in which we work. We pursue both top-down and bottom-up approaches to advancing diversity, equity, and inclusion initiatives and values into our culture. This is reflected in the varying backgrounds of our over 13,000 associates working in 21 countries around the globe.

Learn about our DEI Program here.

About Broadridge

Broadridge Financial Solutions (NYSE: BR), a global Fintech leader with $5 billion in revenues, provides the critical infrastructure that powers investing, corporate governance, and communications to enable better financial lives.

Itiviti was acquired by Broadridge in May 2021 and is now Broadridge Trading and Connectivity Solutions. Our combined offering enables simplification and streamlining of all front office, middle office, and post-trade functions — powering connectivity and multi-asset trading across global markets.

Already working at Broadridge?

Let’s recruit together and find your next colleague.

Senior Site Reliability Engineer (Hybrid-Flexible Options)

Open positions

Manila

Diversity, Equity and Inclusion

About Broadridge

Already working at Broadridge?