Staff Site Reliability Engineer

Technology Operations
Toronto, Ontario, Canada

Apply Now

Technology Operations
Toronto, Ontario, Canada

We shaped the earliest forms of ad tech, and we’re looking for the technical expertise to help shape its future. Our customers have unique problems that can only be solved at internet scale, and that’s where the technical skills of our team make a real difference. 

Our exchange handles over 500 billion requests every day (for comparison Google serves an estimated 9 billion searches a day), all running in our own global data centers. Every member of our technology team has an enormous amount of autonomy in building and managing our systems to support and enable our growing level of scale. Through the transparency of our technology, dedication to innovation and integrity, and long-standing customer relationships, we lead through change. 

What’s it like to work at Index? 

We have more than 550 Indexers around the globe dedicated to building a safe and transparent marketplace that provides a trusted experience for consumers. 

Index is an exciting and fast-paced place to work. We’re built on our values of change, support, learning and teaching, trust, and intention. We pride ourselves on our independence and openness, not only in our technology, but in our teams, too. Our diverse and inclusive culture celebrates how we can leverage our unique differences to help drive Index forward. 

Our culture of success is truly supportive and collaborative. In working together across our teams, we’re continually investing in the people and technology to solve the industry’s most complex problems. As we extend the promise of ad tech to every channel, we’re looking for talented engineers to help advance Index, and the industry, forward. 

Are you ready to join the programmatic evolution? 

Index Exchange funds the open web. Content and journalism across the internet are funded through advertising, and we are the engine that helps to make that happen transparently, safely and efficiently. Handling hundreds of billions of auctions per day within milliseconds requires an intense understanding of the exchange and the ecosystem that we live in. 

Our business is growing significantly every year and is poised to grow even faster. Our people and our platforms are the foundation and enabler of that growth. We are significantly expanding our technology teams, and are looking for technologists with a passion for high performance software development, and a drive to deliver software products and platforms that enable and empower industries at a global scale. 

About The Role:

We are seeking an experienced Staff Engineer with a strong background in Site Reliability Engineering (SRE) to own and develop on-premise and hybrid cloud environments, with a focus on optimizing performance low-latency on Kubernetes platforms supporting a robust developer experience framework. The ideal candidate will have a deep technical understanding of on-premise and hybrid cloud environments and a proven track record of managing SRE teams in a global setting.

Index’s scale spans the globe, our transactions happen 24x7 in our global data centers, and every second that passes millions of requests are evaluated across our exchange. In order to achieve our mission, global efficiency and reliability are absolutely key, as every millisecond quite literally counts in our business.

Here’s What You’ll be Doing

  • Vision: Have a deep understanding of Index and its products and processes and stay informed on the latest events in the industry, whether product or technology changes. Drive initiatives that produce positive outcomes across divisions.
  • Project Management: Act as a technical leader on projects, architecting the design of projects to meet the needs of the business outcome, and to align with existing architectural vision. Collaborate with subject matter experts and with a network of peers to ensure on-time quality delivery.
  • Technical Leadership: Using a deep understanding of on-premise and hybrid cloud environments, collaborate with engineering teams and lead initiatives cross-functionally to architect innovative solutions that enhance our observability capabilities.
  • Operational Excellence: Drive operational excellence through proactive monitoring, automation, and the development of robust incident management processes.
  • Software Engineering Skills: Collaborate with software engineering teams to implement SRE best practices in the software development life cycle, including designing scalable and resilient systems.
  • Incident Management: Lead incident response efforts, ensuring rapid resolution and post-incident analysis to prevent recurrence. Maintain incident reports and contribute to continuous improvement.
  • Reporting and Metrics: Develop and maintain meaningful performance metrics and reporting mechanisms to track the health and reliability of our systems. Use data-driven insights to guide decision-making and triaging.
  • Global Scale: Manage SRE operations at global scale, considering regional nuances and ensuring consistent, reliable service delivery across geographies.

Here's What You Need

  • Proven experience (6+ years) in SRE roles, with a focus on low-latency, global-scale environments built on upstream Kubernetes.
  • Strong software engineering skills, including proficiency in programming languages such as Golang, Python, Perl.
  • Excellent understanding of on-premise and hybrid cloud architectures.
  • Exceptional leadership and team-building skills with a track record of developing high-performing teams with at least 3 years of experience in that role.
  • Expertise in incident management, root cause analysis, and post-incident reviews.
  • Strong analytical and problem-solving abilities.
  • Extensive experience with industry-standard SRE tools and technologies within the CNCF portfolio such as ArgoCD, Cilium, Rook, OPA, Jaeger.
  • Significant experience with configuration management tools such as Ansible, Puppet or Salt.
  • Strong background in working with observability stack components such as ELK, Prometheus, Mimir, OpenTelemetry.
  • Excellent communication skills, with the ability to collaborate effectively with cross-functional teams.

Why You’ll Love Working Here: 

  • Comprehensive health, dental, and vision plans for you and your dependents 
  • Paid time off, health days, and personal obligation days plus flexible work schedules  
  • Competitive retirement matching plans  
  • Equity packages  
  • Generous parental leave available to birthing, non-birthing, and adoptive parents  
  • Annual well-being allowance plus fitness discounts and group wellness activities     
  • Commuter benefits and discounts, where available  
  • Employee assistance program  
  • Mental health first aid program that provides an in-the-moment point of contact and reassurance  
  • One day of volunteer time off per year and a donation-matching program  
  • Bi-weekly town halls and regular community-led team events  
  • Multiple resources and programming to support continuous learning
  • A workplace that supports a diverse, equitable, and inclusive environment – learn more here 

Notification 

Index Exchange is aware that there have been recent scams directed toward candidates regarding job interviews and offers. 

Please be vigilant and do not accept interview requests, job offers, or other hiring-related documents from anyone other than our dedicated recruitment team, from the domain of @indexexchange.com. Our interview process consists of several steps, including phone screens and video interviews. We do not conduct interviews via an email questionnaire or request money at any point in the process. 

We remain dedicated to resolving this matter and we appreciate your support. 

Equal employment opportunity 

At Index Exchange, we believe that successful products are built by teams just as diverse as the audience who uses them. As such, we are committed to equal employment opportunities. We celebrate diversity of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or expression, or veteran status. Additionally, we realize that diversity is deeper than any status or classification—diversity is the human experience. For those who show grit, passion, and humility—Index will welcome you. 

Accessibility for applicants with disabilities  

Index Exchange welcomes and encourages individuals with disabilities to apply to work with us.  

If you require an accommodation, please share the details of your request and any information how we can assist you with the hiring recruiter when they contact you. Index Exchange will make reasonable efforts to ensure accommodation requests are met throughout the recruitment process. 

Index Everywhere, Index Anywhere 

Our corporate headquarters are in Toronto, with major offices in New York, Montreal, Kitchener, London, San Francisco, and many other global cities. As a major global advertising exchange, we are committed to operating as a tightly knit global team and embracing and empowering talent wherever our colleagues may be. 

 

#LI-ONSITE

#LI-LP1