Team Lead Site Reliability Engineer (all genders)

City:  Mohali
Job Function:  Tech
Job Area:  Product & IT
Seniority Level:  Mid-Senior level
Date:  Feb 21, 2025

HRS AS A COMPANY

HRS, a pioneer in business travel, aims to elevate every stay through innovative technology. With over 50 years of experience, their digital platform, driven by ProcureTech, TravelTech, and FinTech, transforms how companies and travelers Stay, Work, and Pay.

ProcureTech digitally revolutionizes lodging procurement, connecting corporations and suppliers in a cutting-edge ecosystem. This enables seamless efficiency and automation, surpassing travelers' expectations.

TravelTech redefines the online lodging experience, offering personalized content from selection to check-in, ensuring an unparalleled journey for corporate travelers.

In FinTech, HRS introduces advancements like mobile banking and digital payments, turning corporate back offices into touchless lodging enablers, eliminating legacy cost barriers. The innovative 2-click book-to-pay feature streamlines interactions for travelers and hoteliers.

Combining these technology propositions, HRS unlocks exponential catalyst effects. Their data-driven focus delivers value-added services and high-return network effects, creating substantial customer value.

HRS's exponential growth since 1972 serves over 35% of the global Fortune 500 and leading hotel chains.

Join HRS to shape the future of business travel, empowered by a culture of growth and setting new industry standards worldwide.

BUSINESS UNIT

The Site Reliability Engineering (SRE) department at HRS is fundamental to ensuring the reliability, scalability, and performance of our Lodging-as-a-Service (LaaS) platform. Our team collaborates across engineering, operations, and development teams to define and implement reliability standards, infrastructure architecture, and operational excellence while maintaining our service level objectives (SLOs) and reducing toil.

SRE Team Leads at HRS own the reliability roadmap, manage incident response protocols, and are responsible for platform observability, automation initiatives, and system resilience. We maintain critical metrics including error budgets, mean time to recovery (MTTR), and service level indicators (SLIs) to ensure optimal platform performance and availability. Our role requires deep technical expertise in cloud infrastructure, distributed systems, and automation, combined with strong leadership and incident management capabilities.

The department operates according to HRS' leadership principles, prioritizing system reliability and customer experience above all. We embrace a culture of blameless post-mortems, continuous improvement, and proactive problem-solving. As technical leaders, we recruit top engineering talent and foster an environment of learning and growth through mentorship and knowledge sharing.

SRE Team Leads at HRS are innovation drivers, constantly exploring new technologies and methodologies to improve system reliability and operational efficiency. We implement infrastructure as code, maintain robust monitoring and alerting systems, and develop automation solutions to reduce manual intervention. Our team takes full ownership of production systems, from capacity planning to disaster recovery, ensuring resilient and scalable infrastructure.

POSITION

We are looking for an experienced (Site Reliability Engineer - Team Lead) to lead our SRE team at HRS. The ideal candidate will have a strong background in enhancing the reliability and scalability of services, leading technical teams, and driving strategic initiatives to improve our Lodging-as-a-Service platform.

CHALLENGE

  • Leadership & Mentorship: Lead, mentor, and develop a team of SREs, fostering a culture of reliability, collaboration, and continuous improvement.
  • Strategic Planning: Drive the design and implementation of scalable, sustainable solutions, and lead the transition towards a cloud-native, serverless, and NoOps environment.
  • Service Excellence: Oversee service availability, system performance, and capacity planning for critical systems.
  • Cross-Functional Collaboration: Work closely with stakeholders across the organization to solve complex technical challenges and enhance user experiences.
  • Incident Management: Lead incident response efforts, perform root cause analysis, and implement preventative measures.
  • Process Optimization: Champion the adoption of best practices in monitoring, automation, and observability.
  • SLO Management: Define and manage Service Level Objectives (SLOs) to guide prioritization and ensure reliability.

FOR THIS EXCITING MISSION YOU ARE EQUIPPED WITH...

  • Experience: 7+ years in site reliability engineering or related fields, with at least 2 years in a leadership role.
  • Education: Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.
  • Technical Expertise:
    • Extensive experience with AWS cloud services and cloud engineering best practices.
    • Proficiency in programming languages such as Java, Python, and familiarity with React.
    • Deep understanding of software engineering methodologies and development cycles.
    • Expertise in monitoring and observability tools (New Relic, Kibana, Prometheus, Grafana, ElasticSearch).
  • Leadership Skills: Proven ability to lead technical teams, manage projects, and communicate effectively with stakeholders.
  • Problem-Solving Skills: Exceptional analytical abilities to perform root cause analysis and develop effective solutions.
  • Automation & Efficiency: Strong background in automating processes and driving operational efficiency.

PERSPECTIVE

Access to a global network of a globally united and mutually responsible “Tribe of Intrapreneurs” that is passionately dedicated to renew the travel industry and while doing so reinvent the ways how businesses stay, work and pay.

Our entrepreneurial driven environment of full ownership and execution focus offers you the playground to contribute to a greater mission, while growing personally and professionally throughout this unique journey. You will continuously learn from a radical culture of retrospectives and continuous improvement and actively contribute to making business life better, smarter and more sustainable.

LOCATION, MOBILITY, INCENTIVE

The attractive remuneration is in line with the market and, in addition to a fixed monthly salary, all necessary work equipment and mobility, will also include an annual or multi-year bonus.

Req ID:  18128


Job Segment: Cloud, Travel Industry, Computer Science, Java, Technology, Travel