Senior Data Engineer

City: HCM City

Job Function: Tech

Job Area: Product & IT

Seniority Level: Mid-Senior level

Date: Jul 1, 2026

HRS AS A COMPANY

HRS, a pioneer in business travel, aims to elevate every stay through innovative technology. With over 50 years of experience, their digital platform, driven by ProcureTech, TravelTech, and FinTech, transforms how companies and travelers Stay, Work, and Pay.

ProcureTech digitally revolutionizes lodging procurement, connecting corporations and suppliers in a cutting-edge ecosystem. This enables seamless efficiency and automation, surpassing travelers' expectations.

TravelTech redefines the online lodging experience, offering personalized content from selection to check-in, ensuring an unparalleled journey for corporate travelers.

In FinTech, HRS introduces advancements like mobile banking and digital payments, turning corporate back offices into touchless lodging enablers, eliminating legacy cost barriers. The innovative 2-click book-to-pay feature streamlines interactions for travelers and hoteliers.

Combining these technology propositions, HRS unlocks exponential catalyst effects. Their data-driven focus delivers value-added services and high-return network effects, creating substantial customer value.

HRS's exponential growth since 1972 serves over 35% of the global Fortune 500 and leading hotel chains.

Join HRS to shape the future of business travel, empowered by a culture of growth and setting new industry standards worldwide.

BUSINESS UNIT

We are building a next-generation, data-driven Insurtech Claims Accommodations Platform designed to unify fragmented data ecosystems across products, geographies, and operational systems into a single scalable, intelligent data core. Today, the customers’ data landscape is distributed across operational databases, third-party accommodation APIs, claims management systems, and analytics pipelines — limiting our ability to generate real-time insights, deploy predictive models, and operationalize AI at scale.

This is a cornerstone hire that will define and execute our long-term data-reporting capability and AI strategy, working directly with the core Insurtech team.

POSITION

We are currently looking for a Senior Data Engineer to join our team of dedicated professionals. You will build AI Agents to build and run the data pipelines, models, and AI Automation Agents. Architect and build the data infrastructure that powers a claims accommodations.

CHALLENGE

1. Data Warehouse & Lakehouse Architecture

Design and implement a cloud-native data warehouse on infrastructure stack primarily on Azure / Fabric and a structured data lake on Onelake . Implement dimensional data models optimized for claims analytics, cost reporting, and executive dashboards, both for internal and external customer facing.\
Establish data governance policies — data lineage, cataloguing (Purview / Glue), access controls, and PII masking for compliance

2. Data Pipelines & Real-Time Ingestion

Architect and build end-to-end ELT/ETL pipelines ingesting from claims management systems legacy sources (ODBC/SRSS), accommodation APIs (booking platforms, hotel aggregators), policy engines, and hosted operational CRMs
Implement streaming data pipelines to capture real-time claims events, accommodation availability updates, and pricing signals
Build batch ingestion pipelines and orchestrate using Azure Data Factory. Deploy dbt (data build tool) for modular, version-controlled data transformation layers with automated data quality testing
Develop and optimise CDC (Change Data Capture) pipelines from operational PostgreSQL / SQL Server databases

3. Predictive Models & Machine Learning Infrastructure

Build ML-ready feature stores (AWS SageMaker Feature Store) to serve real-time and batch model features.
Develop predictive models for: claims accommodation cost forecasting, length-of-stay prediction, accommodation supplier risk scoring, and demand surge detection
Implement MLOps pipelines for model training, versioning, deployment, and monitoring using MLflow and CI/CD frameworks
Operationalize model outputs back into the claims platform via REST APIs and feature pipelines for real-time decisioning

4. AI / LLM Enablement & Agentic Workflows

Architect vector databases (in Azure)) and build RAG (Retrieval-Augmented Generation) pipelines over claims and policy documents
Enable LLM-powered workflows for automated claims summarization, accommodation recommendation, and SLA breach prediction
Build agentic AI systems that autonomously monitor claims pipelines, trigger re-routing logic, and surface anomalies to adjusters
Integrate structured and unstructured data into unified AI-ready datasets for continuous model improvement

FOR THIS EXCITING MISSION YOU ARE EQUIPPED WITH...

Technical Requirements:

Data Warehouse & Lakehouse Design: AWS Redshift, Onelake, Azure Synapse, Expert in dimensional modelling, Star schemas, and data vault design for insurance data domains.
Data Pipeline Engineering: Azure Data Factory, AWS Glue . Experience building production-grade ELT/ETL and CDC pipelines from complex, heterogeneous source systems.
Cloud Data Platforms: Deep hands-on expertise across Azure (ADLS Gen2, Synapse, ADF, Purview) and AWS (S3, Glue, Redshift, SageMaker). Infrastructure-as-Code via Terraform or Bicep.
ML & Predictive Modelling: Feature engineering, Feature Stores, MLflow, scikit-learn, XGBoost. Experience deploying predictive models to production with monitoring, drift detection, and retraining pipelines.
Streaming & Real-Time Data: Apache Kafka, Azure Event Hubs, Spark Structured Streaming, Flink. Building low-latency pipelines for real-time claims event processing and accommodation availability feeds.
AI Enabled Workflows & RAG Infrastructures: Vector databases (Pinecone, pgvector), RAG pipeline architecture, LangChain/LlamaIndex, prompt engineering, and embedding pipelines over structured and unstructured data. AI enabled workflows & Agentic Workflow Automations.
Data Governance & Quality: Data cataloguing, lineage tracking (Purview / OpenLineage), PII masking, RBAC, Great Expectations / dbt tests. Compliance-aware design for insurance regulatory requirements.
Languages & Tooling: Python (primary), SQL (advanced), Scala or Java (preferred), Git, Docker, Kubernetes. CI/CD for data pipelines via GitHub Actions or Azure DevOps.

Preferred Background:

Insurtech, Fintech, or Financial Services — handling policy, claims, underwriting, and compliance data with strict lineage and auditability requirements
Experience with accommodation, travel, or claims management platform data — understanding supplier networks, booking APIs, cost structures, and SLA frameworks
Familiarity with regulatory data requirements in insurance (e.g. APRA, Lloyd's, FCA) and data sovereignty considerations across geographies
Prior greenfield data platform builds — architecting from scratch with an ownership and delivery mindset
Fluency in English

PERSPECTIVE

Access to a global network of a globally united and mutually responsible “Tribe of Intrapreneurs” that is passionately dedicated to renew the travel industry and while doing so reinvent the ways how businesses stay, work and pay.

Our entrepreneurial driven environment of full ownership and execution focus offers you the playground to contribute to a greater mission, while growing personally and professionally throughout this unique journey. You will continuously learn from a radical culture of retrospectives and continuous improvement and actively contribute to making business life better, smarter and more sustainable.

LOCATION, MOBILITY, INCENTIVE

HCM City. The attractive remuneration is in line with the market and, in addition to a fixed monthly salary, all necessary work equipment and mobility.

Req ID: 18780