Build and maintain ETL pipelines that keep the knowledge graph current as source data changes. Solid data engineering foundations — you have designed and built……
Experience with developing data warehousing, data lakes, batch/real-time event processing, streaming, data processing (ETL/ELT), data migrations, data……
The ideal candidate is resourceful, a fast learner, and excited to work across machine learning, large language models (LLMs), and computer vision to enhance……
Familiarity with data engineering concepts such as ETL/ELT processes, data pipelines, data warehousing, and data modelling. 10 Paya Lebar Rd, Singapore 409057.…
10 years of experience with cloud native architecture in a customer-facing pre-sales role. Experience with enterprise architectures and migration strategies for……
Experience supporting production cloud environments. FinOps or cloud governance exposure. Comfortable working in enterprise cloud environments.…
Experience supporting production cloud environments. FinOps or cloud governance exposure. Comfortable working in enterprise cloud environments.…
Possess a bachelor’s degree in Computer Science, Computer Engineering, or a related field. Responsibilities include developing robust data pipelines, cloud-……
Familiarity with cloud infrastructure and hybrid cloud environments. Ensure the data centre and cloud operations meets industry standards and complies with……
Bachelor’s degree in Engineering, Computer Science, a related field, or equivalent practical experience. Master’s degree or PhD in AI, Computer Science, or a……
Experience supporting production cloud environments. FinOps or cloud governance exposure. Comfortable working in enterprise cloud environments.…
Experience supporting production cloud environments. FinOps or cloud governance exposure. Comfortable working in enterprise cloud environments.…
Familiarity with cloud data platforms such as: Experience supporting or maintaining production data pipelines. Experience: Fresher / Intern with Data Engineer*.…
The business provides clients with unbiased OTC content and proprietary data, in-depth insights across price discovery, risk management, benchmark and indices,……
Proven experience in operating AWS cloud environments at scale. Excellent problem-solving skills and capability to manage complex cloud projects independently……
Bachelor’s degree in Engineering, Computer Science, a related field, or equivalent practical experience. Master’s degree or PhD in AI, Computer Science, or a……
Ability to work independently under pressure to meet system performance service levels. 2+ years of experience with implementing and managing cloud environments……
Familiarity with cloud infrastructure and hybrid cloud environments. Ensure the data centre and cloud operations meets industry standards and complies with……
1 year required, 3 years preferred of working with cloud-based services. Ability to understand carrier networks and cloud based service offerings.…
Good experience designing data solutions including data modeling. Collaborate with infrastructure leaders to advance cloud-based data platforms.…
Bachelor’s degree in Engineering, Computer Science, a related field, or equivalent practical experience. Master’s degree or PhD in AI, Computer Science, or a……
Bachelor’s degree in Engineering, Computer Science, a related field, or equivalent practical experience. Master’s degree or PhD in AI, Computer Science, or a……
Supporting OS instances on-premises cloud and on public cloud providers like AWS, Oracle Cloud etc. You stay current on technical trends to suggest innovative……
Services include engineering works to design, build and provide facility management service for mission critical environments such data centers, disaster……
Posting Start Date: 29/06/2026
Country/Region: Singapore
Work Location: Singapore Great World City
Business/Function: IT
Job Summary
We are seeking a passionate AI Data Architect to design and build the data foundation that makes AI work well across Kuok Group — specifically, the hybrid vector and knowledge graph layer (the enterprise semantic memory) that underpins RAG-based use cases across every business unit, as well as the embedding pipelines, ingestion workflows, and data schemas that keep it accurate and current.This role will be reporting to the Principal AI Architect.
This is an architecture-first role. The successful candidate will make structural decisions about how the Group’s unstructured data is organised, retrieved, and made useful to AI systems — working closely with the Principal AI Architect on technical direction and with the Applied AI Engineers who depend on the data layer to build reliable, high-quality solutions.
The role sits at an exciting intersection of data engineering and AI infrastructure. Those who bring a strong data engineering background and a genuine curiosity about vector databases, knowledge graphs, and retrieval design will find a lot to build here.
Key Responsibilities
AI Data Foundation Architecture
Design and own the hybrid vector and knowledge graph layer that underpins RAG across all Kuok Group BUs — the enterprise semantic memory that AI use cases draw on
Make structural decisions on how unstructured data is organised for retrieval: chunking strategies, embedding approaches, metadata schemas, and knowledge graph ontologies
Work with the Principal AI Architect to align data foundation design with the broader AI platform architecture and the requirements of active use cases
Document architectural decisions clearly — capturing both the reasoning and the outcomes — so the wider team can work with confidence
Embedding Pipelines & Vector Infrastructure
Build and maintain embedding pipelines: document ingestion, chunking, embedding model selection, and vector DB write workflows
Own the vector database layer (Pinecone, Weaviate, or equivalent) — index management, refresh cadence, performance tuning, and cost management
Design retrieval patterns that serve the needs of applied use cases: similarity search, hybrid search, re-ranking, and metadata filtering
Ensure embedding pipelines are monitored, versioned, and recoverable — data foundation reliability is as important as application reliability
Knowledge Graph Design & Ingestion
Design the knowledge graph layer (Neo4j or equivalent) — ontology modelling, entity and relationship schema, and ingestion workflows from source systems
Work with domain experts across BUs to ensure the knowledge graph accurately reflects the entities, relationships, and terminology that matter in each business context
Build and maintain ETL pipelines that keep the knowledge graph current as source data changes
Knowledge graph capability is being built from the ground up at the Group — this role has a real opportunity to shape how it develops and set the direction for how it scales
Data Quality & Governance
Establish data quality standards for AI-ingested content — source freshness, deduplication, completeness checks, and validation pipelines
Work with BU Domain Data Stewards to validate that domain-specific data is accurate before it enters the AI data layer
Maintain clear data lineage across the AI data foundation — what source data feeds which index or graph, and when it was last refreshed
Partner with the AI Governance & Compliance Lead on data privacy requirements for AI-ingested content, particularly across BUs with sensitive operational data
Collaboration & Standards
Partner with Applied AI Engineers to understand the retrieval requirements of each use case and ensure the data foundation is designed to support them well
Work with the Lead Data Engineer (supporting functions) on the handoff boundary between structured data / BI pipelines and the AI data layer
Maintain documentation of the AI data foundation — schemas, pipeline specs, refresh schedules, and known limitations — so the team can work with the data layer confidently
Contribute to the broader AI Platform cluster's engineering standards and participate in code and design reviews
Key Requirements
Solid data engineering foundations — you have designed and built ETL / ELT pipelines at production scale, managed data quality, and worked with structured and semi-structured data in cloud environments
Hands-on experience with vector databases — you have built embedding pipelines, managed indexes, and designed retrieval patterns for RAG or semantic search applications
Understanding of RAG architecture from the data side: chunking strategies, embedding model selection, retrieval optimisation, and the effect of data quality on AI output quality
Experience designing schemas and data models for AI systems — with a strong appreciation for how data structure shapes retrieval quality and downstream AI output
Strong Python skills and comfort with the data engineering tooling ecosystem: pipeline orchestration, data validation, and working with cloud storage and databases
Clear, structured communication skills — you can explain data architecture decisions to both technical peers and non-technical stakeholders
Education
Bachelors in Computer Engineering or Computer Science
#body.unify div.unify-button-container .unify-apply-now: focus, #body.unify div.unify-button-container .unify-apply-#body.unify div.unify-button-container .unify-apply-now: focus, #body.unify div.unify-button-container .unify-apply-
The minimum salary is $114K and the max salary is $170K.
$114K – $170K/yr (Glassdoor Est.)
$139K
/yr Median
Singapore
If an employer includes a salary or salary range on their job, we display it as "Employer provided". If a job has no salary data, Glassdoor displays a "Glassdoor estimate" if available. To learn more about "Glassdoor estimates", see our FAQ page.
Working here doesn’t have to be a secret
Sign in to browse authentic reviews, anonymous ratings and salary data before you apply.