Job Description
Celsior is seeking an experienced Data Architect to design, modernize, and govern scalable enterprise data platforms that support advanced analytics, GenAI, and intelligent automation initiatives. This role will be responsible for defining data architecture patterns, ingestion frameworks, storage strategies, and governance models aligned with Celsiors CAFE, HALO, and PACE frameworks.
The Data Architect will work closely with Data Engineers, AI/ML teams, Cloud Architects, and Delivery leaders to ensure data platforms are secure, scalable, cloud-native, and production-ready across BFSI and Healthcare clients.
Key Responsibilities
Data Architecture & Platform Design
- Design end-to-end enterprise data architectures supporting analytics, GenAI, ML, and AI agents, including batch, streaming, and event-driven pipelines.
- Define reference architectures for data ingestion, transformation, storage, and consumption aligned with CAFE (data ingestion & knowledge layers) and PACE (AI/AgentOps enablement).
- Architect scalable lakehouse and warehouse solutions using Delta/Iceberg/Hudi and platforms such as BigQuery, Snowflake, Synapse, or Fabric.
- Define and standardize data models, schemas, and canonical data structures for enterprise-wide reuse.
Data Engineering & Integration
- Guide implementation of ETL/ELT pipelines, streaming architectures, and API-based data integrations.
- Support ingestion from structured and unstructured sources including databases, SaaS platforms, documents, logs, and APIs.
- Define data orchestration standards using Airflow, Prefect, Step Functions, or cloud-native schedulers.
- Collaborate with integration platforms and APIs to ensure seamless data flow across enterprise systems.
Cloud & Platform Collaboration
- Design cloud-native data solutions across AWS, Azure, and GCP, leveraging managed services for scalability and cost efficiency.
- Define cloud-specific patterns:
- AWS: S3, Glue, Redshift, Kinesis, Bedrock (data readiness)
- Azure: Data Factory, Synapse, Fabric, Databricks
- GCP: BigQuery, Dataflow, Pub/Sub, Vertex AI (data enablement)
- Ensure data platforms integrate cleanly with AI/ML environments, vector stores, and model pipelines.
Data Governance, Security & Quality
- Define and implement data governance frameworks covering lineage, metadata, cataloging, data quality, and access controls.
- Ensure compliance with regulatory and enterprise requirements (PII protection, data residency, auditability).
- Establish data quality metrics, validation rules, and monitoring standards.
- Support Responsible AI initiatives through data traceability, explainability, and audit-ready architectures.
AI & Analytics Enablement
- Design data foundations to support GenAI use cases such as RAG, knowledge assistants, and AI agents.
- Enable feature stores, vector databases, and analytical data products for ML and AI teams.
- Collaborate with MLOps/LLMOps teams to ensure smooth data-to-model pipelines and observability.
Delivery & Stakeholder Collaboration
- Work with Business Analysts, Product Managers, and Engineering teams to translate requirements into robust data architectures.
- Participate in solution reviews, design and architecture walkthroughs with clients and internal stakeholders.
- Provide architectural guidance during implementation, performance tuning, and production rollout
Unique Knowledge & Skill Requirement
- 10+ years of experience in data architecture, data engineering, or analytics platforms.
- Strong expertise in modern data stacks:
- Streaming: Kafka, Kinesis, Pub/Sub
- Warehousing: BigQuery, Snowflake, Synapse, Fabric
- Lakehouse: Delta, Iceberg, Hudi
- Hands-on experience with cloud platforms: AWS, Azure, and/or GCP.
- Strong understanding of ETL/ELT patterns, data modelling, and performance optimization.
- Experience designing data platforms that support AI/ML and GenAI workloads.
- Knowledge of data governance, metadata management, lineage, and security best practices.
- Proficiency in SQL and experience working with semi-structured and unstructured data.
- Strong communication skills with the ability to explain complex architectures to technical and non-technical stakeholders.
Preferred Qualifications
- Experience supporting BFSI or Healthcare data platforms.
- Exposure to AI/ML ecosystems, feature stores, vector databases, and analytics enablement.
- Familiarity with Responsible AI, data risk management, and regulatory compliance.
- Experience contributing to reference architectures, accelerators, or reusable frameworks.
- Bachelors or masters degree in computer science, Data Engineering, or a related field.
Quick Joiners Preferred !!
Please share CVs at an********r@ce********h.com
Job Classification
Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: Software Development
Role: Technical Architect
Employement Type: Full time
Contact Details:
Company: Celsior Technologies
Location(s): Noida, Gurugram
Keyskills:
Data Architect
GCP
Center Of Excellence
Coe
Microsoft Azure
Data Modeling
Data Architecture
ETL
ELT
Solutioning
AWS