AI/ RAG Architect @ Icertis

Home > Software Development

AI/ RAG Architect

Icertis
12 - 16 years
Pune
2 months ago
Email to a friend
Report this job

Job Description

Job Description

We are looking for a Senior Architect, Machine Learning to define and lead the architecture for enterprise-grade Generative AI and Agentic AI systems. This is a senior, hands-on architecture role focused on building reliable, scalable, secure, and cost-efficient AI platforms - covering RAG, agent orchestration, inference infrastructure, evaluation/guardrails, and production operations across multiple tenants.

You will work at the intersection of research innovation and engineering reliability: enabling rapid experimentation while ensuring the system runs 24/7 with strong SLOs, governance, and predictable cost.

Responsibilities

Architecture & Technical Leadership

Own the end-to-end architecture for RAG + agentic workflows (Plan Execute Verify) across enterprise use cases (contracts, PDFs, knowledge bases).
Define architecture standards for multi-tenant isolation, API design, service boundaries, and integration patterns.
Lead technical decision-making: build vs buy, model strategy (hosted vs open-weights), tooling selection, and performance/cost tradeoffs.
Drive architecture reviews, mentor engineers/researchers, and raise the overall bar for engineering quality and research rigor.

RAG & Retrieval Systems (Enterprise-grade)

Design retrieval pipelines that optimize grounded accuracy: chunking strategy, hybrid retrieval, reranking, query rewriting, and context construction.
Define document ingestion patterns (PDF parsing, OCR, structured extraction, metadata enrichment) and index lifecycle strategies.
Establish retrieval evaluation and regression frameworks (ground truth, offline/online evaluation, drift tracking).

Enable async and event-driven architectures for long-running tasks using queues/streams (Kafka/RabbitMQ/Redis Streams) and/or durable workflow engines (Temporal).
Inference & Platform Engineering

Architect model serving for high throughput and low latency using engines like vLLM / TGI / Triton / TorchServe (as applicable).
Define GPU orchestration and capacity strategy on Kubernetes (AKS/EKS/GKE), including scale-to-zero, scheduling, and quota-based governance.
Design platform-level controls for rate limiting, caching, backpressure, and cost containment (tenant quotas, token budgets, throttling).

Safety, Guardrails, Security & Compliance

Own guardrail architecture for prompt injection defense, tool safety, policy enforcement, and PII handling (redaction patterns).
Define secure-by-default patterns: secrets management, data protection, audit logs, and safe prompt/tool execution boundaries.
Partner with security/compliance teams to meet enterprise standards (e.g., SOC2/GDPR expectations where relevant).

Observability, Reliability & Operational Excellence

Establish SLOs and production readiness standards: error budgets, runbooks, incident response patterns.
Define observability strategy across LLM calls and agent tools: tracing, metrics, logs, cost dashboards, and token usage reporting.
Build reliability patterns for dependency failure (model provider downtime, throttling): circuit breakers, fallbacks, degradation strategies.

Qualifications

Required Qualifications

13+ years of experience in ML systems / platform engineering / architecture roles, with ownership of production-grade systems.
Strong software engineering fundamentals: APIs, distributed systems patterns, testing, versioning, CI/CD, and operational readiness.
Hands-on experience with Kubernetes and Docker and cloud-native design (Azure/AWS/GCP).
Strong experience designing event-driven and async architectures with durable execution patterns (queues/workflows).
Proven ability to lead architecture for complex systems involving ML/LLMs, data pipelines, and multi-service integration.
Strong Python proficiency; comfortable with async patterns and structured validation (e.g., Pydantic-style design).

Preferred Qualifications

Deep experience with RAG (retrieval + grounding + reranking) and evaluation techniques for hallucinations and answer quality.
Experience with agent frameworks and multi-step tool execution patterns (plan/execute/verify, tool routing, loop prevention).
Experience with open-weight models and adaptation methods (e.g., PEFT/LoRA), plus evaluation-driven iteration.
Experience with model inference optimization (throughput, batching, caching) and GPU efficiency management.
Experience operating observability stacks (OpenTelemetry, Prometheus/Grafana, Datadog) and LLM tracing tools.

About Us

Icertis is the global leader in AI-powered contract intelligence. The Icertis platform revolutionizes contract management, equipping customers with powerful insights and automation to grow revenue, control costs, mitigate risk, and ensure compliance - the pillars of business success. Today, more than one third of the Fortune 100 trust Icertis to realize the full intent of millions of commercial agreements in 90+ countries.

Job Classification

Industry: Software Product
Functional Area / Department: Engineering - Software & QA
Role Category: Software Development
Role: Software Development - Other
Employement Type: Full time

Contact Details:

Company: Icertis
Location(s): Pune

+ View Contact

Login

Candidates can login here to view contacts and apply.

Sign In Sign Up

Email:

Password:

Password too short

To create your profile, apply for a job or make a registration

Your name (*)

Email (*)

Mobile (*)

Preferred City (* max. 2 w/comma)

Designation / Expected Role

Current / Recent Company (*)

Experience (*)

Expected Salary (*)

Desired Industry (*):

Functional area / Department (*):

Enter Skills (key skills, subjects, technologies & roles to use in search)

Write briefly about yourself, your experience and education (*)

Attach Resume Max 2.38 MB (RTF, PDF, DOC, DOCX formats only parsed)

Please, check the file size and type.

Add social media [ + ]

Create password

I agree with website service terms and conditions

Candidates are expected to provide most recent and accurate profile information, inappropriate content is strictly prohibited!

Keyskills: Agentic Ai Aiml rag Machine Learning

Fraud Alert to job seekers!

₹ Not Disclosed

Job application

We will notify the employer with your details. You can also attach a resume or a cover letter.

Sign In Sign Up

Email:

Password:

Password too short

To create your profile, apply for a job or make a registration

Your name (*)

Email (*)

Mobile (*)

Preferred City (* max. 2 w/comma)

Designation / Expected Role

Current / Recent Company (*)

Experience (*)

Expected Salary (*)

Desired Industry (*):

Functional area / Department (*):

Enter Skills (key skills, subjects, technologies & roles to use in search)

Write briefly about yourself, your experience and education (*)

Attach ResumeMax 2.38 MB (RTF, PDF, DOC, DOCX formats only parsed)

Please, check the file size and type.

Add social media [ + ]

Create password

I agree with website service terms and conditions

Similar positions

Data Architect

Accenture

5 - 10 years

Bengaluru

2 days ago

₹ Not Disclosed

ServiceNow Architect

Cognizant

8 - 13 years

Chennai

2 days ago

₹ Not Disclosed

OSB Architect (Gurgaon)

Cognizant

10 - 16 years

Noida, Gurugram

3 days ago

₹ Not Disclosed

Solution Architect

Air India

8 - 16 years

Noida, Gurugram

2 days ago

₹ Not Disclosed

Icertis

This is a manufacturing, Trading and Retail Sales Company.\r\nProducts: Door, Chokhat, Plywood, Board etc.

AI/ RAG Architect @ Icertis

Home > Software Development

AI/ RAG Architect

Job Description

Job Classification

Contact Details:

Create password

Create password

Similar positions

Data Architect

ServiceNow Architect

OSB Architect (Gurgaon)

Solution Architect

Icertis

Job Listings

Job type

Location

Category