Data Science - Lead Data Scientist (Voice+Text) @ Paytm

Home > Data Science & Machine Learning

Data Science - Lead Data Scientist (Voice+Text)

Paytm
6 - 8 years
Noida, Gurugram
1 day ago
Email to a friend
Report this job

Job Description

About the Role:

We are looking for an experienced AI Conversational Engineer with strong expertise in real-time voice systems, Reinforcement Learning (RL)based model alignment, and custom LLM orchestration using PipeCat.

You will architect and optimize end-to-end conversational pipelines from Speech-to-Text and Text-to-Speech systems to SLM reinforcement and multi-agent orchestration ensuring low-latency, high-accuracy interactions at scale.

Core Engineering Responsibilities

1. Real-Time Conversational Systems

A. Design and build low-latency, high-concurrency conversational systems for both voice and text.

B. Integrate STT, TTS, and LLM/SLM components into unified, real-time architectures.

C. Develop and maintain PipeCat-based orchestration pipelines for multi-agent conversational flows.

D. Engineer robust streaming APIs and telephony integrations (VoIP/SIP).

2. Reinforcement Learning and Model Fine-Tuning

A. Fine-tune Small Language Models (SLMs) using Supervised Fine-Tuning (SFT) and RL (DPO, PPO) for alignment and personality control.

B. Design reward models to guide tone, factual accuracy, and conversational flow.

C. Build RL feedback loops for continuous model refinement based on user interactions.

3. Voice Synthesis and Adaptation

A. Develop high-quality ASR and TTS models for expressive, natural-sounding speech generation.

B. Apply speaker adaptation and voice cloning techniques for personalization.

C. Utilize Diffusion- or HiFi-GANbased vocoders for high-fidelity audio generation.

D. Engineer robust handling of sampling frequency, audio fidelity, and streaming performance.

4. Infrastructure, Serving, and Deployment

A. Build containerized inference microservices using Docker and Kubernetes.

B. Deploy Ray Servebased endpoints for distributed, dynamically batched inference.

C. Implement autoscaling, monitoring, and observability for production-grade systems.

D. Optimize serving for latency, throughput, and fault tolerance.

5. Guardrails, Security, and Reliability

A. Implement guardrail frameworks to protect against prompt injection, jailbreaks, and unsafe outputs.

B. Develop input sanitizers, content filters, and boundary-check mechanisms.

C. Maintain secure integrations with authenticated APIs and external toolchains.

D. Enable traceability through conversation logging, replay, and audit pipelines.

What We're Looking For:

3+ years in AI conversational systems or RL-driven model architectures.

Languages & Frameworks: Python, PyTorch, TensorFlow.

Core Expertise:

1) RL-based model alignment (SFT, PPO, DPO)

2) ASR/TTS pipeline design and optimization

3) Transformer architecture and optimization

4) Ray Serve + Kubernetes deployment

5) Secure orchestration using PipeCat

6) Programming & Engineering

Foundational Knowledge: Optimization, Statistics, and Linear Algebra.

Preferred Qualifications:

Bachelor's/Master's Degree in Computer Science or equivalent

Job Classification

Industry: Banking
Functional Area / Department: Data Science & Analytics
Role Category: Data Science & Machine Learning
Role: Data Scientist
Employement Type: Full time

Contact Details:

Company: Paytm
Location(s): Noida, Gurugram

+ View Contact

Login

Candidates can login here to view contacts and apply.

Sign In Sign Up

Email:

Password:

Password too short

To create your profile, apply for a job or make a registration

Your name (*)

Email (*)

Mobile (*)

Preferred City (* max. 2 w/comma)

Designation / Expected Role

Current / Recent Company (*)

Experience (*)

Expected Salary (*)

Desired Industry (*):

Functional area / Department (*):

Enter Skills (key skills, subjects, technologies & roles to use in search)

Write briefly about yourself, your experience and education (*)

Attach Resume Max 2.38 MB (RTF, PDF, DOC, DOCX formats only parsed)

Please, check the file size and type.

Add social media [ + ]

Create password

I agree with website service terms and conditions

Candidates are expected to provide most recent and accurate profile information, inappropriate content is strictly prohibited!

Keyskills: data scientist kubernetes python tensorflow data science ai pytorch llm microservices docker reinforcement learning

Fraud Alert to job seekers!

₹ Not Disclosed

Job application

We will notify the employer with your details. You can also attach a resume or a cover letter.

Sign In Sign Up

Email:

Password:

Password too short

To create your profile, apply for a job or make a registration

Your name (*)

Email (*)

Mobile (*)

Preferred City (* max. 2 w/comma)

Designation / Expected Role

Current / Recent Company (*)

Experience (*)

Expected Salary (*)

Desired Industry (*):

Functional area / Department (*):

Enter Skills (key skills, subjects, technologies & roles to use in search)

Write briefly about yourself, your experience and education (*)

Attach ResumeMax 2.38 MB (RTF, PDF, DOC, DOCX formats only parsed)

Please, check the file size and type.

Add social media [ + ]

Create password

I agree with website service terms and conditions

Similar positions

Data Science Professional

CGI

7 - 12 years

Hyderabad

23 hours ago

₹ Not Disclosed

Principal Data Scientist

ADP

10 - 12 years

Hyderabad

1 day ago

₹ 5 Lacs-1 Cr P.A.

Data Science - Data Scientist

Paytm

0 - 3 years

Noida, Gurugram

1 day ago

₹ Not Disclosed

Principal Engineer, Agentic AI - Transforming Lending

Idexcel

10 - 15 years

Hyderabad

1 day ago

₹ Not Disclosed

Paytm

Wipro Ltd (NYSE:WIT) is a global information technology, consulting and outsourcing company with 170,000+ workforce serving clients in 175+ cities across 6 continents. \r\n\r\nWipro helps customers do business better by leveraging our industry-wide experience, deep technology expertise, comprehensiv...

Data Science - Lead Data Scientist (Voice+Text) @ Paytm

Home > Data Science & Machine Learning