Job Description
AI Developer
Req number:
R5796
Employment type:
Full time
Worksite flexibility:
Hybrid
Who we areCAI is a global technology services firm with over 8,500 associates worldwide and a yearly revenue of $1 billion+. We have over 40 years of excellence in uniting talent and technology to power the possible for our clients, colleagues, and communities. As a privately held company, we have the freedom and focus to do what is rightwhatever it takes. Our tailor-made solutions create lasting results across the public and commercial sectors, and we are trailblazers in bringing neurodiversity to the enterprise.
Job Summary
Were seeking an AI Developer who specializes in agentic AI frameworksLangchain, LangGraph, CrewAI, or equivalentsand who can take both vision and language models from prototype to production. You will lead the design of multi agent systems that coordinate perception (image classification & extraction), reasoning, and action, while owning the end-to-end deep learning life cycle (training, scaling, deployment, and monitoring). This is a Full-time and Hybrid position.
Job Description
What Youll Do
- Agentic AI Frameworks (Primary Focus): Architect and implement multiagent workflows using Langchain, LangGraph, CrewAI, or similar.
Design role hierarchies, state graphs, and tool integrations that enable autonomous data processing, decisionmaking, and orchestration.
Benchmark and optimize agent performance (cost, latency, reliability). - Image Classification & Extraction: Build and finetune CNN/ViT models for classification, detection, OCR, and structured data extraction.
Create scalable dataingestion, labelling, and augmentation pipelines. - LLM FineTuning & RetrievalAugmented Generation (RAG): Finetune openweight LLMs with LoRA/QLoRA, PEFT; perform SFT, DPO, or RLHF as needed.
Implement RAG pipelines using vector databases (FAISS, Weaviate, pgvector) and domainspecific adapters. - Deep Learning at Scale: Develop reproducible training workflows in PyTorch/TensorFlow with experiment tracking (MLflow, W&B).
Serve models via TorchServe/Triton/KServe on Kubernetes, SageMaker, or GCP Vertex AI. - MLOps & Production Excellence: Build robust APIs/microservices (FastAPI, gRPC).
Establish CI/CD, monitoring (Prometheus, Grafana), and automated retraining triggers.
Optimize inference on CPU/GPU/Edge with ONNX/TensorRT, quantization, and pruning. - Collaboration & Mentorship: Translate product requirements into scalable AI services.
Mentor junior engineers, conduct code and experiment reviews, and evangelize best practices.
What You'll Need
- B.S./M.S. in Computer Science, Electrical Engineering, Applied Math, or related discipline.
- 5+ years building production ML/DL systems with strong Python & Git.
- Demonstrable expertise in at least one agentic AI framework (Langchain, LangGraph, CrewAI, or comparable).
- Proven delivery of computervision models for image classification/extraction.
- Handson experience finetuning LLMs and deploying RAG solutions.
- Solid understanding of containerization (Docker) and cloud AI stacks (AWS/Azure).
- Knowledge of distributed training, GPU acceleration, and performance optimization.
Physical Demands
- This role involves mostly sedentary work, with occasional movement around the office to attend meetings, etc.
- Ability to perform repetitive tasks on a computer, using a mouse, keyboard, and monitor.
Reasonable accommodation statement
If you require a reasonable accommodation in completing this application, interviewing, completing any pre-employment testing, or otherwise participating in the employment selection process, please direct your inquiries to ap***********************s@***.io or (888) 824 8111.
Job Classification
Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: Software Development
Role: Data Platform Engineer
Employement Type: Full time
Contact Details:
Company: CAI
Location(s): Bengaluru
Keyskills:
kubernetes
stack
production
dl
artificial intelligence
docker
cloud
containerization
deep learning
tensorflow
git
computer science
pytorch
ml
ocr
architecture
python
cnn
natural language processing
microsoft azure
machine learning
framework
system
computer vision
aws