Generative AI EngineerJOB DESCRIPTION:
Must-Have**
(Ideally should not be
more than 3-5)
LLM Ops & Agent Orchestration: Experience with prompt engineering, prompt/version management, model routing, and evaluation using agentic and orchestration frameworks such as LangChain, LangGraph, Google Agent Development Kit (ADK), and other agent-based LLM frameworks.
RAG Expertise: Practical experience building and tuning RAG pipelines (chunking strategies, embeddings, retrievers, reranking).
Vector DBs: Hands-on with at least one: FAISS, Pinecone, Milvus, Weaviate (index types, parameters, scaling).
Python Engineering: Strong Python with FastAPI/Flask, async IO, typing, packaging; clean code & SOLID principles.
MLOps/Platform: Docker, Kubernetes, Git, CI/CD (Jenkins/GitHub Actions/Azure DevOps), observability (logs/metrics/traces).
APIs: Design/consume REST APIs, OpenAPI/Swagger, pagination, error models, rate limiting.
Data Handling: Experience with text preprocessing, embeddings, metadata schemas, and storage (object stores/RDBMS).
Build production-grade RAG pipelines, manage vector databases, and own LLM Opsincluding observability, evaluation, and cost/performance optimizationon secure, compliant platforms.
Good-to-Have
Agentic Architectures: Practical experience comparing and implementing LangChain/LangGraph vs Google ADK and other agentic solutions, understanding trade-offs across control flow,
memory, observability, tool-calling, scalability, and production readiness. Retrieval & Ranking: BM25, hybrid search, approximate nearest neighbor configs, rerankers (e.g., cross-encoders).
Model Ecosystem: Experience with OpenAI, Azure OpenAI, Anthropic, Cohere, local models (Llama, Mistral) and serving frameworks (vLLM/Triton).
Guardrails & Safety: Tools like Guardrails.ai, LlamaGuard, content moderation APIs; jailbreak detection.
Evaluation & Tracing: LLM traces (LangSmith, Phoenix), synthetic eval sets, human-in-the-loop feedback.
Infra as Code: Terraform/Helm; secrets management (Vault/KMS). Streaming & Queues: Kafka/RabbitMQ; event-driven RAG updates. Search Platforms: Elastic/OpenSearch integrations; hybrid retrieval pipelines. Caching & Cost Controls: Redis, response caching, token usage optimization

Keyskills: Pytorch Vector DB Artificial Intelligence LLM Ops Machine Learning Tensorflow MLOps Natural Language Processing Retrieval Augmented Generation Python
\n\nMavlers is a full-service digital marketing agency that has propelled growth for over 7,000 brands and agencies worldwide. As Google, Mailchimp, WP VIP, Microsoft, Salesforce, and HubSpot partners, we possess the expertise to deliver high-impact projects and campaigns tailored to our clients uni...