Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Lead II - Software Engineering @ UST

Home > Devops

 Lead II - Software Engineering

Job Description

Job Title: Machine Learning Operations Lead Engineer

Job Summary: We are seeking an experienced MLOps Lead Engineer to design, implement, and manage scalable machine learning infrastructure and deployment pipelines. You will collaborate with data scientists, ML engineers, and DevOps teams to ensure models are production-ready, resilient, and continuously improving. The ideal candidate has a strong background in both software engineering and machine learning operations, with a passion for automation, reliability, and performance.

Key Responsibilities:

  • Lead the design and implementation of MLOps frameworks and CI/CD pipelines for ML models.
  • Manage the end-to-end ML lifecycle: data ingestion, training, validation, deployment, monitoring, and retraining.
  • Collaborate with data science and engineering teams to operationalize ML models in production.
  • Ensure scalability, security, and compliance in ML workflows.
  • Build and maintain automated systems for model versioning, model registry, and experiment tracking.
  • Monitor ML model performance in production and establish ing and rollback strategies.
  • Mentor and guide junior MLOps and ML engineers.

Required Skills and Qualifications:

  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
  • 8+ years of experience in software engineering, DevOps, or MLOps roles (minimum 3+ years of experience in MLOps).
  • Proficiency in ML frameworks (TensorFlow, PyTorch) and MLOps tools (MLflow, Kubeflow, Airflow).
  • Experience with Docker, Kubernetes, and cloud platforms (AWS, GCP, or Azure).
  • Strong coding skills in Python and scripting languages like Bash or YAML.
  • Knowledge of model monitoring, data drift detection, and A/B testing.
  • Familiarity with data versioning tools (e.g.,Github, DVC) and model registry tools.
  • Experience in monitoring productized ML models using tools such as Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), AWS SageMaker Model Monitor.
  • Experience with Jenkins, Terraform, and GitHub Actions.

Required Skills
  • 8+ years of experience in software engineering, DevOps, or MLOps roles
  • Minimum 3+ years of experience in MLOps
  • Proficiency in ML frameworks (TensorFlow, PyTorch) and
  • MLOps tools (MLflow, Kubeflow, Airflow).

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: DevOps
Role: Site Reliability Engineer
Employement Type: Full time

Contact Details:

Company: UST
Location(s): Bengaluru

+ View Contactajax loader


Keyskills:   kubernetes continuous integration docker coding logstash elastic search tensorflow operations gcp devops jenkins pytorch software engineering prometheus kibana scripting languages ml yaml cd python github elk microsoft azure machine learning framework grafana terraform bash aws

 Job seems aged, it may have been expired!
 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Site Reliability Engineer Lead

  • Kiya.ai
  • 9 - 14 years
  • Chennai
  • 1 day ago
₹ Not Disclosed

Senior Software Engineer

  • Sunquest Information
  • 8 - 10 years
  • Bengaluru
  • 2 days ago
₹ Not Disclosed

Senior Software Engineer

  • Sunquest Information
  • 8 - 13 years
  • Bengaluru
  • 2 days ago
₹ Not Disclosed

Senior Software Engineer

  • Sunquest Information
  • 8 - 13 years
  • Bengaluru
  • 2 days ago
₹ Not Disclosed

UST

INDUSTRIAL ROBOTICS INSTITUTE