Oversee the AI/ML service lifecycle in production, ensuring seamless management of model updates and versions
Coordinate with relevant teams to facilitate the deployment and integration of new models and updates into the production environment
Ensure compliance with ITSM standards and processes for incident, problem, and service request management according to enterprise ITSM standards
Respond promptly to incidents related to AI model and data quality, conduct root cause analysis, and implement corrective actions
Document incident retrospectives and maintain a comprehensive knowledge base for future reference
Provide timely updates on high-priority issues to leadership and stakeholders, ensuring transparency and effective communication
Regularly collaborate with vendors (e.g., Google Cloud TAM) and development teams (data scientists, AI engineers, etc.) to understand new requirements and AI solutions
Work closely with these teams to resolve issues with existing implementations and ensure smooth operation of AI services
Serve as the primary liaison between technical teams (data scientists, AI engineers, etc.) and business stakeholders, ensuring clear and effective communication
Set up system health monitoring for performance, availability, and business functions
Configure appropriate alerts to ensure timely detection of issues and document response actions for quick resolution
Stay updated with advancements in AI and machine learning technologies to ensure the organization remains at the forefront of innovation
Experiment with new tools and techniques to improve AI operations and contribute to developing best practices and standards for AI Ops
Implement lessons learned and best practices to continuously improve the AI/ML service
Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regards to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so
Qualifications - External
Required Qualifications:
Graduate degree or equivalent experience
4 year bachelors degree in Computer Science / Information Technology / Computer Engineering from a recognized university or equivalent experience
3+ years of experience in Java
2+ years of experience in Spring Boot, Microservices
1+ years of hands-on experience in AI ML
Experience working with data analysis (e.g., Splunk, Dynatrace, New Relic) tools
Experience working with AI technologies and cloud platforms (Google Cloud Platform, GCP)
Proven experience as a Machine Learning Engineer or similar role in the industry
Experience in AI Ops that includes AI/ML Models quality monitoring and alerting, AI/ML Model development and continuous improvement, incident triage, troubleshooting, monitoring, continuous innovation
Solid knowledge of GenAI models like Gemini, GPT4, and traditional machine learning models from the NLP/Deep Learning discipline
Solid incident management skills, with a data-driven and analytical approach to diagnosing complex issues
Excellent problem-solving and troubleshooting skills
Ability to work collaboratively with a diverse team of engineers, architects, and developers
Ability to work independently and in a team environment
Preferred Qualifications:
Machine Learning certifications
Experience in a Healthcare Payer enterprise
Experience in training team members on machine learning technologies and applications
Proficient in CRM platforms, such as Salesforce
Knowledge of data privacy regulations and practices
Solid communication and project management skills
Ability to work independently and in a team environment
Job Classification
Industry: IT Services & ConsultingFunctional Area / Department: Engineering - Software & QARole Category: Software DevelopmentRole: Back End DeveloperEmployement Type: Full time