Your browser does not support javascript! Please enable it, otherwise web will not work for you.

AI Data Platform Reliability & Validation Engineer @ Oracle

Home > Software Development

 AI Data Platform Reliability & Validation Engineer

Job Description

Job Summary:
Oracle's AI Data Platform is accelerating enterprise AI and redefining how AI applications are built. The AI Data Platform team is seeking anexperience engineer to help drive AI platform reliability. This role is vital to ensuring our enterprise-scale, AI-powered data platform is robust, performant, and reliable. You will develop and execute end-to-end scenario tests across distributed systems, You will design and execute end-to-end scenario tests across distributed systems, and partner with engineering and architecture teams to develop tooling that improves and maintains the platform. You will also embed operational excellence by applying modern SRE practices.

Responsibilities

  • Design, develop, and execute end-to-end (E2E) scenario validations that simulate real-world usage of complex AI data platform workflows (data ingestion, transformation, ML pipeline orchestration, etc.).
  • Collaborate closely with product, engineering, and field teams to identify gaps in coverage and propose test automation strategies.
  • Develop and maintain automated test frameworks supporting E2E, integration, performance, and regression testing for distributed data/AI services
  • Monitor system health across the stack (infrastructure, data pipelines, AI/ML workloads), proactively detect failures or SLA breaches.
  • Champion SRE best practices including observability, incident management, blameless postmortems, and runbook automation.
  • Analyze logs, traces, and metrics to identify reliability, latency, and scalability issues; drive root cause analysis and corrective actions.
  • Partner with engineering to drive high-availability, fault tolerance, and continuous delivery (CI/CD) improvements.
  • Participate in on-call rotation to support critical services, ensuring rapid resolution and minimizing customer impact.
Desired Qualifications:
  • Bachelors or masters degree in computer science, Engineering, or related field (or demonstrated equivalent experience)
  • 3+ years experience in software QA/validation, SRE, or DevOps roles, ideally in data platforms, cloud, or AI/ML environments.
  • Proficient with DevOps automation and tools for continuous integration, deployment, and monitoring (e.g., Terraform, Jenkins, GitLab CI/CD, Prometheus).
  • Working knowledge of distributed systems, data engineering pipelines, and cloud-native architectures (OCI, AWS, Azure, GCP, etc.).
  • Strong proficiency in Java, Python and related technologies
  • Hands-on experience with test automation frameworks (e.g., Selenium, pytest, JUnit) and scripting (Python, Bash, etc.).
  • Familiarity with SRE practices: service-level objectives (SLO/SLA), incident response, observability (Prometheus, Grafana, ELK, etc.).
  • Strong troubleshooting and analytical skills with a passion for reliability engineering and process automation.
  • Excellent communication and cross-team collaboration abilities.

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: Software Development
Role: Data Platform Engineer
Employement Type: Full time

Contact Details:

Company: Oracle
Location(s): Bengaluru

+ View Contactajax loader


Keyskills:   AI Data Platform python regression testing java selenium gcp grafana devops jenkins gitlab terraform prometheus aws azure

 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Hiring For AXIOM developer resources in Mumbai

  • Clover Infotech
  • 4 - 7 years
  • Mumbai
  • 3 days ago
₹ 5-15 Lacs P.A.

Principal Applied AI Engineer

  • Zycus Infotech
  • 6 - 11 years
  • Pune
  • 3 days ago
₹ Not Disclosed

Python + DevOps Engineer

  • TekPillar
  • 4 - 8 years
  • Pune
  • 4 days ago
₹ -15 Lacs P.A.

Data Engineer

  • Tata Consultancy
  • 5 - 10 years
  • Bengaluru
  • 4 days ago
₹ Not Disclosed

Oracle

About Accenture\\r\\n\\r\\n \\r\\n\\r\\nAccenture is a global professional services company with leading capabilities in digital, cloud and security. Combining unmatched experience and specialized skills across more than 40 industries, we offer Strategy and Consulting, Interactive, Technology and Op...