Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Azure Data Engineer @ Infogain

Home > Software Development

 Azure Data Engineer

Job Description

  • Lead design and execution of Dataproc Databricks PySpark migration roadmap.
  • Define modernization strategy , including data ingestion, transformation, orchestration, and governance.
  • Architect scalable Delta Lake and Unity Catalog -based solutions.
  • Manage and guide teams on code conversion, dependency mapping, and data validation.
  • Collaborate with platform, infra, and DevOps teams to optimize compute costs and performance.
  • Own the automation & GenAI acceleration layer , integrating code parsers, lineage tools, and validation utilities.
  • Conduct performance benchmarking, cost optimization, and platform tuning (Photon, Auto-scaling, Delta Caching).
  • Mentor senior and mid-level developers, ensuring quality standards, documentation, and delivery timelines.
Technical Skills
  • Languages: Python, PySpark, SQL
  • Platforms: Databricks (Jobs, Workflows, Delta Live Tables, Unity Catalog), GCP Dataproc
  • Data Tools: Hadoop, Hive, Pig, Spark (RDD & DataFrame APIs), Delta Lake
  • Cloud & Integration: GCS, BigQuery, Pub/Sub, Cloud Composer, Airflow
  • Automation: GenAI-powered migration tools, custom Python utilities for code conversion
  • Version Control & DevOps: Git, Terraform, Jenkins, CI/CD pipelines
  • Other: Performance tuning, cost optimization, and lineage tracking with Unity Catalog
Preferred Experience
  • 10-14 years of data engineering experience with at least 3 years leading Databricks or Spark modernization programs.
  • Proven success in migration or replatforming projects from Hadoop or Dataproc to Databricks.
  • Exposure to AI/GenAI in code transformation or data engineering automation .
  • Strong stakeholder management and technical leadership skills.
EXPERIENCE
  • 11-12 Years

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: Software Development
Role: Data Engineer
Employement Type: Full time

Contact Details:

Company: Infogain
Location(s): Noida, Gurugram

+ View Contactajax loader


Keyskills:   hive python technical leadership data validation performance tuning airflow pyspark apache pig data engineering artificial intelligence sql dataproc data bricks automation apache git stakeholder management spark gcp data ingestion hadoop bigquery

 Job seems aged, it may have been expired!
 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Staff Software Engineer Forward Deployed

  • Pfizer
  • 9 - 14 years
  • Mumbai
  • 4 days ago
₹ Not Disclosed

Aws Cloud Engineer

  • Infogain
  • 6 - 10 years
  • Pune
  • 4 days ago
₹ Not Disclosed

Senior Python//AI Engineer

  • Luxoft
  • 6 - 11 years
  • Mumbai
  • 4 days ago
₹ Not Disclosed

Associate Software Engineer

  • Sunquest Information
  • 2 - 4 years
  • Bengaluru
  • 4 days ago
₹ Not Disclosed

Infogain

A global digital engineering company delivering technology solutions that accelerate business outcomes. It specializes in cloud, data, AI, and experience-led transformation for enterprises across industries.