Your browser does not support javascript! Please enable it, otherwise web will not work for you.

PySpark Professional @ Cirruslabs

Home > DBA / Data warehousing

 PySpark Professional

Job Description

CirrusLabs Private Limited is looking for PySpark Professional to join our dynamic team and embark on a rewarding career journey
  • Apache Spark Fundamentals: You have a solid understanding of the Apache Spark architecture, its components like Spark Core, Spark SQL, Spark Streaming, MLlib, and Spark GraphX
  • Python Programming: You are proficient in Python programming language as PySpark heavily relies on Python APIs for data manipulation, analysis, and processing
  • Data Manipulation and Analysis: You are experienced in performing data manipulation tasks such as filtering, transforming, aggregating, and joining large datasets using PySpark DataFrame API or RDDs (Resilient Distributed Datasets)
  • Spark SQL: You can write SQL queries using Spark SQL for querying structured data and performing analytics operations on DataFrames and tables
  • Data Processing Pipelines: You are capable of designing and building end-to-end data processing pipelines using PySpark that can handle various stages of data ingestion, cleaning, transformation, and analysis
  • Performance Optimization: You have knowledge of techniques for optimizing PySpark jobs and improving the performance of Spark applications, including partitioning, caching, and tuning the execution settings
  • Integration with External Systems: You can integrate PySpark with various data sources and file formats such as HDFS, S3, Hive, Parquet, Avro, JSON, CSV, etc
Disclaimer: This job description has been sourced from a public domain and may have been modified by Naukri.com to improve clarity for our users. We encourage job seekers to verify all details directly with the employer via their official channels before applying.

Job Classification

Industry: Software Product
Functional Area / Department: Engineering - Software & QA
Role Category: DBA / Data warehousing
Role: Data warehouse Architect / Consultant
Employement Type: Full time

Contact Details:

Company: Cirruslabs
Location(s): Hyderabad

+ View Contactajax loader


Keyskills:   HTTP ETL Lead Data warehousing digital transformation Analytics Python

 Job seems aged, it may have been expired!
 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

ERP Cloud Techno-fucntional Professional

  • NCR Corporation
  • 9 - 14 years
  • Noida, Gurugram
  • 23 days ago
₹ Not Disclosed

Kinaxis Professional

  • Birlasoft
  • 8 - 10 years
  • Mumbai
  • 28 days ago
₹ Not Disclosed

Billing Support Professional

  • Infobeans
  • 4 - 7 years
  • Indore
  • 1 month ago
₹ Not Disclosed

AWS Glue + PySpark

  • Cognizant
  • 8 - 13 years
  • Hyderabad
  • 1 month ago
₹ Not Disclosed

Cirruslabs

We are CirrusLabs. Our vision is to become the world's most sought-after niche digital transformation company that helps customers realize value through innovation. Our mission is to co-create success with our customers, partners and community. Our goal is to enable employees to dream, grow and make...