Minimum 3 years of experience in designing and implementing scalable big data infrastructure in an agile environment.
Strong hands-on experience in Spark, Spark Streaming, Hive, Spark SQL and Data Frames with Python.
Thorough understanding of Python with Test Driven Development.
Good understanding of Object Oriented Analysis and Design.
Experience with columnar databases like HBase and MongoDB.
Experienced in engineering systems from the ground up: familiar with OS-level, distributed databases, big data clusters.
Hands-on functional programming
NoSQL databaseslike Mongo DB
Unit Test and Coverage in Python
Knowledge on shell scripting
Knowledge of Cloud Platforms - AWS

Keyskills: c java script python bigdata django analytics hadoop ph