Python Spark AWS !! Columbus, OH - non local candidates accepted at Columbus, Ohio, USA |
Email: [email protected] |
QLS:3.0 Job Title : Python Spark AWS Location : Columbus, OH non local candidates accepted Job Responsibilities: Develop and maintain data platforms using Python, Spark, and PySpark. Handle migration to PySpark on AWS. Design and implement data pipelines. Work with AWS and Big Data. Produce unit tests for Spark transformations and helper methods. Create Scala/Spark jobs for data transformation and aggregation. Write Scaladoc-style documentation for code. Optimize Spark queries for performance. Integrate with SQL databases (e.g., Microsoft, Oracle, Postgres, MySQL). Understand distributed systems concepts (CAP theorem, partitioning, replication, consistency, and consensus). Skills: Proficiency in Python, Scala (with a focus on functional programming), and Spark. Familiarity with Spark APIs, including RDD, DataFrame, MLlib, GraphX, and Streaming. Experience working with HDFS, S3, Cassandra, and/or DynamoDB. Deep understanding of distributed systems. Experience with building or maintaining cloud-native applications. Familiarity with serverless approaches using AWS Lambda is a plus Ajay Jha [email protected] linkedin.com/in/ajay-jha-4717b294 Diverse Lynx LLC, 300 Alexander Park, Suite 200. Princeton, NJ 08520 Keywords: sthree New Jersey Ohio Python Spark AWS !! Columbus, OH - non local candidates accepted [email protected] |
[email protected] View all |
Tue Aug 13 22:09:00 UTC 2024 |