Home

Looking for python/pyspark developer at Remote, Remote, USA
Email: [email protected]
Title: python/pyspark developer

Location: Whippany, NJ

 Key Responsibilities:

Develop,
optimize, and maintain ETL pipelines using PySpark to process large-scale
datasets across distributed environments.

Design
and implement complex data transformation logic using PySpark and other
Big Data tools.

Work
with various Big Data technologies such as Hadoop, Hive, HBase, Kafka, and
Spark to build robust, scalable data systems.

Collaborate
with data engineers and data scientists to integrate data from multiple
sources and create unified datasets.

Write
efficient, reusable, and scalable Python code to handle both batch and
real-time data processing tasks.

Ensure
data quality, consistency, and reliability by implementing data
validation, monitoring, and error handling.

Fine-tune
and optimize PySpark jobs to improve performance in distributed
environments.

Manage
and maintain data flows in HDFS, ensuring scalability and fault tolerance.

Perform
data extraction, aggregation, and reporting using SQL and NoSQL databases.

Participate
in system design discussions and provide recommendations for architecture
and performance improvements

--

Keywords: information technology New Jersey
Looking for python/pyspark developer
[email protected]
[email protected]
View all
Tue Oct 08 19:17:00 UTC 2024

To remove this job post send "job_kill 1821892" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 21

Location: , New Jersey