Job Details

Home

Big Data Engineer with extensive experience in PySpark Onsite at Dallas TX and Richmond VA at Richmond, Virginia, USA

Email: [email protected]

From:

Debasish Pattnaik,

MRTECHNOSOFT

[email protected]

Reply to: [email protected]

Big Data Engineer with extensive experience in PySpark

Onsite at Dallas TX/Richmond VA

We are seeking a highly skilled Big Data Engineer with extensive experience in PySpark to join our dynamic team. The ideal candidate will be responsible for designing, developing, and maintaining big data solutions to support our data-driven initiatives. This role requires a deep understanding of big data technologies, distributed systems, and the ability to work collaboratively with cross-functional teams.Key Responsibilities:
Design and implement scalable data pipelines using PySpark and other big data technologies.
Develop and maintain data processing workflows to ingest, process, and analyze large datasets.
Optimize and tune PySpark applications for performance and scalability.
Collaborate with data scientists, analysts, and other stakeholders to understand data requirements and deliver robust solutions.
Ensure data quality, integrity, and security across all data processing activities.
Monitor and troubleshoot data pipeline issues, providing timely resolutions.
Stay up-to-date with the latest industry trends and advancements in big data technologies.
Contribute to the development of best practices and standards for data engineering.Qualifications:
Bachelor's or Master's degree in Computer Science, Information Technology, or a related field.
3+ years of experience in big data engineering, with a focus on PySpark.
Strong proficiency in PySpark, including data frame operations, RDDs, and Spark SQL.
Experience with big data technologies such as Hadoop, HDFS, Hive, and Kafka.
Proficient in programming languages such as Python, Scala, or Java.
Familiarity with cloud platforms (e.g., AWS, Azure, GCP) and their big data services.
Strong understanding of data warehousing concepts and ETL processes.
Experience with version control systems (e.g., Git) and CI/CD pipelines.
Excellent problem-solving skills and attention to detail.
Strong communication and collaboration skills.Preferred Qualifications:
Experience with real-time data processing frameworks like Apache Flink or Apache Storm.
Knowledge of machine learning frameworks and libraries.
Experience with containerization and orchestration tools (e.g., Docker, Kubernetes).
Certifications in big data technologies or cloud platforms.

Thanks

Debasish Pattnaik

[email protected]

www.mrtechnosoft.com

Keywords: continuous integration continuous deployment Texas Virginia
Big Data Engineer with extensive experience in PySpark Onsite at Dallas TX and Richmond VA
[email protected]

[email protected]
View all

Fri Jun 28 20:19:00 UTC 2024

To remove this job post send "job_kill 1520565" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.

Your reply to [email protected] -

To

Subject
Message -

d.pattanaik@mrtechnosoft.com wrote:
From:

Debasish Pattnaik,

MRTECHNOSOFT

d.pattanaik@mrtechnosoft.com

Reply to:   d.pattanaik@mrtechnosoft.com

Big Data Engineer with extensive experience in PySpark

Onsite at Dallas TX/Richmond VA

We are seeking a highly skilled Big Data Engineer with extensive experience in PySpark to join our dynamic team. The ideal candidate will be responsible for designing, developing, and maintaining big data solutions to support our data-driven initiatives. This role requires a deep understanding of big data technologies, distributed systems, and the ability to work collaboratively with cross-functional teams.Key Responsibilities:
Design and implement scalable data pipelines using PySpark and other big data technologies.
Develop and maintain data processing workflows to ingest, process, and analyze large datasets.
Optimize and tune PySpark applications for performance and scalability.
Collaborate with data scientists, analysts, and other stakeholders to understand data requirements and deliver robust solutions.
Ensure data quality, integrity, and security across all data processing activities.
Monitor and troubleshoot data pipeline issues, providing timely resolutions.
Stay up-to-date with the latest industry trends and advancements in big data technologies.
Contribute to the development of best practices and standards for data engineering.Qualifications:
Bachelor's or Master's degree in Computer Science, Information Technology, or a related field.
3+ years of experience in big data engineering, with a focus on PySpark.
Strong proficiency in PySpark, including data frame operations, RDDs, and Spark SQL.
Experience with big data technologies such as Hadoop, HDFS, Hive, and Kafka.
Proficient in programming languages such as Python, Scala, or Java.
Familiarity with cloud platforms (e.g., AWS, Azure, GCP) and their big data services.
Strong understanding of data warehousing concepts and ETL processes.
Experience with version control systems (e.g., Git) and CI/CD pipelines.
Excellent problem-solving skills and attention to detail.
Strong communication and collaboration skills.Preferred Qualifications:
Experience with real-time data processing frameworks like Apache Flink or Apache Storm.
Knowledge of machine learning frameworks and libraries.
Experience with containerization and orchestration tools (e.g., Docker, Kubernetes).
Certifications in big data technologies or cloud platforms.

Thanks

Debasish Pattnaik

d.pattanaik@mrtechnosoft.com

www.mrtechnosoft.com

Keywords: continuous integration continuous deployment Texas Virginia 
Big Data Engineer with extensive experience in PySpark  Onsite at Dallas TX and Richmond VA
d.pattanaik@mrtechnosoft.com

Your email id:

Captcha Image:

Captcha Code:

Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 111

Location: , Indiana