Home

Databricks Data engineer with pyspark ( Azure or AWS) - Atlanta, GA (Day 1 Onsite) at Atlanta, Georgia, USA
Email: [email protected]
Hi Professional,

I am writing to let you know regarding a job opportunity as
Databricks Data engineer with pyspark ( Azure or AWS). Mentioned is the job description for your review.

Job Title: Databricks Data engineer with pyspark ( Azure or AWS)

Location: Atlanta, GA (Day 1 Onsite) 

Job Type: Contract

Must Have 11+ Years of Experience.

Job Description:

As a Senior Data Engineer, he/she will be responsible for designing, developing, and maintaining data pipelines using PySpark.

You will work closely with data scientists, analysts, and other stakeholders to ensure the efficient processing and analysis of large datasets, while handling complex transformations and aggregations.

Required Skills and Experience:

Bachelor's degree in Computer Science, Engineering, or a related field.

10 years of experience in data engineering, with a focus on PySpark, Neo4j or Neptune DB or any other Graph DB.

Strong understanding of data modeling, database architecture and schema design.

Proficiency in Python and Spark, with strong coding and debugging skills.

Experience with big data technologies such as Hadoop, Hive, and Kafka.

Hands-on experience with cloud platforms such as AWS, Azure, or Google Cloud Platform (GCP).

Strong knowledge of SQL and experience with relational databases (e.g., PostgreSQL, MySQL, SQL Server).

Experience with data warehousing solutions like Redshift, Snowflake, Databricks or Google BigQuery.

Familiarity with data lake architectures and data storage solutions.

Knowledge of CI/CD pipelines and version control systems (e.g., Git).

Excellent problem-solving skills and the ability to troubleshoot complex issues.

Strong communication and collaboration skills, with the ability to work effectively in a team environment.

Preferred Skills:

Knowledge of machine learning workflows and experience working with data scientists.

Understanding of data security and governance best practices.

Experience with containerization technologies such as Docker and Kubernetes.

Experience with orchestration tools like Apache Airflow or AWS Step Functions.

Familiarity with streaming data platforms and real-time data processing.

Key Responsibilities:

Design, develop, and maintain scalable and efficient ETL pipelines using PySpark.

Collaborate across functional areas to translate business process, problems into optimal data modeling and analytical solutions that drive business value.

Design data model by interacting with several business teams.

Manage data collection process providing interpretation and recommendations to management.

Build and optimize graph database solutions to support data-driven decision making and advanced analytics, integrate into data pipelines.

Optimize and tune PySpark applications for performance and scalability.

Collaborate with data scientists and analysts to understand data requirements, review Business Requirement documents and deliver high-quality datasets.

Implement data quality checks and ensure data integrity.

Monitor and troubleshoot data pipeline issues and ensure timely resolution.

Stay up-to-date with the latest trends and technologies in big data and distributed computing.

Thanks & Regards

Maneesh Sanghi| US IT Recruiter

Centraprise Corp

Desk/Direct: 848-271-1949 Ext 1039

33 Wood Avenue South, Suite 600, Iselin NJ  08830

[email protected]

--

Keywords: continuous integration continuous deployment access management database information technology Georgia New Jersey
Databricks Data engineer with pyspark ( Azure or AWS) - Atlanta, GA (Day 1 Onsite)
[email protected]
[email protected]
View all
Fri Nov 08 04:19:00 UTC 2024

To remove this job post send "job_kill 1913325" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 6

Location: Atlanta, Georgia