Job Details

Home

Data Engineer at Remote, Remote, USA

Email: [email protected]

From:

Santhoshi,

HAN IT Staffing

[email protected]

Reply to: [email protected]

Role : Data Engineer - Onsite (Need Local)

Work location: HARRISBURG (US:17105), PA

Client : Capgemini

Job Description :

.As an AWS Redshift Data Engineer, the primary responsibility is to design, implement, and maintain data solutions using Amazon Redshift. Your role involves working with large-scale data sets, optimizing performance, and ensuring the reliability and availability of the Redshift environment. Here is a general job description for an AWS Redshift Data Engineer:

1. Data Modeling and Design: Develop and maintain data models for Redshift databases, including schema design, table structures, and optimization techniques. Collaborate with data architects and stakeholders to understand requirements and translate them into efficient data structures.

2. ETL Development: Design and implement Extract, Transform, Load (ETL) processes to extract data from various sources, transform it as per business requirements, and load it into Redshift. Develop efficient and scalable ETL workflows, considering data quality, performance, and data governance.

3. Performance Optimization: Optimize query performance by creating appropriate data distribution keys, sort keys, and compression techniques. Identify and resolve performance bottlenecks, fine-tuning queries, and optimizing data processing to enhance Redshift's performance.

4. Data Integration: Integrate Redshift with other AWS services, such as AWS Glue, AWS Lambda, Amazon S3, and more, to build end-to-end data pipelines. Ensure seamless data flow between different systems and platforms, maintaining data integrity and consistency.

5. Monitoring and Troubleshooting: Implement monitoring and alerting systems to proactively identify issues and ensure the stability and availability of the Redshift cluster. Perform troubleshooting, diagnose and resolve data-related issues, and work closely with support teams to resolve any performance or operational problems.

6. Security and Compliance: Implement security best practices to protect data stored in Redshift. Ensure compliance with data privacy regulations and industry standards, such as GDPR and HIPAA. Implement encryption, access controls, and data masking techniques to secure sensitive data.

7. Documentation and Collaboration: Maintain documentation of data models, ETL processes, and system configurations. Collaborate with cross-functional teams, including data scientists, analysts, and software engineers, to understand data requirements and provide data solutions that meet their needs.

8. Scalability and Capacity Planning: Plan and execute strategies for scaling Redshift clusters to handle increasing data volumes and user demands. Monitor resource utilization, track data growth, and make recommendations for capacity planning and infrastructure scaling.

9. Knowledge or previous experience in oracle PLSQL will be added advantage.

Keywords: sthree information technology Pennsylvania
Data Engineer
[email protected]

[email protected]
View all

Tue Jul 23 18:56:00 UTC 2024

To remove this job post send "job_kill 1586879" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.

Your reply to [email protected] -

To

Subject
Message -

santhoshi@hanstaffing.com wrote:
From:

Santhoshi,

HAN IT Staffing

santhoshi@hanstaffing.com

Reply to:   santhoshi@hanstaffing.com

Role : Data Engineer - Onsite (Need Local)

Work location: HARRISBURG (US:17105), PA

Client : Capgemini

Job Description :

.As an AWS Redshift Data Engineer, the primary responsibility is to design, implement, and maintain data solutions using Amazon Redshift. Your role involves working with large-scale data sets, optimizing performance, and ensuring the reliability and availability of the Redshift environment. Here is a general job description for an AWS Redshift Data Engineer:

1.   Data Modeling and Design: Develop and maintain data models for Redshift databases, including schema design, table structures, and optimization techniques. Collaborate with data architects and stakeholders to understand requirements and translate them into efficient data structures.

2.    ETL Development: Design and implement Extract, Transform, Load (ETL) processes to extract data from various sources, transform it as per business requirements, and load it into Redshift. Develop efficient and scalable ETL workflows, considering data quality, performance, and data governance.

3.  Performance Optimization: Optimize query performance by creating appropriate data distribution keys, sort keys, and compression techniques. Identify and resolve performance bottlenecks, fine-tuning queries, and optimizing data processing to enhance Redshift's performance.

4.      Data Integration: Integrate Redshift with other AWS services, such as AWS Glue, AWS Lambda, Amazon S3, and more, to build end-to-end data pipelines. Ensure seamless data flow between different systems and platforms, maintaining data integrity and consistency.

5.   Monitoring and Troubleshooting: Implement monitoring and alerting systems to proactively identify issues and ensure the stability and availability of the Redshift cluster. Perform troubleshooting, diagnose and resolve data-related issues, and work closely with support teams to resolve any performance or operational problems.

6.        Security and Compliance: Implement security best practices to protect data stored in Redshift. Ensure compliance with data privacy regulations and industry standards, such as GDPR and HIPAA. Implement encryption, access controls, and data masking techniques to secure sensitive data.

7.   Documentation and Collaboration: Maintain documentation of data models, ETL processes, and system configurations. Collaborate with cross-functional teams, including data scientists, analysts, and software engineers, to understand data requirements and provide data solutions that meet their needs.

8.     Scalability and Capacity Planning: Plan and execute strategies for scaling Redshift clusters to handle increasing data volumes and user demands. Monitor resource utilization, track data growth, and make recommendations for capacity planning and infrastructure scaling.

9. Knowledge or previous experience in oracle PLSQL will be added advantage.

Keywords: sthree information technology Pennsylvania 
Data Engineer
santhoshi@hanstaffing.com

Your email id:

Captcha Image:

Captcha Code:

Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 0

Location: ,