Home

AWS Data Architect (EMR and Iceberg experience needed) - Remote at Remote, Remote, USA
Email: [email protected]
Location:
Woodland Hills, CA

Duration:
12+ Months

Responsibilities:

Designing and Implementing Data Solutions: Create
blueprints for data storage, processing, and access using AWS services
like S3, Glue, EMR, Kinesis by considering factors like performance,
scalability, security, and cost-effectiveness.

Lakehouse and ETL: Component design and build of
Lakehouse Solution on AWS using Lambda, EMR and EKS for compute resource
and Iceberg and S3 as Storage resource. Design and implement ETL (Extract,
Transform, Load) pipelines to move and transform data from various sources
into the Lakehouse across Bronze, Silver and Gold layers.

Data Delivery for Consumption: Design for data access
for querying with Athena, Integration of Iceberg Tables in Glue Catalog
with Snowflake for Analytics and Reporting

Big Data Solutions: Working with large datasets and
distributed computing using services like EMR (with Spark) to process and
Streaming solution with Kinesis, Kafka

Data Governance and Security: Ensuring data quality,
compliance with regulations (like GDPR), and implementing security
measures to protect sensitive information.

Collaboration and Communication: Working with
stakeholders (business analysts, data scientists, developers) to
understand their needs and translate them into technical solutions. You'll
need to explain complex concepts clearly.

Skills:

AWS Cloud Expertise: Deep understanding of core AWS
services (S3, EC2, EKS, EMS, VPC, IAM, Glue, Athena )

AWS certifications (e.g., Solutions Architect, Big Data
Specialty) are highly valued.

Data Warehousing and Modeling: Strong knowledge of
dimensional modeling, schema design, and data warehousing principles.

ETL and Data Pipelines: Experience with tools and
techniques for data extraction, transformation, and loading.

Big Data Technologies: Familiarity with Hadoop, Spark,
Hive, and other big data frameworks.

Databases: Proficiency in SQL and experience with
relational databases are must and nice to have NoSQL databases (like
DynamoDB) experience.

Programming: Python with PySpark hands on coding for
automation, and data processing tasks.

Data Governance and Security: Understanding of data
security best practices, access control, and compliance requirements.

--

Keywords: sthree information technology California
AWS Data Architect (EMR and Iceberg experience needed) - Remote
[email protected]
[email protected]
View all
Sat Nov 09 00:59:00 UTC 2024

To remove this job post send "job_kill 1916488" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 19

Location: Woodland, California