Home

Lead Data Engineer-NY-local needed at Tarrytown, New York, USA
Email: [email protected]
From:

sravani,

NityaINC

[email protected]

Reply to:   [email protected]

Hi All,

Please share profiles for below role on priority. We are looking for Senior candidates with atleast 12+ years experience. Looking for Tech experts who can lead the project as well with good communication and stakeholder management skills. Must have onshore-offshore coordination experience.

Here are the key expectations from the tech perspective: Primary skills are PySpark, RedShift, Airflow, AWS
Must have in-depth understanding of how data pipelines are built
Typical challenges with fetching data from various sources. How incremental/CDC data flows are handled.
How do you ensure data quality
How do you do Data profiling
Hands-on experience with Pyspark, Redshift (SQL) and Airflow at minimum
Should be able to design and document data model at various levels
Must have onshore-offshore coordination experience.

I

Role: Lead Data Engineer

Location: 777 Old Saw Mill River Rd, Tarrytown, NY 10591 (100% onsite role) Ideal to look for local profiles.

Job Description:
Candidate should have 12+ years of experience in Data Engineering. Must have strong work experience with onshore-offshore model
Designing, creating, testing and maintaining the complete data management & processing systems.
Candidate need to have in depth understanding of how data pipelines are built
Typical challenges with fetching data from various sources. How incremental/CDC data flows are handled.
How do you ensure data quality
How do you do Data profiling
Hands-on experience with PySpark, Redshift (SQL) and Airflow at minimum
Strong hands-on with required tech skills, flexible, right attitude to play the lead role
Should be able to design and document data model at various levels
Working closely with the stakeholders.
Building highly scalable, robust & fault-tolerant systems.
Knowledge of Hadoop ecosystem and different frameworks inside it HDFS, YARN, MapReduce, Apache Pig, Hive, Flume, Sqoop, ZooKeeper, Oozie, Impala and Kafka
Must have experience on SQL-based technologies (e.g. MySQL/ Oracle DB) and NoSQL technologies (e.g. Cassandra and MongoDB)
Should have Python/Scala/Java Programming skills
Discovering data acquisitions opportunities
Finding ways & methods to find value out of existing data.
Improving data quality, reliability & efficiency of the individual components & the complete system.
Problem solving mindset working in agile environment

Keywords: database information technology New York
[email protected]
View all
Tue Nov 21 21:54:00 UTC 2023

To remove this job post send "job_kill 876283" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 20

Location: Tarrytown, New York