Data Engineer (Local to NY) at Tarrytown, New York, USA |
Email: [email protected] |
From: supriya, Nitya Software Solutions [email protected] Reply to: [email protected] Hi, Hope you are doing great.! This is Supriya from Nitya Software Solutions Inc; This mail is regarding the job opportunity. Please let me know if you are interested and available for the below position. If you are interested, Please share your updated resume to [email protected] or you can reach me at 8017834423. I would appreciate if you can refer someone for this position. Please share profiles for below role on priority. We are looking for Senior candidates with atleast 12+ years experience. Looking for Tech experts who can lead the project as well with good communication and stakeholder management skills. Must have onshore-offshore coordination experience. Here are the key expectations from the tech perspective: Primary skills are PySpark, RedShift, Airflow, AWS Must have in-depth understanding of how data pipelines are built Typical challenges with fetching data from various sources. How incremental/CDC data flows are handled. How do you ensure data quality How do you do Data profiling Hands-on experience with Pyspark, Redshift (SQL) and Airflow at minimum Should be able to design and document data model at various levels Must have onshore-offshore coordination experience. Indent: PSI204070_2-7-1 Role: Lead Data Engineer Location: 777 Old Saw Mill River Rd, Tarrytown, NY 10591 Job Description: Candidate should have 12+ years of experience in Data Engineering. Must have strong work experience with onshore-offshore model Designing, creating, testing and maintaining the complete data management & processing systems. Candidate need to have in depth understanding of how data pipelines are built Typical challenges with fetching data from various sources. How incremental/CDC data flows are handled. How do you ensure data quality How do you do Data profiling Hands-on experience with PySpark, Redshift (SQL) and Airflow at minimum Strong hands-on with required tech skills, flexible, right attitude to play the lead role Should be able to design and document data model at various levels Working closely with the stakeholders. Building highly scalable, robust & fault-tolerant systems. Knowledge of Hadoop ecosystem and different frameworks inside it HDFS, YARN, MapReduce, Apache Pig, Hive, Flume, Sqoop, ZooKeeper, Oozie, Impala and Kafka Must have experience on SQL-based technologies (e.g. MySQL/ Oracle DB) and NoSQL technologies (e.g. Cassandra and MongoDB) Should have Python/Scala/Java Programming skills Discovering data acquisitions opportunities Finding ways & methods to find value out of existing data. Improving data quality, reliability & efficiency of the individual components & the complete system. Problem solving mindset working in agile environment Keywords: database information technology New York |
[email protected] View all |
Tue Nov 21 23:14:00 UTC 2023 |