Lead Data Scientist onsite GC GC EAD USC only at Remote, Remote, USA |
Email: [email protected] |
From: shaik, Convextech [email protected] Reply to: [email protected] Job Title: IT - Data Scientist City & State: PITTSBURGH, Pennsylvania Work Location: P2PTPP - Two PNC Plaza Work Permit: USC, GC, GC-EAD Position Location: ONSITE Pittsburgh preferred, Cleveland, New Jersey, DC, Atlanta, Raleigh NC, Columbus, Dallas, Philadelphia 1-2 days a week onsite, if further from an office location (1.5 hours, 4 days a month in office) Remote: No Potential for Contract Extension: Yes If yes, details (intended length of extension): 12 month budget and can be extended based on project need Project/ initiative Financial Crime Modeling team (Anti Money Laundering/ Sanctions) Candidate Technical and skills profile: Education/Experience: Master of Science degree in computer science or in a relevant field or equivalent work experience required 8+ years of relevant experience required. Key Experience : 6+ years of financial solutions architecture, software development, data engineering, data science or business intelligence engineering experience with minimum 3 Years recent hands-on experience in PySpark 3+ year of experience with Machine Learning code development Deep knowledge of Hadoop ecosystem and Big Data technologies such as Spark, Hive, Hbase, Oozie, Kafka, YARN, SLURM Spark query tuning and performance optimization Experience and good understanding of Apache Spark Data sources API Advanced experience in Python and common python libraries/ Scala/ Java Strong analytical experience with database in writing complex queries, query optimization, debugging, user-defined functions, views, indexes, etc. Strong working experience with source control systems such as Git, Bitbucket, and Jenkins build and continuous integration tools. Experience working with Microservices, Rest API and Oauth Experience working with one or more Agile development methods proven consulting and delivery leadership in data transformation, data modeling, data analytics, data visualization and/or data science Must have technical skills/experience (ask for alternative/tool/version): Hadoop Cloud Native Architecture/ Application Design + Build Ci-CD pipeline/ tools Pyspark/ Python Databricks (1 Azure, 2 AWS, 3 Google) Flex Skills: 2+ years of experience with a public cloud (AWS, Microsoft Azure) 4+ years of experience with NoSQL implementation (ELK, Mongo, Cassandra) 1+ year of experience with process orchestration including AirFlow, KubeFlow Data lake and Delta lake experience Familiarity with Metadata Management, Data Quality frameworks and Data as a Service concepts a big plus. Alation experience preferred Banking or financial services experience is a big plus Keywords: continuous integration continuous deployment information technology green card North Carolina |
[email protected] View all |
Mon Jul 31 19:59:00 UTC 2023 |