data engineer python scala hadoop : Sunnyvale: 58 per HR at Remote, Remote, USA |
Email: [email protected] |
From: Jay, Brillius [email protected] Reply to: [email protected] RGS ID (Please put the requirement ID) 9054145 # of requirements 1 Job Title Data Engineer Relevant Experience (in Yrs) 8-12yr Technical/Functional Skills Python, ETL pipelines,Hadoop, Teradata Experience Required 8-15yrs Roles & Responsibilities Prior Apple experience is a plus Responsibilities Experience with design, build and optimize the data architecture and Extract, Transform & Load (ETL) pipelines to make it accessible for Business Data Analysts, Data Scientists and Business Users. Work with Analysts to productionize & scale value creating capabilities including data integration, transformations, model features, and statistical and machine learning models. Handle data administration tasks such as scheduling jobs and fix job errors. Write and review end-user and technical documents, including requirements and design documents for existing and future data systems, as well as data standards and policies. Translate complex data and methodology into strategic, operationally practical insights. Explore complex data sources, becoming a SME in customer behavior across Apples online retail platforms. Key Qualifications 3-5 years of software development experience with very high proficiency in Python or Scala or Java. Must have detailed knowledge of data structures and algorithms. Solid technical database knowledge (Hadoop, Teradata, Snowflake data modeling) and experience optimizing SQL queries on large data. Experience working with large-scale data warehouse solutions such as Teradata, Snowflake, or Redshift. Hands on experience in a Unix/Linux environment. Experience with Continuous Integration & Development and automation tools such as Jenkins, Artifactory, Git etc. Experience in building data engineering monitoring tools. Familiarity with Tableau & Dataiku. Experience with Agile and Test-Driven Development methodology. Ability to present complex ideas in a clear, concise way. Generic Managerial Skills Education BS in computer science, engineering, mathematics, statistics, econometrics or other quantitative field. Preferred MS is computer science, engineering, statistical methods, machine learning. Start date (dd-mmm-yy) Jul-28-2023 Duration of assignment (in Months) 12 months Work Location (Remote, if they can work from anywhere or specific location from where associate need to work. If you put specific location then number of profiles received may be less) Remote (but need to work SCV timezone), preference to candidates willing to work from office, 3 days/week Salary range Key words to search in resume Python, ETL pipelines,Hadoop, Teradata Prescreening /Sample Questionnaire with Answers Please ask the vendors to share candidate answers to the following question with each submissions Python What is the difference between Python Arrays, lists, tuples, sets What are lambda functions in Python How can we delete a column or row from a dataframe. What parameter we can use to permanently drop a column or row in a dataframe. How can we drop one or more rows with NaN/Null values from a dataframe. What is a List comprehensions in Python. Give an example. Lets say there is a string s = 'orangeapple' How can we extract 'apple' from the above string value. DB/DW What are different types of table joins you know explain with example. What are indexes in DB What is the use of Coalesce function Difference between Delete and Truncate What is the difference between Facts and Dimensions Give examples. Difference between row_number, rank and dense_rank. What are the Key Differences Between Normalization and Denormalization In Teradata what is the difference between PI and SI Rating Matrix (0-5) 0 - no knowledge and 5-expert Python Haddop Teradata Keywords: database information technology microsoft Idaho |
[email protected] View all |
Tue Jul 25 03:53:00 UTC 2023 |