Job Details

Home

data engineer python scala hadoop : Sunnyvale: 58 per HR at Remote, Remote, USA

Email: [email protected]

From:

Jay,

Brillius

[email protected]

Reply to: [email protected]

RGS ID
(Please put the requirement ID)

9054145

# of requirements

1

Job Title

Data Engineer

Relevant Experience

(in Yrs)

8-12yr

Technical/Functional Skills

Python, ETL pipelines,Hadoop, Teradata

Experience Required

8-15yrs

Roles & Responsibilities

Prior Apple experience is a plus

Responsibilities

Experience with design, build and optimize the data architecture and Extract, Transform & Load (ETL) pipelines to make it accessible for Business Data Analysts, Data Scientists and Business Users.

Work with Analysts to productionize & scale value creating capabilities including data integration, transformations, model features, and statistical and machine learning models.

Handle data administration tasks such as scheduling jobs and fix job errors.

Write and review end-user and technical documents, including requirements and

design documents for existing and future data systems, as well as data standards and

policies.

Translate complex data and methodology into strategic, operationally practical

insights.

Explore complex data sources, becoming a SME in customer behavior across Apples

online retail platforms.

Key Qualifications

3-5 years of software development experience with very high proficiency in Python or Scala or Java.

Must have detailed knowledge of data structures and algorithms.

Solid technical database knowledge (Hadoop, Teradata, Snowflake data modeling)

and experience optimizing SQL queries on large data.

Experience working with large-scale data warehouse solutions such as Teradata,

Snowflake, or Redshift.

Hands on experience in a Unix/Linux environment.

Experience with Continuous Integration & Development and automation tools such as

Jenkins, Artifactory, Git etc.

Experience in building data engineering monitoring tools.

Familiarity with Tableau & Dataiku.

Experience with Agile and Test-Driven Development methodology.

Ability to present complex ideas in a clear, concise way.

Generic Managerial Skills

Education

BS in computer science, engineering, mathematics, statistics, econometrics or other quantitative field.

Preferred MS is computer science, engineering, statistical methods, machine learning.

Start

date

(dd-mmm-yy)

Jul-28-2023

Duration of assignment

(in Months)

12 months

Work Location

(Remote, if they can work from anywhere or specific location from where associate need to work. If you put specific location then number of profiles received may be less)

Remote (but need to work SCV timezone), preference to candidates willing to work from office, 3 days/week

Salary range

Key words to search in resume

Python, ETL pipelines,Hadoop, Teradata

Prescreening /Sample Questionnaire with Answers

Please ask the vendors to share candidate answers to the following question with each submissions

Python

What is the difference between Python Arrays, lists, tuples, sets

What are lambda functions in Python

How can we delete a column or row from a dataframe. What parameter we can use to permanently drop a column or row in a dataframe.

How can we drop one or more rows with NaN/Null values from a dataframe.

What is a List comprehensions in Python. Give an example.

Lets say there is a string s = 'orangeapple'

How can we extract 'apple' from the above string value.

DB/DW

What are different types of table joins you know explain with example.

What are indexes in DB

What is the use of Coalesce function

Difference between Delete and Truncate

What is the difference between Facts and Dimensions Give examples.

Difference between row_number, rank and dense_rank.

What are the Key Differences Between Normalization and Denormalization

In Teradata what is the difference between PI and SI

Rating Matrix

(0-5) 0 - no knowledge and 5-expert

Python

Haddop

Teradata

Keywords: database information technology microsoft Idaho

[email protected]
View all

Tue Jul 25 03:53:00 UTC 2023

To remove this job post send "job_kill 442932" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.

Your reply to [email protected] -

To

Subject
Message -

jay.s@brillius.com wrote:
From:

Jay,

Brillius

jay.s@brillius.com

Reply to:   jay.s@brillius.com

RGS ID
 (Please put the requirement ID)

9054145

# of requirements

Job Title

Data Engineer

Relevant Experience

(in Yrs)

8-12yr

Technical/Functional Skills

Python, ETL pipelines,Hadoop, Teradata

Experience Required

8-15yrs

Roles & Responsibilities

Prior Apple experience is a plus

Responsibilities

Experience with design, build and optimize the data architecture and Extract, Transform & Load (ETL) pipelines to make it accessible for Business Data Analysts, Data Scientists and Business Users.

Work with Analysts to productionize & scale value creating capabilities including data integration, transformations, model features, and statistical and machine learning models.

Handle data administration tasks such as scheduling jobs and fix job errors.

Write and review end-user and technical documents, including requirements and

design documents for existing and future data systems, as well as data standards and

policies.

Translate complex data and methodology into strategic, operationally practical

insights.

Explore complex data sources, becoming a SME in customer behavior across Apples

online retail platforms.

Key Qualifications

3-5 years of software development experience with very high proficiency in Python or Scala or Java.

Must have detailed knowledge of data structures and algorithms.

Solid technical database knowledge (Hadoop, Teradata, Snowflake data modeling)

and experience optimizing SQL queries on large data.

Experience working with large-scale data warehouse solutions such as Teradata,

Snowflake, or Redshift.

Hands on experience in a Unix/Linux environment.

Experience with Continuous Integration & Development and automation tools such as

Jenkins, Artifactory, Git etc.

Experience in building data engineering monitoring tools.

Familiarity with Tableau & Dataiku.

Experience with Agile and Test-Driven Development methodology.

Ability to present complex ideas in a clear, concise way.

Generic Managerial Skills

Education

BS in computer science, engineering, mathematics, statistics, econometrics or other quantitative field.

Preferred MS is computer science, engineering, statistical methods, machine learning.

Start

date

(dd-mmm-yy)

Jul-28-2023

Duration of assignment

(in Months)

12 months

Work Location

(Remote, if they can work from anywhere or specific location from where associate need to work. If you put specific location then number of profiles received may be less)

Remote (but need to work SCV timezone), preference to candidates willing to work from office, 3 days/week

Salary range

Key words to search in resume

Python, ETL pipelines,Hadoop, Teradata

Prescreening /Sample Questionnaire with Answers

Please ask the vendors to share candidate answers to the following question with each submissions

Python

What is the difference between Python Arrays, lists, tuples, sets

What are lambda functions in Python

How can we delete a column or row from a dataframe. What parameter we can use to permanently drop a column or row in a dataframe.

How can we drop one or more rows with NaN/Null values from a dataframe.

What is a List comprehensions in Python. Give an example.

Lets say there is a string s = 'orangeapple'

How can we extract 'apple' from the above string value.

DB/DW

What are different types of table joins you know explain with example.

What are indexes in DB

What is the use of Coalesce function

Difference between Delete and Truncate

What is the difference between Facts and Dimensions Give examples.

Difference between row_number, rank and dense_rank.

What are the Key Differences Between Normalization and Denormalization

In Teradata what is the difference between PI and SI

Rating Matrix

(0-5) 0 - no knowledge and 5-expert

Python

Haddop

Teradata

Keywords: database information technology microsoft Idaho

Your email id:

Captcha Image:

Captcha Code:

Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 0

Location: ,