Job Details

Home

Machine Learning Engineer or Data Engineer,No H1B at Remote, Remote, USA

Email: [email protected]

From:

Shikha,

KPG99

[email protected]

Reply to:   [email protected]

Hi,

Hope you are doing well.

Please find the job description below and let me know your interest.

Position

: Machine Learning Engineer or Data Engineer,

No H1B

Location: Hybrid in Plano, TX or Pleasanton, CA

Duration: 6+ Month

MOI: Phone and Video

Interview Process

2 interviews. 1st with engineering manager or Aravind & 2nd will be technical from Tech lead or architect.

30 min for 1st round

45 min for 2nd round

Reason for position

Looking to expend the team by 20%

Team Overview

The team is spread between 4 PODS that are mix skill set between ML and Data Engineer

Project Description

2 main projects:

Personalization
focused on customer how do we solve customer related problems. Products/content, putting the right product/content in front of the customer. The team call Customer Team

Business growth
anything that is related to Supply Chain, inventory, price, merchandising

In both of these fields/projects, Aravind works closely with Data Science team to put the Data Science model into production, which means Aravinds team, build the pipelines end to end which includes data preparation and data cleaning, training, model evaluation, model metrics.

Duties/Day to Day Overview

This Engineer will be responsible for sending the model to production which means creating the pipeline which can run the task, training and put it into production. Someone else would have to consume it and it would have to be expose to some serving API layer.

Top Requirements

(Must haves)

Having ML Engineering background and understanding the ML Lifecycle experience is critical

Candidate needs to understand at what stage what is happening and what needs to be done


Need someone who is strong technically, using spark as a way of using distributed framework and having deep knowledge in that. If using data from snowflake or maybe using data from data lake, the candidate will need to understand how the data is organized how you structure the data in order to have an optimal read. How do you write queries to make sure the performance of the data that you are reading on the read queries is optimal.

The 2nd aspect we look in the candidate, once these models are done it's some other consuming application need to consume it.

Example, customer if you now producing recommendations for the customer it needs to be shown to different end, on an app or mobile application so we are looking at integrations working closely with those teams around building the integration points so

that's becomes like a software engineering skill set.

This is NOT a Data Science role

, they
will not be looking at what model and what algorithm that they need to solve the business part. Not looking for data science who understand the model itself



Additional Qualifications

(Nice to Haves)

If a candidate has a data science experience but willing to become a ML Engineer,: The sample candidate of a resume that we reviewed had the following skill set:

Has data analysis experience

Data imputation using Scikit-learning package in Python with Spark with 2.0

Used predictive models using machine learning algorithms

Used Random Forest, K-mean Clustering

Used XGboost to predict customer life time value

AWS with primary with Spark

Although the title shows as Data Scientist but it does not look like he's data scientist. Normally data scientists are really exploring doing an exploratory study and figuring out what model to use and they are always stick to one specific model. Unlike this profile he uses many models in one project

Additional Notes

Will be nice if candidate comes from:
Retail, supply chain, merchandising, pricing optimization and personalization.

Platform that is currently being used by this team

Spark

Python

Data lake snowflake

Azure (cloud)

Keywords: machine learning information technology California Texas

[email protected]
View all

Fri Jul 28 00:52:00 UTC 2023

To remove this job post send "job_kill 457603" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.

Your reply to [email protected] -

To

Subject
Message -

shikha@kpgtech.com wrote:
From:

Shikha,

KPG99

shikha@kpgtech.com

Reply to:   shikha@kpgtech.com

Hi,

Hope you are doing well.

Please find the job description below and let me know your interest.

Position

: Machine Learning Engineer or Data Engineer,

No H1B

Location: Hybrid in Plano, TX or Pleasanton, CA

Duration: 6+ Month

MOI:  Phone and Video

Interview Process

2 interviews. 1st with engineering manager or Aravind & 2nd will be technical from Tech lead or architect.

30 min for 1st round

45 min for 2nd round

Reason for position

Looking to expend the team by 20%

Team Overview

The team is spread between 4 PODS that are mix skill set between ML and Data Engineer

Project Description

2 main projects:

Personalization 
 focused on customer  how do we solve customer related problems. Products/content, putting the right product/content in front of the customer. The team call Customer Team

Business growth  
anything that is related to Supply Chain, inventory, price, merchandising

In both of these fields/projects, Aravind works closely with Data Science team to put the Data Science model into production, which means Aravinds team, build the pipelines end to end which includes data preparation and data cleaning, training, model evaluation, model metrics.

Duties/Day to Day Overview

This Engineer will be responsible for sending the model to production which means creating the pipeline which can run the task, training and put it into production. Someone else would have to consume it and it would have to be expose to some serving API layer.

Top Requirements

(Must haves)

Having ML Engineering background and understanding the ML Lifecycle experience is critical

Candidate needs to understand at what stage what is happening and what needs to be done

Need someone who is strong technically, using spark as a way of using distributed framework and having deep knowledge in that. If using data from snowflake or maybe using data from data lake, the candidate will need to understand how the data is organized how you structure the data in order to have an optimal read. How do you write queries to make sure the performance of the data that you are reading on the read queries is optimal.

The 2nd aspect we look in the candidate, once these models are done it's some other consuming application need to consume it.

Example, customer if you now producing recommendations for the customer it needs to be shown to different end, on an app or mobile application so we are looking at integrations working closely with those teams around building the integration points so

that's becomes like a software engineering skill set.

This is NOT a Data Science role

, they
 will not be looking at what model and what algorithm that they need to solve the business part. Not looking for data science who understand the model itself

Additional Qualifications

(Nice to Haves)

If a candidate has a data science experience but willing to become a ML Engineer,: The sample candidate of a resume that we reviewed had the following skill set:

Has data analysis experience

Data imputation using Scikit-learning package in Python with Spark with 2.0

Used predictive models using machine learning algorithms

Used Random Forest, K-mean Clustering

Used XGboost to predict customer life time value

AWS with primary with Spark

Although the title shows as Data Scientist but it does not look like he's data scientist. Normally data scientists are really exploring doing an exploratory study and figuring out what model to use and they are always stick to one specific model. Unlike this profile he uses many models in one project

Additional Notes

Will be nice if candidate comes from: 
Retail, supply chain, merchandising, pricing optimization and personalization.

Platform that is currently being used by this team

Spark

Python

Data lake snowflake

Azure (cloud)

Keywords: machine learning information technology California Texas

Your email id:

Captcha Image:

Captcha Code:

Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 22

Location: , Indiana