Home

Tomorrow Interview || Machine Learning Engineer || Full Remote at Remote, Remote, USA
Email: [email protected]
From:

Rahul Kumar,

SPAR Information Systems

[email protected]

Reply to:   [email protected]

Hello Folks,

Hope you all are doing good.

Please go through the Job description and let me know your interest. 

Title: Machine Learning Engineer

Work Location: Full Remote

Duration: Long Term Contract

The ideal candidate should have experience with Seldon core, MLFlow, Istio, Jaeger, Ambassador, Triton, PyTorch, Tensorflow/TFserving (is a plus) and Experience with distributed computing and deep learning technologies such as Apache MXNet, CUDA, cuDNN, TensorRT.

Design and implement different flavors of architecture to deliver better system performance and resiliency. 

Develop capability requirements and transition plan for the next generation of AI/ML enablement technology, tools, and processes to enable Walmart to efficiently improve performance with scale. 

Tools/Skills (hands-on experience is must): 

Administering Kubernetes. Ability to create, maintain, scale, and debug production Kubernetes clusters as a Kubernetes administrator and In-depth knowledge of Docker. 

Ability to transform designs ground up and lead innovation in system design 

Deep understanding of data center architectures, networking, storage solutions, and scale system performance 

Have worked on at least one Kubernetes cloud offering (EKS/GKE/AKS) or on-prem Kubernetes (native Kubernetes, Gravity, MetalK8s) 

Programming experience in Python, Node, Golang, or bash Ability to use observability tools (Splunk, Prometheus, and Grafana ) to look at logs and metrics to diagnose issues within the system. 

Experience with Seldon core, MLFlow, Istio, Jaeger, Ambassador, Triton, PyTorch, Tensorflow/TFserving is a plus. 

Experience with distributed computing and deep learning technologies such as Apache MXNet, CUDA, cuDNN, TensorRT 

Experience hardening a production-level Kubernetes environment (memory/CPU/GPU limits, node taints, annotations/labels, etc.) 

Experience with Kubernetes cluster networking and Linux host networking 

Experience scaling infrastructure to support high-throughput data-intensive applications 

Background with automation and monitoring platforms, MLOps ,and configuration management platforms 

Education & Experience: - 

5+ years relevant experience in roles with responsibility over data platforms and data operations dealing with large volumes of data in cloud based distributed computing environments.

Contact:

Rahul Kumar

Sr Technical Recruiter

Direct No: 469-829-4899 / Desk No: 469-409-0307 Ext: 338 

Cell No: 936-304-8292

Email: [email protected]

Keywords: artificial intelligence machine learning golang
[email protected]
View all
Wed Feb 22 00:43:00 UTC 2023

To remove this job post send "job_kill 377959" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 24

Location: , Remote