Home

Urgent opening on ML Ops Engineer (Bentonville, AR Complete onsite) at Bentonville, Arkansas, USA
Email: [email protected]
Hi Everybody,

Greetings!!

We have Urgent opening on ML
Ops Engineer (Bentonville, AR Complete onsite)

Role: ML Ops Engineer

Location: Bentonville, AR Complete onsite

Exp- 12+Years

Domain Experience (If any)
Retail

Must have skills.

Python, Node, Golang, or
bash

Experience with Seldon core,
MLFlow, Istio, Jaeger, Ambassador, Triton, PyTorch, Tensorflow/TFserving is a
plus.

Experience with distributed
computing and Apache MXNet, CUDA, cuDNN, TensorRT

Experience production-level
Kubernetes environment (memory/CPU/GPU limits, node taints, annotations/labels,
etc.)

Key Responsibilities:

Work with client's AI/ML
Platform Enablement team within the eCommerce Analytics team. The broader team
is currently on a transformation path, and this role will be instrumental in
enabling the broader team's vision.

Work closely with data
scientists to help with production models and maintain them in production.

Deploy and configure
Kubernetes components for production cluster, including API Gateway, Ingress,
Model Serving, Logging, Monitoring, Cron Jobs, etc. Improve the model
deployment process for MLE for faster builds and simplified workflows

Be a technical leader on
various projects across platforms and a hands-on contributor of the entire
platform's architecture

System administration,
security compliance, and internal tech audits

Responsible for leading
operational excellence initiatives in the AI/ML space which includes efficient
use of resources, identifying optimization opportunities, forecasting capacity,
etc.

Design and implement
different flavors of architecture to deliver better system performance and
resiliency.

Develop capability
requirements and transition plan for the next generation of AI/ML enablement
technology, tools, and processes to enable client to efficiently improve
performance with scale.

Tools/Skills (hands-on
experience is must):

Administering Kubernetes.
Ability to create, maintain, scale, and debug production Kubernetes clusters as
a Kubernetes administrator and In-depth knowledge of Docker.

Ability to transform designs
ground up and lead innovation in system design

Deep understanding of data
center architectures, networking, storage solutions, and scale system
performance

Have worked on at least one
Kubernetes cloud offering (EKS/GKE/AKS) or on-prem Kubernetes (native
Kubernetes, Gravity, MetalK8s)

Programming experience in
Python, Node, Golang, or bash

Ability to use observability
tools (Splunk, Prometheus, and Grafana ) to look at logs and metrics to
diagnose issues within the system.

Experience with Seldon core,
MLFlow, Istio, Jaeger, Ambassador, Triton, PyTorch, Tensorflow/TFserving is a
plus.

Experience with distributed
computing and deep learning technologies such as Apache MXNet, CUDA, cuDNN,
TensorRT

Experience hardening a
production-level Kubernetes environment (memory/CPU/GPU limits, node taints,
annotations/labels, etc.)

Experience with Kubernetes
cluster networking and Linux host networking

Experience scaling
infrastructure to support high-throughput data-intensive applications

Background with automation
and monitoring platforms, MLOps ,and configuration management platforms

Education & Experience:
-

5+ years relevant experience
in roles with responsibility over data platforms and data operations dealing
with large volumes of data in cloud based distributed computing environments.

Graduate degree preferred in
a quantitative discipline (e.g., computer engineering, computer science,
economics, math, operations research).

Proven ability to solve
enterprise level data operations problems at scale which require
cross-functional collaboration for solution development, implementation, and
adoption

Thanks,

Ruchi
Verma

HCL Global Systems, Inc

24543 Indoplex Circle,Suite 220,

Farmington Hills,MI 48335

Direct Phone# 248-473-3018

Phone 248-473-0720
  Ext: 176

Email:
[email protected]

LinkedIn:
https://www.linkedin.com/in/ruchi-verma-a28b4921a/

--

Keywords: artificial intelligence machine learning information technology Arkansas Michigan
Urgent opening on ML Ops Engineer (Bentonville, AR Complete onsite)
[email protected]
[email protected]
View all
Wed Oct 09 18:57:00 UTC 2024

To remove this job post send "job_kill 1826260" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 20

Location: Bentonville, Arkansas