Home

Urgent opening on AI/ML Ops Engineer (Bentonville, AR Complete onsite) at Bentonville, Arkansas, USA
Email: [email protected]
Hi
Everybody,

Greetings!!

We
have Urgent opening on AI/ML Ops Engineer (Bentonville,
AR Complete onsite)

Role: AI/ML Ops Engineer

Location: Bentonville, AR Complete onsite

Exp- 12+Years

Domain Experience (If any) Retail

Must have skills.

Python, Node, Golang, or bash

Experience with Seldon core, MLFlow,
Istio, Jaeger, Ambassador, Triton, PyTorch, Tensorflow/TFserving is a plus.

Experience with distributed
computing and Apache MXNet, CUDA, cuDNN, TensorRT

Experience production-level
Kubernetes environment (memory/CPU/GPU limits, node taints, annotations/labels,
etc.)

Key Responsibilities:

Work with client's AI/ML Platform Enablement team within the eCommerce
Analytics team. The broader team is currently on a transformation path, and
this role will be instrumental in enabling the broader team's vision.

Work closely with data scientists to help with production models and maintain
them in production.

Deploy and configure Kubernetes components for production cluster, including
API Gateway, Ingress, Model Serving, Logging, Monitoring, Cron Jobs, etc.
Improve the model deployment process for MLE for faster builds and simplified
workflows

Be a technical leader on various projects across platforms and a hands-on
contributor of the entire platform's architecture

System administration, security compliance, and internal tech audits

Responsible for leading operational excellence initiatives in the AI/ML space
which includes efficient use of resources, identifying optimization
opportunities, forecasting capacity, etc.

Design and implement different flavors of architecture to deliver better system
performance and resiliency.

Develop capability requirements and transition plan for the next generation of
AI/ML enablement technology, tools, and processes to enable client to
efficiently improve performance with scale.

Tools/Skills (hands-on experience is
must):

Administering Kubernetes. Ability to create, maintain, scale, and debug
production Kubernetes clusters as a Kubernetes administrator and In-depth
knowledge of Docker.

Ability to transform designs ground up and lead innovation in system design

Deep understanding of data center architectures, networking, storage solutions,
and scale system performance

Have worked on at least one Kubernetes cloud offering (EKS/GKE/AKS) or on-prem
Kubernetes (native Kubernetes, Gravity, MetalK8s)

Programming experience in Python, Node, Golang, or bash

Ability to use observability tools (Splunk, Prometheus, and Grafana ) to look
at logs and metrics to diagnose issues within the system.

Experience with Seldon core, MLFlow, Istio, Jaeger, Ambassador, Triton,
PyTorch, Tensorflow/TFserving is a plus.

Experience with distributed computing and deep learning technologies such as
Apache MXNet, CUDA, cuDNN, TensorRT

Experience hardening a production-level Kubernetes environment (memory/CPU/GPU
limits, node taints, annotations/labels, etc.)

Experience with Kubernetes cluster networking and Linux host networking

Experience scaling infrastructure to support high-throughput data-intensive
applications

Background with automation and monitoring platforms, MLOps ,and configuration
management platforms

Education & Experience: -

5+ years relevant experience in roles with responsibility over data platforms
and data operations dealing with large volumes of data in cloud based
distributed computing environments.

Graduate degree preferred in a quantitative discipline (e.g., computer
engineering, computer science, economics, math, operations research).

Proven ability to solve enterprise level data operations problems at scale
which require cross-functional collaboration for solution development,
implementation, and adoption

Thanks,

Ruchi Verma

HCL Global Systems, Inc

24543 Indoplex Circle,Suite 220,

Farmington Hills,MI 48335

Direct Phone# 248-473-3018

Phone 248-473-0720
  Ext: 176

Email:
[email protected]

LinkedIn:
https://www.linkedin.com/in/ruchi-verma-a28b4921a/

--

Keywords: artificial intelligence machine learning information technology Arkansas Michigan
Urgent opening on AI/ML Ops Engineer (Bentonville, AR Complete onsite)
[email protected]
[email protected]
View all
Tue Oct 08 19:26:00 UTC 2024

To remove this job post send "job_kill 1821981" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 8

Location: Bentonville, Arkansas