Home

Looking for Sr. SRE ( Site Reliability Engineer) at Remote, Remote, USA
Email: [email protected]
Location
Seattle WA- - needs to come to the office 3 days a week. 

Visa any

Duration
-10-12 month

Imp Note

This is a  Sr. SRE role and not devops
role

Azure Cloud -AKS -
must have this experience

Databricks Notebooks
must have  this experience

NO-SQL Database - Cassandra, Mongo, PostGres-
must have this
 experience

Kubernetes
must have this  experience

Kafka-
skill level expert is required for
this role

Terraform-
skill level expert is required for
this role

Core
skills needed -

Azure
Clous, AKS Scalability, monitoring, deployment, check logs, ensure node and
pod health.

Databases
include - Cassandra, Mongo, PostGres

Databricks
Notebooks There are a lot of jobs on Databricks experience with Databricks
to know how a notebook is created and run  - run queries against the
database and find discrepancies and perform fixes.

Based
microservices, responsible for deployment, scripting language is python. 

Should have
an understanding around terraform. 

Emphasis on
Logs and Monitoring (datadog and splunk)

Summary of Experience

Requires 10-12 years
experience in the IT industry

Requires 9+ years of
software and DevOps development engineering

Experience in working
with cloud environment Azure preferred.

Experience with
Kubernetes, Azure Kubernetes (AKS) preferred.

Experience with using
Kafka, Event Hub, NATS or any messaging broker.

Experience with
Cassandra, PostgresSQL, Mongo, Elastic Search, Cosmos DB

Experience on Azure
DevOps, Jenkins/ Python / Terraform / Ansible

Experience with
Databricks

Experience with
DataDog, Splunk or other logging and APM tools.

Experience in working
with Linux environment.

Summary of Key Responsibilities

Responsibilities and essential job functions include but
are not limited to the following:

           
Responsible for health of production system

           
Develop monitoring dashboards

           
Configure alerts and automate process for system recovery

           
Monitor alerts and take proactive steps to resolve system issues

           
Troubleshoot production issues

           
Lead production troubleshooting calls

           
Responsible for patches and updates on production systems.

           
Design and build cutting-edge, multi-micro service solutions to support
Starbuckss growth worldwide.

           
Helping CI/CD team during rolling out applications and infrastructure
globally.

           
Collaborates with the development team, other Information Technology (IT) teams
developer leads.  Initiates process improvements for new and existing
systems.

           
Participates in a production support rotation that includes pager
responsibilities.

           
Ability to accurately break down complex application designs into component
deliverables and estimate design and development timelines

09/26/24, 11:48:29 AM

--

Keywords: continuous integration continuous deployment access management database information technology Washington
Looking for Sr. SRE ( Site Reliability Engineer)
[email protected]
[email protected]
View all
Thu Sep 26 21:22:00 UTC 2024

To remove this job post send "job_kill 1788187" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 12

Location: Seattle, Washington