Home

Site Reliability Engineer (SRE) - Mountain View, California(Hybrid)- need locals at Mountain, Wisconsin, USA
Email: [email protected]
From:

John,

VySystems

[email protected]

Reply to:   [email protected]

Minimum 10 years of experience

Very DevOps SRE experience and development skills

Responsibilities: 

Design, implement, and maintain complex data systems supporting millions of customers with Cloud Native principles and best practices to ensure highly available, secure, performant and scalable database systems 

Build and maintain CI/CD pipelines in Jenkins 

Build and deploy services in Kubernetes cluster using helm, kustomize, etc 

Contribute to infrastructure changes to AWS with deep understanding of AWS services 

Engage in on-call for pre-production and production systems supporting multi-million users 

Write/Review RCA docs to prevent recurrence of Incidents in future and share the learnings 

Contribute to major system upgrades, deployment automation, monitoring enhancements and Production changes 

Create operational playbooks, contribute to how-to articles, and gain domain knowledge to drive changes in the team 

Participate and contribute in FMEA/Chaos testing, Security remediations, etc 

Share best practices and patterns for operational excellence and cost optimization 

Reduce or eliminate manual steps by automating as much as possible 

Continuously look for opportunities to increase developer velocity and productivity 

Qualifications 

Bachelors or masters degree in computer science or a related technical field. Equivalent experience will be considered 

4+ years of hands-on development & operational experience with building and maintaining infrastructure in AWS 

Extensive performance monitoring, troubleshooting & tuning experience 

Experience with AWS services and hands-on knowledge of hosting on Cloud 

Experience with scripting languages for DevOps automation 

Experience with any one of the programming languages: Java/Python/Ruby 

Knowledge of Docker & Kubernetes, ArgoCD, 

Experience with monitoring and observability using Splunk, Wavefront, AppDynamics, Prometheus, Tracing, etc 

Keywords: continuous integration continuous deployment
Site Reliability Engineer (SRE) - Mountain View, California(Hybrid)- need locals
[email protected]
[email protected]
View all
Wed Oct 30 01:49:00 UTC 2024

To remove this job post send "job_kill 1889570" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 0

Location: ,