Job Details

Home

Senior SRE (Site Reliability Engineer) || Onsite || San Francisco, CA at Francisco, Indiana, USA

Email: [email protected]

From:

Riya,

IDC Technologies

[email protected]

Reply to: [email protected]

Hi

I came across your profile on our resume database and wanted to reach out regarding a job opportunity. if interested please reply with your updated resume, contact details, and best time to discuss regarding the opportunity.

Job Title: Senior SRE (Site Reliability Engineer)

Location: Onsite || San Francisco, CA

Duration: Contract

Experience: 7+ Years

Position Details/Requirements:

Required Experience:

1. Cloud Concepts: Extensive hands-on experience in AWS/GCP, Kubernetes

2. Infrastructure as code : Terraform, CI/CD: GitHub, Jenkins

3. Monitoring Tools: Experience in using New Relic, Grafana, Prometheus.

5+ years managing and monitoring Incident/Crisis management

3+ years experience monitoring with various tools like Grafana, NewRelic etc.

1+ years experience programming in a programming language such as Python and Go

Infrastructure as Code and Terraform

On call experience

Lead the on-call teams and processes to improve site reliability

Focus on managing large scale systems with high loads 24/7

Support our SRE and engineering teams in their day to day

Build, enhance and maintain runbooks working with various teams cross-functionally

Thrive on automating processes as much as possible

Observability and Monitoring with services like Prometheus, Grafana, New Relic

Additional other duties and responsibilities, as assigned.

Lead the NOC tools, runbooks, processes and teams

Automation of runbooks as necessary

Work with our development teams on improving the system

Attention to detail and ability to manage multiple projects

Strong analytical skills and ability to present complex data on site reliability and other factors

Demonstrated ability to work with 3rd parties and collaborate on solutions.

Experience in Monitoring using NewRelic/Grafana/Prometheus.

Experienced in scripting languages Python/Go

7-8 years Experience

1. Cloud Concepts: Extensive hands-on experience in AWS/GCP, Kubernetes

2. Infrastructure as code : Terraform, CI/CD: GitHub, Jenkins

3. Monitoring Tools: Experience in using New Relic, Grafana, Prometheus.

Experience in Incident/Crisis management, Support experience, On Call experience

Team Culture:

Candidate with Support and On call experience

Attention to detail and ability to manage multiple projects

Strong analytical skills and ability to present complex data on site reliability and other factors

Demonstrated ability to work with 3rd parties and collaborate on solutions

Ability to work with Offshore team, set goals and ensure 100% adherence on the process and deliverables

Willingness to provide on call support and join calls to troubleshoot major outages.

Best Regards,

Keywords: continuous integration continuous deployment golang California

[email protected]
View all

Mon Feb 06 21:16:00 UTC 2023

To remove this job post send "job_kill 332291" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.

Your reply to [email protected] -

To

Subject
Message -

riya.t@idctechnologies.com wrote:
From:

Riya,

IDC Technologies

riya.t@idctechnologies.com

Reply to:   riya.t@idctechnologies.com

I came across your profile on our resume database and wanted to reach out regarding a job opportunity. if interested please reply with your updated resume, contact details, and best time to discuss regarding the opportunity.

Job Title: Senior SRE (Site Reliability Engineer)

Location: Onsite || San Francisco, CA

Duration: Contract

Experience: 7+ Years

Position Details/Requirements:

Required Experience:

1. Cloud Concepts: Extensive hands-on experience in AWS/GCP, Kubernetes

2. Infrastructure as code : Terraform, CI/CD: GitHub, Jenkins

3. Monitoring Tools: Experience in using New Relic, Grafana, Prometheus.

5+ years managing and monitoring Incident/Crisis management

3+ years experience monitoring with various tools like Grafana, NewRelic etc.

1+ years experience programming in a programming language such as Python and Go

Infrastructure as Code and Terraform

On call experience

Lead the on-call teams and processes to improve site reliability

Focus on managing large scale systems with high loads 24/7

Support our SRE and engineering teams in their day to day

Build, enhance and maintain runbooks working with various teams cross-functionally

Thrive on automating processes as much as possible

Observability and Monitoring with services like Prometheus, Grafana, New Relic

Additional other duties and responsibilities, as assigned.

Lead the NOC tools, runbooks, processes and teams

Automation of runbooks as necessary

Work with our development teams on improving the system

Attention to detail and ability to manage multiple projects

Strong analytical skills and ability to present complex data on site reliability and other factors

Demonstrated ability to work with 3rd parties and collaborate on solutions.

Experience in Monitoring using NewRelic/Grafana/Prometheus.

Experienced in scripting languages Python/Go

7-8 years Experience

1. Cloud Concepts: Extensive hands-on experience in AWS/GCP, Kubernetes

2. Infrastructure as code : Terraform, CI/CD: GitHub, Jenkins

3. Monitoring Tools: Experience in using New Relic, Grafana, Prometheus.

Experience in Incident/Crisis management, Support experience, On Call experience

Team Culture:

Candidate with Support and On call experience

Attention to detail and ability to manage multiple projects

Strong analytical skills and ability to present complex data on site reliability and other factors

Demonstrated ability to work with 3rd parties and collaborate on solutions

Ability to work with Offshore team, set goals and ensure 100% adherence on the process and deliverables

Willingness to provide on call support and join calls to troubleshoot major outages.

Best Regards,

Keywords: continuous integration continuous deployment golang California

Your email id:

Captcha Image:

Captcha Code:

Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 60

Location: San Francisco, California