Job Details

Home

SRE, Production Support Analyst, No H1B at Remote, Remote, USA

Email: [email protected]

From:

Shikha,

KPG99

[email protected]

Reply to:   [email protected]

Hi,

Hope you are doing well.

Please find the job description below and let me know your interest.

Position

:

SRE/Production Support Analyst, No H1B

Location:

Hybrid in

Dallas, Tampa, Jersey City or Boston

Duration: 6+ Month

MOI: Phone and Video

The location can be: Dallas, Tampa, Jersey City or Boston.

A Site Reliability Engineer is responsible for monitoring, automating, and improving the reliability, performance, and availability of software systems in an organization

.

**

NOTE: Must be an SRE, not DevOps. If they did DevOps years ago and are now SRE that is okay.

Your Primary Responsibilities:

1.

Join all project stakeholders planning and design sessions, sprint zero and stand-ups for all new delivery fully understanding the changes and impact.

2

Attend and present operational readiness with application support (EAS L2) at each project management meeting - raise any operational risks and concerns.

3.

Partner with IT Embedded Risk Managers to identify strategic solutions for risk incidents.

4.

Metrics and Reporting demonstrate operational improvements through defined KPIs.

5.

Ensure NFRs are raised, properly defined and prioritized as part of delivery.

6

Review all Controls and Alerting for new delivery and ensure it meets operational standards.

7.

Test NFRs in UAT environments to validate effectiveness and completeness of operational capabilities.

8.

Partner with ETE to drive resiliency testing scenarios.

9.

Evaluate how the application behaves for hardware failures in the middle of processing.

10.

Make design recommendations that will allow the application to recover without cleanup activities or create a recovery runbook for application support team to follow for improved application recovery times.

11.

Ensure avoidance of creation of control of controls or alert of alerts instead of improving application controls and alerting.

12.

Participate in daily EAS RDS L2 activities to understand what can be improved to make support of RDS applications more efficient and organized.

Talents needed for success:

1.

Understanding of SRE Principles/Practices and metrics as well as Traceability

2.

Excellent Problem Solving skills and passion for automation

3

Hand-on Experience in SQL/PLSQL, Unix, Linux, Windows

4

Working experience in Shell Scripting, Java or Python, Perl, JavaScript

5

Failure Mode Analysis to - evaluate how the application behaves for hardware failures in the middle of processing

6.

Good knowledge in AutoSys, ServiceNow, JIRA

7

Demonstrate knowledge of DevOps toolchains and process

8.

Monitoring / Big data tools such as Splunk, Dynatrace

9.

Knowledge in maintenance and support of AWS functionalities and services

10.

Leadership experience is a plus

Keywords: information technology

[email protected]
View all

Thu Jan 18 01:33:00 UTC 2024

To remove this job post send "job_kill 1024418" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.

Your reply to [email protected] -

To

Subject
Message -

shikha@kpgtech.com wrote:
From:

Shikha,

KPG99

shikha@kpgtech.com

Reply to:   shikha@kpgtech.com

Hi,

Hope you are doing well.

Please find the job description below and let me know your interest.

Position

SRE/Production Support Analyst, No H1B

Location:

Hybrid in

Dallas, Tampa, Jersey City or Boston

Duration: 6+ Month

MOI: Phone and Video

The location can be:  Dallas, Tampa, Jersey City or Boston.

A Site Reliability Engineer is responsible for monitoring, automating, and improving the reliability, performance, and availability of software systems in an organization

NOTE:  Must be an SRE, not DevOps.  If they did DevOps years ago and are now SRE that is okay.

Your Primary Responsibilities:

Join all project stakeholders planning and design sessions, sprint zero and stand-ups for all new delivery fully understanding the changes and impact.

Attend and present operational readiness with application support (EAS L2) at each project management meeting - raise any operational risks and concerns.

Partner with IT Embedded Risk Managers to identify strategic solutions for risk incidents.

Metrics and Reporting  demonstrate operational improvements through defined KPIs.

Ensure NFRs are raised, properly defined and prioritized as part of delivery.

Review all Controls and Alerting for new delivery and ensure it meets operational standards.

Test NFRs in UAT environments to validate effectiveness and completeness of operational capabilities.

Partner with ETE to drive resiliency testing scenarios.

Evaluate how the application behaves for hardware failures in the middle of processing.

10.

Make design recommendations that will allow the application to recover without cleanup activities or create a recovery runbook for application support team to follow for improved application recovery times.

11.

Ensure avoidance of creation of control of controls or alert of alerts instead of improving application controls and alerting.

12.

Participate in daily EAS RDS L2 activities to understand what can be improved to make support of RDS applications more efficient and organized.

Talents needed for success:

Understanding of SRE Principles/Practices and metrics as well as Traceability

Excellent Problem Solving skills and passion for automation

Hand-on Experience in SQL/PLSQL, Unix, Linux, Windows

Working experience in Shell Scripting, Java or Python, Perl, JavaScript

Failure Mode Analysis to - evaluate how the application behaves for hardware failures in the middle of processing

Good knowledge in AutoSys, ServiceNow, JIRA

Demonstrate knowledge of DevOps toolchains and process

Monitoring / Big data tools such as Splunk, Dynatrace

Knowledge in maintenance and support of AWS functionalities and services

10.

Leadership experience is a plus

Keywords: information technology

Your email id:

Captcha Image:

Captcha Code:

Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 33

Location: , Indiana