Home

Regarding:- SRE engineer for Monroe, MA at Monroe, New York, USA
Email: [email protected]
Hi ,

Please go through below job role and do let me know if you want to submit your candidature.

Location:- Monroe, MA

Role: Site Reliability Engineer

SRE Role Overview: We are looking for a Lead Software Engineer to join our Public Sector Core Framework platform team. This individual will play a SRE (Site Reliability Engineer) role in an Azure / Kubernetes ecosystem and help enable stability for the system, leading to the continued success of client.

Responsibilities:

Strengthen the teams SRE practices, starting from service level indicator definitions, objectives, error budgets, thresholds, alerting and error management systems.

Site Planning SRE will have to work with dev and testing teams to plan changes to production and other systems.

Optimizing planned outages This includes optimizing dev. Ops area and any other activity resulting in a planned outage.

Toil management Identify areas of high toil and find solutions for improvement.

Leverage automation wherever possible to minimize workload, enhance stability, and improve the overall functionality of the environment.

Alert management Strengthening areas with alerting, including establishing goals, criteria, alert recalls, reset, enable/disable revising error budget based on the toil undergone by teams.

Prevention of outages respond to non-critical alerts and work closely with development and testing teams.

Verification Work closely with Load and Performance teams in redefining parameters like load and concurrent users.

Incident management Chair meetings with development and operations teams in the event of an incident.

Post Incident Reviews Derive learnings from issues and alerts along with teams, inclusive of RCAs. Work on long term solutions which could include changes in code, configuration, change in design/architecture or capacity planning.

Reporting with Reliability Metrics This includes set of derived metrics which includes Availability, Mean Time to Restore, Mean Time Between Repairs and Probability of Failure.

Continuous improvement Development and maintain a backlog of SRE improvements opportunities.

With company sponsorship, underdo necessary background checks to obtain and maintain U.S. Federal Government Public Trust suitability clearance.

Requirements:

Knowledgeable within the Site Reliability Engineering discipline with a proven track record of success.

Proficient with administering Azure systems.

Proficient with Kubernetes systems. Familiarly with Podman/Docker and Helm Charts.

Proficient with Python.

Experience with GitHub.

Knowledgeable with resiliency / reliability design patterns.

To be a match fit, youll have experience with or have interest in:

Prometheus

AKS Monitoring

Grafana

Automation

Somnath Anand

Sr. Technical Recruiter

www.e-solutionsinc.com

[email protected]

Disclaimer: E-Solutions Inc. provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, disability, genetic information, marital status, amnesty, or status as a covered veteran in accordance with applicable federal, state and local laws. We especially invite women, minorities, veterans, and individuals with disabilities to apply. EEO/AA/M/F/Vet/Disability.

--

Keywords: information technology golang Massachusetts
Regarding:- SRE engineer for Monroe, MA
[email protected]
[email protected]
View all
Wed Apr 17 04:13:00 UTC 2024

To remove this job post send "job_kill 1316767" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 32

Location: , Massachusetts