Home

SRE L with Migration experience from App Dynamics to Datadog, SLI or SLO concepts, Gremlin, Chaos engineering scenarios, SAS rule validation and deployment Exp at Rule, Texas, USA
Email: [email protected]
http://bit.ly/4ey8w48
https://jobs.nvoids.com/job_details.jsp?id=2322581&uid=

From:

Narayana Rao 3MKLLC,

3MK Software Solutions LLC

[email protected]

Reply to: [email protected]

Hello,
Greetings for the day!!!
Please review the below roles and advise the best time to connect with you. If you are interested, you can reach me on Linkedin: www.linkedin.com/in/narayanarao2 and share resumes to [email protected]

Hiring: SRE Lead with Migration experience
from App Dynamics to Datadog, SLI/SLO concepts, Gremlin, Chaos engineering scenarios,
SAS rule validation and deployment Exp

Location: Remote

Contract: Long Term

Rate: Open / hr

JD:

Leading the AMS card operations team
towards SRE and Automation Efforts for
Fraud Authorization and authentication area.

Responsible for creating solutions for Observability by building
APP dynamics single-glass-pane dashboard for both
Fraud Authorization and authentication Area (FAA) and Collections Value stream (CVS).

Build

single glass pane Dashboard
s in
Datadog for Collections Value stream.

Worked on
migration of alerts for Collection Value Stream alerts from
app dynamics to Datadog.

Work on
migration of dashboards from AppDynamics to Datadog and created monitors for Recovery application in Datadog.

Create Service now reports for incidents, RRTs Problem tickets, Deployment reports for tracking and leadership visibility for FAA and CVS area.

Create Aggregator framework for Incident monitoring to reduce the false positives and reduction in noise generated incidents. Reduced 1500 False positive tickets.

Work on creating automation for IRIS health check and designed framework for automated reporting database issues to recovery DBA team.

Created Runbooks and postmortem reports for all the alerts created and RRTs.

Navigation of issue triaging would take 30-40 minutes and after observability framework is implemented the AIL MTTD improved by 85% by bringing down issue detection to 5-10 minutes..

Created SRE roadmap on Enterprise level, introduced
SLI/SLO
concepts on component level,
Gremlin on Chaos Engineering.

Scrum Master for SRE sprint plans, risk identification and mitigation, capacity and velocity planning.

Working on Creating
chaos engineering scenarios and assisting team in getting gremlin agents installed on pre-prod servers.

Guided team in implementing automation for critical automation like
SAS rule validation and deployment.

Thanks & Regards,

Narayana Rao

Sr Manager (Recruitments)

3MK Software Solutions LLC

Email:

[email protected]

Website:

http://3mkllc.com/

Connect me
on Linkedin too for daily updates and REQUIREMENTS:

linkedin.com/in/narayanarao2

Note:
WANT TO GET MY DIRECT CLIENT REQUIREMENTS DAILY

Please click on below link and clink on
Ask to join Group

Keywords:
SRE Lead with Migration experience from App Dynamics to Datadog, SLI or SLO concepts, Gremlin, Chaos engineering scenarios, SAS rule validation and deployment Exp
[email protected]
http://bit.ly/4ey8w48
https://jobs.nvoids.com/job_details.jsp?id=2322581&uid=
[email protected]
View All
03:42 AM 08-Apr-25


To remove this job post send "job_kill 2322581" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.

Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]


Time Taken: 0

Location: ,