| SRE L with Migration experience from App Dynamics to Datadog, SLI or SLO concepts, Gremlin, Chaos engineering scenarios, SAS rule validation and deployment Exp at Rule, Texas, USA |
| Email: [email protected] |
|
http://bit.ly/4ey8w48 https://jobs.nvoids.com/job_details.jsp?id=2322581&uid= From: Narayana Rao 3MKLLC, 3MK Software Solutions LLC [email protected] Reply to: [email protected] Hello, Greetings for the day!!! Please review the below roles and advise the best time to connect with you. If you are interested, you can reach me on Linkedin: www.linkedin.com/in/narayanarao2 and share resumes to [email protected] Hiring: SRE Lead with Migration experience from App Dynamics to Datadog, SLI/SLO concepts, Gremlin, Chaos engineering scenarios, SAS rule validation and deployment Exp Location: Remote Contract: Long Term Rate: Open / hr JD: Leading the AMS card operations team towards SRE and Automation Efforts for Fraud Authorization and authentication area. Responsible for creating solutions for Observability by building APP dynamics single-glass-pane dashboard for both Fraud Authorization and authentication Area (FAA) and Collections Value stream (CVS). Build single glass pane Dashboard s in Datadog for Collections Value stream. Worked on migration of alerts for Collection Value Stream alerts from app dynamics to Datadog. Work on migration of dashboards from AppDynamics to Datadog and created monitors for Recovery application in Datadog. Create Service now reports for incidents, RRTs Problem tickets, Deployment reports for tracking and leadership visibility for FAA and CVS area. Create Aggregator framework for Incident monitoring to reduce the false positives and reduction in noise generated incidents. Reduced 1500 False positive tickets. Work on creating automation for IRIS health check and designed framework for automated reporting database issues to recovery DBA team. Created Runbooks and postmortem reports for all the alerts created and RRTs. Navigation of issue triaging would take 30-40 minutes and after observability framework is implemented the AIL MTTD improved by 85% by bringing down issue detection to 5-10 minutes.. Created SRE roadmap on Enterprise level, introduced SLI/SLO concepts on component level, Gremlin on Chaos Engineering. Scrum Master for SRE sprint plans, risk identification and mitigation, capacity and velocity planning. Working on Creating chaos engineering scenarios and assisting team in getting gremlin agents installed on pre-prod servers. Guided team in implementing automation for critical automation like SAS rule validation and deployment. Thanks & Regards, Narayana Rao Sr Manager (Recruitments) 3MK Software Solutions LLC Email: [email protected] Website: http://3mkllc.com/ Connect me on Linkedin too for daily updates and REQUIREMENTS: linkedin.com/in/narayanarao2 Note: WANT TO GET MY DIRECT CLIENT REQUIREMENTS DAILY Please click on below link and clink on Ask to join Group Keywords: SRE Lead with Migration experience from App Dynamics to Datadog, SLI or SLO concepts, Gremlin, Chaos engineering scenarios, SAS rule validation and deployment Exp [email protected] http://bit.ly/4ey8w48 https://jobs.nvoids.com/job_details.jsp?id=2322581&uid= |
| [email protected] View All |
| 03:42 AM 08-Apr-25 |