Site Reliability Engineer Remote Need Only USC, GC, GC EAD, H4 EAD, OPT , TN at Remote, Remote, USA |
Email: [email protected] |
From: Suresh, Tech Rakers [email protected] Reply to: [email protected] Senior Site Reliability Engineer Duration: 6 - 12+ months Location: 100% Remote out of CST but open to time zone Required: Building dashboards and alerting for application teams across Dynatrace and Grafana Monitoring applications (primarily Java/Weblogic based) deployed on Kubernetes, on-prem VMWare and Azure cloud Establishing SLOs with development teams Writing scripts to contextualize alerts and logs Automating SRE processes including but not limited to alert management automation, capturing application metrics, health checks of applications Reducing redundant manual processes (called Toil) Skills Needed: Senior level SRE Engineering skills for alert management including setting up alerts, automating alerting processes and building dashboards with Dynatrace and Grafana Experience with Azure Cloud, Kubernetes, VMWare (on-prem), Dyantrace, Grafana, Weblogic and possibly Kafka, MongoDB and/or Hazelcast Extensive experience monitoring applications (preferably Java/Weblogic based) deployed on Kubernetes, Azure and VMWare (on-prem) at an enterprise scale Strong scripting experience for alerting and logging purposes Well versed in automating a variety of SRE processes for alert management and application health Experience reducing redundant manual processes (Toil) Experience working with development teams to author SLOs Must have a mind-set that is eternally curious, constantly learning and takes a proactive approach to work/problem solving/solutioning Thanks and Regards, Suresh Kumar, [email protected] Keywords: |
[email protected] View all |
Mon Oct 02 19:27:00 UTC 2023 |