Observability & Monitoring Manager Lead at Remote, Remote, USA |
Email: jhanvi.katyal@tekinspirations.com |
https://jobs.nvoids.com/job_details.jsp?id=2317498&uid= From: Jhanvi katyal, Tek inspirations llc jhanvi.katyal@tekinspirations.com Reply to: jhanvi.katyal@tekinspirations.com Job Description - Need Candidates on PV W2 Observability & Monitoring Manager/Lead Atlanta, Georgia (Remote) Need LinkedIn Must be able to go full time after 3 to 4 months. Need candidates with heavy experience of managing the observability/ monitoring platforms. Need Candidates 10 years+They are looking someone who can work with client facing teams. Who can manage Onshore and offshore teams for them. Must Have:- Tivoli ITM, Dynatrace, AppDynamics, Quantum Metric, Grafana, Solarwinds, 7signal, OBM, Sitescope, Splunk, ServiceNow ITOM, SCOM, xMatters, Prometheus, Clickhouse, Vector Key Responsibilities: Tool Management: Oversee the implementation and operation of observability and monitoring tools (e.g., AppDynamics, Splunk, OBM, Tivoli, Open Telemetry, Solarwinds, xMatters, Prometheus, Grafana etc.) to track metrics, logs, and traces. System Health Monitoring: Develop strategies for real-time monitoring of systems to ensure high availability, performance, and fault tolerance. Incident Management: Lead incident detection and resolution by ensuring timely alerts and diagnosis of system issues, minimizing downtime. Data-Driven Insights: Use monitoring data to identify trends, anomalies, and potential bottlenecks in systems to optimize performance. Collaboration: Work with DevOps, SRE (Site Reliability Engineering), and engineering teams to design monitoring strategies and ensure proper instrumentation of applications. SLAs & SLOs Management: Ensure that Service Level Agreements (SLAs) and Service Level Objectives (SLOs) are met through proactive monitoring and reporting. Process Improvement: Continuously evolve and refine observability processes and tools to meet the growing demands of the infrastructure. Team Leadership: Manage a team of monitoring engineers, assign tasks, and oversee the execution of observability initiatives. Automation: Drive automation in monitoring and alerting to reduce manual efforts and improve the reliability of alerts. Skills and Qualifications: Expertise in monitoring tools and observability platforms (AppDynamics, Splunk, OBM, Tivoli, Open Telemetry, Solarwinds, xMatters, Prometheus, Grafana, etc.) Strong understanding of system architecture, cloud infrastructure (AWS, GCP, Azure), and containerization (Kubernetes, Docker). Experience with incident management and troubleshooting in complex environments. Leadership experience with cross-functional teams. Proficiency in scripting languages like Python, Bash, or similar for automation. Knowledge of best practices in logging, monitoring, and distributed tracing. Strong analytical skills with a focus on data interpretation for performance tuning. Keywords: golang wtwo Observability & Monitoring Manager Lead jhanvi.katyal@tekinspirations.com https://jobs.nvoids.com/job_details.jsp?id=2317498&uid= |
jhanvi.katyal@tekinspirations.com View All |
11:36 PM 04-Apr-25 |