12+ Only -- Senior SRE/Observability Engineer in Dallas TX (Day-1 Onsite) Local Only at Dallas, Texas, USA |
Email: [email protected] |
Hi Hope you are doing well, We have the below requirement open. Please send me your candidates updated Resume to [email protected] Role : Senior SRE/Observability Engineer Location: Dallas TX (Day-1 Onsite) NOTE : Need Passport n umber and LinkedIn ID and please do mention Current Location and Visa of candidate while sending the Profile. NOTE : Need local to TX Candidates ONLY with DL / State ID Mandate Skills: In-depth knowledge of observability tools such as Prometheus, Grafana, Splunk, Netcool, ELK, AIM, Sumologic, and New Relics. Strong understanding of licensing mechanisms and MELT. Proven experience in creating dashboards, establishing design patterns, and understanding application flows in containerized/microservice environments. Job Description: We are seeking a highly skilled Senior SRE/Observability Engineer with a deep understanding of SRE principles, observability, and extensive experience in Prometheus, Grafana Enterprise Metrics. This role requires bridging the gap between application teams and SRE, managing observability solutions, and optimizing performance monitoring. Key Responsibilities : Observability & Monitoring: Implement and maintain observability solutions using Prometheus as the backend and GEM as the middle end. Develop and manage Grafana dashboards for visualizing metrics and performance data. Optimize and configure licensing mechanisms for observability tools. Write and manage complex queries and alert definitions. Bridge the gap between application development teams and SRE operations. Manage and optimize OpenShift, Linux environments, and Grafana Enterprise Metrics. Utilize MELT (Metrics, Events, Logs, and Traces) and plan for long-term data migration to AWS S3. Configure and manage monitoring, alerts, and observability using a range of tools including Splunk, Netcool, ELK, and AIM. Maintain deep technical knowledge and operational experience with tools like AppDynamics, DataDog, Dynatrace, NewRelic, Sumologic, Splunk, Prometheus, and Grafana. Understand and write code (Java, Python, Ruby, Node.js, etc.), programs, config files, and complex queries. Implement and manage Infrastructure as Code (IAC) using Terraform. Manage and optimize cloud platforms (AWS/Azure) and Kubernetes environments. Establish design patterns for monitoring and benchmarking application uptime and performance. Provide thought leadership and strategy in implementing and maintaining observability solutions. Onboard new teams and data sources into the observability solutions. Create and maintain operational process documentation for observability solutions. Optimize the Observability Suite for monitoring applications and infrastructure. Write queries for alerts, dashboards, and reporting. Qualifications: 10+ years of experience in AWS, configuring alerts, monitoring, Open Telemetry framework, Terraform, and scripting. In-depth knowledge of observability tools such as Prometheus, Grafana, Splunk, Netcool, ELK, AIM, Sumologic, and New Relics. Strong understanding of licensing mechanisms and MELT. Experience with Cloud Platforms (AWS/Azure), Kubernetes, CI/CD (Jenkins), and Infrastructure as Code (Terraform). Ability to read and write code in Java, Python, Ruby, Node.js, and other relevant languages. Proven experience in creating dashboards, establishing design patterns, and understanding application flows in containerized/microservice environments. Excellent communication skills and the ability to work effectively across teams. T hanks and Regards, Ramya Sri https://www.linkedin.com/in/ramya-sri-66152a23b/ Email: [email protected] Web: www.canopyone.com -- Keywords: continuous integration continuous deployment javascript sthree information technology Idaho Texas 12+ Only -- Senior SRE/Observability Engineer in Dallas TX (Day-1 Onsite) Local Only [email protected] |
[email protected] View all |
Thu Aug 15 22:23:00 UTC 2024 |