Site Reliability Engineer (SRE) Lead || Phoenix, AZ Local at Phoenix, Arizona, USA |
Email: [email protected] |
Job Title : Site Reliability Engineer (SRE) III Job Location : Phoenix, AZ ( Need local candidate ) Work Schedule: Hybrid Onsite: onsite 3 days per week in Phoenix, AZ Responsibilities: Experience in leading Observability initiatives as Lead Engineer. Lead the Observability Ingestion team. Development and implementation of build release pipelines with accountability for managing deployment schedules, issues, risks, and impediments. Agile development experience with team member accountability for commitment and delivery each sprint. Ensure that all implementations of observability meet the requirements prescribed by IT Services through the effective implementation or use of approved processes, methodologies, and deliverables. Provide expertise and design solutions for observability applications as well as system integration with internal systems and external vendors. Provide technical leadership in design, development, and testing of solutions. Track infrastructure delivery and dependencies to implementation. Prepare and present potential technical solutions and advise the teams on approach and tradeoffs Defines the structure of systems, their interfaces, and the principles that guide its organization, software design and implementation. Defines and supports reusable application components from a business and technology perspective. Able to provide coding and technical direction to less experienced staff or develops highly complex original code. Qualifications: 12 15 years of IT experience Experience with gathering and organizing large volume of data to use for instrumentation into an Enterprise Observability solution. Experience with recommending baseline monitoring thresholds, and performance monitoring KPIs and SLAs. Experience with installing agents, forwarders, APIs, performance monitoring alerts, dashboards, and data trend analysis. Good Knowledge and understanding of Azure foundation components e.g., App GW, APIM, Virtual Network, NSG, Load Balancer, Azure VM etc. is required. Skills: 5+ years Tech lead experience required. * 8+ years development experience required. Cloud (GCP) experience 10+ years of experience on integration engineering related to Observability/Monitoring framework and on two or more APM Tools (AppDynamics, Datadog, Splunk, Dynatrace, Kibana, Elastic etc.). 5+ years of experience as a System Reliability Engineer is required. Experience working with Open-source platforms and Open Telemetry libraries e.g., Grafana is preferred. Experience must include at least one of the following languages: Java (required), Desired (Python, Go, C, C++.) Experience with Databases Azure SQL, PostgreSQL, MySQL, MongoDB, TSDB or similar databases. Experience on one of cloud platforms Microsoft Azure and GCP cloud is required. Experience on PCF, Docker, Kubernetes platform is required. Experience with DevOps and CI/CD tools and processes is required. Experience in high-performance and high-frequency data streaming (using Kafka etc.) and handling large volume of batch data is strongly preferred, but not required. Experience with Agile/Scrum methodologies is required. Regards, Sandy M | 1Point System LLC Lead Technical Recruiter Direct: (803)-828-2974 Email: [email protected] Fax: 803-832-7973 www.1pointsys.com 115 Stone Village Drive Suite C Fort Mill, SC 29708 : https://www.linkedin.com/in/sandy-m-74b06b212/ An E-Verified company | An Equal Opportunity Employer -- Keywords: cprogramm cplusplus continuous integration continuous deployment information technology golang Arizona South Carolina Site Reliability Engineer (SRE) Lead || Phoenix, AZ Local [email protected] |
[email protected] View all |
Tue Jun 04 19:47:00 UTC 2024 |