SRE Lead at Fort Mill, South Carolina, USA |
Email: [email protected] |
From: Santhoshi, HAN IT Staffing [email protected] Reply to: [email protected] Role: SRE Lead - Remote Client : Capgemini Location : Fort Mill, SC Visa Types: USC and GC Only Job Description: The Site Reliability Lead would be responsible for assisting/ensuring the reliability, scalability, and performance. Industry Experience 15+ Years, Domain experience 10+ years Overall responsibility to perform SRE Assessment & built Capability Roadmap for the client Establish Observability - Single Pane of view, Alerts, Runbooks, Logs and Telemetry Organization & Governance alignment, drive cultural change & Identify training requirements and drive in establishing SRE enabled teams Implement Automation across including Alerting as Code, Logging Code libraries, Telemetry Libraries, Synthetic monitoring Operational Excellence - Toil Reduction, Reselliency, E2E Engineering support Continuous Automation & Asset maintenance Provide inputs into SRE assessment planning, availability of stakeholders, existing documentation, maturity artifacts and execution Provide SRE adoption support Provide direction and innovation by enabling core platform capabilities, building cloud connectivity, infrastructure, and shared services at scale Design and implement scalable and highly reliable solutions, build tools and services, full-stack observability, monitoring and event management integrations to monitor and advance the reliability and quality of services Skills Required. Strong background in software engineering, systems architecture, and infrastructure design. Proficiency in programming languages commonly used in infrastructure and automation (e.g., Python, Shell scripting). In-depth understanding of AWS, Containers. Strong leadership skills to inspire and motivate the SRE team Ability to set a clear vision and strategy for the team and align it with organizational goals. Skill in conducting post-incident reviews and implementing improvements based on lessons learned. Proficiency in automation tools and frameworks for infrastructure and configuration management (e.g., Ansible, Terraform, Chef). Experience designing and implementing effective monitoring and alerting systems. Familiarity with monitoring tools and platforms (e.g., Grafana, ELK stack). Understanding of security principles and practices in the context of reliability. Keywords: information technology green card South Carolina SRE Lead [email protected] |
[email protected] View all |
Mon Apr 01 22:48:00 UTC 2024 |