SRE LEAD-Frisco , TX , Irving TX at Frisco, Texas, USA |
Email: [email protected] |
From: nagesh, yochana [email protected] Reply to: [email protected] Hello, Hope you doing well Role :SRE LEAD Location Frisco , tx , irrving TX Job Description : Lead and mentor a team of SREs, ensuring they have the resources and support needed to succeed Foster a culture of reliability and continuous improvement within the team System Reliability: Ensure the availability, performance, and scalability of systems and services Develop and implement strategies for monitoring and maintaining system health Incident Management: Oversee the response to incidents, ensuring quick resolution and minimal downtime Conduct post-mortems to identify root causes and prevent future incidents Automation and Tooling: Develop and maintain automation tools to reduce manual work and improve efficiency Implement and manage CI/CD pipelines to streamline deployments Collaboration: Work closely with development, operations, and product teams to ensure alignment on reliability goals Communicate effectively with stakeholders about system performance and reliability Risk Management: Identify and mitigate potential risks to system reliability Implement strategies to handle failures and ensure disaster recovery Skills:Technical Expertise: Experience with: Cloud platforms (AWS), containerization technologies (Docker & Kubernetes), API management (Apigee), Databases (Non-SQL: Casandra & SQL: Oracle, PostgreSQL & DB2), and CICD (Jenkins, Github) Other technologies, ELK Stack & APM (New Relic, Terraform) Proficiency in scripting languages like Python or Bash Problem-Solving: Strong analytical skills to diagnose and resolve complex system issues Ability to design and implement effective monitoring and alerting systems Leadership: Proven experience in leading and growing engineering teams Excellent communication and collaboration skills Automation: Expertise in automation tools and practices to reduce manual intervention Familiarity with CI/CD processes and tools Resilience Engineering: Knowledge of best practices in building resilient, self-healing systems Experience with disaster recovery planning and execution Keywords: continuous integration continuous deployment Texas SRE LEAD-Frisco , TX , Irving TX [email protected] |
[email protected] View all |
Thu Nov 14 03:25:00 UTC 2024 |