SRE Lead, Irving, Frisco, TX(Onsite) - Looking for Only Locals at Frisco, Texas, USA |
Email: [email protected] |
From: Chandra Sekhar Prathipati, Yochana IT Solutions [email protected] Reply to: [email protected] Hi, This is Chandra from Yochana IT Solutions, I Recently found your resume in one of our Job portal and We are looking for SRE Lead, Irving, Frisco, TX(Onsite) with one of our client, I have included the job information below, If you are interested, please share your updated resume. Job Description: Skills: SRE - AWS, Docker, Kubernetes, APIGEE, Cassandra, Oracle, PostgresSQL, Jenkins, Github, ELK, New Relic, Terraform, Python, Bash Responsibilities: Team Leadership: Lead and mentor a team of SREs, ensuring they have the resources and support needed to succeed Foster a culture of reliability and continuous improvement within the team System Reliability: Ensure the availability, performance, and scalability of systems and services Develop and implement strategies for monitoring and maintaining system health Incident Management: Oversee the response to incidents, ensuring quick resolution and minimal downtime Conduct post-mortems to identify root causes and prevent future incidents Automation and Tooling: Develop and maintain automation tools to reduce manual work and improve efficiency Implement and manage CI/CD pipelines to streamline deployments Collaboration: Work closely with development, operations, and product teams to ensure alignment on reliability goals Communicate effectively with stakeholders about system performance and reliability Risk Management: Identify and mitigate potential risks to system reliability Implement strategies to handle failures and ensure disaster recovery Skills: Technical Expertise: Experience with: Cloud platforms (AWS), containerization technologies (Docker & Kubernetes), API management (Apigee), Databases (Non-SQL: Casandra & SQL: Oracle, PostgreSQL & DB2), and CICD (Jenkins, Github) Other technologies, ELK Stack & APM (New Relic, Terraform) Proficiency in scripting languages like Python or Bash Problem-Solving: Strong analytical skills to diagnose and resolve complex system issues Ability to design and implement effective monitoring and alerting systems Leadership: Proven experience in leading and growing engineering teams Excellent communication and collaboration skills Automation: Expertise in automation tools and practices to reduce manual intervention Familiarity with CI/CD processes and tools Resilience Engineering: Knowledge of best practices in building resilient, self-healing systems Experience with disaster recovery planning and execution Keywords: continuous integration continuous deployment information technology Texas SRE Lead, Irving, Frisco, TX(Onsite) - Looking for Only Locals [email protected] |
[email protected] View all |
Wed Nov 13 01:15:00 UTC 2024 |