Hiring Job Title: Site Reliability Engineer SRE with Python Location Alpharetta GA need 8+ yrs at Alpharetta, Georgia, USA |
Email: [email protected] |
From: Mounikanth, siptek [email protected] Reply to: [email protected] Hello Team !! Hope you are doing well !! I hope this message finds you well. Please go through the below requirement and please do send me good profiles. Job Title: Site Reliability Engineer (SRE) with Python Location :Alpharetta GA Rate: 50/Hr on CTC is max Job Description: We are seeking a highly skilled Stability and Resilience Engineer to join our dynamic team. This role is crucial for ensuring the stability, reliability, and resilience of our systems and applications, with a focus on proactive measures to prevent downtime and mitigate risks. The ideal candidate will have a strong technical background, a passion for problem-solving, and a commitment to excellence in system performance. Key Responsibilities: 1. System Monitoring and Analysis: Implement and manage monitoring solutions to track system performance and health. Analyze system metrics to identify patterns and irregularities that may indicate potential failures. 2. Incident Management: Develop and execute incident response plans to address system outages and failures. Conduct root cause analysis to understand incidents and implement corrective actions. 3. Collaboration: Work closely with development, operations, and other engineering teams to ensure systems are designed with stability and resilience in mind. Participate in architectural reviews and design discussions to provide insights on resilience best practices. 4. Documentation and Reporting: Maintain thorough documentation of system configurations, incident reports, and resilience testing results. Prepare reports and presentations to communicate system performance, risks, and improvement strategies to stakeholders. 5. Continuous Improvement: Stay current with industry trends, tools, and techniques in system stability and resilience. Propose and implement innovations to enhance system performance and reliability. Qualifications: Bachelors degree in Computer Science, Engineering, or related field; Masters degree preferred. Proven experience in systems engineering, reliability engineering, or a similar role. Strong understanding of system architecture, cloud infrastructure, and networking concepts. Proficiency in monitoring tools and techniques (e.g., Prometheus, Grafana, Dynatrace, etc.). Experience with incident management and response frameworks (ITIL, SRE principles). Knowledge of scripting and programming languages (e.g., Python, Bash, Go). Familiarity with DevOps practices and CI/CD pipelines. Excellent problem-solving skills and the ability to work under pressure. Strong communication and collaboration skills, with an ability to work effectively in a team environment. Preferred Qualifications: Relevant certifications (e.g., AWS Certified Solutions Architect, Google Cloud Professional DevOps Engineer, Certified Kubernetes Administrator). Experience with container orchestration and microservices architecture. Previous involvement in disaster recovery planning and Keywords: continuous integration continuous deployment golang Georgia Hiring Job Title: Site Reliability Engineer SRE with Python Location Alpharetta GA need 8+ yrs [email protected] |
[email protected] View all |
Tue Oct 08 23:30:00 UTC 2024 |