Local Senior Site Reliability Engineer needed for Atlanta, GA at Atlanta, Georgia, USA |
Email: [email protected] |
Hi, Greetings of the day! I have an urgent requirement below, please go through JD and let me know if you are comfortable or have any profile. Kindly revert me back with your updated resume as well. Job Title: Senior Site Reliability Engineer Location: Atlanta, GA (Work from Office) Duration: Long Term Contract Qualifications: We are looking for a Senior Site Reliability Engineer who is versed in modern reliability disciplines and can drive cross-team reliability initiatives. These initiatives include improving Delta reliability engineering practices through increased application resiliency, increased uptime/availability and improving application performance. An ideal candidate would have prior experience implementing observability plans around logs, metrics, and traces. YOUR RESPONSIBILITIES IN THIS ROLE: Strong experience setting SLOs / SLIs / error budgets and managing of reliability for infrastructure and applications Proficient in one or more of the following scripting languages: JavaScript, Nodejs, Python, Maven, Ansible, Bash, etc Experience handling large numbers of diverse systems with configuration management systems like Puppet, Chef, Ansible Proven history of toil elimination by leveraging automation Strong background using tools like PagerDuty for managing incidents Strong experience with monitoring and alerting systems like Prometheus, Grafana, Dynatrace Understanding of standard networking protocols and components such as HTTP, DNS, ECMP, TCP/IP, ICMP, the OSI Model, Subnetting and Load Balancing strategies Experience in Serverless Application Framework Experience in containerized workloads and management platforms such as Docker or Kubernetes Familiarity with distributed systems including Microservices Experience in Infrastructure automation tools such as CloudFormation, Terraform Understanding of CI/CD processes and experience with deployment automation tools such as Code Pipeline, Code Deploy, Jenkins, Bamboo Strong debugging, troubleshooting, and problem-solving skills Effective communication, collaboration & negotiation skills with the ability to interface with various business units and third parties Experience liaising with developers, operations staff and third-party resources Experience with API integration projects Ability to coach/mentor team members on multiple aspects of reliability engineering Must Have Expertise: 1. Experience in DevOps practices 2. Hands on experience with AWS Cloud and DevOps principles 3. Experience working on DevOps tools (GitLab CI, AWS-CodePipeline) 4. Experience in Scripting tools (Bash, Python etc.) 5. Experience in developing NodeJS or TypeScript applications. 6. Experience in building and supporting applications in AWS and engineering applications in the AWS infrastructure using their Native services. 7. Experience in AWS CDK 8. Ability to troubleshoot and resolve problems with existing AWS Cloud Controls Nice-To Have Expertise: 1. Experience in Containerization technologies like Kubernetes, OpenShift, Docker 2. Experience in Application Resiliency evaluation using AWS FIS 3. Experience using Litmus for Chaos Engineering methods. 4. Exposure to RedHat OpenShift on AWS (ROSA) Thanks & Regards, Pradeep Email: [email protected] "Caltriko Solutions is an Equal Opportunity Employer. We are committed to creating an inclusive environment for all employees and do not discriminate based on race, color, religion, sex, national origin, age, disability, or any other legally protected status." Keywords: continuous integration continuous deployment golang Georgia Local Senior Site Reliability Engineer needed for Atlanta, GA [email protected] |
[email protected] View all |
Tue Oct 22 23:01:00 UTC 2024 |