Home

Senior Site Reliability Engineer | Atlanta, GA (Hybrid)Local only at Atlanta, Georgia, USA
Email: [email protected]
Role: Senior Site Reliability Engineer

Location:
Atlanta, GA

(Hybrid)Local only

Duration: 12+ Months Contract

Visa- Only USC/GC or H4-EAD

Qualifications:

We are looking for a Senior Site Reliability Engineer who is versed in modern reliability disciplines and can drive cross-team reliability initiatives. These initiatives include improving Delta reliability engineering
practices through increased application resiliency, increased uptime/availability and improving application performance. An ideal candidate would have prior experience implementing observability plans around logs, metrics, and traces.

YOUR RESPONSIBILITIES IN THIS ROLE

Strong experience setting SLOs / SLIs / error budgets and managing of reliability for infrastructure and applications

Proficient in one or more of the following scripting languages: JavaScript, Nodejs, Python, Maven, Ansible, Bash, etc

Experience handling large numbers of diverse systems with configuration management systems like Puppet, Chef, Ansible

Proven history of toil elimination by leveraging automation

Strong background using tools like PagerDuty for managing incidents

Strong experience with monitoring and alerting systems like Prometheus, Grafana, Dynatrace

Understanding of standard networking protocols and components such as HTTP, DNS, ECMP, TCP/IP, ICMP, the OSI Model, Subnetting and Load Balancing strategies

Experience in Serverless Application Framework

Experience in containerized workloads and management platforms such as Docker or Kubernetes

Familiarity with distributed systems including Microservices

Experience in Infrastructure automation tools such as CloudFormation, Terraform

Understanding of CI/CD processes and experience with deployment automation tools such as Code Pipeline, Code Deploy, Jenkins, Bamboo

Strong debugging, troubleshooting, and problem-solving skills

Effective communication, collaboration & negotiation skills with the ability to interface with various business units and third parties

Experience liaising with developers, operations staff and third-party resources

Experience with API integration projects

Ability to coach/mentor team members on multiple aspects of reliability engineering

Must Have Expertise

1. Experience in DevOps practices

2. Hands on experience with AWS Cloud and DevOps principles

3. Experience working on DevOps tools (GitLab CI, AWS-CodePipeline)

4. Experience in Scripting tools (Bash, Python etc.)

5. Experience in developing NodeJS or TypeScript applications.

6. Experience in building and supporting applications in AWS and engineering applications in the AWS infrastructure using their Native services.

7. Experience in AWS CDK

8. Ability to troubleshoot and resolve problems with existing AWS Cloud Controls

Nice-To Have Expertise

1. Experience in Containerization technologies like Kubernetes, OpenShift, Docker

2. Experience in Application Resiliency evaluation using AWS FIS

3. Experience using Litmus for Chaos Engineering methods.

4. Exposure to RedHat OpenShift on AWS (ROSA)

Responsibilities:

As a lead engineer with Retail, Site Reliability Engineering team, you will be at the forefront of Cloud and Big Data technology. In this role you will establish yourself as a technical leader by exposing yourself to
a broad range of industry leading technologies that will help to drive acceleration. The ideal candidate will have expert design and development capabilities and be positioned to contribute to a growing set of services and features for the ecosystem. This
role will be supporting highly available, business critical applications. This role will serve as the escalation point for complex and hard to define issues in both on premise and AWS environments. We are seeking talented engineers, well versed in DevOps technologies,
automation, infrastructure orchestration, configuration management, continuous integration, troubleshooting of complex issues, who are not constrained by how things are usually done.

Thanks & Regards,

Gaurav Jangid

Senior Technical Recruiter

Email-

[email protected]

LinkedIn

-
https://www.linkedin.com/in/gaurav-jangid-97a5241ab/

806, New Castle
,
Wilmington, DE,

US, 19801

--

Keywords: continuous integration continuous deployment information technology green card Delaware Georgia
Senior Site Reliability Engineer | Atlanta, GA (Hybrid)Local only
[email protected]
[email protected]
View all
Mon Oct 21 22:13:00 UTC 2024

To remove this job post send "job_kill 1861016" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 7

Location: Atlanta, Georgia