Job Details

Home

100% Remote Principal AWS Site Reliability Engineer with Very Strong Ansible-terraform kubernetes 11+ Years Profiles Needed at Strong, Arkansas, USA

Email: [email protected]

From:

Gaurav Gaur,

DMS VISIONS INC

[email protected]

Reply to: [email protected]

Hi,

Hope you are doing well,

Please find the job description given below and let me know your interest.

Position

: 100% Remote Principal AWS Site Reliability Engineer with Very Strong Ansible-terraform kubernetes 11+ Years Profiles Needed

Location: 100% Remote

Duration :6+ Months

Visa :

Any

Job Description

:

About the job

Environment: DEVops=SRE

AWS

net

Kubernetes

Gravana

KEY REQUIRED SKILLS

Expert: ansible-terraform kubernetes

Expert; AWS devops pro cert Preferred

Good: AWS EKS

Good APM

Overview

We are looking for an outgoing and dynamic Site Reliability Engineer to manage the successful operation and support of our application environments. This position is responsible for overseeing application policies and procedures to ensure the integrity and availability of applications. The Site Reliability Engineer is responsible for working with the product development teams and DevOps teams, focusing on the consideration for web and applications regarding deployment, performance and availability for all applications being developed.

Responsibilities

Drive focused initiatives that improve operational efficiency and scalability of the platform and applications

Drive standardization efforts across multiple disciplines and services in conjunction with embedded SREs throughout the organization

Identify and drive opportunities to improve automation for the company; scope and create automation for deployment, management and visibility of our services Understand modern software security and secure software systems with cloud-based infrastructure

Provide full-stack diagnostics and determine root cause of internal problems

Analyze operational performance which support delivering improvements to critical related system metrics & KPIs

Examine all areas of infrastructure and applications for improvement and suggest changes, rather than wait for direction

Safeguard application information against accidental or unauthorized damage, modification, or disclosure

Build and maintain redundant systems and procedures for high availability and disaster recovery

Develop integrated workflows for our support teams

Own the customer experience think and act in ways that put our customers first, provide them a great digital experience, and make them promoters of our products and services

Respond to and help troubleshoot incidents

Participate in a 24x7 on-call rotation

Key Skills and Competencies Needed

5+ years of extensive experience with Infrastructure as a Code (IaaC) and Desired State Configuration (DSC) tools such as Terraform and Ansible

5+ years of experience packaging, deploying and managing containerized workloads running in common PaaS solutions (i.e. Docker, Kubernetes)

5+ years expertise in managing AWS infrastructure at scale including expertise in the following services: EC2, S3, Elastic Load Balancing, Lambda, Route 53, ECS, SQS, CloudWatch

Prior experience working in a DevOps or SRE environment

Highly experienced with automation and scripting using languages such as: PowerShell, Python, Bash

Large-scale monitoring and reporting experience using ELK stack, Dynatrace (or other APM)

Experience with MS Windows IIS management, troubleshooting, and performance monitoring

Experience managing web farms in a high-traffic SaaS environment

Strong analytical and problem-solving skills including robust troubleshooting skills with a focus on preventative and proactive actions

Extensive experience with .NET applications architecture components (caching, content delivery, high availability, load balancing, etc.)

Understanding of the Software/Application Development Life Cycle process and experience with implementing and maintaining CI/CD technologies including: TeamCity, Octopus Deploy, GitHub, Jenkins, Codefresh, etc.

Knowledge of or experience with most of the following technologies:

Active Directory, SSL, FTP, Big-IP F5, T-SQL, MongoDB, MySQL, SQL Server, Nagios, Git, TeamCity, Octopus Deploy, Codefresh, Chef, Salt, Docker, Kubernetes, Kafka, Azure, Linux Server Administration, Bash, Apache

If you are interested, please share your updated resume and suggest the best number & time to connect with you

Thanks & Regards,

Gaurav Gaur

Email:
[email protected] | Phone :

972-645-9280

LinkedIn:

https://www.linkedin.com/in/gaurav-gaur-hr/

DMS Vision ,INC

4645 Avon Lane, Suite 210

Frisco, TX 75033

Keywords: continuous integration continuous deployment sthree ffive microsoft Texas
100% Remote Principal AWS Site Reliability Engineer with Very Strong Ansible-terraform kubernetes 11+ Years Profiles Needed
[email protected]

[email protected]
View all

Thu Aug 22 19:33:00 UTC 2024

To remove this job post send "job_kill 1683557" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.

Your reply to [email protected] -

To

Subject
Message -

gaurav@dmsvisions.com wrote:
From:

Gaurav Gaur,

DMS VISIONS INC

gaurav@dmsvisions.com

Reply to:   gaurav@dmsvisions.com

Hi,

Hope you are doing well,

Please find the job description given below and let me know your interest.

Position

: 100% Remote Principal AWS Site Reliability Engineer with Very Strong Ansible-terraform kubernetes 11+ Years Profiles Needed

Location: 100% Remote

Duration :6+ Months

Visa :

Any

Job Description

About the job

Environment: DEVops=SRE

AWS

net

Kubernetes

Gravana

KEY REQUIRED SKILLS

Expert: ansible-terraform kubernetes

Expert; AWS devops pro cert Preferred

Good: AWS EKS

Good APM

Overview

We are looking for an outgoing and dynamic Site Reliability Engineer to manage the successful operation and support of our application environments. This position is responsible for overseeing application policies and procedures to ensure the integrity and availability of applications. The Site Reliability Engineer is responsible for working with the product development teams and DevOps teams, focusing on the consideration for web and applications regarding deployment, performance and availability for all applications being developed.

Responsibilities

Drive focused initiatives that improve operational efficiency and scalability of the platform and applications

Drive standardization efforts across multiple disciplines and services in conjunction with embedded SREs throughout the organization

Identify and drive opportunities to improve automation for the company; scope and create automation for deployment, management and visibility of our services Understand modern software security and secure software systems with cloud-based infrastructure

Provide full-stack diagnostics and determine root cause of internal problems

Analyze operational performance which support delivering improvements to critical related system metrics & KPIs

Examine all areas of infrastructure and applications for improvement and suggest changes, rather than wait for direction

Safeguard application information against accidental or unauthorized damage, modification, or disclosure

Build and maintain redundant systems and procedures for high availability and disaster recovery

Develop integrated workflows for our support teams

Own the customer experience  think and act in ways that put our customers first, provide them a great digital experience, and make them promoters of our products and services

Respond to and help troubleshoot incidents

Participate in a 24x7 on-call rotation

Key Skills and Competencies Needed

5+ years of extensive experience with Infrastructure as a Code (IaaC) and Desired State Configuration (DSC) tools such as Terraform and Ansible

5+ years of experience packaging, deploying and managing containerized workloads running in common PaaS solutions (i.e. Docker, Kubernetes)

5+ years expertise in managing AWS infrastructure at scale including expertise in the following services: EC2, S3, Elastic Load Balancing, Lambda, Route 53, ECS, SQS, CloudWatch

Prior experience working in a DevOps or SRE environment

Highly experienced with automation and scripting using languages such as: PowerShell, Python, Bash

Large-scale monitoring and reporting experience using ELK stack, Dynatrace (or other APM)

Experience with MS Windows IIS management, troubleshooting, and performance monitoring

Experience managing web farms in a high-traffic SaaS environment

Strong analytical and problem-solving skills including robust troubleshooting skills with a focus on preventative and proactive actions

Extensive experience with .NET applications architecture components (caching, content delivery, high availability, load balancing, etc.)

Understanding of the Software/Application Development Life Cycle process and experience with implementing and maintaining CI/CD technologies including: TeamCity, Octopus Deploy, GitHub, Jenkins, Codefresh, etc.

Knowledge of or experience with most of the following technologies:

Active Directory, SSL, FTP, Big-IP F5, T-SQL, MongoDB, MySQL, SQL Server, Nagios, Git, TeamCity, Octopus Deploy, Codefresh, Chef, Salt, Docker, Kubernetes, Kafka, Azure, Linux Server Administration, Bash, Apache

If you are interested, please share your updated resume and suggest the best number & time to connect with you

Thanks & Regards,

Gaurav Gaur

Email: 
gaurav@dmsvisions.com | Phone :

972-645-9280

LinkedIn:

https://www.linkedin.com/in/gaurav-gaur-hr/

DMS Vision ,INC

4645 Avon Lane, Suite 210

Frisco, TX 75033

Keywords: continuous integration continuous deployment sthree ffive microsoft Texas 
100% Remote Principal AWS Site Reliability Engineer with Very Strong Ansible-terraform kubernetes 11+ Years Profiles Needed
gaurav@dmsvisions.com

Your email id:

Captcha Image:

Captcha Code:

Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 28

Location: , Remote