100% Remote Principal AWS Site Reliability Engineer with Very Strong Ansible-terraform kubernetes 11+ Years Profiles Needed at Strong, Arkansas, USA |
Email: [email protected] |
From: Gaurav Gaur, DMS VISIONS INC [email protected] Reply to: [email protected] Hi, Hope you are doing well, Please find the job description given below and let me know your interest. Position : 100% Remote Principal AWS Site Reliability Engineer with Very Strong Ansible-terraform kubernetes 11+ Years Profiles Needed Location: 100% Remote Duration :6+ Months Visa : Any Job Description : About the job Environment: DEVops=SRE AWS net Kubernetes Gravana KEY REQUIRED SKILLS Expert: ansible-terraform kubernetes Expert; AWS devops pro cert Preferred Good: AWS EKS Good APM Overview We are looking for an outgoing and dynamic Site Reliability Engineer to manage the successful operation and support of our application environments. This position is responsible for overseeing application policies and procedures to ensure the integrity and availability of applications. The Site Reliability Engineer is responsible for working with the product development teams and DevOps teams, focusing on the consideration for web and applications regarding deployment, performance and availability for all applications being developed. Responsibilities Drive focused initiatives that improve operational efficiency and scalability of the platform and applications Drive standardization efforts across multiple disciplines and services in conjunction with embedded SREs throughout the organization Identify and drive opportunities to improve automation for the company; scope and create automation for deployment, management and visibility of our services Understand modern software security and secure software systems with cloud-based infrastructure Provide full-stack diagnostics and determine root cause of internal problems Analyze operational performance which support delivering improvements to critical related system metrics & KPIs Examine all areas of infrastructure and applications for improvement and suggest changes, rather than wait for direction Safeguard application information against accidental or unauthorized damage, modification, or disclosure Build and maintain redundant systems and procedures for high availability and disaster recovery Develop integrated workflows for our support teams Own the customer experience think and act in ways that put our customers first, provide them a great digital experience, and make them promoters of our products and services Respond to and help troubleshoot incidents Participate in a 24x7 on-call rotation Key Skills and Competencies Needed 5+ years of extensive experience with Infrastructure as a Code (IaaC) and Desired State Configuration (DSC) tools such as Terraform and Ansible 5+ years of experience packaging, deploying and managing containerized workloads running in common PaaS solutions (i.e. Docker, Kubernetes) 5+ years expertise in managing AWS infrastructure at scale including expertise in the following services: EC2, S3, Elastic Load Balancing, Lambda, Route 53, ECS, SQS, CloudWatch Prior experience working in a DevOps or SRE environment Highly experienced with automation and scripting using languages such as: PowerShell, Python, Bash Large-scale monitoring and reporting experience using ELK stack, Dynatrace (or other APM) Experience with MS Windows IIS management, troubleshooting, and performance monitoring Experience managing web farms in a high-traffic SaaS environment Strong analytical and problem-solving skills including robust troubleshooting skills with a focus on preventative and proactive actions Extensive experience with .NET applications architecture components (caching, content delivery, high availability, load balancing, etc.) Understanding of the Software/Application Development Life Cycle process and experience with implementing and maintaining CI/CD technologies including: TeamCity, Octopus Deploy, GitHub, Jenkins, Codefresh, etc. Knowledge of or experience with most of the following technologies: Active Directory, SSL, FTP, Big-IP F5, T-SQL, MongoDB, MySQL, SQL Server, Nagios, Git, TeamCity, Octopus Deploy, Codefresh, Chef, Salt, Docker, Kubernetes, Kafka, Azure, Linux Server Administration, Bash, Apache If you are interested, please share your updated resume and suggest the best number & time to connect with you Thanks & Regards, Gaurav Gaur Email: [email protected] | Phone : 972-645-9280 LinkedIn: https://www.linkedin.com/in/gaurav-gaur-hr/ DMS Vision ,INC 4645 Avon Lane, Suite 210 Frisco, TX 75033 Keywords: continuous integration continuous deployment sthree ffive microsoft Texas 100% Remote Principal AWS Site Reliability Engineer with Very Strong Ansible-terraform kubernetes 11+ Years Profiles Needed [email protected] |
[email protected] View all |
Thu Aug 22 19:33:00 UTC 2024 |