SRE Engineer at Mclean, Virginia, USA |
Email: [email protected] |
From: Abhishek, heliogic [email protected] Reply to: [email protected] Job Title: IMS SRE Engineer Location : McLean, VA (Day 1 onsite) Must have : IMS Experience Long Term Contract Required 8+ only Please find the detailed job description for this IMS SRE Consultant need. Please submit profiles accordingly. Responsibilities: Development and maintenance of programs written in ELK, Python Cloud deployments and maintenance Automate health monitoring of the production and Automate return to service procedures for Cloud Platform Components The Role offers: Hire, develop, and retain a team of SREs. Mentor, grow, and empower your team by giving them the skills, confidence, and motivation to make decisions independently that lead to their personal and professional success, and enable them to become technical leaders. Coach the team on SREs principles: automation, visibility improvements, toil reduction, self-healing, and root cause analysis. Manage high-visibility projects delivery, including estimation, schedule, risks, and dependencies. Establish standard practices and processes for planning and prioritizing reliability work. Drive a culture of reliability, and ensuring teams are aligned around common priorities and approaches. Participate in deep technical design discussions within your team, and across partner teams, and ensure that we're building the right systems and keeping the quality high. Drive cross-team alignment across development and other organizations around reliability initiatives. Meet with internal customers, supporting deep-dive conversations on product capabilities and escalation or problem resolution surrounding incidents. Participate in 24x7 management escalation on-call rotation. Essential Skills: Strong working experience with Linux Administration, build/release/config management Experience working with cloud deployments and maintenance. Experience with cloud monitoring tools like Prometheus, Logstash, and/or ELK stack Experience operating services in AWS/GCP/Azure clouds using Terraform and other technologies. Knowledge of object-oriented design principles and patterns Knowledge of SOA web services REST, SOAP, XML-RPC, XML, JSON Experience operating large scale distributed systems in first party and cloud environments. Experience with Agile Development Methodologies and Test-Driven Development Excellent interpersonal skills and an ability to build effective networks and leverage them appropriately. Excellent troubleshooting skills with the ability to learn new technologies in complex distributed systems. Good experience in any of the scripting/programming languages: Python, GoLang etc. Essential Qualification: Any Bachelor/Master Degree is a must. Keywords: Virginia |
[email protected] View all |
Wed Aug 02 20:28:00 UTC 2023 |