| SRE + Java at Atlanta, Georgia, USA |
| Email: [email protected] |
|
http://bit.ly/4ey8w48 https://jobs.nvoids.com/job_details.jsp?id=1292088&uid= From: Utsav (IT Resource Manager), ChabezTech [email protected] Reply to: [email protected] Job Title: Kubernetes Site Reliability Engineer + Java Developer (Combined profile) Location: Atlanta, GA (Primary) or Frisco, TX (Secondary) Duration: Long Term Job Description: As a Site Reliability Engineer on the ACE Platform Engineering team, you will play a crucial role in supporting the critical API Platform and DevOps activities for the Digital Services Group. Your responsibilities will include: Implementing monitoring solutions and creating complex alerts and dashboards for production systems. Conducting capacity analysis and tuning for Cloud applications deployed on LINUX and container platforms. Providing 24x7 on-call support on a rotating basis with other team members. Leading efforts in troubleshooting, recovery, and root cause investigation. Analyzing user requirements and problems to automate or improve systems, while reviewing system capabilities, workflow, and scheduling limitations. Developing and following detailed work plans, schedules, project estimates, resource plans, and status reports. Facilitating Disaster Recovery (DR) exercises to ensure team readiness. Leading root cause analysis sessions to identify and prevent production issues in the future. Ensuring documentation is created and updated for all related work. Demonstrating a strong understanding of UNIX operating systems and proficiency in scripting languages. Evaluating product and service solutions. Skill Requirements: Strong hands-on experience in Kubernetes, infrastructure, and support. Proficiency in DevOps practices for Microservices using Kubernetes as an orchestrator. Extensive experience with Cloud configurations and services. Strong familiarity with API microservices. Proficiency with tools such as NGINX, Docker, PostMan, SOAP UI, ELK, Splunk, App Dynamics, CI/CD tools, and GITLab. Experience in performance measures and tuning, capacity planning and management, and contingency and disaster recovery. Strong scripting knowledge and experience. Good understanding of networking and routing. Thanks & Regards Utsav Manager ChabezTech LLC 4 Lemoyne Dr #102, Lemoyne, PA 17043, USA Email: [email protected] | www.chabeztech.com Keywords: continuous integration continuous deployment user interface information technology Georgia Pennsylvania Texas SRE + Java [email protected] http://bit.ly/4ey8w48 https://jobs.nvoids.com/job_details.jsp?id=1292088&uid= |
| [email protected] View All |
| 05:14 AM 09-Apr-24 |