Home

Site Reliability Engineer: Atlanta, GA (Need Locals) at Atlanta, Georgia, USA
Email: [email protected]
From:

Satnam Singh,

SPAR Information Systems

[email protected]

Reply to:   [email protected]

Hello All,

Hope you are doing great

Please go through the job description and let me know your interest.

Role: Site Reliability Engineer

Location: Atlanta, GA (Hybrid from day 1) (Need Locals)

Duration: Long Term Contract

Job Description

:

Mandatory Skills: Kubernetes, Java Api, Cloud Services, Devops Tools

Optional Skills: Aws, Agile Scrum, Api Gateway

Clients telecommunications practice is looking for dynamic and driven professionals to join a rapidly growing high-performance team.

Our client is a leading provider of digital Global System for Mobile Communications/General Packet Radio Service (GSM/GPRS) wireless voice and data technology standards.

Site Reliability Engineer, ACE Platform Engineering will support critical API Platform, devops and other activities for the Digital Services Group. Position duties and responsibilities include, but are not limited to:

Provide consulting services for improved system stability, availability, performance and reliability.

Assist in determining the impact of operational issues and provide input into their resolution via data extraction and quantification.

Work through day-to-day support issues, ensure effective and timely resolution of issues in production environment, troubleshoot customer impacting issues.

Forecast and plan for rapidly growing environment.

Support multiple applications, specifically running Solo Gloo/Kubernetes/PCF/GCP/Java based systems in an enterprise environment.

Supporting Gloo running on Kubernetes, Grafana, Prometheus, Cassandra, Postgres, Spring Boot or Java based applications running on PCF and WebLogic.

Apply monitoring and creating complex alerts and dashboards for production systems.

Provide capacity analysis, tuning analysis for Cloud applications in a LINUX and container platform.

Available to provide 24X7 on call support on a rotating basis with other team members.

Lead efforts in troubleshooting, recovery, and root cause investigation.

Perform analysis of user requirements and problems to automate or improve systems and review system capabilities, workflow, and scheduling limitations.

Able to follow and develop detailed work plans, schedules, project estimates, resource plans, and status reports.

Facilitate DR (Disaster Recovery) exercises to ensure that the team are fully prepared in any event.

Lead root cause analysis session to understand what causes issues in Production and come up with solutions that will prevent them from happening in the future.

Ensure documentation is created and remain updated for any related work.

Strong understanding of UNIX operating systems and any scripting language.

Evaluates product and service solutions.

Skill requirements:

Strong hands on experience in Kubernetes, infrastructure and support.

Strong experience in DevOps Practice for Micro Services using Kubernetes as Orchestrator.

Strong experience with Cloud configurations, services

Strong experience in API microservices

Experience with tools like: NGINX, Docker, PostMan, SOAP UI, ELK, Splunk, App Dynamics, CI/CD tools and GITLab

Good Experience in performance measures and tuning, capacity planning and management, contingency and disaster recovery

Strong scripting knowledge and experience.

Good understanding of networking and routing.

PRIMARY REQUIREMENTS:

Masters degree in Information Technology, Computer Science, Computer Information Systems, Computer Applications, related field or its equivalent and 3 years of relevant work experience. ALTERNATIVE REQUIREMENTS: Applicant must have Bachelors degree in Information Technology, Computer Science, Computer Information Systems, Computer Applications, related field or its equivalent and 5 years of relevant work experience.

Thanks & Regards,

Satnam Singh

Direct: 201 623 3660

Email : [email protected] 

Keywords: continuous integration continuous deployment user interface golang Georgia
Site Reliability Engineer: Atlanta, GA (Need Locals)
[email protected]
[email protected]
View all
Wed May 01 02:03:00 UTC 2024

To remove this job post send "job_kill 1356138" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 11

Location: Atlanta, Georgia