Home

Site Reliability Engineer at USA-California Remote at California, Maryland, USA
Email: [email protected]
From:

Devyani Kumari,

Absolute IT

[email protected]

Reply to:   [email protected]

Position: Site Reliability Engineer

Location: USA-California Remote

Duration: 6+ months

Looking for a highly motivated Site Reliability Engineer, who is capable of build and run large-scale, massively distributed, fault-tolerant systems. Individual to work with teams across the organization and ensures core services reliability and keep an eye on capacity and performance.

This is for a migration from AWS into GCP. Knowledge and experience with GCP is mandatory, knowledge of AWS is nice to have.

Responsibilities:

Responsible for blameless postmortems and proactive identification of potential outages factor into iterative improvement.

Experience in Designing and Deploying multi-data center Large Scale Web Applications.

Work closely with dev, and ops teams to build highly available, cost-effective systems.

Create new tools and scripts designed for auto-remediation of incidents.

Design/Implementation of Big Data technologies, including Hadoop, MongoDB, Kafka, RabbitMQ, Zookeeper, Spark, ELK, etc.

Responsible for establishing end-to-end monitoring and alerting on all critical aspects to ensure SLAs and get proactive notifications of possible issues for all systems.

Design platforms for extremely high uptime metrics.

Works well independently and requires little or no supervision.

Work with cloud operations team to resolve trouble tickets, developing and running scripts, and troubleshooting.

Fully understand the application, microservices interactions.

Design/Implementation containers/applications in scalable HA/DR multi-tier cloud environments, including new system design, documentation, implementation, and deployment.

Participate in 24x7 an on-call rotation.

Job Description:

Job Requirements (7+ years of experience in the following areas):

Experience in providing L4 technical support for production 24x7.

Strong experience in production support and operations.

Design/Implementation of network and presentation tier technologies, including F5, Apache, Nginx, etc.

Experience in Performance Testing/Tuning/Monitoring, maximizing system uptime and availability, ensuring functional and performance SLAs.

Experience with monitoring Application/Infrastructure Performance, and availability.

Automation Experience with Build/deployment, Software Configuration/Continuous Integration/Continuous Delivery/Release Engineering related tasks in an JavaEE/C++ Environments.

Experience in automating manual processes using Python, Ruby, Unix Shell (bash, ksh), perl, Ant, etc.

Installing, Configuring, Administering, and Tuning of JavaEE Application Servers/Containers like Tomcat, WebSphere, etc.

Installing/maintaining/Administering software on Unix Linux, Windows servers.

Experience with Web service technologies, including REST, SOAP, JSON, XML.

Experience with Cloud Platforms and virtualization Technologies.

Deploying and automating infrastructure/applications in cloud environment using Chef, RPM, etc.

Working closely with Development, QA, Product Management, and Production Ops teams to make sure Product Releases on-time with quality.

Hands on experience Configuring and Administering SCM (GIT, SVN), Build (CMake, Make files, Maven), CI(Jenkins), CD Automation Tools.

Keywords: cplusplus continuous integration continuous deployment quality analyst information technology ffive
[email protected]
View all
Wed Dec 06 21:25:00 UTC 2023

To remove this job post send "job_kill 914191" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 0

Location: ,