Home

SiteReliabilityEngineering (SRE) with Strong Exp in Kubernetes - Austin,TX (Onsite) Only Locals at Austin, Texas, USA
Email: [email protected]
https://tinyurl.com/4fj4s2y9
https://jobs.nvoids.com/job_details.jsp?id=2420306&uid=e56ac9659088461f9b63eca9d1d4c7aa
From:

Manohar Reddy,

Procorp Systems Inc

[email protected]

Reply to:   [email protected]

Site Reliability Engineering (SRE) with Strong Exp in Kubernetes

Location: Austin, TX (Onsite) Only Locals

Tools & Technologies Required

Python, Java, AWS, Kube, Jenkins, Docker, Splunk, Golang

Design, implement, and maintain highly available and scalable distributed systems.

Develop automation tools and scripts using Java, Python, or other relevant technologies to improve system reliability and efficiency.

Work on AWS-based Kubernetes deployments, optimizing infrastructure reliability.

Identify and resolve platform bugs, providing enhancements to improve Kubernetes efficiency.

Implement best practices in reliability, automation, and performance tuning to streamline operations.

Handle incident tickets ranging from P0 to P4, troubleshooting and resolving critical issues within SLAs.

Investigate recurring issues, conduct root cause analysis (RCA), and implement preventive measures.

Ensure the uptime, reliability, and performance of production systems

Automate operational processes and eliminate manual intervention

Collaborate with developers to build scalable and resilient infrastructure

Monitor and troubleshoot systems, identifying and resolving issues proactively

Implement and maintain monitoring, logging, and alerting systems

Participate in on-call rotation for production incident response

Monitor, troubleshoot, and resolve production incidents, ensuring system uptime and performance.

Optimize infrastructure by implementing best practices in observability, logging, and monitoring (Prometheus, Grafana, ELK, etc.).

Collaborate with development teams to enhance CI/CD pipelines, automate deployments, and improve software delivery processes.

Additional responsibilities

Ensure all the application components are running smoothly in the Kubernetes and AWS environment.

Support the components (patches / upgrades / issues / configurations) on the application Platform

Manage CI/CD pipelines for the application tools / components

Automation of Tasks to improve efficiency and effort reduction

Create and publish comprehensive dashboards for Observability

Configuring & Monitoring for Health Checks

User Provisioning

Experience with cloud platforms (AWS, Google Cloud Platform, Azure)

Proficient in programming/scripting languages (Python, Go, , etc.)

Strong knowledge of Linux/Unix systems and networking

Familiarity with containerization and orchestration tools (Docker, Kubernetes)

Solid understanding of CI/CD, automation, and infrastructure-as-code principles

Strong problem-solving and troubleshooting skills

Monitoring & Remediation of Alerts

Thanks & Regards

Manohar Reddy

Senior Technical Recruiter

Procorp Systems Inc

2222 W Spring Creek Pkwy, STE 202, Plano, Texas 75023

E-mail: 
[email protected]

Keywords: continuous integration continuous deployment golang Texas
SiteReliabilityEngineering (SRE) with Strong Exp in Kubernetes - Austin,TX (Onsite) Only Locals
[email protected]
https://tinyurl.com/4fj4s2y9
https://jobs.nvoids.com/job_details.jsp?id=2420306&uid=e56ac9659088461f9b63eca9d1d4c7aa
[email protected]
View All
04:16 AM 13-May-25


To remove this job post send "job_kill 2420306" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]


Time Taken: 7

Location: Austin, Texas