Home

SRE engineer || O Fallon, MO (Onsite - Only share Local profiles) at Fallon, Nevada, USA
Email: [email protected]
From:

Ambika,

Technosphere Inc.

[email protected]

Reply to:   [email protected]

Exp - Minimum 9+ years exp

Job Description

As a Site Reliability Engineer manages our production environment, providing a highly available and scalable platform for Ekata to serve our customers. The infrastructure team provides a resource for Engineering to help diagnose production issues and provide guidance on improving the availability and performance of our applications. This position also develops systems, automation, and tools to help make it easier for Engineering teams to deploy services in a fast, automated, and reliable fashion. 

In this role You will:

Build, scale and support high-availability Ubuntu Linux production and development systems in a public cloud environment.

Work with tools such as Jenkins, Ansible, Argo CD, Terraform, CloudFormation, Resource Manager and many more to ensure that our stack is well represented as Infrastructure as Code.

Manage and Improve security and availability monitoring for all services, ensure defined security policies are consistently implemented across all environments.

Deploy workloads to multiple cloud environments, proven experience with all of the core services within AWS, Azure or GCP, including instance management, IAM configuration, Database, Caching and general support/troubleshooting.

Have a developed understanding of the core components required to run Kubernetes and be able to build a cluster from scratch if needed.

Have perfected the fundamentals of load balancing, service mesh and always looking for ways to improve availability and uptime.

Maintain quality documentation for systems owned by the Infrastructure team.

Use monitoring tools to identify and resolve issues before they happen. Have familiarity with Prometheus.

Help other teams troubleshoot and solve failures and performance problems, participate in on-call rotations.

Have a passion for working with Go, Python, Rust or even Bash to build custom tools and improve system integration. Take code ownership to the

next level and act as an advocate for writing code that aligns with industry best practice.

Have a solid grasp on networking fundamentals and can easily explain how DNS, DHCP and routing work in most environments.

All About Candidate:  

Excellent spoken and written English skills. Is a team player and values collaboration.

BS degree in Computer Science or equivalent experience.

Proven skills with Linux or UNIX systems and related protocols/software with 3+ years experience.

A command of Linux systems including troubleshooting, memory management, tuning, I/O subsystem, RAID, and security.

Experience with provisioning tools such as Ansible/Chef/Terraform.

Experience with Jenkins or other CI/CD tools.

Programming aptitude in Go, Python, and Bash.

Working knowledge of database systems such as MySQL or PostgreSQL.

Experience building and deploying Containers, including orchestration tools such as Kubernetes, Mesos, or Docker Swarm.

Experience with cloud providers (AWS, Azure, GCP) 

has context menuComposeParagraph

Keywords: continuous integration continuous deployment information technology golang
[email protected]
View all
Tue Sep 12 23:49:00 UTC 2023

To remove this job post send "job_kill 630041" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 0

Location: ,