Home

Only H1B|H4ead Hiring SRE Observability Engineer in Dallas TX at Dallas, Texas, USA
Email: [email protected]
From:

Samir Priya Darshan,

Netonics Inc

[email protected]

Reply to: [email protected]

Mandate Skills:

Help build a Site Reliability Engineering culture by sharing your best practices, approaches, documentation, and code with other engineering teams.

Apply automation and software to any tasks or parts of the system that would benefit from it or are performed manually.

Able to troubleshoot complicated issues handling OS, Networking, Database in a cloud-based SaaS environment/on-premises environment and handle live production incidents, debug/troubleshoot application, and infrastructure issues, follow and implement SRE best practices.

Monitor application performance, take steps to improve overall application performance and stability and follow through with implementation;

JD:

The Site Reliability Engineer is a fundamental piece of the Site Reliability Engineering team. Site Reliability Engineering is accountable for the availability, reliability, and performance of the services and platforms in a highly transactional 24x7 environment.

What you will do:

Monitor application performance, take steps to improve overall application performance and stability, and follow through with implementation.

Apply automation and software to any tasks or parts of the system that would benefit from it or are performed manually.

Able to troubleshoot issues handling OS, Networking, databases in a cloud-based environment/on-premises environment and handle live production incidents, debug/troubleshoot application, and infrastructure issues, follow and implement SRE best practices.

Coordinate with Product owners/business representatives to define Service Level Objectives and error budgets for key functionalities of the projects

Participate in design reviews of software/components with build teams to ensure that they are built right.

Review products prior to production deployments to validate compliance with Service level objectives

Conduct system analysis, and configuration management and develop improvements for system software performance, availability, and reliability.

Work closely with software engineers and QA to ensure the system is responding properly to non-functional requirements such as performance, security, and availability.

Document system knowledge as acquired over time, create runbooks and ensure critical system information is readily available to those who need it.

Maintain and monitor deployment of the servers, docker containers, databases, and general backend infrastructure.

Participate in production feedback sessions, problem management calls to identify opportunities for product improvement.

What youll bring:

Bachelors Degree in Computer Science or related; or equivalent combination of education and experience

5+ years experience in full-stack application support/SRE role

Experience in Javascript, Typescript and web development technologies

Proficient in scripting languages such as Powershell and/or Python

Troubleshooting experience of complex application incidents built in AWS stack

Experience in conducting design reviews of software components and leading performance, capacity and chaos experiments.

Extensive Experience with observability platforms (Data dog) is required. Experience with built-in browser side diagnostic tools is expected.

Knowledge of DevOps methodologies and the tools involved such as CI/CD concepts, CI/CD tools (Jenkins, CodePipeline, etc.), and automation and configuration tools (Puppet, Ansible, etc) a plus.

Hands on experience with AWS public cloud is a must, Project implementation experience on public cloud is a plus.

Ability and willingness to adapt to new application stacks and new technology concepts as the business evolves over time

Excellent communication skills, both verbal and written

Ability to collaborate with local and remote teams in different time zones

Ability to present/lead technical discussions with product, cloud COE, security and other support teams

Keywords: continuous integration continuous deployment quality analyst information technology
Only H1B|H4ead Hiring SRE Observability Engineer in Dallas TX
[email protected]
[email protected]
View all
Fri Aug 16 18:45:00 UTC 2024

To remove this job post send "job_kill 1665789" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 0

Location: ,