Home

Site Reliability Engineer - Charlotte NC - NEED ONLY LOCAL CANDIDATES at Charlotte, North Carolina, USA
Email: [email protected]
From:

Abhishek Garg,

USG Inc.

[email protected]

Reply to: [email protected]

Site Reliability Engineer

Charlotte NC - Hybrid Onsite from day 1

12 Months

Lately weve seen several profile rejections in our internal tech screening. Hence, weve requested and received below updates from the client on what their expectations are. Kindly have a look; also, please ensure that the resumes are well updated with their roles showcasing these technologies in their projects rather than just the one liners.

1. Splunk

Understanding of important commands like index, dedup, rex, uniq, lookup, timechart, transaction, etc.

Hands on experience on creating new dashboards.

Analyze and troubleshooting using Splunk tool.

They should know process of optimizing Splunk queries for large scale data. Also discuss the technique.

Hands-on expertise in building monitoring dashboards and setting up alerts using Splunk.

2. AppDynamics

Primary purpose

Creating health rules,

Hands on experience

3. Grafana

Knowledge of Grafana

4. Mongo DB and SQL Queries

Hands-on experience in writing Oracle SQL queries and MongoDB queries.

5. Basic Java

Basic understanding of Java, OOPs concept.

Able to analyze and trouble shoot production issues by reading stack trace and exceptions.

Please find JD/Expectation from client:

Site Reliability Engineer:

5-10 years of experience in Production support/SRE teams with continued focus on improving Platform health

Experience working in Micro service architecture.

Hands-on Java coding Exp and able to analyze and trouble shoot production issues by reading stack trace and exceptions.

Familiar with Agile or other rapid application development practices

Hands-on expertise in building monitoring dashboards and setting up alerts using Splunk.

Hands-on experience in writing Oracle SQL queries and MongoDB queries.

Experience with distributed (multi-tiered) systems, algorithms, and relational databases.

Must have working knowledge of APM tools such as Splunk, ELK, Grafana, Prometheus etc.

Knowledge & Exposure caching tools (Redis, memcache) or messaging tools such as MQ, Kafka is a plus.

Working knowledge of CICD is a plus Source control like Git/Bitbucket , Continuous Integration Jenkins / UCD Release etc.

Ability to work with Engineering teams across the ecosystem such as Security , Networking & Infrastructure challenges which can impact platform health & resiliency.

Shell Scripting / DevOps tools like Ansible with good knowledge of YAML file to write playbooks .

Experience with distributed storage technologies like NFS as well as dynamic resource management frameworks PCF, Kubernetes / Open Shift.

A proactive approach to spotting problems, areas for improvement, and performance bottlenecks.

Expectations:

You will be a core member of a SRE support team, will be utilizing the latest technology tools to write code, test cases, working with API specs and automate to maintain the resiliency, performance and availability of Digital Sales & Marketing platforms.

Strong & relevant experience in supporting Web/API platforms built using Java/java script Stack (Spring/Spring boot, Javascript -Angular/react)

Proficiency in dealing with Legacy infrastructure along with cloud infrastructure (on prem & 3rd party) such as PCF or Azure.

Identifying opportunities to adopt to new technologies while improving the efficiency by removing toil and continues to drive efficiency & optimization.

Proactive monitoring of app performance through Splunk, App dashboards, App dynamics & Dynatrace etc.

Represent Platform engineering teams during production outages and collaborate with engineering teams to resolve production outages. Collaborate with stake holders across engineering function to own/derive RCA & work towards permanent resolution.

Plan, support, execute and comply with governance programs/processes in support of a strong control environment in your functional area. Leverage process documentation to improve operational controls and identify and remediate process deficiencies.

Proactively identify, communicate, mitigate, and escalate risk originating from non-compliance of processes, operational errors, and data integrity issues in all applicable processes.

Ability to influence SRE practices within and outside teams to enable a strong DevOps culture within the organization.

Responsible for working with Engineering teams to maintain the SLAs & SLOs. Constantly looking out for opportunities to improve platform metrics & communicate the same to stakeholders.

Tech Stack : Java/J2EE ( Spring, spring boot, python, shell scripting).

Exposure and proficiency in different API styles such as SOAP, REST, Micro services etc.

Regards,

Abhishek Garg

|[email protected]

Keywords: message queue database North Carolina
Site Reliability Engineer - Charlotte NC - NEED ONLY LOCAL CANDIDATES
[email protected]
[email protected]
View all
Mon May 20 22:37:00 UTC 2024

To remove this job post send "job_kill 1410433" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 38

Location: , North Carolina