Home

Urgent hiring for Site Reliability Engineering || Philly, PA Day 1 Onsite at Day, New York, USA
Email: [email protected]
Role
: Site Reliability Engineering

Location
: Philly, PA Day 1 Onsite

Duration
: Contract

Responsibilities:

1.           
Observability and Monitoring:

o            
Develop and implement robust observability strategies, including logging,
metrics, and tracing, to gain deep insights into the performance and health of
our systems.

Collaborate with cross-functional teams to establish and
enforce best practices for instrumentation, logging, and monitoring throughout
the software development lifecycle.

1.           
Site Reliability Engineering:

o            
Lead initiatives to improve the reliability, availability, and scalability of
our applications and infrastructure.

Collaborate with development teams to design and implement
systems that are resilient to failures and capable of quick recovery.

Drive the adoption of SRE principles and practices across
the organization.

1.           
Incident Management:

o            
Develop and refine incident response processes, ensuring timely detection,
analysis, and resolution of incidents.

Collaborate with teams to conduct post-incident reviews,
identify root causes, and implement preventive measures.

1.           
Automation and Tooling:

o            
Build and maintain automation tools for deployment, monitoring, and incident
response to streamline operational processes.

Evaluate and integrate third-party tools to enhance
observability and SRE capabilities.

1.           
Collaboration and Leadership:

o            
Provide technical leadership and mentorship to the engineering team.

Collaborate with product managers, architects, and other
stakeholders to align observability and SRE initiatives with business goals.

Qualifications:

1.           
Bachelor's or higher degree in Computer Science, Software Engineering, or a
related field.

Extensive experience in software engineering with a focus on
observability, monitoring, and SRE.

Strong expertise in designing and implementing distributed
systems for high availability and reliability.

Proficiency in APM (Application performance monitoring), RUM
(Real user monitoring), Synthetics, correlation, alert & incident
management will be required. (e.g., OTEL, Jaeger, Kloudfuse, service-now)

Proficiency in one or more programming languages (e.g.,
Java, Python, Go).

Experience with cloud platforms (e.g., AWS, Azure, GCP) and
container orchestration (e.g., Kubernetes).

In-depth knowledge of observability tools and frameworks (e.g.,
Prometheus, Grafana, ELK stack, Datadog, Aternity) and incident management
processes.

In-depth knowledge of ML & AI frameworks (e.g., Anomaly,
Outlier, AIOps, LLM )

Excellent communication and collaboration skills.

Demonstrated ability to lead technical initiatives and
mentor team members.

Preferred Qualifications:

1.           
Certifications in relevant areas such as AWS Certified DevOps Engineer,
Certified Kubernetes Administrator (CKA), or equivalent.

Previous experience in a leadership or management role.

Familiarity with Infrastructure as Code (IaC) tools such as
Terraform, Packer & C Crossplane

Anurag  Kumar

[email protected]

848-209-8381

linkedin.com/in/anurag-kumar-bbb127232

www.siriinfo.com

3 Ethel
Rd, Suite # 302 Edison NJ 08817

CPUC Certified

P

Please consider the environment
before printing.

We respect
your online privacy. If you would like to be removed from our mailing list
please reply with "
Remove
" in the subject and we will
comply immediately. We apologize for any inconvenience caused. Please let us
know if you have more than one domain. The material in this e-mail is intended
only for the use of the individual to whom it is addressed and may contain
information that is confidential, privileged, and exempt from disclosure under
applicable law. If you are not the intended recipient, be advised that the
unauthorized use, disclosure, copying, distribution, or the taking of any
action in reliance on this information is strictly prohibited. We are an equal
opportunity employer with a diverse workforce.

--

Keywords: cprogramm artificial intelligence machine learning information technology golang New Jersey Pennsylvania
[email protected]
View all
Thu Dec 14 21:00:00 UTC 2023

To remove this job post send "job_kill 938382" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 27

Location: , Pennsylvania