Job Details

Home

Need 11+ years Lead SRE Local to CA only USC GC GCEAD HYBRID at Newport Beach, California, USA

Email: [email protected]

From:

Amandeep Kaur,

Techrooted INC

[email protected]

Reply to: [email protected]

Hi,

This is Aman, Technical Recruiter at Techrooted, we have an excellent job opportunity for you. Please let me know if you are interested then share your updated resume along with your contact details so that we can proceed further.

Lead SRE

Location: Newport Beach, CA
(Local to CA only who can go onsite for 2-3 days per week; HYBRID)

USC GC GCEAD only

Solid LinkedIn profile is must

Mention Current & exact Location , Visa and LinkedIn profile while sharing the profile; otherwise i will not consider it.

TECHNICAL SKILLS

Must Have

Advanced experience with algorithms, data structures, complexity analysis and software design

Demonstrated expertise in microservices lifecycle management (integration, testing, deployment)

Expert knowledge of release software tooling (e.g. Jenkins or Jenkins X, Spinnaker, Harness, Azure Devops service or other Cloud specific cloud environment)

Expert level knowledge of containerization technologies including experience in optimizing Docker image and managing Docker image lifecycle

Expert level of knowledge for Kubernetes preferred but will consider experienced in other orchestration solution

Expert level of Linux/Unix/Window OS experience

Strong experience in multiple technologies in the following set of logging and monitoring tools: ELK stack, Prometheus, Stackdriver, New Relic, Datadog, Dynatrace, Splunk, AWS logging and monitoring

Subject matter expert in designing and supporting one of the 3 major public cloud provider AWS is a plus will consider any other public cloud providers experience

Job Description:

LEAD SRE

As a Lead SRE you will be providing technical leadership, direction and accountability for platform engineering, system design and end-to-end implementation to meet and exceed the product or platform non-functional requirements including quality, security, reliability, availability and performance. The main responsibilities include, but are not limited to, optimizing design and engineering for new system and enhancements, including processes and day to day activities, to reliably support product rollout and operation in production. As a lead SRE, the role will include both oversight for production operations of our portfolio of systems, as well as development/engineering of solutions to optimize system reliability and automation.

How youll help move us forward:

Lead the design, build and implement orchestration and tooling solutions to ensure that repetitive administration tasks are performed at a high level of efficiency and free of defect

Establish best practices for structuring, automating, building, deploying and monitoring complex distributed software products and environments.

Ensure the reliability and traceability of software releases and deployments of software and infrastructure changes.

Create and maintain platform architecture and design specifications to aid development, testing and maintenance of software environments

Design and implement monitoring and recovery tools to provide for site high availability (HA) and disaster recovery (DR)

Design and develop highly available infrastructure and platform components to meet the needs of our growing and evolving product lines

Design and implement security engineering best practices in all our deployed platform and environments

Triage alerts & diagnose/resolve critical issues, manage the implementation of changes

Manage the coordination, documentation, and tracking of critical incidents and corresponding root cause analysis, ensuring rapid and complete issue resolution and appropriate closed loop to customers and other key stakeholders.

Collaborate with Delivery Engineers and DevExp Engineers to enhance and implement continuous integration/continuous deployment orchestration system to reduce friction for software delivery to production

Lead, grow, mentor other SRE team member.

Evangelize the DevSecOps culture and SRE mindset, and mentor others about reliability and best practices.

Identify and work with other engineering discipline to implement opportunities for:

Automation

Signal to noise reduction

Prevention of recurring issues, and other actions to reduce time to mitigate service-impacting events and increase the productivity of cloud operations and development resources

Maintain a strong understanding of IaaS, PaaS, and SaaS offerings with building and maintaining a state-of-the-art, cloud-based environment for large-scale data processing

Design and implement processes, technology and automation for performance testing.

Ensure that implementation and solution are fully documented, and solution deployed with fully operationalized processes to support the solution lifecycle

The experience you bring:

10-15 years of experience in infrastructure, system engineering, software engineering

Advanced knowledge in software engineering in test, testing automation frameworks and tools for application and/or any-as-code (infrastructure, configuration, development tools such as documentation or diagram as code)

Advanced knowledge in at least 3 of the following key areas: Cloud native and IaaS Architecture (performance testing, monitoring, operations), Design (compliance, security), Cloud Engineering (planning, provision), Containers orchestration solutions.

Strong understanding of business technology drivers and their impact on architecture design, performance and monitoring

Advanced level of knowledge on Observability engineering with hands on experience implementing and integrating at least 2-3 monitoring and observability platform such as AppDynamics, Dynatrace, Splunk, Grafana Cloud or cloud-based observability services in AWS or Azure

A systematic problem-solving approach, coupled with strong communications skills and a sense of ownership and drive.

Hands-on experience in designing, analyzing, scaling, and troubleshooting medium to large scale distributed systems.

Practice and well-versed with SRE methodologies and passionate about solving operation problems through automation and software engineering.

Ability to communicate effectively vertically and horizontally within the organization about technical strategy in clear, concise, understandable terms appropriate to the audience technical understanding and expertise

Demonstrated ability to conceptualize, launch and deliver multiple engineering projects on time and within budget

Demonstrated ability to understand and troubleshoot complex problems under pressure

What makes you stand out:

Subject matter expert in designing and supporting one of the 3 major public cloud provider AWS is a plus will consider any other public cloud providers experience

Demonstrated expertise in microservices lifecycle management (integration, testing, deployment)

Strong experience in multiple technologies in the following set of logging and monitoring tools: ELK stack, Prometheus, Stackdriver, New Relic, Datadog, Dynatrace, Splunk, AWS logging and monitoring

Expert knowledge of release software tooling (e.g. Jenkins or Jenkins X, Spinnaker, Harness, Azure Devops service or other Cloud specific cloud environment)

Expert level knowledge of containerization technologies including experience in optimizing Docker image and managing Docker image lifecycle

Regards

Amandeep Kaur

Sr. Technical Recruiter

E
:

[email protected]

TechRooted Inc.

14 Wall Street 20th Floor | New York, NY 10005

www.techrooted.com/

Keywords: information technology golang green card California New York
Need 11+ years Lead SRE Local to CA only USC GC GCEAD HYBRID
[email protected]

[email protected]
View all

Mon Nov 11 23:17:00 UTC 2024

To remove this job post send "job_kill 1919939" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.

Your reply to [email protected] -

To

Subject
Message -

amandeep@techrooted.com wrote:
From:

Amandeep Kaur,

Techrooted INC

amandeep@techrooted.com

Reply to:   amandeep@techrooted.com

Hi,

This is Aman, Technical Recruiter at Techrooted, we have an excellent job opportunity for you.  Please let me know if you are interested then share your updated resume along with your contact details so that we can proceed further.

Lead SRE

Location: Newport Beach, CA 
(Local to CA only who can go onsite for 2-3 days per week; HYBRID)

USC GC GCEAD only

Solid LinkedIn profile is must

Mention Current & exact Location , Visa and LinkedIn profile while sharing the profile; otherwise i will not consider it.

TECHNICAL SKILLS

Must Have

Advanced experience with algorithms, data structures, complexity analysis and software design

Demonstrated expertise in microservices lifecycle management (integration, testing, deployment)

Expert knowledge of release software tooling (e.g. Jenkins or Jenkins X, Spinnaker, Harness, Azure Devops service or other Cloud specific cloud environment)

Expert level knowledge of containerization technologies including experience in optimizing Docker image and managing Docker image lifecycle

Expert level of knowledge for Kubernetes preferred but will consider experienced in other orchestration solution

Expert level of Linux/Unix/Window OS experience

Strong experience in multiple technologies in the following set of logging and monitoring tools: ELK stack, Prometheus, Stackdriver, New Relic, Datadog, Dynatrace, Splunk, AWS logging and monitoring

Subject matter expert in designing and supporting one of the 3 major public cloud provider  AWS is a plus will consider any other public cloud providers experience

Job Description:

LEAD SRE

As a Lead SRE you will be providing technical leadership, direction and accountability for platform engineering, system design and end-to-end implementation to meet and exceed the product or platform non-functional requirements including quality, security, reliability, availability and performance. The main responsibilities include, but are not limited to, optimizing design and engineering for new system and enhancements, including processes and day to day activities, to reliably support product rollout and operation in production.  As a lead SRE, the role will include both oversight for production operations of our portfolio of systems, as well as development/engineering of solutions to optimize system reliability and automation.

How youll help move us forward:

Lead the design, build and implement orchestration and tooling solutions to ensure that repetitive administration tasks are performed at a high level of efficiency and free of defect

Establish best practices for structuring, automating, building, deploying and monitoring complex distributed software products and environments.

Ensure the reliability and traceability of software releases and deployments of software and infrastructure changes.

Create and maintain platform architecture and design specifications to aid development, testing and maintenance of software environments

Design and implement monitoring and recovery tools to provide for site high availability (HA) and disaster recovery (DR)

Design and develop highly available infrastructure and platform components to meet the needs of our growing and evolving product lines

Design and implement security engineering best practices in all our deployed platform and environments

Triage alerts & diagnose/resolve critical issues, manage the implementation of changes

Manage the coordination, documentation, and tracking of critical incidents and corresponding root cause analysis, ensuring rapid and complete issue resolution and appropriate closed loop to customers and other key stakeholders.

Collaborate with Delivery Engineers and DevExp Engineers to enhance and implement continuous integration/continuous deployment orchestration system to reduce friction for software delivery to production

Lead, grow, mentor other SRE team member.

Evangelize the DevSecOps culture and SRE mindset, and mentor others about reliability and best practices.

Identify and work with other engineering discipline to implement opportunities for:

Automation

Signal to noise reduction

Prevention of recurring issues, and other actions to reduce time to mitigate service-impacting events and increase the productivity of cloud operations and development resources

Maintain a strong understanding of IaaS, PaaS, and SaaS offerings with building and maintaining a state-of-the-art, cloud-based environment for large-scale data processing

Design and implement processes, technology and automation for performance testing.

Ensure that implementation and solution are fully documented, and solution deployed with fully operationalized processes to support the solution lifecycle

The experience you bring:

10-15 years of experience in infrastructure, system engineering, software engineering

Advanced knowledge in software engineering in test, testing automation frameworks and tools for application and/or any-as-code (infrastructure, configuration, development tools such as documentation or diagram as code)

Advanced knowledge in at least 3 of the following key areas: Cloud native and IaaS Architecture (performance testing, monitoring, operations), Design (compliance, security), Cloud Engineering (planning, provision), Containers orchestration solutions.

Strong understanding of business technology drivers and their impact on architecture design, performance and monitoring

Advanced level of knowledge on Observability engineering with hands on experience implementing and integrating at least 2-3 monitoring and observability platform such as AppDynamics, Dynatrace, Splunk, Grafana Cloud or cloud-based observability services in AWS or Azure

A systematic problem-solving approach, coupled with strong communications skills and a sense of ownership and drive.

Hands-on experience in designing, analyzing, scaling, and troubleshooting medium to large scale distributed systems.

Practice and well-versed with SRE methodologies and passionate about solving operation problems through automation and software engineering.

Ability to communicate effectively vertically and horizontally within the organization about technical strategy in clear, concise, understandable terms appropriate to the audience technical understanding and expertise

Demonstrated ability to conceptualize, launch and deliver multiple engineering projects on time and within budget

Demonstrated ability to understand and troubleshoot complex problems under pressure

What makes you stand out:

Subject matter expert in designing and supporting one of the 3 major public cloud provider  AWS is a plus will consider any other public cloud providers experience

Demonstrated expertise in microservices lifecycle management (integration, testing, deployment)

Strong experience in multiple technologies in the following set of logging and monitoring tools: ELK stack, Prometheus, Stackdriver, New Relic, Datadog, Dynatrace, Splunk, AWS logging and monitoring

Expert knowledge of release software tooling (e.g. Jenkins or Jenkins X, Spinnaker, Harness, Azure Devops service or other Cloud specific cloud environment)

Expert level knowledge of containerization technologies including experience in optimizing Docker image and managing Docker image lifecycle

Regards

Amandeep Kaur

Sr. Technical Recruiter

E
:

Amandeep@techrooted.com

TechRooted Inc.

14 Wall Street 20th Floor | New York, NY 10005

www.techrooted.com/

Keywords: information technology golang green card California New York 
Need 11+ years Lead SRE Local to CA only USC GC GCEAD HYBRID
amandeep@techrooted.com

Your email id:

Captcha Image:

Captcha Code:

Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 7

Location: Newport Beach, California