Need - Operations Engineer-SRE at Remote, Remote, USA |
Email: [email protected] |
From: bhavani, Brillius [email protected] Reply to: [email protected] Role : Operations Engineer-SRE Experience (Years): 8-10 Austin, TX / Sunnyvale, CA Role Description: We are looking for an operations engineer tjoin the CryptServices SRE team. The CryptServices SRE team is responsible for systems and services that support a vast number of both Apples internal services as well as services that Apple users directly use. As an Operations Engineer, you will play a crucial role in helping ensure our systems and services are reliable, scalable, performant and operating optimally. You will have the opportunity tlearn and contribute tvarious aspects of site reliability engineering including monitoring, automation, incident response, and infrastructure optimization while making contributions that will help make Apple users experiences better. Key Skills: Knowledge of Linux/Unix fundamentals and network concepts. Hands on Shell scripting, interpreted or compiled languages such as bash, zsh, Perl, Python, C/C++, Go, Java Configuration management/Infrastructure as Code - Ansible, Puppet, Terraform/Terragrunt, CloudFormation Basic understanding of containerization technologies such as Docker or Podman and container orchestration technologies like Kubernetes or Apache Mesos. Strong communication and collaboration skills with the ability twork across functional teams. Awareness of key security principles including encryption and keys (types and exchange protocols) Basic understanding of SRE principles including monitoring, alerting, error budgets, fault analysis, and automation. Responsibilities: Creating tooling tassist in the implementation, maintenance and support of monitoring, observability, alerting and logging systems tensure they remain available and highly reliable. Help and participate in the design and implementation of automated processes and tooling like writing Ansible playbooks, writing tooling monitor different API endpoints. Help in monitoring key performance metrics and proactively identify opportunities for optimization and efficiency gains. Collaborate with cross functional teams troubleshoot incidents, identify root causes and help implement effective solutions prevent recurrence. Help with documenting workflows and procedures, and writing and validating run books Keywords: Python, Shell Scripting, Puppet, Linux/Unix, Docker Keywords: cprogramm cplusplus golang California Texas Need - Operations Engineer-SRE [email protected] |
[email protected] View all |
Wed Jul 24 01:10:00 UTC 2024 |