Hiring for Site Observability Engineer (Remote) at Remote, Remote, USA |
Email: [email protected] |
From: Lokesh Shiv, Paramount Software Solutions Inc [email protected] Reply to: [email protected] Site Observability Engineer (USC only) Location: Remote Duration: 12+ months (possibility of conversion to FTE after 6 mos) Project: HANA Cloud Build Team Role Summary: We are seeking a skilled and experienced Site Observability Engineer to join the SAP Observability team. The ideal candidate will be responsible for improving our monitoring and alerting posture for Cloud Infrastructure. The role requires a strong understanding of observability tools and practices, with a focus on Prometheus, Grafana, Gardener Kubernetes, and Splunk. Experience with Dynatrace is a plus. Skills: 1. Implement, manage, and improve monitoring solutions that use Prometheus, ensuring high availability and accurate alerting for our systems. 2. Contribute to the development of observability strategies to improve our Cloud monitoring posture. 3. Collaborate with development teams to integrate observability into the CI/CD pipeline and throughout the application lifecycle. 4. Respond to and investigate incidents, providing thorough post-mortem analyses and implementing preventive measures. 5. Stay current with the latest trends and best practices in site reliability and observability. 6. Work with cross-functional teams to ensure system reliability, scalability, and performance. Qualifications: 1. Bachelor's degree in Computer Science, Information Technology, or a related field, or equivalent experience. 2. Proven experience with observability tools such as Prometheus, Grafana, and Splunk. 3. Hands-on experience with Kubernetes and container orchestration, preferably with Gardener Kubernetes. 4. Familiarity with logging, monitoring, and application performance management (APM) tools; experience with Dynatrace is a plus. 5. Strong understanding of cloud infrastructure, networking, and distributed systems. 6. Excellent problem-solving and analytical skills, with the ability to work independently and as part of a team. 7. Strong communication skills and the ability to work effectively with both technical and non-technical stakeholders. 8. Experience with scripting and automation tools. (Python, Terraform, Ansible, etc.) Regards, Lokesh Shiv Email: [email protected] Keywords: continuous integration continuous deployment Hiring for Site Observability Engineer (Remote) [email protected] |
[email protected] View all |
Thu Jun 06 05:02:00 UTC 2024 |