Site Reliability Engineer With Kubernetes - Hybrid - San Jose, CA -ONLY GC/CITIZEN at Remote, Remote, USA |
Email: [email protected] |
Greetings of the day Kubernetes SRE - Please Share Kubernetes Engineer resume. 6-month temporary contract (with possible extension/conversion Hybrid (3 days in office, 2 remote) in San Jose, CA' Pay range: $65/hr ctc. USC/GC- Hybrid locals only Client - Zscaler About the Role: We are seeking a skilled and experienced Site Reliability Engineer (SRE) to join our team. The primary focus of this role is to develop and maintain a comprehensive observability solution for our Kubernetes-based applications. The ideal candidate will be proficient in using various monitoring and logging tools to ensure the reliability and scalability of our services. Key Responsibilities: Design and Implementation : Develop and implement observability solutions for Kubernetes-based applications using Fluentbit, Cloud Watch, StackDriver, Grafana Loki, Grafana Tempo, Prometheus, Envoy Health Probes, Open Telemetry, and ArgoCD. Monitoring and Logging : Configure and maintain logging pipelines using Fluentbit to collect process, and route logs for storage and analysis. Metrics and Tracing : Set up Prometheus for metrics collection and Grafana Tempo for distributed tracing. Integrate these with Grafana for real-time monitoring and alerting via open telemetry. Telemetry : Utilize Open Telemetry to instrument applications for better traceability and observability. CI/CD : Use ArgoCD for continuous deployment and ensure observability tools are integrated into the CI/CD pipeline to deploy the observability suite. Observability Optimization : Analyze and optimize the performance of the observability stack to ensure minimal overhead and maximum efficiency. Troubleshooting : Proactively identify and resolve issues related to the observability infrastructure. Collaborate with development and operations teams to troubleshoot and resolve incidents. Documentation and Training : Document observability processes and best practices. Provide training and support to other team members on observability tools and techniques. Required Skills and Qualifications: Experience : Proven experience as an SRE or in a similar role, with a strong focus on observability in Kubernetes environments supporting applications in EKS in AWS. Technologies : Hands-on experience with Fluentbit, Cloud Watch, StackDriver, Grafana Loki, Grafana Tempo, Prometheus, Envoy Health Probes, Open Telemetry, and ArgoCD. Kubernetes : In-depth knowledge of Kubernetes and container orchestration. Scripting and Automation : Proficiency in scripting languages such as Python, Bash, or similar for automation tasks. Monitoring and Logging : Strong understanding of monitoring, logging, and tracing concepts and best practices. Problem Solving : Excellent analytical and problem-solving skills. Collaboration : Strong communication skills and the ability to work effectively in a team environment. Continuous Improvement : A proactive attitude towards identifying opportunities for improvement and implementing solutions. Preferred Qualifications: Certifications : Relevant certifications such as Certified Kubernetes Administrator (CKA) or Certified Kubernetes Application Developer (CKAD) Cloud Platforms : Experience with cloud platforms such as AWS and EKS. DevOps Practices : Familiarity with DevOps practices and tools. -- Thanks & Regards, Arshad Ali Technical Recruiter. Nascent Global LLC. D: 469-844-7115 Mail id : [email protected] http://nascentglobal.com/ Enabling Digital Transformation 3838 Oak Lawn Avenue Suite 1000 PMB #233 Dallas, TX 75219. https://www.linkedin.com/in/arshad-ali-68729170 -- Keywords: continuous integration continuous deployment information technology green card California Idaho Texas Site Reliability Engineer With Kubernetes - Hybrid - San Jose, CA -ONLY GC/CITIZEN [email protected] |
[email protected] View all |
Tue Jul 30 21:24:00 UTC 2024 |