Urgent Need SRE / Reliability Engineer/ Hybrid in Plano TX /F2F interview / Need local at Plano, Texas, USA |
Email: [email protected] |
Hi Sir/Madam, Hi , I just came across your profile over a job portal. Please find below the JD and let me know if youre fine with it Role: Reliability Engineer Location: Hybrid or Plano, TX(Local candidate needs to be go for F2F interview Contract Responsible for ensuring the availability, performance, and reliability of our cloud-based infrastructure and services. The primary focus of this role is designing, implementing, and managing robust monitoring and alerting systems to proactively identify issues and timely incident response. This resource will work closely with the CTP Platform Engineering and Development teams to optimize services and maintain service uptime. Duties include: Develop and maintain comprehensive monitoring solutions for cloud-based services and applications. Configure monitoring tools and systems to collect relevant metrics, logs, and traces. Create custom monitoring dashboards and reports using DataDog or other tools, to provide real-time insights into system performance and health. Continuously monitor the cloud infrastructure's performance and capacity, anticipating and addressing potential scalability issues. Proactively suggest and implement improvements to enhance the system's reliability, resilience, and fault tolerance. Work on automating tasks to streamline operational processes and reduce manual intervention. Collaborate with cross-functional teams to investigate and resolve critical incidents, ensuring minimal impact on end-users. Work with Problem Management team to complete post-mortem analysis of incidents to identify root causes and implement preventive measures. Ideal Qualifications: 3+ years experience working with cloud platforms and services (AWS, Azure, GCP, etc.) in a production environment. Solid understanding of monitoring and logging tools, such as Prometheus, Grafana, ELK stack, Splunk, etc. Experience with infrastructure as code (IaC) tools, like Terraform, CloudFormation, or Ansible. Strong scripting and automation skills (e.g., Python, Bash) to facilitate operational tasks. Knowledge of containerization technologies (Docker, Kubernetes) and microservices architecture. Familiarity with DevOps practices and Agile methodologies. I look forward to hear from you and work with you at the earliest. Regards, Krishnakant Tripathi Team Recruitment Work: (201) 425-1319 Ext.978 Email: [email protected] Address: 270 Davidson Ave Suite 704, Somerset, NJ, 08873 Website: www.net2source.com Microsoft Gold Certified Partner | Cisco Certified Premier Partner | Oracle Gold Partner | IBM Business Partner | ISO 9001:2008 Certified Company | NASSCOM Certified Company | E-Verified Employer here Keywords: information technology golang New Jersey Texas |
[email protected] View all |
Wed Aug 16 03:29:00 UTC 2023 |