| Monitoring and Alerting Engineer || F2F in Texas || DevOps at Fort Worth, Texas, USA |
| Email: [email protected] |
|
http://bit.ly/4ey8w48 https://jobs.nvoids.com/job_details.jsp?id=2095744&uid= From: Rajeev, Tek Inspirations LLC [email protected] Reply to: [email protected] Job Title: Monitoring and Alerting Engineer Location: Fort Worth, TX (Hybrid, 4 days onsite) Duration: 6+ months MOI: 1 Zoom Interview, followed by Final Onsite Interview Position Overview: We are looking for a Monitoring and Alerting Engineer to join our team. This is a forward-facing engineering role that involves working closely with the application team to ensure continuous monitoring and management of critical IT systems and applications. The ideal candidate should have hands-on experience with Dynatrace and AWS CloudWatch, as well as a strong background in alerting systems, incident response, and performance optimization. Key Responsibilities: System Monitoring: Implement and maintain monitoring solutions to track the health and availability of IT systems, applications, and networks. Alert Management: Configure and manage alert systems to notify the team of any anomalies, failures, or performance issues. Incident Response: Work with support and operations teams to analyze, resolve, and lead event resolution processes during incidents and outages. Root Cause Analysis: Investigate the root causes of incidents and implement corrective actions to prevent recurrence. Optimization: Identify opportunities for system optimization based on data analysis and trend identification. Tool Evaluation and Integration: Evaluate and recommend new monitoring and alerting tools to improve monitoring capabilities. Documentation and Reporting: Develop and maintain documentation on monitoring configurations, incident reports, and performance metrics. Collaboration: Work closely with application, infrastructure, and DevOps teams to ensure smooth operations and efficient incident handling. Required Skills and Qualifications: Proficiency in Dynatrace, CloudWatch, Datadog, or Splunk. Solid understanding of IT infrastructure, including servers, networks, databases, and cloud environments. Experience with incident, problem, and change management processes. Strong troubleshooting and problem-solving abilities. Excellent communication and collaboration skills. Familiarity with ITIL best practices and service management frameworks. Flexibility to support after-hours and weekend work, as required. Preferred Qualifications: Bachelors degree in Computer Science, Information Systems, or Engineering. Experience with distributed systems and scripting/programming (e.g., Python, Node.js, Ruby, Perl, Bash). Experience with ServiceNow. Familiarity with DDU and DEM license. Knowledge of session replay. Work Environment & Schedule: Hybrid position, requiring 4 days per week onsite. Flexibility for after-hours/weekend work, including periodic Saturday or Sunday shifts. Availability for on-call support is necessary. Operating in a 24/7 environment. Additional Information: This role is a backfill position. Zoom virtual interview required, followed by an onsite interview. Best regards, Rajeev Kharwar Sr. Technical Recruiter | DevOps Specialist TEK Inspirations LLC 13573 Tabasco Cat Trail, Frisco, TX 75035 Desk: 469-393-0216 | Email: [email protected] WhatsApp: 752-589-4499 | LinkedIn: linkedin.com/in/predestined-rk Keywords: javascript information technology card Texas Monitoring and Alerting Engineer || F2F in Texas || DevOps [email protected] http://bit.ly/4ey8w48 https://jobs.nvoids.com/job_details.jsp?id=2095744&uid= |
| [email protected] View All |
| 01:37 AM 21-Jan-25 |