Home

Monitoring Engineer, Remote at Atlanta, Georgia, USA
Email: [email protected]
From:

Navnish Kumar,

Stellent It

[email protected]

Reply to:   [email protected]

Monitoring Engineer

Location: Atlanta, Georgia( Remote)

Interview: Phone + Skype

Job Description

Principle Duties and Responsibilities:

Essential Functions:

Ensure the Infrastructure Tools environment is maintained in accordance with company standards and leading practices.

Lead and participate in Infrastructure Tools related engagements/meetings that have a strong business & technology focus .

Manage Infrastructure Tools incident resolution within the environment, facilitating collaboration among other infrastructure and business service support teams as required.

Serve as the Infrastructure Tools escalation point for major incidents, and escalate to the Mgr. of the TCC and/or appropriate Gulfstream stakeholders as required.

Ensure Infrastructure Tools root cause analysis is performed for any incident affecting GAC enterprise monitoring tools in accordance with Gulfstream policies and standards.

Collaborate with the Technology Command Center to evaluate and recommend tools and processes that enhance the stability and functionality of the monitoring environment, ensuring that any new technologies are compatible with existing Gulfstream architecture and standards.

Transform customer inputs and feedback into concept visualizations leveraging agile methodologies to develop and deliver iterative designs and enhance or streamline existing monitoring designs.

Practical experience in APM working with application & infrastructure teams to define/interpret monitoring/alerting requirements for managing end user experience, fault isolation, and proactive environment health management including alerting, analysis, and reporting.

This person should be an Expert in Application Performance Management & Systems Performance Management.

Ensure Infrastructure Tools supporting enterprise monitoring are maintained in accordance with company standards and leading practices.

Actively mentor technical skill development within the Enterprise Systems Monitoring (ESM) team.

Tools this person will Use

NetCool

ManageEngine

Foglight

Dynatrace

Others - ThousandEyes, Lakeside SysTrack .

Perform other duties as assigned.

Other Requirements:

Strong knowledge of IT infrastructure and underlying technologies i.e., operating systems (UNIX, Linux, and Windows), networks, storage, databases and application components.

Holistic view of enterprise solutions, including a sound appreciation of operational costs, security, performance engineering, application development and systems management.

Knowledge of the ServiceNow Tool Suite and administration and tool support.

Knowledge of leading-edge products/technology and industry in order to effectively participate on business unit and architectural boards.

Knowledge of local and remote data center design/management and cloud computing.

Work Service Now Tickets (Incidents and Requests).

Customers words - Heres what I need: 

Someone very familiar with DevOps principles that are applied to modern IT monitoring platforms covering the following 4 major monitoring categories (synthetic transactions, Infrastructure Health, APM and log file monitoring) like Dynatrace, AppDynamics, DataDog, Logic Monitor, Splunk, etc. This person should understand concepts like event correlation, incident ticket storms, self-healing and of course automation, automation, and automation.  The ideal lead will be a hands-on, teaching leader who does so by example and has a background in scripting, development and system administration.

ROLES AND RESPONSIBILITIES:

Provide system engineering for Enterprise Monitoring Systems (AppDynamics, DataDog, Splunk, Foglight, Thousand Eyes) including systems architecture, monitoring strategy, operational deployments, application design and maintenance/administration.

Engage with subject matter experts ranging from network to applications to define, deploy and maintain system and service monitors.

Engage Technical Command Center management to analyze current monitoring processes and use of tools identifying gaps and opportunities for innovation and automation.

Help build a strategic roadmap for the next 3 to 5 years.

Work with other IT departments to plan and implement new features, enhancements, and upgrades.

Document supporting policies, processes and procedures.

Provide training as needed to operations teams regarding alarm correlation and threshold setting.

Assists in the installation, maintenance, and general support of monitoring systems.

Routinely review monitoring systems and services to ensure stability and security.

Assist in interpretation of diagnostic data obtained from monitoring solutions.

Provide implementation support for custom monitoring requirements.

Create and test all monitoring scripts.

Manage the installation of new software releases and patch installs that resolves monitoring related software problems.

Provide planning and monitoring guidance to support teams.

Identify, diagnose, and resolve technical monitoring problems.

Provide Capacity, Performance and Availability reports for monitoring platforms.

Research/Design new monitors that meet the needs of the enterprise

Assist in building a team with strong skills and a can do attitude.

Required Technical Skills

Minimum 3+ years' system administration of Enterprise Monitoring Systems.

Minimum 3+ years of experience in Server Management products, i.e.: AppDynamics, Nagios, OpsManager.

Minimum 3+ years networking experience in an enterprise environment.

Minimum 3+ years of experience with Javascript, Python or other programming languages.

Minimum 3+ years working with both Windows and UNIX based systems in an enterprise environment, including advanced shell scripting.

Advanced knowledge of Enterprise Monitoring metrics, reporting, logging and best practices.

Experience with SNMP, TCP/IP and core LAN/WAN principles.

Ability to perform network traffic analysis using network capture tools.

Keywords: information technology
[email protected]
View all
Tue Jan 10 02:42:00 UTC 2023

To remove this job post send "job_kill 263999" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 8

Location: Atlanta, Georgia