Home

Splunk engineer with Azure tools: remote: 60 per hr at Remote, Remote, USA
Email: [email protected]
From:

Jay,

Brillius

[email protected]

Reply to:   [email protected]

As a monitoring specialist, you are responsible to maintain 100% uptime of tool and mission critical applications by meeting Service Level Agreements (SLA) and process compliance. You will perform business impact analysis and risk assessments to reduce the likelihood of significant service outage or disasters.

Drives reliability into systems across the enterprise taking a holistic view of system health.

Primary Responsibilities

Maintenance and deployment of Splunk / Dynatrace infrastructure and integrating alerts with ticketing system to maintain minimal service outages.

Perform installation, implementation, customization, operation, recovery, and performance tuning.

Understand customer requirement and develop customize scripts to get monitoring data into system.

Process knowledge on observing, evaluating, and managing the health, performance, and availability of cloud-based applications, architecture, and services. 

Monitor data flowing through multiple locations via various devices. 

Get visibility into user, file, and application behavior to improve the performance of their cloud environment. 

Identify potential vulnerabilities before they become a significant issue.

Prepare security audit reports for compliance purposes.

Scale observability capabilities as architecture grows.

Use monitoring insight to make informed engineering and product decisions.

Take daily, weekly, and monthly backup of your historical data, tool configuration.

Utilize scripting & development skills to reduce operational man-hours and reduce time to restore for incidents. This should include implementing practices that support Agile and Continuous Integration/Continuous Delivery (CI/CD) principles. Provisioning both day to day operations and automation using tools, e.g. Ansible, Jenkins, python, bash, PowerShell.

Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement.

Work with both application and operations teams with a focus on developing the self-healing system for alerts triggered in monitoring tool.

Should be highly proactive with a keen focus on improving uptime availability of mission critical application. Leverage modern tools and instrumentation to drive reliability and meet Service Level Objectives (SLOs).

Utilize monitoring tools to track performance and availability of applications and determine trends. Ability to coach others in learning this technology.

Utilize log forwarding technology to troubleshoot problems and identify trends. Ability to coach others in learning this technology.

Encourage collaboration and cohesiveness within their team and drive teams outside their immediate organization in implementing SRE principles.

Required Technical and Professional Expertise

Enabling monitoring for various technologies like OS, Database including MSSQL, MySQL, Oracle, db2, and Middleware applications, Storage, venter, backup, hardware and SNMP monitoring.

Doing health checks of the systems to make sure 100% availability of Splunk / Dynatrace / Zabbix applications.

Experience in implementing Splunk / Dynatrace core in different projects along with integration between Splunk / Dynatrace and Ticketing tool.

Proficient in installation and configuration of Splunk / Dynatrace / Zabbix monitoring tool.

Working knowledge of migration from different monitoring tools to Zabbix along with up gradation of Zabbix version to the latest version

Working knowledge of migration from different monitoring tools along with up gradation to old version to the latest version

Hands-on trouble shooting technical issues an identify solutions.

Strong background in Linux/Unix, windows administration

Basic idea on Networking concept and experience with Bash, PowerShell, Python.

Experience using a wide variety of open-source technologies and cloud services.

Experience with automation orchestration and configuration management tools such as Ansible, Jenkins etc.

Primary Skillset

Splunk, Azure Monitoring Tool, Splunk O

Basic knowledge in any one of the middleware application Server (Jboss/WebLogic/WebSphere)

Should be able to capture/analyze logs with a basic level of troubleshooting.

Basic understanding and troubleshooting skills including file/disk/process/network management.

Hands-on experience in interpreting and writing SQL queries.

Knowledge of ServiceNow ITSM tool

Secondary Skillset

Dynatrace, Zabbix, Graphana, Knowledge on MSSQL, MySQL Database

Knowledge of ITSM tools such as Jira, BMC Remedy, HPSM, , Service Desk, etc.

Should be able to capture/analyze logs with a basic level of troubleshooting.

Hands-on experience of shell scripting for automations purposes

Keywords: continuous integration continuous deployment
[email protected]
View all
Wed Jan 10 22:47:00 UTC 2024

To remove this job post send "job_kill 1002309" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 7

Location: ,