Java Application Support Engineer Need 10+ Years at Remote, Remote, USA |
Email: [email protected] |
From: Patricia, W3Global [email protected] Reply to: [email protected] This is a support role and needs people who have support experience in the current project. The Application Support Engineer is responsible for the overall incident & problem management process, including triaging, troubleshooting, providing resolution/workarounds, applying hot patches, performing root cause analysis, deploying fixes by working with the engineering team. This role will also drive interactions with business & internal stakeholders and deal with incident/business communications. S(he) performs periodic analysis on the incidents reported to identify improvements to address system pain points and automation opportunities. Technical Skills Minimum of 4+ years of hands-on experience working with Java Application support operations and batch schedulers/pipeline with strong technical competency in Java and J2EE technologies Good exposure to AWS services like Lambdas, CloudWatch, CloudTrail, EC2, S3, ECS, EKS, etc., Good exposure to frameworks such as Spring MVC, Springboot, NodeJs, Log4J Good exposure to version control such as GitHub. Good exposure to Kubernetes services and Docker containers Good experience with APM & observability tools and system monitoring platforms such as CloudWatch, Datadog, Prometheus, Grafana is huge plus Strong Analytical skills with strong verbal and written communications skills is required Strong knowledge in production technical architecture (micro services), deployment architecture and design Good knowledge on ITIL and ITSM processes (Event Management, Incident Management, Problem Management and Change Management) Good to have exposure to MySQL and No SQL databases like DynamoDB Experience in working IDEs like Eclipse, STS, IntelliJ is required Experience in working CI/CD platforms like Jenkins, Spinnaker is required Python and Terraform experience will be a huge plus Responsibilities: This position has the primary responsibility of providing production support for APIs built in microservice architecture, analyzing, troubleshooting, and resolving production incidents, log research, recommending code fixes as well as coordinating resolution with the engineering, Database, Infrastructure, and implementation teams Be a first responder, manage day to day daily health checks, regularly communicate the status of the incidents and/or any issues through appropriate communication channel. Primary production support for the APIs running in production environment and ensuring that the incidents addressed timely and meeting the business reporting SLAs Monitor production activities/processes to ensure timely and effective reporting, tracking, follow-up, and communication of incidents to technical resources and impacted teams Focus on processes around the availability, SLA and efficiency of core data services, batch jobs and applications Developing / fine-tuning monitoring tools and log analysis tools to manage operations Suggest best practices in application logging, monitoring, intelligent alerting, and automated self-healing Drive and participate in root cause analysis and preventative actions to avoid recurring incidents Identify and participate in the buildout of automation to prevent problem recurrence, implement continuous process improvement with the goal of automating response to all non-exceptional service conditions Identify and implement operational best practices and process improvements within the following functional areas: Incident Management, Problem management, Planned and unplanned Outage/Event Management, Technology Refresh, Operational Reporting, Tooling, and Application Support Develop and foster a positive relationship with team members, team leads and business partners Encourage and participate in knowledge sharing as necessary Develop and update documentation, departmental technical procedures, and runbooks Be willing to work non-standard business hours on an on-call basis in the follow-the-sun support model Good communication, listening and interpersonal skills Keywords: continuous integration continuous deployment sthree |
[email protected] View all |
Mon Oct 09 19:39:00 UTC 2023 |