Site Reliability Engineer F2F at Atlanta, Georgia, USA |
Email: [email protected] |
From: Farha khan, Tek Inspirations LLC [email protected] Reply to: [email protected] Job Description - Title: Site Reliability Engineer Job Type: Hybrid Location: Atlanta, GA (Local Must) MOI Skype + F2F Candidates need to have: Manage and optimize data streaming and API components in OpenShift (On-Premise) and AWS. Review APIs and processes to optimize response times across various components. Automate testing (including data quality checks), production delivery, and deployments. Develop integrations between On-Premise and AWS applications and third-party tools (e.g., ServiceNow, VersionOne, Sumo). Collaborate with teams to create SLIs/SLOs. Monitor performance, troubleshoot platform issues, and document findings from root cause analysis. Evolve cloud infrastructure by experimenting with emerging technologies and creating prototypes. Design and maintain CI/CD pipelines for deploying APIs and Data Process Jobs. Develop monitoring and alerting solutions for proactive issue resolution. Ensure data integrity and access control using AWS security tools (HSM, IAM, etc.). Monitor AWS billing, generate reports, and implement cost optimization strategies. Work with enterprise security architects to design security measures, encryption, key management, and address vulnerabilities. Monitor capacity and performance; collaborate on designing elastic infrastructure to manage irregular user traffic. Develop backup strategies and implement disaster recovery solutions. Provide input on continuous improvement for design, performance, and security enhancements. In-depth knowledge of AWS cloud platforms. Expertise in automation, scripting, and monitoring using tools like OpenShift, CloudFormation, Terraform, Ansible, Shell, and Python. Strong understanding of infrastructure layers: Linux OS, virtualization platforms, networking, firewalls, load balancers, and monitoring tools. Experience managing end-to-end operations for enterprise systems, including issue resolution for mission-critical applications. Proficiency with CI/CD tools (GitLab, GitHub, Jenkins, Maven, Gradle, Nexus) and software release management. Key Responsibilities: Lead cloud and big data initiatives to accelerate technology adoption. Serve as a technical leader in implementing new services and features. Support highly available, business-critical applications across on-premise and AWS environments. Act as the escalation point for complex platform issues. Automate, operationalize, and improve DevOps/QA processes through CI/CD tools and practices. Troubleshoot and resolve infrastructure and application issues without being constrained by traditional approaches. Keywords: continuous integration continuous deployment quality analyst Georgia Site Reliability Engineer F2F [email protected] |
[email protected] View all |
Wed Oct 30 21:35:00 UTC 2024 |