Home

Need Senior SRE -Site Reliability Engineer || Santa Clara, CA|| Day-1 Onsite at Santa Clara, California, USA
Email: [email protected]
From:

Pavan,

softcom systems Inc

[email protected]

Reply to:   [email protected]

Hi,

Iam Pavan from Softcom Systems

Kindly respond to this requirement with updated profile

Details : 

Client :Infosys

Role: Senior SRE -Site Reliability Engineer

Location: Santa Clara, CA|| Day-1 Onsite

Type : Contract

Job Details:

Role Overview:

We are seeking a skilled and motivated Senior Site Reliability Engineer to join our dynamic support team.

The candidate will play a crucial role in ensuring the successful rollout and continuous operation of our clients Gaming Battle Stations across numerous retail stores in North America, and anticipated expansions across Europe.

Responsibilities and JD:

Design, develop, and maintain bare-metal server and/or docker configurations, automation scripts, tools, and images to provision and manage Linux cache server for the Gaming Battle Stations, and replicating the configuration ensuring consistent and reliable deployments across many locations.

Remote configuration and administration of servers at each retail location, including initial setup, network configuration, and ongoing maintenance.

Implement robust monitoring and alerting solutions to detect and resolve issues promptly, minimizing downtime and ensuring a seamless gaming experience for users.

Perform regular patches, updates, and security enhancements on Battle Station Servers to maintain optimal performance and mitigate vulnerabilities.

Work closely with software development teams to provide input on deployment and operational aspects, ensuring a smooth integration of new features and updates.

Collaborate with network and infrastructure teams to optimize network connectivity and ensure low-latency interactions for online gaming.

Troubleshoot and resolve complex technical issues related to the Gaming Battle Stations, both remotely and on-site when necessary.

Develop and maintain documentation for provisioning, deployment, and configuration procedures, troubleshooting guides, etc.

Participate in an on-call rotation to provide 24/7 support for critical incidents and urgent issues.

Continuously analyze system performance metrics, identify bottlenecks, and implement optimizations to enhance overall system efficiency.

Stay current with industry trends, emerging technologies, and best practices in Site Reliability Engineering and gaming systems administration.

Qualifications:

Bachelor's degree in Computer Science, Information Technology, or related field (or equivalent experience).

Proven experience as a Senior Site Reliability Engineer or similar production role, preferably in high-availability, large-scale production environments.

Strong expertise in Linux systems administration, including strong command-line proficiency, package management, scripting, hardening, and performance tuning.

Strong expertise in virtualization, Docker, Kubernetes, etc. and high-performance containerized environments and networks (Ex. 10+ Gbit)

Strong proficiency with scripting languages such as Python, Bash, or similar for automation and tooling.

Experience with configuration management tools (e.g., Ansible, Puppet, Chef) and infrastructure-as-code principles.

Solid experience in managing hybrid cloud and on-premise infrastructure, full stack infrastructure administration and support in multiple environments and platforms.

Strong problem-solving skills and the ability to troubleshoot and solve complex issues under pressure.

Excellent communication skills and the ability to collaborate effectively with cross-functional teams.

Willingness to participate in an on-call rotation and provide support during non-standard hours.

Research and development experience and a passion for R&D is preferred, as this role will assist in the completion of full-scale automation and rollout into production.

Keywords: rlang California
[email protected]
View all
Wed Sep 13 18:56:00 UTC 2023

To remove this job post send "job_kill 632650" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 9

Location: Santa Clara, California