Machine Learning Operations Engineer || Remote || no H1B at Remote, Remote, USA |
Email: [email protected] |
Hi Hope you are doing well Requirements: Proficiency in Linux administration, with a strong preference for candidates with deep expertise in Linux environments o Windows experience is acceptable, but a solid grasp of Linux is essential Demonstrated ability to install, modify, and provide support for Linux applications Familiarity with cluster management, particularly in negotiating resources across multiple computers simultaneously Proficiency in Slurm for job scheduling, with any prior experience being an advantage Competence in container management, including expertise with Docker for containerization, pushing, and pulling containers Knowledge of maintaining High-Performance Computing (HPC) systems, encompassing various components that make up this sophisticated infrastructure Desirable Skills Experience with JupyterHub is a plus Knowledge of the Bright software is highly desirable Typical Duties Collaborate with the AI team to customize the environment, ensuring it is optimized for AI development Work closely with the infrastructure team to configure and manage physical hardware and the underlying operating system Implement and manage partitioning on the supernode, allocating resources for different environments (Jupyter, Slurm, Linux shell, Docker containers, etc.) Provide support and administration for Kubernetes, aiding in the integration of various providers Continuously evolve processes and ways of working to maximize the platform's efficiency, ultimately reducing the need for external support Regards Vinay Chaudhary https://www.linkedin.com/in/vinay-chaudhary-550630245lipi=urn%3Ali%3Apage%3Ad_flagship3_profile_view_base_contact_details%3B7jYVgBdtS1afkDGgr6g2cQ%3D%3D Keywords: artificial intelligence information technology golang |
[email protected] View all |
Fri Sep 15 23:09:00 UTC 2023 |