Onsite Contract Position for Platform Ops Enterprise Open AI GPT Engineer in Bellevue, WA at Bellevue, Washington, USA |
Email: [email protected] |
From: Gaurav Sharma, USG Inc. [email protected] Reply to: [email protected] Hi, Hope you are doing great. Please go through the job description given below and if you are interested do share an updated word copy of your resume and best time to reach you over the phone. Position: Platform Ops Enterprise Open AI GPT Engineer Locations: Bellevue, WA Duration: Contract Job Description: We are seeking a highly skilled and motivated Platform Ops Engineer to manage and optimize the operational performance of Open AI GPT and related AI platforms across our enterprise. The role focuses on ensuring efficient, scalable, and secure operations for AI-based products and services, including large language models (LLMs) and custom GPT solutions. The ideal candidate will have a blend of expertise in AI/ML model operations, DevOps, infrastructure management, and strong problem-solving skills to support mission-critical AI applications. Key Responsibilities: 1. AI/LLM Operations & Monitoring: Manage the day-to-day operations of Open AI GPT models and other AI/ML platforms. Implement automated monitoring and alerting for model performance, drift, and infrastructure health. Ensure high availability, reliability, and scalability of deployed GPT models across the enterprise. Optimize resource allocation and scaling for large model deployments, ensuring cost-effectiveness. 2. Automation & CI/CD Pipelines: Design and maintain automated CI/CD pipelines for rapid deployment of AI/ML models. Collaborate with data science and engineering teams to streamline model retraining and updates. Integrate MLOps tools and platforms (e.g., Kubeflow, MLflow, or other AI orchestration tools). 3. Security & Compliance: Implement and manage security policies around data privacy, model access, and infrastructure security. Ensure AI platforms adhere to enterprise-level compliance and governance standards. Identify and mitigate risks related to AI model vulnerabilities and data usage. 4. Infrastructure Management: Administer cloud-based infrastructure (e.g., Azure,) used for AI/ML model deployment. Handle model orchestration, scaling, and optimization in containerized environments (Kubernetes, Docker). Support hybrid cloud/on-prem infrastructure setups where required. 5. Collaboration & Stakeholder Management: Work closely with data scientists, AI engineers, and product teams to align AI Ops activities with business goals. Serve as the central point of contact for troubleshooting AI-related issues, providing root-cause analysis, and addressing performance bottlenecks. Document operational workflows, best practices, and post-mortem analyses for continuous improvement. 6. Proactive Issue Resolution: Use predictive analytics and anomaly detection techniques to prevent AI platform issues before they impact the business. Lead incident management for AI platform disruptions and resolve operational issues in a timely manner. Experience: 5+ years of experience in AI Ops, MLOps, DevOps, or platform operations. Proven expertise with AI/ML platforms, especially Open AI GPT, other LLMs, or enterprise-grade AI services. Technical Expertise: Hands-on experience with cloud platforms Preferably Azure for AI/ML deployments. Proficiency with AI frameworks and libraries (TensorFlow, PyTorch, etc.). Experience with CI/CD tools (Jenkins, GitLab, CircleCI) and infrastructure-as-code (Terraform, Ansible). Familiarity with containerization (Docker, Kubernetes) and orchestration tools. Understanding of AI model lifecycle management, versioning, and governance. Skills: Strong scripting/programming skills (Python, Bash, etc.). Analytical and problem-solving mindset with the ability to address complex operational issues. Excellent communication skills to engage with cross-functional teams and present solutions to stakeholders. Experience in managing high-performance, distributed systems. Thanks with regards, Gaurav Sharma | Sr. Technical Recruiter Email ID: [email protected] Keywords: continuous integration continuous deployment artificial intelligence machine learning golang Idaho Kansas Washington Onsite Contract Position for Platform Ops Enterprise Open AI GPT Engineer in Bellevue, WA [email protected] |
[email protected] View all |
Tue Nov 05 20:02:00 UTC 2024 |