Hiring Now :: GEN AI Data Platform Engineer :: Remote at Remote, Remote, USA |
Email: [email protected] |
From: Amit Kumar Pradhan, Vyze Inc. [email protected] Reply to: [email protected] Job Description - Tittle: GEN AI Data Platform Engineer (BackFill) Visa: All MOI: Skype The person would be working mainly on building application pipelines with AWS services, working with LLM models and then deploying artifacts using CICD pipelines built with Jenkins and Terraform. Experience with ChatGPT, Gemini,CoPilot, ChatPDF and customizations of Retrieval Augmented Generation Pipelines. This role requires versatility and expertise across a wide range of skills. Someone with a diverse background/experience and an engineer at heart will fit into this role seamlessly. The Generative AI team is comprised of a multiple cross-functional group that works in unison and ensures a sound move from our research activities to scalable solutions. You will collaborate closely with our cloud, security, infrastructure, enterprise architecture and data science team to conceive and execute essential functionalities. Responsibilities: Design and build fault-tolerant infrastructure to support the Generative AI Ref architecture (RAG, Summarization, Agent etc). Ensure code is delivered without vulnerabilities by enforcing engineering practices, code scanning, etc. Build and maintain IAC (terraform/Cloud Formation), CICD (Jenkins) scripts, CodePipeline, uDeploy, & GitHub Actions. Partner with our shared service teams like Architecture, Cloud, Security, etc to design and implement platform solutions. Collaborate with the DS team to develop a self-service internal developer Generative AI platform. Design and build the Data ingestion pipeline for Finetuning LLM Models. Create templates (Architecture As Code) implementing Ref architecture applications topology. Build a feedback system using HITL for Supervised fine tuning. Qualifications: Bachelor's degree in Computer Science, Computer Engineering, or a technical field. 4+ years of experience with AWS cloud. At least 8 years of experience designing and building data-intensive solutions using distributed computing. 8+ years building and shipping software and/or platform infrastructure solutions for enterprises. Experience with CI/CD pipelines, Automated Testing, Automated Deployments, Agile methodologies, Unit Testing and Integration Testing tools. Experience with building scalable serverless application (real-time / batch) on AWS stack (Lambda + step function) Knowledge of distributed NoSQL database systems. Experience with data engineering,and conversation UX is a plus. Experience with HPCs, vector embedding, and Hybrid/Semantic search technologies. Experience with AWS OpenSearch, Step/Lambda Functions, API Gateways , ECS/Docker is a plus. Proficiency in customization techniques across various stages of the RAG pipeline, including model fine-tuning, retrieval re-ranking, and hierarchical navigable small-world graph (HNSW) is a plus. Strong proficiency in embeddings, ANN/KNN, vector stores, database optimization, & performance tuning. Extensive programming experience with Python, Java. Experience with LLM orchestration frameworks like Langchain, LlamaIndex etc. Foundational understanding of Chat GPT, Gemini, Copilot, or other Open AI tools, NLP, and Deep Learning. Excellent problem-solving skills and the ability to work in a collaborative team environment. Excellent communication skills Keywords: continuous integration continuous deployment artificial intelligence user experience Hiring Now :: GEN AI Data Platform Engineer :: Remote [email protected] |
[email protected] View all |
Thu Jul 04 04:10:00 UTC 2024 |