**Contract to Hire** Data Scientist :: Parsippany, NJ ( H1B / USC / GC ) at Parsippany, New Jersey, USA |
Email: [email protected] |
Hi , We have a position open for our client , please go through the job description below and share the Relative Profiles. Please share the profiles along with Candidate Full Name : VISA status : LinkedIn ID : Current Location : Please share 100% Genuine profiles , I assure immediate interviews , If you are sharing Fake Resumes for the sake of submission, please dont call me for any updates Job Title : Data Scientist Location : Parsippany, NJ Contract : CONTRACT TO HIRE H1B / USC / GC 1 0 + Years of profiles only please RESPONSIBILITIES Work closely with cross-functional teams, including product managers, data scientists and engineers to understand project requirements and objectives ensuring alignment with overall business goals. Build data ingestion framework and data pipelines to ingest unstructured and structured data from various data sources such as SharePoint, Confluence, Chat Bots, Jira, External Sites, etc. into our existing OneData platform. Design a scalable target state architecture for data processing-based on document content ( Data types may include, but are not limited to: XML, HTML, DOC, PDF, XLS, JPEG, TIFF, and PPT ) including PII/CII handling, policy-based hierarchy rules and Metadata tagging. Design, development, and deployment of optimal data pipelines including incremental data ingestion strategy by taking advantage of leading-edge technologies through experimentation and iterative refinement. Design and implement vector databases to efficiently store and retrieve high-dimensional vectors. Conducting research to stay up to date with the latest advancements in generative AI services and identify opportunities to integrate them into our products and services. Implement data quality and validation checks to ensure accuracy and consistency of data. Build automation that effectively and repeatably ensures quality, security, integrity, and maintainability of our solutions. Monitor and troubleshoot data pipeline performance, identifying and resolving bottlenecks and issues. Define and implement data access policies; implement and maintain data security measures and access policies for cloud storage buckets and vector databases. QUALIFICATIONS REQUIRED Bachelors degree in Engineering, Computer Science or a related field; Masters degree is a plus. 10+ years relevant industry and functional experience in Database and Cloud-based technologies Experience in working with Machine learning and AI concepts related to RAG architecture, LLMSs, embedding and data insertion into a Vector data store. Experience in building data ingestion pipelines for Structured and Unstructured data both for storage and optimal retrieval Experience working with Cloud data stores, noSQL, Graph and Vector databases. Proficiency with languages such as Python, SQL, and PySpark Experience working with Databricks and Snowflake technologies. Experience with relevant code repository and project tools such as GitHub, JIRA and Confluence Working experience with Continuous Integration & Continuous Deployment with hands-on expertise on Jenkins, Terraform, Splunk and Dynatrace . Highly innovative with aptitude for foresight, systems thinking and design thinking, with a bias towards simplifying processes. Detail oriented individual with strong analytical, problem-solving, and organizational skills Ability to clearly communicate to both technical and business teams. Thank You & Regards, Arun Kumar / Sr. US IT Recruiter 8801 Fast Park Drive, Ste.301 Raleigh, NC 27617 | Email:[email protected] www.maintec.com | www.maintectraining.com Linkedin: www.linkedin.com/in/arunkumarpasupula/ Keywords: artificial intelligence information technology golang green card Idaho New Jersey North Carolina |
[email protected] View all |
Fri Feb 09 03:16:00 UTC 2024 |