Multiple Remote Roles for Remote Data Scientist at Remote, Remote, USA |
Email: [email protected] |
From: Syed, TechStar Group [email protected] Reply to: [email protected] Title: Sr Data Scientist Location: Remote Duration: Long Term The resource will come in having experience in modeling work at a production level, this resource will not be doing any research-related type of work. System understanding + modeling deliverables are key stills for this role. This person should have a sense of things they build in preparation for production and be able to articulate that Deliverables: Establishing a single, end-to-end synaptic-based model (for evaluation) and be prepared to hook it up for model deployment for online testing Skills Required: Dive into the data to understand its structure, volume, and any existing preprocessing. This may involve: Data Cleaning: Handling missing values, removing duplicates, and standardizing formats. Exploratory Data Analysis (EDA): Analyzing patterns and distributions, assessing feature relevance, and identifying potential biases. Familiarity with data nuances and readiness to create initial embeddings. Generate embeddings, likely using pre-trained or fine-tuned language models, for candidate retrieval data. Embeddings for retrieval and have evaluation results, evaluating results of model and have that artifact ready for deployment understand the data and build indexes SQL- programming language - required understand Java Programming languages such as Python Code versioning software (Git) Machine learning Deep learning (including large language models and/or computer vision) Data pipeline engineering Model deployment /development ( write a model from scratch, infrastructure pipelining Nice to have: Some experience in search (Lucene) some experience with Synaptic-based Language Modeling and Vector Databases (VectorDB) Keywords: information technology Multiple Remote Roles for Remote Data Scientist [email protected] |
[email protected] View all |
Wed Nov 13 21:26:00 UTC 2024 |