Data Scientist at Client will discloseLocation at Remote, Remote, USA |
Email: [email protected] |
From: Devyani Kumari, Absolute IT [email protected] Reply to: [email protected] Role: Data Scientist Location: Client will disclose Duration: 12 months Experience: 10 years We need a profile of candidate should be USC or GC and ready to relocate in any state of US as per Client project. Job Description: Position Summary We are looking for an experienced Senior Data Scientist with strong technical skills. The candidate will have prior experience applying deep learning for the Document AI and NLP domains, strong software engineering skills and experience with data driven decision making. The candidate is passionate about transforming data, deriving insights, and building solutions. This is an ideal opportunity to become part of an innovative and energetic team that develops analytical solutions to advance oncology care. Key Responsibilities: Business Drivers A strong team player who will collaborate with internal stakeholders, product owners, and engineering to understand business needs and devise solutions. The Data Scientist role is highly collaborative. The candidate is expected to work closely with other data scientists and business partners in creating solutions applying machine learning algorithms to build models on existing data platforms. Participate in the full lifecycle of building analytic data solutions, from data visualization, feature engineering, modeling, deploying, and monitoring. Participate in code reviews and employ software engineering best practices. Communicate process, requirements, assumptions and caveats of advanced ML and NLP concepts and deliverables in laymen language to non-technical business leaders. Technical responsibilities Develop data-driven AI solutions for Document AI use cases such as information extraction and document classification from healthcare documents. Fine-tune existing state-of-the-art pre-trained transformer models on our data such as Donut, use large language models such as GPT-based models and build deep learning models targeted at multimodal content understanding. Use cloud-based AI services such as Azure Form Recognizer by developing code to interface with their API and/or use their SDK. Build data extraction, transformation and post processing pipelines using Python and Spark. Design, implement, deploy, and maintain deep learning and ML models using cloud technologies (e.g., Azure Databricks). Provide mentorship and guidance to junior team members in areas of technical and professional development. This description is general in nature and is not intended to be an exhaustive list of all responsibilities. Other duties may be assigned as needed to meet company goals. Typical Minimum Requirements Typically requires 6+ years of industry experience in ML, deep learning and/or data science roles. Critical Skills and Experience Strong programming abilities in Python. Solid background in algorithms, data structures, and object-oriented programming. 2 Prior experience in training and/or fine-tuning large pre-trained deep learning models. Experience with building information extraction models from documents that are scanned images/PDFs using deep learning, multi-modal learning, LLMs and NLP. Experience with using statistical, machine learning, deep learning, and data visualization libraries, such as Pandas, Scikit-Learn, NumPy, Matplotlib, Keras, Tensor Flow and/or PyTorch. Experience with distributed computing using Spark. Experience working on using OCR for layout and information extraction. Experience using cloud platforms such as Amazon Web Services, Microsoft Azure, or Google Cloud Platform. Theoretical understanding of advances in deep learning in the field of computer vision and NLP including transformer architecture, LLMs, zero-shot learning, few shot learning and multi-modal learning. Experience with deployment and maintenance in a production environment. Excellent verbal communication, including the ability to articulate complex concepts to both technical and non-technical audiences. Education: A BS/MS/Ph.D. degree in a quantitative field such as Computer Science, Machine Learning, Mathematics, Statistics or another related field (Masters Degree or higher preferred). Keywords: artificial intelligence machine learning information technology green card microsoft |
[email protected] View all |
Wed Dec 06 23:15:00 UTC 2023 |