Urgent need Data Scientist || Hybrid in NC || Need only USC H4 EAD OR OPT visa at Remote, Remote, USA |
Email: rajeev@vyzeinc.com |
https://jobs.nvoids.com/job_details.jsp?id=2118175&uid= From: Rajeev, Vyzeinc rajeev@vyzeinc.com Reply to: rajeev@vyzeinc.com Data Scientist Location: Hybrid in Raleigh, NC open to relocation (I would focus on relocation more so than local) Duration: 12-month CTH Work Auth: OPT/H4/Citizen MOI: 2 round of Video Interview Need proper Linkedin and 2 round of PV Video screening.!! The project: They are processing extreme amount of data Process 15,000 docs per minute. THIS CANDIDATE NEEDS TO HAVE WORKED WITH AROUND THIS AMOUNT OF DATA (Duncan was working with 8 data points per DAY). Trying to process and extract enrichments from the documents They collect news from across the world, take those in real time and run an LP algorithm to do things like identity companies mentioned, product mentioned, etc. then link it to say this company was mentioned and link it to their records, or a person. Sentiment analysis - its complex. They want to know generally if an article is positive or negative about a certain person or company. It may be negative about one company and positive about another They want to be able to extract additional things - when did an event occur Last month When was that Need to be able to categorize based on the timeline. Skills: Minimum 5 years of experience using NLP tools and methods such as OpenNLP, Stanford NLP, LDA, Gensim, spaCy Natural Language Processing (NLP): Relevant experience with NLP tools and techniques. Proficiency in Python: Familiarity with Python and relevant libraries. Nice to have: Databricks: Experience with Databricks or similar platforms. Pyspark: Hands-on experience. Snowflake: Knowledge of this platform. spaCy and Gensim: Leveraging spaCy for Named Entity Recognition (NER) and Gensim for LDA topic modeling to provide contextual insights. PyTorch and Numpy: Proficiency in these libraries. Large Language Models (LLMs): Strong focus on LLMs and fine-tuning expertise. Cloud: Cloud experience is a bonus. ML Ops: Exposure to ML Ops tools and practices. Feedback of Interviews: Candidate 1: While this candidates resum looked really good, his data science knowledge just was not there. He was not able to answer any of our questions very well, and in fact always just went back to what he knows, which is a RAG-enabled LLM chat bot that hes working on. He has some background in MLOps as well. Hes not a good candidate for us. Candidate 2: Answered a lot of technical questions, many correctly, however these were very fundamental, basic questions, and he was often unsure. Candidate was unable to go deep and while showing as Sr he was the middle of his current org Candidate 3: They liked moving forward to next interview.. Regards, Rajeev | Vyze Inc. 25179 Methley Plum Place, Aldie, VA 20105 Email: rajeev@vyzeinc.com Keywords: machine learning information technology golang North Carolina Virginia Urgent need Data Scientist || Hybrid in NC || Need only USC H4 EAD OR OPT visa rajeev@vyzeinc.com https://jobs.nvoids.com/job_details.jsp?id=2118175&uid= |
rajeev@vyzeinc.com View All |
03:43 AM 28-Jan-25 |