Data Engineer at Bethlehem, Pennsylvania, USA |
Email: [email protected] |
From: Vikrama Rao, ValiantIQ INC [email protected] Reply to: [email protected] Title: Data Engineer Client: Guardian Life Insurance Location: Bethlehem, PA (Hybrid) Duration: 12+ Months Visa: GC or USC Data Engineer- strong Python, SQL and Spark along with ML/AI experience. Location: Bethlehem, PA. Resource will be required to work onsite a minimum of 3 days per week in the Bethlehem, PA office. Local candidates preferred. If your candidate is not local to Bethlehem, they will be required to relocate and work onsite from Day 1. Duration: Through year end (12/31/24), with the possibility of an extension You will: Collaborate with data scientists and analysts to understand data requirements and translate them into scalable, high performant data pipeline solutions. Support data discovery & data preparation for model development. Perform detailed analysis of raw data sources by applying business context and collaborate with cross-functional teams to transform raw data into curated & certified data assets to be used for ML and BI use cases. Collaborate with data science and data engineering team to build scalable and reproducible machine learning pipelines for training and inference. Implement machine learning models into operations and processes via batch, streaming and API methods. Monitor and troubleshoot data pipeline performance, identifying and resolving bottlenecks and issues. Develop, test, and maintain robust tools, frameworks, and libraries that standardize and streamline the data & machine learning lifecycle. Contribute to developing and maintaining end-to-end MLOps lifecycle to automate machine learning solutions development and delivery. Implement robust monitoring framework for model performance. Collaborate with cross-functional teams of Data Science, Data Engineering, business units and various IT teams. Create and maintain effective documentation for project and practices ensuring transparency and effective team communication. You Have: Bachelors or masters degree with 5+ years of experience in Computer Science, Data Science, Engineering, or a related field. 4+ years of experience in working with Python, SQL, PySpark and bash scripts. Proficient in software development lifecycle and software engineering practices. 2+ years of hands-on experience in using Databricks platform 3+ years of hands-on experience in operationalizing Machine Learning solutions which are used in live production processes. 2+ years of experience and proficiency in API development using FastAPI frameworks and familiarity with containerization technologies like docker or Kubernetes. 3+ years of experience in developing and maintaining robust data pipelines data to be used by Data Scientists to build ML Models. 3+ years of experience working with Cloud Data Warehousing (Redshift, Snowflake, Databricks SQL or equivalent) platforms and experience in working with distributed framework like Spark. Solid understanding of machine learning life cycle, data mining, and ETL techniques. Experience with machine learning frameworks (like Keras or PyTorch) and libraries (like scikit-learn, xgboost). Hands-on experience in building and maintaining tools and libraries which have been used by multiple teams across organization. Proficient in understanding and incorporating software engineering principles in design & development process. Hands on experience with CI/CD tools (e.g., Jenkins or equivalent), version control (Github, Bitbucket), Orchestration (Airflow, Prefect or equivalent) Excellent communication skills and ability to work and collaborate with cross functional teams across technology and business. Thanks & Regards, Vikrama Rao Recruitment Executive- ValiantIQ Inc. "Searching Best Minds Searching Best Minds" Email: [email protected] P. 704-249-2259 F. (302) 482-3672 Disclaimer: If you are not interested in receiving our e-mails then please reply with a "REMOVE" in the subject line for automatic removal. And mention all the e-mail addresses to be removed with any e-mail addresses, which might be diverting the e-mails to you. We are sorry for the inconvenience. Keywords: continuous integration continuous deployment artificial intelligence machine learning business intelligence information technology green card Pennsylvania Data Engineer [email protected] |
[email protected] View all |
Wed Jun 19 05:19:00 UTC 2024 |