Data Engineer with ML Databricks At Remote 12+ years at Remote, Remote, USA |
Email: [email protected] |
Hi, Hope you are doing well!!! Role: Data Engineer with Data science with Databricks Location: Remote Contract H1B consultant LinkedIn link Experience: Minimum of 3 years of experience in data-centric roles with a strong focus on analytics, data science, or machine learning. Proven experience with feature engineering, particularly within the Databricks platform. Key Responsibilities : Position Overview: As a Feature Engineering Specialist with expertise in Databricks, you will be pivotal in enhancing data quality and extracting meaningful features from raw data to boost the performance of predictive models on the Databricks platform. Collaborating closely with data scientists and machine learning engineers, you will leverage your deep understanding of Databricks unified analytics platform to streamline workflows and implement advanced feature engineering techniques. Key Responsibilities: Data Analysis and Preprocessing on Databricks: Utilize Databricks for analyzing large datasets, identifying patterns, and correlations. Employ Databricks data handling capabilities to clean, normalize, and preprocess data efficiently. Address missing data, outliers, and anomalies using Databricks robust data processing tools. Feature Development in Databricks Environment: Design and engineer new features using Databricks collaborative notebooks and MLflow for tracking experiments. Innovate feature engineering strategies by leveraging Databricks Delta Lake for high-quality data storage. Apply feature selection and dimensionality reduction directly within the Databricks ecosystem to optimize model training. Collaboration and Strategy Implementation: Integrate feature engineering processes with Databricks collaborative and interactive workspace. Work closely with stakeholders to understand business objectives and leverage Databricks for scalable data solutions. Ensure seamless integration of feature engineering outputs with machine learning pipelines on Databricks. Testing and Optimization on Databricks: Leverage Databricks for setting up scalable testing frameworks to validate new features. Use Databricks capabilities to continuously refine features and optimize data pipelines for performance and efficiency. Documentation and Reporting: Document feature engineering processes and methodologies specifically tailored for Databricks. Utilize Databricks collaborative notebooks to share findings and reports with stakeholders effectively. Minimum Requirements: Education: Bachelors degree in Computer Science, Statistics, Mathematics, or related field. Masters degree preferred. Experience: Minimum of 3 years of experience in data-centric roles with a strong focus on analytics, data science, or machine learning. Proven experience with feature engineering, particularly within the Databricks platform. Technical Skills: Proficiency in Python, SQL, and Scala, with strong programming skills in at least one of these languages. Understanding of the Databricks platform, including Delta Lake, Databricks SQL, and MLflow. Familiarity with Apache Spark and its integration within Databricks. Analytical Skills: Robust analytical and problem-solving skills with a solid foundation in statistical methods. Expertise in translating business issues into data-driven solutions using Databricks. Communication Skills: Excellent verbal and written communication abilities. Proficient in explaining complex technical details and insights, leveraging Databricks collaborative features. Teamwork and Leadership: Proven ability to work in and lead cross-functional teams. Experience in mentoring and guiding teams in a Databricks-centric environment. Thanks & Regards, Rahul Kumar Crox Consulting Inc. Email:- [email protected] LinkedIn:- https://www.linkedin.com/in/rahul-kumar-7ab895a6/ www.thecroxgroup.com ______________________________________________________________________ RPO Solutions || Staffing Solutions || IT Solutions || ______________________________________________________________________ -- Keywords: information technology Data Engineer with ML Databricks At Remote 12+ years [email protected] |
[email protected] View all |
Thu Dec 19 01:20:00 UTC 2024 |