Home

Azure Datanricks Engineer - Rate 40$ (need OPT Candidates) at Remote, Remote, USA
Email: rohitk@insourcetechsolutions.com
https://jobs.nvoids.com/job_details.jsp?id=2303310&uid=
Job Title: Azure Datanricks Engineer - Rate 40$

Location: Remote

Job Description:
We are seeking an experienced Databricks Data Engineer to build and maintain robust, scalable data pipelines and workflows. The ideal candidate will have strong knowledge of Databricks, cloud platforms (Azure), and big data processing frameworks. You will be responsible for working on data processing from raw layers to production-ready datasets, utilizing both Databricks and Azure tools for efficient and reliable data engineering practices.

Key Responsibilities:

Process data from the Raw (Bronze) layer to the Silver layer in Databricks, transforming raw data into structured, clean datasets for analysis.

Pull data from Azure Blob Storage into Databricks and configure the required connections, ensuring data is ingested seamlessly into the platform.

Understand and utilize SQL Pools in Azure Synapse Analytics, applying them to improve the performance of large-scale data queries and transformations.

Implement performance optimization techniques in Databricks, such as caching, partitioning, and tuning Spark configurations.

Utilize Auto Loader in Databricks to automatically ingest data from cloud storage in near real-time.

Integrate and leverage Synapse Serverless SQL Pools for serverless querying and scaling in your data workflows.

Process multiple Parquet files from a control table and append them to Delta tables with schema enforcement, ensuring data integrity.

Integrate and configure connections from various data sources using Databricks, including both structured and semi-structured data.

Work with Oracle databases within data engineering pipelines for data extraction and transformation.

Handle a variety of file formats (e.g., Parquet, CSV, JSON, Delta) to support data workflows and transformation needs.

Collaborate with data scientists, analysts, and other engineers to ensure optimal performance and integration across data systems.

Ensure best practices in data security, governance, and compliance across the data pipeline.

Required Skills and Experience:

Strong experience working with Databricks for building and managing data pipelines, from ingestion to transformation.

Proficiency in using Apache Spark (PySpark/Scala) and Delta Lake for big data processing and management.

Experience working with Azure Blob Storage, configuring data ingestion from Azure to Databricks.

Solid understanding of SQL Pools in Azure Synapse Analytics and their application in large-scale data transformations.

Proven ability to optimize Spark jobs in Databricks for performance (e.g., through partitioning, caching, optimizing Spark configurations).

Hands-on experience with Auto Loader in Databricks for automating real-time data ingestion.

Proficient in integrating Databricks with Synapse Analytics to enable seamless workflows across platforms.

Expertise in working with Parquet and Delta formats, including schema enforcement and schema evolution.

Experience with integrating multiple data sources within Databricks, including databases like Oracle and other third-party data systems.

In-depth knowledge of Azure Data Factory, including its Integration Runtime and data movement capabilities.

Familiarity with various file formats in data pipelines, including CSV, JSON, Parquet, and Delta.

Strong programming skills in Python, Scala, or SQL.

Excellent communication and problem-solving skills, with the ability to collaborate across teams.

Preferred Qualifications:

Experience working with serverless SQL pools in Synapse Analytics.

Knowledge of machine learning integration in Databricks and its use in data pipelines.

Familiarity with version control systems like Git and CI/CD processes.

Bachelor's or Master's degree in Computer Science, Engineering, Data Science, or related field.

--

Thanks & Regards,
Rohit
Technical Recruiter
rohitk@insourcetechsolutions.com
www.insourcetechsolutions.com
Marion rd, Westport, CT 06880,

--

Keywords: continuous integration continuous deployment information technology Connecticut
Azure Datanricks Engineer - Rate 40$ (need OPT Candidates)
rohitk@insourcetechsolutions.com
https://jobs.nvoids.com/job_details.jsp?id=2303310&uid=
rohitk@insourcetechsolutions.com
View All
06:50 PM 01-Apr-25


To remove this job post send "job_kill 2303310" as subject from rohitk@insourcetechsolutions.com to usjobs@nvoids.com. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to rohitk@insourcetechsolutions.com -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at me@nvoids.com


Time Taken: 0

Location: ,