Home

Databricks Unity Catalog Engineer, Princeton,NJ (Onsite) at Princeton, New Jersey, USA
Email: [email protected]
From:

Harish Varma,

R2 Technologies

[email protected]

Reply to:   [email protected]

Databricks Unity Catalog Engineer

Location: Princeton NJ (prefer onsite)

Duration: 6 months

Client:Capgemini

Architect the Unity Catalog to provide centralized access control, auditing, lineage, and data discovery capabilities across Databricks workspaces.

Define and organize data assets (structured and unstructured) within the Unity Catalog.

Design and implement Azure cloud-based Data Warehousing and Governance architecture with Lakehouse paradigm

Define access policies at a granular level (rows, columns, features) to ensure secure and consistent access management across workspaces and platforms.

Leverage Delta Sharing to enable easy data sharing across regions, and platforms.

Ensure that data and AI assets can be securely shared with minimal replication, maintaining a unified experience for users.

Integrating technical functionality, ensuring data accessibility, accuracy, and security.

Enable data analysts and etl engineers to discover and classify data, notebooks, dashboards, and files across clouds and platforms.

Implement a single permission model for data and AI assets.

Monitoring and Observability: utilize AI to automate monitoring, diagnose errors, and maintain data and quality.

Set up alerts for personally identifiable information (PII) detection, and operational intelligence.

Work closely with data scientists, analysts, and engineers to promote adoption of the Unity Catalog.

Provide training and documentation to ensure effective usage and compliance with governance policies

Skills:

Development and configuration of Unity Catalog

Deep understanding of data governance principles, especially related to data cataloging, access control, lineage, and metadata management.

Designed data warehouse and data lake solutions along with data processing Pipeline using PySpark using Databricks

Performed Data Modelling on Databricks [Delta Table] for transactional and analytical need.

Designed and developed pipelines to load data to Data Lake

Databricks Platform Proficiency, including its components like Databricks SQL, Delta Live Tables, Databricks Repos, and Task Orchestration.

Strong SQL skills for querying and managing data

Ability to design and optimize data models for structured and unstructured data.

Understand how to manage compute resources, including clusters and workspaces.

Keywords: artificial intelligence New Jersey
Databricks Unity Catalog Engineer, Princeton,NJ (Onsite)
[email protected]
[email protected]
View all
Thu May 30 19:45:00 UTC 2024

To remove this job post send "job_kill 1438146" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]
Time Taken: 14

Location: Princeton, New Jersey