Databricks Unity Catalog Engineer, Princeton,NJ (Onsite) at Princeton, New Jersey, USA |
Email: [email protected] |
From: Harish Varma, R2 Technologies [email protected] Reply to: [email protected] Databricks Unity Catalog Engineer Location: Princeton NJ (prefer onsite) Duration: 6 months Client:Capgemini Architect the Unity Catalog to provide centralized access control, auditing, lineage, and data discovery capabilities across Databricks workspaces. Define and organize data assets (structured and unstructured) within the Unity Catalog. Design and implement Azure cloud-based Data Warehousing and Governance architecture with Lakehouse paradigm Define access policies at a granular level (rows, columns, features) to ensure secure and consistent access management across workspaces and platforms. Leverage Delta Sharing to enable easy data sharing across regions, and platforms. Ensure that data and AI assets can be securely shared with minimal replication, maintaining a unified experience for users. Integrating technical functionality, ensuring data accessibility, accuracy, and security. Enable data analysts and etl engineers to discover and classify data, notebooks, dashboards, and files across clouds and platforms. Implement a single permission model for data and AI assets. Monitoring and Observability: utilize AI to automate monitoring, diagnose errors, and maintain data and quality. Set up alerts for personally identifiable information (PII) detection, and operational intelligence. Work closely with data scientists, analysts, and engineers to promote adoption of the Unity Catalog. Provide training and documentation to ensure effective usage and compliance with governance policies Skills: Development and configuration of Unity Catalog Deep understanding of data governance principles, especially related to data cataloging, access control, lineage, and metadata management. Designed data warehouse and data lake solutions along with data processing Pipeline using PySpark using Databricks Performed Data Modelling on Databricks [Delta Table] for transactional and analytical need. Designed and developed pipelines to load data to Data Lake Databricks Platform Proficiency, including its components like Databricks SQL, Delta Live Tables, Databricks Repos, and Task Orchestration. Strong SQL skills for querying and managing data Ability to design and optimize data models for structured and unstructured data. Understand how to manage compute resources, including clusters and workspaces. Keywords: artificial intelligence New Jersey Databricks Unity Catalog Engineer, Princeton,NJ (Onsite) [email protected] |
[email protected] View all |
Thu May 30 19:45:00 UTC 2024 |