Prerequisites
- Databricks workspace with Unity Catalog enabled
- Metastore created and attached to your workspace
- Appropriate admin privileges to create service principals and grant permissions
Integration Methods
Option 1: Direct Read Access via Service Principal
This method allows Aampe to directly access your Unity Catalog tables using a dedicated service principal with OAuth authentication.Required Permissions
The Aampe service principal needs the following Unity Catalog privileges:- USE CATALOG - Access to view metadata in the catalog
- USE SCHEMA - Access to view metadata in schemas
- SELECT - Read access to tables and views
- BROWSE - Optional: View object metadata in Catalog Explorer
Step-by-step Procedure
- Create the Service Principal
- Go to your Databricks account console
- Navigate to User management > Service principals
- Click Add service principal
- Provide a name (e.g., aampe-integration-sp)
- Click Add
- Create an app registration in Azure Portal
- Note the Application (client) ID
- Add the service principal to your Databricks account
- Generate OAuth Credentials
- In the service principal’s details page, go to the Secrets tab
- Under OAuth secrets, click Generate secret
- Set the secret’s lifetime (maximum 730 days)
- Copy and securely store:
- Client ID (same as Application ID)
- OAuth Secret (shown only once)
- Click Done
- Grant Unity Catalog Permissions
- <catalog_name>: Your Unity Catalog name
- <schema_name>: Your schema/database name
- <table_name>: Specific table names (or use schema-level grant)
- <service_principal_id>: The Application ID of your service principal
- Provide Connection Details to Aampe
- Workspace URL:
https://<workspace-name>.cloud.databricks.com - Client ID: Your service principal’s Application ID
- OAuth Secret: The generated secret (transfer securely)
- Catalog Name: Your Unity Catalog name
- Schema Names: List of schemas containing relevant tables
- Table Names: Specific tables to sync (if not syncing entire schemas)
- Workspace ID: Your Databricks workspace ID
- Account ID: Your Databricks account ID (for account-level access)
Option 2: Push to Cloud Storage
For large-scale data transfers, you can export data to cloud storage that Aampe can access.AWS S3 Setup
-
Create External Location
-
Grant Permissions
- Set up Automated Export