Editing the Cluster Configuration
Note: The instructions mentioned in the section apply only to the Application Protector REST approach.
After the configurator script is executed and the UDFs are created, the cluster must be updated to include the following configurations:
- Inclusion of the environment variables.
- Attach the
BigDataProtector-Init-Script_Linux-ALL-64_x86-64_AWS.Databricks-<DBR_version>-64_<BDP_version>.shscript to the Databricks compute.
Ensure that ESA is started and in a running state before restarting the Databricks cluster after updating the configurations.
To edit the cluster:
Log in to the Databricks portal.
Edit the required cluster.
Expand the Advanced section.
Click the Spark tab.
Under Environment variables, add the variables, with their values, listed in the table:
Variable Value PTY_ESA_IPEnter ESA IP address. PTY_ESA_PORTEnter the port number to connect to ESA. PTY_ESA_TOKENEnter the JWT token to connect to ESA. PTY_ESA_ADMINISTRATOR_USERNAMEEnter the user name to connect to ESA. PTY_ESA_ADMINISTRATOR_PASSWORDEnter the password to connect to ESA. PTY_AUDIT_STORE_IP_PORTEnter the port to connect to the Audit Store. The value is a comma-separated string of <audit_store_ip>:<audit_store_port>. For example, 11.22.33.44:9200, 55.66.77.88:9200PTY_PROTECTOR_CONFIGURATIONSpecify the values as [core]emptystring=empty, [sync]interval=10 Click the Init scripts tab.
From the Source list, select Volumes.
In the File path box, enter the location of the initialization script.
To save the changes and restart the cluster, click Confirm and restart.
Note: If the initialization script fails with a non-zero exit code, enable cluster logging to view the error log files for troubleshooting purposes.
When the cluster is restarted, the initialization script starts the Application Protector REST service on every node in the cluster. After the Application Protector REST service is started, use the Unity Catalog Batch Python UDFs to protect and unprotect data.
Note: The process to execute the initialization script will take some time before the cluster is ready to use for performing protect and unprotect operations. For more information on using the UDFs for protect and unprotect operations, refer to the section Unity Catalog Batch Python UDFs.
Feedback
Was this page helpful?