Installing the Impala UDFs

To use the Impala component, you must first install the UDFs. The UDFs for Impala are available in the pepimpala.so file. This file is available in the /opt/cloudera/parcels/PTY_BDP/pepimpala/ directory after you install the Big Data Protector. To install the Impala UDFs, you must:

  1. Load the pepimpala.so file to HDFS.
  2. Execute the .sql scripts to load the Impala UDFs.

To install the Impala UDFs:

  1. Ensure that the cluster is installed, configured, and running.

  2. To create the /opt/protegrity/impala/udfs/ directory in HDFS, run the following command:

    sudo -u hdfs hadoop fs -mkdir -p /opt/protegrity/impala/udfs/
    
  3. To assign Impala supergroup permissions to the /opt/protegrity/impala/udfs/ directory, run the following command:

    sudo -u hdfs hadoop fs -chown -R impala:supergroup /opt/protegrity/impala/udfs/
    
  4. To navigate to the /opt/cloudera/parcels/PTY_BDP/pepimpala/ directory, run the following command:

    cd /opt/cloudera/parcels/PTY_BDP/pepimpala/
    
  5. To load the pepimpala.so file to the /opt/Protegrity/impala/udfs/ directory, run the following command:

    sudo -u hdfs hadoop fs -put pepimpala<version>.so /opt/protegrity/impala/udfs
    

    In this case, the name of the shared objects file considered as pepimpala.so. Typically, the name of the shared objects file is pepimpala<xx>RHEL.so, where is the version of the file, which needs to be considered.

  6. Navigate to the /opt/cloudera/parcels/PTY_BDP/pepimpala/sqlscripts/ directory.

    This directory contains the SQL scripts to install the Protegrity UDFs for the Impala protector.

  7. If you are not using a Kerberos-enabled Hadoop cluster, then execute the createobjects.sql script to install the Protegrity UDFs for the Impala protector.

    impala-shell -i <IP address of any Impala slave node> -f /opt/cloudera/parcels/PTY_BDP/pepimpala/sqlscripts/createobjects.sql
    
  8. If you are using a Kerberos-enabled Hadoop cluster, then execute the createobjects.sql script to load the Protegrity UDFs for the Impala protector.

    impala-shell -i <IP address of any Impala slave node> -f /opt/cloudera/parcels/PTY_BDP/pepimpala/sqlscripts/createobjects.sql -k
    

Note: For more information about registering the Impala UDFs using the helper script, refer Registering the Impala UDFs


Last modified : February 20, 2026