Creating Protegrity Anonymization Requests
This section walks you through the process of creating Protegrity Anonymization requests to anonymize your data. It describes the steps for using the REST API and creating Protegrity Anonymization Python SDK requests.
A general overview of the process you need to follow to anonymize the data is shown in the following figure:

- Identify the dataset that needs to be anonymized.
- Analyze and classify the various fields available in the dataset. The following classifications are available:
- Direct Identifiers
- Quasi-Identifier
- Sensitive Attributes
- Non-Sensitive Attributes
- Determine the use case by specifying the data that is required for further analysis.
- Specify the quasi-identifiers and other fields that are not required in the dataset
- Specify the required Protegrity Anonymization methods for the data. Some commonly used methods are as follows:
- Generalization
- Micro-Aggregation
- Specify and measure the acceptable statistics and risk levels for the data fields for measuring the statistic before running the Protegrity Anonymization job.
Note: For more information about different risk levels for the data fields, refer to Protegrity Anonymization models.
- Verify that the anonymized data satisfies the acceptable risk threshold level.
- Measure the quality of the anonymized data by comparing it with the original data. If the quality does not meet standards, then work on the data or drop the output.
- Save the anonymized data to an output file.
The anonymized data can now be used for further analysis and as input for machine learning softwares.
Feedback
Was this page helpful?