<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Building the Protegrity Synthetic Data Request on</title><link>https://docs.protegrity.com/synthetic-data/1.0.1/docs/building_req/</link><description>Recent content in Building the Protegrity Synthetic Data Request on</description><generator>Hugo</generator><language>en</language><atom:link href="https://docs.protegrity.com/synthetic-data/1.0.1/docs/building_req/index.xml" rel="self" type="application/rss+xml"/><item><title>High-Level Workflow</title><link>https://docs.protegrity.com/synthetic-data/1.0.1/docs/building_req/hide_high_level_workflow/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://docs.protegrity.com/synthetic-data/1.0.1/docs/building_req/hide_high_level_workflow/</guid><description>&lt;p>The Protegrity Synthetic Data follows a structured pipeline to generate Synthetic Data:&lt;/p>
&lt;ol>
&lt;li>Configuration Validation&lt;/li>
&lt;li>Optimal Real Data Usage&lt;/li>
&lt;li>Automatic Data Preprocessing&lt;/li>
&lt;li>Training of Protegrity Synthetic Data Generator Model&lt;/li>
&lt;li>Evaluation Against Real Data&lt;/li>
&lt;li>Protegrity Synthetic Data Generation&lt;/li>
&lt;li>Machine Learning Operations&lt;/li>
&lt;/ol>
&lt;h2 id="configuration-validation">Configuration Validation&lt;/h2>
&lt;p>Training Protegrity Synthetic Data generators is a slow process, taking from a couple of minutes to several hours depending on the configurations used. To optimize compute time, several validations are proactively done to ensure a valid configuration before any training takes place. If any violations are found, descriptive exceptions are returned to the user.&lt;/p></description></item><item><title>Building the Request Using the REST API</title><link>https://docs.protegrity.com/synthetic-data/1.0.1/docs/building_req/hide_build_rest_wrap/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://docs.protegrity.com/synthetic-data/1.0.1/docs/building_req/hide_build_rest_wrap/</guid><description>&lt;h2 id="identifying-the-source-and-target">Identifying the Source and Target&lt;/h2>
&lt;p>In this step, you specify the source real dataset from which you wish to produce Protegrity Synthetic Data and a target, where corresponding Synthetic Data will be saved.&lt;/p>
&lt;p>The following file formats are supported:&lt;/p>
&lt;ul>
&lt;li>Comma separated values (CSV)&lt;/li>
&lt;/ul>
&lt;p>The following data storages have been tested for Protegrity Synthetic Data:&lt;/p>
&lt;ul>
&lt;li>Local File System&lt;/li>
&lt;li>Amazon S3&lt;/li>
&lt;/ul>
&lt;p>The following data storage types can also be used for the Protegrity Synthetic Data:&lt;/p></description></item></channel></rss>