Create a Fingerprint
You can create and use fingerprints for structured data or unstructured data. Structured, or Exact Data Matching (EDM) fingerprints, allow you to monitor your organizations in a row and column format typically extracted from a database in CSV format. Unstructured or Index Document Matching (IDM) fingerprints work by content matching indexed documents and images. Once your data is fingerprinted, you can add a DLP Policy rule to leverage that indexed data.
Create a Structured Data Fingerprint
- Choose Policy > DLP Policies > Fingerprints.
- Click Create New Fingerprint > Structured Data Fingerprint.
- Fingerprint Name. Enter a name for the Fingerprint.
- Source File. Enter the path to the source file that holds the data you want to fingerprint. For example:
- Unix. file://localhost/etc/folderA/file.csv
- Unix. file:///folder/fileNameWithoutExtension
- Windows. file:///c:/WINDOWS/file.csv
- Windows. file:///WINDOWS/file.csv
- Windows. file://localhost/WINDOWS/file.csv
- Folder Path to Locally Generated Hashed Files. Enter the path to the folder where your locally generated hashed files are stored. Space allocated for this should be about 10–20% of the size of data to be fingerprinted. For example:
- Unix. file://localhost/etc/folderA
- Unix. file:///etc/folderB
- Windows. file:///c:/WINDOWS/file.doc
- Windows. file:///WINDOWS/file.doc
- Windows. file://localhost/WINDOWS/file.doc
- Column Separator. Select the delimiter that separates the columns of your source file: either Comma, Tab, or Pipe.
- First row of file contains column names. Activate this checkbox if the first row of your source file contains column names.
- Column. If the field names aren't automatically detected, enter the first Field Name for your source file. Then select the Field Type from the menu. Selections include:
- Alphabetic. Alphabetic characters.
- Number. Numbers supported with decimals.
- Alphanumeric. Alphanumeric characters.
- Zip Code. ##### or #####-####
- Email. LOCAL_SUBPART ('.' LOCAL_SUBPART)* '@' DOMAIN_SUBPART ('.' DOMAIN_SUBPART)*;
- Date. ##/##/#### or ##-##-####
- Phone. (###)###-#### or ###-###-####
- Credit Card Number. CCNs formatted with decimals, spaces, or underscores, separated by pipes or semicolons.
- Social Security Number. ###-##-####, or ### ## ####, or ###.##.###, separated by pipes or semicolons.
- Identifier. Letters and numbers with hyphens and decimals.
- Generic. Used for exact cell match.
- Do Not Index. Select this Field Type to mark the column as Do Not Index. (This option is available with DLP Integrator 4.1 and later.)
- Click + to add as many Columns as required.
- Click Save.
The new Structured Data Fingerprint is created and appears on the Fingerprint page. Next, generate an index.
Create an Unstructured Data Fingerprint
- Choose Policy > DLP Policies > Fingerprints.
- Click Create New Fingerprint > Unstructured Data Fingerprint.
- Fingerprint Name. Enter a name for the Fingerprint.
- Machine Path. Enter the path to the computer that holds the files you want to fingerprint. Click + to add more paths, if necessary. For example:
- Unix. file://localhost/etc/folderA
- Unix. file:///etc/folderB
- Windows. file:///c:/WINDOWS/file.doc
- Windows. file:///WINDOWS/file.doc
- Windows. file://localhost/WINDOWS/file.doc
- Folder Path to Locally Generated Hashed Files. Enter the path to the folder where your locally generated hashed files are stored. Space allocated for this should be about 10–20% of the size of input paths/folders. For example:
- Unix. file://localhost/etc/folderA
- Unix. file:///etc/folderB
- Windows. file:///c:/WINDOWS/file.doc
- Windows. file:///WINDOWS/file.doc
- Windows. file://localhost/WINDOWS/file.doc
- File Type Exclusions. Enter any file types to be excluded. (Archive files such as ZIP files are automatically excluded.)
- Click Save.
The new Unstructured Data Fingerprint is created and appears on the Fingerprint page. Next, generate an index.