Create a data classification plan for Sensitive Data Governance to define indexing requirements and the types of entities to detect.
Before You Begin
-
If your end-user data includes scanned documents and you want to include the scanned documents in content indexing or entity detection, review the following topics:
-
For information about scanned document support, see OCR Support for Scanned Documents.
-
To enable content indexing and entity detection for images in additional file types, see Adding OCR Support for Additional File Types.
-
-
If content indexing is configured for Exchange backups, you can select the same Index Server for the data classification plan. To find the Index Server used by Exchange backups, go to the Infrastructure settings section on the Configuration tab of your Exchange application.
Procedure
-
From the navigation pane, go to Manage > Plans.
The Plans page appears.
-
In the upper-right corner of the page, click Create plan, and then click Data classification.
The Data classification plan page appears.
-
Click Sensitive data governance.
The Create data classification plan - Sensitive data governance page appears.
-
On the Configuration tab, complete the following steps:
-
In the Plan name box, enter a unique name for the plan.
-
Create an index server or use an existing index server:
-
To create an index server, click the plus button .
To use a server as a node for the Index Server, the node must have the Index Store package installed.
-
To use an existing index server, from the Index server list, select the index server.
-
-
From the Content analyzer list, select the content analyzers to use for entity detection (PII).
-
From the Entities list, select one or more entity types.
-
-
To add a classification model, under Classification, from the Classifiers list, select the classifier.
Classifiers are trained to recognize types of documents.
-
Click Next.
-
On the Advanced options tab, select from the following options:
-
To include file types for content indexing and entity detection, under Include file types, enter the extension in the Enter file extension box using the format *.ext, and then click Add.
-
To exclude directories from content indexing and entity detection, under Exclude paths, enter the path in the Enter folder path or pattern box, and then click Add.
You can include wildcard expressions in the directory path. For example, to exclude all the files in a temporary directory, enter */temp.
-
To specify the range of file sizes that are content indexed, under File size, enter the minimum file size and the maximum file size in megabytes.
-
To include scanned documents in content indexing and entity detection, select the Extract text from image check box.
-
-
Click Save.
Related Topics
-
For information about index servers for Sensitive Data Governance, see Installing Software Components for Sensitive Data Governance.
-
For information about entity types, see the following topics:
-
For information about creating and training classification models, see Classifier Manager.