To start using Risk Analysis for Sensitive Data Governance, use the setup wizard to create a data classification plan and to create inventory. In the data classification plan, you create an Index Server and define which personally identifiable information (entities) to detect in the data.
Complete this procedure as part of the initial configuration of Data Governance. For more information, see Getting Started with Sensitive Data Governance.
Before You Begin
-
If your end-user data includes scanned documents and you want to include the scanned documents in content indexing or entity detection, review the following topics:
-
For information about scanned document support, see OCR Support for Scanned Documents.
-
To enable content indexing and entity detection for images in additional file types, see Adding OCR Support for Additional File Types.
-
-
If content indexing is configured for Exchange backups, you can select the same Index Server for the data classification plan. To find the Index Server used by Exchange backups, go to the Infrastructure settings section on the Configuration tab of your Exchange application.
Go to the Guided Setup
-
In the navigation pane, click Guided setup.
-
On the Data Insights tab, click Data governance.
The Configuration page of the guided setup appears.
Configuration
General
-
In the Plan name box, enter a unique name for the plan.
-
From the Index server list, select an existing Index Server or create an Index Server.
Steps to create an Index Server
-
Click the plus button (+).
The Create new index server dialog box appears.
-
In the Index Server name field, enter a name for the Index Server.
-
From the Index Server nodes list, select the existing servers or add a new server.
Note
To use a server as a node for the Index Server, the server must have the Index Store package installed.
To add a new server, do the following:
-
Click the plus button (+).
The Add index store software window appears.
-
Specify a name and then select the Add new server option.
-
In the Host name box, type the host name.
-
In the User name and Password boxes, type the credentials for the server.
-
In the Confirm password box, type the password.
-
For OS Type, select the operating system that is installed on the server.
-
Optional: In the Installation location box, enter the installation location path.
-
If the tenant has multiple access nodes configured for a company, from the Software cache list, select the cache source.
If the tenant has only one access node configured for a company, then the access node is selected as software cache source.
-
To reboot the server after the installation, move the Reboot if required toggle key to the right.
-
Click Install.
The Index Store and dependency packages are automatically installed on the server.
-
-
From the Language list, select the language of the content that this Index Server will content index:
-
Chinese
-
English
-
Japanese
During content indexing, text is split into meaningful groups of characters (tokenized). After the text is tokenized, meaningful results are returned when you search the text.
-
-
Click Save.
To use an existing index server, from the Index server drop-down list, select an index server.
-
Entity detection
-
From the Content analyzer list, select the content analyzers to use for entity detection (PII).
To add a new server as a content analyzer, click the plus button (+). In the Add content analyzer window, specify a name, and select the Add new server option. You can also select an existing server and then click Install.
The Content Analyzer and dependency packages are automatically installed on the server.
-
From the Entities list, select one or more entity types.
-
To add a classification model, under Classification, from the Classifiers list, select the classifier.
Classifiers are trained to recognize types of documents.
-
Click Next.
The Advanced options page of the configuration wizard appears.
Advanced Options
-
Optional: Customize what is content indexed and analyzed for entity detection:
-
To include file types for content indexing and entity detection, under Include file types, enter the extension in the Enter file extension box using the format *.ext, and then click Add.
-
To exclude directories from content indexing and entity detection, under Exclude paths, enter the path in the Enter folder path or pattern box, and then click Add.
You can include wildcard expressions in the directory path. For example, to exclude all the files in a temporary directory, enter */temp.
-
-
In the Maximum file size field, enter the maximum size in megabytes for files to be content indexed.
Indexing
-
Optional: To schedule the content indexing job, next to Schedule, click Edit and define the schedule in the Edit schedule dialog box.
-
To include scanned documents in content indexing and entity detection, select the Extract text from image check box.
-
From the Storage pool list, select a storage pool.
-
Click Submit.
What to Do Next
-
After you configure Data Governance, you can perform the following tasks:
-
Processing End-User Access Access Requests (export or delete personal data)
Related Topics
-
When you use the setup wizard to create a data classification plan, only the default entities are available. If you want to manage the types of entities that are available, including custom entities, see Entity Manager.
-
For information about creating and training classification models, see Classifier Manager.
-
After you complete the steps for getting started with Data Governance, you can add individual file servers to your inventories. For more information, see Adding a File Server Asset to an Inventory.