Creating Data Classification Plans for Sensitive Data Governance

Create a data classification plan for Sensitive Data Governance to define indexing requirements and the types of entities to detect.

Before You Begin

  • If your end-user data includes scanned documents and you want to include the scanned documents in content indexing or entity detection, review the following topics:

  • If content indexing is configured for Exchange backups, you can select the same Index Server for the data classification plan. To find the Index Server used by Exchange backups, go to the Infrastructure settings section on the Configuration tab of your Exchange application.

Start the Configuration Wizard

  1. From the navigation pane, go to Manage > Plans.

    The Plans page appears.

  2. In the upper-right corner of the page, click Create plan, and then click Data classification.

    The Create Data Classification Plan configuration wizard appears.

Select Application

  1. Click Sensitive data governance.

  2. Click Next.

    The Configuration page of the configuration wizard appears.

Configuration

General

  1. In the Plan name box, enter a unique name for the plan.

  2. From the Index server list, select an existing Index Server or create an Index Server.

    Steps to create an Index Server
    1. Click the plus button (+).

      The Create new index server dialog box appears.

    2. In the Index Server name field, enter a name for the Index Server.

    3. From the Index Server nodes list, select the existing servers or add a new server.

      Note

      To use a server as a node for the Index Server, the server must have the Index Store package installed.

      To add a new server, do the following:

      1. Click the plus button (+).

        The Add index store software window appears.

      2. Specify a name and then select the Add new server option.

      3. In the Host name box, type the host name.

      4. In the User name and Password boxes, type the credentials for the server.

      5. In the Confirm password box, type the password.

      6. For OS Type, select the operating system that is installed on the server.

      7. Optional: In the Installation location box, enter the installation location path.

      8. If the tenant has multiple access nodes configured for a company, from the Software cache list, select the cache source.

        If the tenant has only one access node configured for a company, then the access node is selected as software cache source.

      9. To reboot the server after the installation, move the Reboot if required toggle key to the right.

      10. Click Install.

        The Index Store and dependency packages are automatically installed on the server.

    4. From the Language list, select the language of the content that this Index Server will content index:

      • Chinese

      • English

      • Japanese

      During content indexing, text is split into meaningful groups of characters (tokenized). After the text is tokenized, meaningful results are returned when you search the text.

    5. Click Save.

    To use an existing Index Server, from the Index server list, select an Index Server.

Entity detection

  1. From the Content analyzer list, select the content analyzers to use for entity detection (PII).

    To add a new server as a content analyzer, click the plus button (+). In the Add content analyzer window, specify a name, and select the Add new server option. You can also select an existing server and then click Install.

    The Content Analyzer and dependency packages are automatically installed on the server.

  2. From the Entities list, select one or more entity types.

  3. To add a classification model, under Classification, from the Classifier list, select the classifier.

    Classifiers are trained to recognize types of documents.

  4. Click Next.

Advanced Options

  1. To include file types for content indexing and entity detection, under Include file types, enter the extension in the Enter file extension box using the format *.ext, and then click Add.

  2. To exclude directories from content indexing and entity detection, under Exclude paths, enter the path in the Enter folder path or pattern box, and then click Add.

    You can include wildcard expressions in the directory path. For example, to exclude all the files in a temporary directory, enter */temp.

  3. In the Maximum file size field, enter the maximum size in megabytes for files to be content indexed.

Indexing

  1. Optional: To schedule the content indexing job, next to Schedule, click Edit edit button outline grey/gray pencil and define the schedule in the Edit schedule dialog box.

  2. To include scanned documents in content indexing and entity detection, select the Extract text from image check box.

  3. From the Storage pool list, select a storage pool.

  4. Click Submit.

Loading...