Creating a User-Defined Subclient to Back Up Hadoop Data

Subclients contain information about what data is backed up. You can create a user-defined subclient to manage and back up specific data.

Before You Begin

You can use wildcards to define the subclient content. For more information, see Wildcards for the UNIX File System Agent.

Procedure

  1. From the CommCell Browser, expand Client Computers > pseudo-client > Big Data Apps.

  2. Right-click the instance that you want to create subclient for, point to All Tasks, and then click New Subclient.

    The Subclient Properties dialog box appears.

  3. Specify the basic settings for the subclient:

    1. In the Subclient Name box, type a name.

    2. On the Data Access Nodes tab, select the data access nodes that you want to add to the subclient, and then click Add.

    3. On the Content tab, click Browse to select the directory or file that you want to back up, and then click Add.

      Repeat this step to include all the files and directories that you want to back up.

      Note: The default subclient does not back up the content that you specify in the user-subclients that are within the same instance.

  4. Click Advanced.

    The Advanced Subclient Properties dialog box appears.

  5. To configure multiple streams for backups, on the Performance tab, specify the number of data streams:

    1. In the Number of Data Readers box, enter the number of data streams.

      Notes:

      • For optimal sharing of the backup load, the number of data readers must be greater than the number of data access nodes.

      • The number of streams configured in the storage policy must be equal to or greater than the value entered in the Number of Data Readers box.

    2. Select the Allow multiple data readers within a drive or mount point check box

    3. Click OK.

  6. On the Storage Device tab, select a storage policy from the Storage Policy list.

  7. To create a new storage policy, click Create Storage Policy, and then follow the instructions in the storage policy creation wizard.

  8. To perform LAN-free backups and restores, select a grid storage policy.

    For more information, see GridStor® (Alternate Data Paths) - Overview.

  9. Optional: Select the subclient options.

    Configuring Backups for Recently Modified or Changed Data

    Retaining Additional Versions of a File During Synthetic Full Jobs
    • On the Retention tab, in Older file versions area, select one of the following options:

      • Keep older versions for n years n months, and n days

        Type a value for the time period for which you want to retain the older versions of the file. All older versions whose modified time is less than the value you specify in the Keep older versions for boxare retained by the synthetic full backup operation.You will see this option only if your client is configured to use Indexing V2.

      • Keep n versions

        Type a value for the number of older versions you want to retain, starting from the latest backup.

    Setting Up Pre-processes and Post-processes

    Important: The pre-process and post-process scripts must be present on the master node.

    • On the Pre/Post Process tab:

      • In the PreBackup Process box, type the full path name for the script.

      • In the PostBackup Process box, type the full path name for the script.

      • To run the post backup process regardless of the job's outcome, select the Run Post Process for all attempts check box.

    Setting Up Network Bandwidth Throttling for a Subclient
    • On the Storage Device tab:

      • On the Data Transfer Option sub-tab, select the Throttle Network Bandwidth (MB/HR) check box, and then enter the throughput limit.

        The Commvault software throttles the transfer on a per-network-agent basis.

    Modifying Software Compression on a Subclient

    If the subclient is associated to a subclient policy, modify the compression settings on a subclient policy.

    For database agents, where third-party backups are run, the software compression settings must be modified in the Instance Properties dialog box.

    • On the Storage Device tab:

      • On the Data Transfer Option sub-tab, under Software Compression, configure the appropriate compression settings:

        • To compress data on the client, click On Client.

        • To compress data on the MediaAgent, click On MediaAgent.

        • To compress data according to the settings in the storage policy copy, click Using Storage Policy Settings.

        • To transfer data without compression, click Off.

    Viewing Data Paths
    • On the Storage Device tab:

      • In the Storage Policy list, click a storage policy name.

      • Click Data Paths.

    Configuring Activity Control
    • On the Activity Control tab

      • Select or clear check boxes to enable or disable activities.

        For a list of activities that you can control in the CommCell, see List of CommCell Activities.

    Configuring Data Encryption
    • Verify that you have encryption enabled on client level. For instructions, see Configuring Data Encryption on a Client.

    • On the Encryption tab, choose one of the following options for data encryption:

      Options

      Description

      None

      Disables encryption on selected subclient.

      Media Only (MediaAgent Side)

      Backup data is transmitted without encryption and then encrypted prior to storage. During restore operation, data is decrypted by the client.

      Network and Media (Agent Side)

      When enabled, backup data is encrypted before transmission and is stored encrypted on the media. During restore operations, data is decrypted by the client.

      This is the default option.

      Network Only (Agent Encrypts, MediaAgent Decrypts)

      Backup data is encrypted for transmission and then decrypted prior to storage on the media. During restore operations, data is encrypted by the MediaAgent and then decrypted in the client.

  10. Click OK.

A subclient with the content that you want to back up is created under the instance that you selected.

Loading...