You can create a user-defined subclient to manage and archive specific data.
Files that are smaller in size than one file system block (16 MB) are archived in the units of subblocks. The archived items are created with extended attributes.
Before You Begin
-
Archiving of IBM Spectrum Scale (GPFS) data is supported only on Linux.
-
You can use wildcards to define the subclient content. For more information, see Wildcards for the UNIX File System Agent.
Procedure
-
From the CommCell Browser, expand Client Computers > pseudo-client > Big Data Apps.
-
Right-click the instance that you want to create a subclient for, point to All Tasks, and then click New Subclient.
The Subclient Properties dialog box appears.
-
Specify the basic settings for the subclient:
-
In the Subclient Name box, type a name.
-
On the Data Access Nodes tab, select the data access nodes that you want to add to the subclient, and then click Add.
-
On the Content tab, click Browse to select the directory or files that you want to archive, and then click Add.
Note: The default subclient does not back up the content that you specify in user-defined subclients that are within the same instance.
-
Select the Only backup files that qualify for archiving check box.
-
-
To specify the retention period for your archived files, on the Retention tab, set the time period.
For more information about retention period, see Retention Options for Archiving.
-
Click Advanced.
The Advanced Subclient Properties dialog box appears.
-
To configure multiple streams for archiving, on the Performance tab, specify the number of data streams:
-
In the Number of Data Readers box, enter the number of data streams.
Notes:
-
For optimal sharing of the backup load, the number of data readers must be greater than the number of data access nodes.
-
The number of streams configured inf Data Readers box.
-
-
Select the Allow multiple data readers within a drive or mount point check box.
-
Click OK.
-
-
On the Archiving Rules - General subtab, select the Enable Archiving with these rules check box to archive your subclient content.
To archive your files, set the rules based on the file access time, modified time, or file size. For more information about archiving rules, see Archiving Rules for GPFS Data.
-
On the Storage Device tab, select a storage policy from the Storage Policy list.
-
To create a new storage policy, click Create Storage Policy, and then follow the instructions in the storage policy creation wizard.
-
To perform LAN-free backup and restores, select a grid storage policy.
For more information, see GridStor® (Alternate Data Paths) - Overview.
-
Optional: Select the subclient options.
Checking Redundancy for Files that Qualify for Archiving
Setting Up Pre-processes and Post-processes
Important: The pre-process and post-process scripts must be present on the master node.
-
On the Pre/Post Process tab:
-
In the PreBackup Process box, type the full path name for the script.
-
In the PostBackup Process box, type the full path name for the script.
-
To run the post backup process regardless of the job's outcome, select the Run Post Process for all attempts check box.
-
-
-
Click OK.
A subclient with the content that you want to archive is created under the instance that you selected.