The big data guided setup guides you through creating a server backup plan and adding the Hadoop cluster.
Go to the Big Data Guided Setup
From the navigation pane, go to Guided setup.
The Welcome page appears.
On the Protect tab, at the bottom of the page, click More.
Click the Big data apps tile.
The Big data app setup page appears.
If you already completed the guided setup, the Apps page appears.
Create a Server Plan That You Can Use for Hadoop
If you already have a server plan that you can use, you can skip this step.
On the Create server backup plan page, specify the settings for a server plan that you can use for Hadoop.
Enter a name for the server plan.
The Add backup destination dialog box appears.
In Name, enter a name for the backup destination.
From the Storage list, select the storage to use for the backups.
If you selected storage that uses Distributed Storages, the Optimize for instant clone toggle key appears.
By default, this setting is turned on to allow the associated Distributed Storage to optimize backups for clones, using Copy Data Management. To turn off the setting, move the Optimize for instant clone toggle key to the left.
The setting does not apply to Hyperscale solutions that use Distributed Storage.
For Retention period, enter the amount of time to retain the backups.
To specify additional backups, such as weekly full backups, move the Extended retention rules toggle key to the right, and then add rules.
For Backup, specify how often and when to run incremental backups.
To run full backups, move the Add full backup toggle key to the right, and then specify how often and when to run full backups.
For Backup window, specify when you want incremental backups to run.
For Full backup window, specify when you want full backups to run.
Folders to back up
To back up only some content, in Content to back up, enter the content to back up.
By default, all content is backed up.
To exclude folders or files from the backup, in Exclude - files/folders/patterns, enter the content to exclude.
Specify whether to include the system state in backups:
To include the system state in all backups, select the Back up system state check box.
To include the system state only in full backups, select the Back up system state check box and the Only with full backup check box.
To use VSS (Volume Snapshot Service, also called Shadow Copy) to back up the system state, select the Use VSS for system state check box.
Specify how to retain snapshots:
To specify a number of jobs to retain on a snapshot copy, select Number of snap recovery points, and then enter the number of jobs to retain.
To specify a retention period, select Retention period, and then enter the amount of time to retain the jobs.
If you don't want to create backup copies, move the Enable backup copy toggle key to the left to turn it off.
For Backup copy frequency, enter how often to run backup copy jobs.
For Log backup RPO, enter how often to run log backups.
To use the disk cache of the logs to the MediaAgent for backups, do the following:
Move the Use disk cache for log backups toggle key to the right.
For Commit every, enter how often to commit the logs to the CommServe computer.
Disk caching of database logs applies to the following agents: Informix, Microsoft SQL Server on Windows, Oracle, Oracle RAC, and SAP HANA.
Add the Hadoop Cluster
When you add a cluster, an instance, an app, and a default subclient are automatically created. You can create additional subclients for content that has different backup requirements.
On the Add a big data app page, from the Application type list, select Hadoop.
In Name, enter a name for the cluster.
From the Access nodes list, select the server or servers that you want to be data access nodes for Hadoop.
In HDFS user, enter the Hadoop user name to use for backups.
From Plan, select the server plan to use for the cluster.
To configure HBase, under Configure HBase, specify the following information: