Warehouse: Step 6. Configure the Destination Using WebHDFS

Document created by RSA Information Design and Development on Sep 14, 2017Last modified by RSA Information Design and Development on Nov 16, 2017
Version 5Show Document
  • View in full screen mode
 

Make sure that you have:

  • Installed the Warehouse Connector service or virtual appliance in your network environment.
  • Added the Warehouse Connector service to NetWitness. For more information, see the 'Add a Service to a Host' in the Hosts and Services Getting Started Guide.
  • Added the hostname (or FQDN) and IP address of the warehouse nodes and Warehouse Connector to the DNS server.  If the DNS server is not configured, add the hostname (or FQDN) and IP address of the warehouse nodes and Warehouse Connector to the file in the host on which the Warehouse Connector service is installed.
  • If you want Kerberos authentication between the warehouse connector and the warehouse cluster, make sure that you perform the following:
    • Kerberos Key Distribution Center (KDC) Server is configured in your network environment and the Kerberos Keytab file is copied to the host on which you have installed Warehouse Connector.
    • Kerberos authentication is enabled in the warehouse cluster.
  • If you want to enable checksum validation to validate the integrity of the AVRO files that are transferred from the Warehouse Connector to the destinations, make sure that you generate the keys without setting the passphrase and do a key exchange between the Warehouse Connector and the warehouse nodes. You need to configure SSH key-based access between the Warehouse Connector and the Warehouse host or hadoop node. For more information, see 'Configure SSH Keys' in Configure the Destination Using SFTP.

Configure Warehouse Connector to Write to a Remote Destination

To configure the destination:

  1. Log on to NetWitness.
  2. In the main menu, select ADMIN > Services.
  3. In the Services view, select the added Warehouse Connector service and select  > View > Config.
    The Services Config view of Warehouse Connector is displayed.
    Warehouse connector services config view
  4. On the Sources and Destinations tab, in the Destination Configuration section, click .
  5. In the Add Destination dialog, select WebHDFS from the drop-down list.
    add WebHDFS destination
  6. In the Name field, enter a unique symbolic name for the destination.

    Note: The Name field does not support spaces or special characters except underscore (_).

  7. In the Hadoop IP field, enter the namenode IP address of the warehouse cluster.
  8. In the Hadoop Port field, enter the base port that is used by the namenode web user interface.
  9. In the Username field, enter the owner of the directory in the warehouse to which Warehouse Connector should write the data.
  10. In the Hadoop Path field, enter the path of the directory in the warehouse to which Warehouse Connector should write the data.
  11. Select the Kerberos Authentication checkbox, if you want the warehouse connector to securely communicate with the warehouse using Kerberos authentication.
    WebHDFS kerberos authentication
    Perform the following:
    1. In the Kerberos Principal field, enter the KDC Principal used for Kerberos authentication.
    2. In the Kerberos Keytab File Path field, enter the path of the Kerberos Keytab file in the Warehouse Connector.
  12. Click Save.
  13. (Optional) If you want to enable checksum validation, perform the following:
    1. In the main menu, select ADMIN > Services.
    2. In the Services view, select the added Warehouse Connector service and select  > View > Explore.
      The Explore view of Warehouse Connector is displayed.
    3. In the options panel, navigate to warehouseconnector/destinations/webhdfs/config.
    4. Set the parameter isChecksumValidationRequired to 1.
      WebHDFS config param update
    5. Restart the respective stream.
You are here
Table of Contents > Configure the Destination > Configure the Destination Using WebHDFS

Attachments

    Outcomes