000034731 - Unable to locate failed or bad physical drive in a DAC in RSA NetWintess

Document created by RSA Customer Support Employee on Feb 6, 2017Last modified by RSA Customer Support Employee on Apr 21, 2017
Version 2Show Document
  • View in full screen mode

Article Content

Article Number000034731
Applies ToRSA Product Set: Security Analytics
RSA Product/Service Type: SA Archiver, SA Concentrator, SA Packet Decoder, SA Log Decoder, 
RSA Version/Condition: 10.4.x, 10.5.x, 10.6.x
 
IssueIn certain circumstances, failed or bad physical disk drives in an appliance chassis or DAC will not respond to MegaCli64 commands.  
This means that if a drive is failed or bad, you cannot use the MegaCli64 command to flash the LED on the disk drive to identify it for removal.
 
TasksThe tasks required for these steps are:
  1. Logon to the appliance via ssh as root.
  2. Run the nwraidutil.pl script to check the drive status.
  3. Run the specified MegaCli64 commands to identify the suspect drive.
  4. Replace the suspect drive.
  5. Run the nwraidutil.pl script to verify the drive status.
  6. Run the specified MegaCli64 commands stop the LED flashing on all drives. 
  7. Logout of the ssh session when done.
ResolutionTo locate a specific drive that is non-responsive to MegaCli64 flash commands, flash the drive on either side, or to one side of the drive in question.
Example of Bad/Failed Disk 
Adapter 1 (PERC H830 Adapter) enclosure 29 slots found: 15
Encl  Slot  State     P.Fail.Count  Raw Size       Inquiry Data
29     0    (O)       0             2.728 TB       YS18      SEAGATE ST330006CLAR3000YS18Z298AB01
29     1    (B)       0             0 KB           YS18      SEAGATE ST330006CLAR3000YS18Z298AB03
29     2    (O)       0             2.728 TB       YS18      SEAGATE ST330006CLAR3000YS18Z298AC04

In the example above, the drive on RAID Adapter 1, Enclosure 29, Slot 2 is bad and does not respond to flash commands.  
If you are not sure which DAC is Enclosure 29, try to flash Enclosure 29, Slot 1 and Enclosure 29, Slot 3 with the commands below.
/opt/MegaRAID/MegaCli/MegaCli64 -PdLocate -start -physdrv[29:1] -a1
/opt/MegaRAID/MegaCli/MegaCli64 -PdLocate -start -physdrv[29:3] -a1

These two commands start the LEDs flashing on the drives to the right and left of the bad drive thus indicating the location of the bad drive.
After replacing the drive, confirm the status of the drive by flashing it, and by running the nwraidutil.pl script to verify that the drive is in an optimal state and rebuilding. 
Example of Rebuilding or Optimal Drive After Replacing Failed Drive
Adapter 1 (PERC H830 Adapter) enclosure 29 slots found: 15
Encl  Slot  State     P.Fail.Count  Raw Size       Inquiry Data
29     1    (R)       0             2.728 TB       YS18    SEAGATE ST330006CLAR3000YS18Z298QQQ1
       ------- OR -------
Adapter 1 (PERC H830 Adapter) enclosure 29 slots found: 15
Encl  Slot  State     P.Fail.Count  Raw Size       Inquiry Data
29     1    (O)       0             2.728 TB       YS18    SEAGATE ST330006CLAR3000YS18Z298QQQ1

Once the drive has been identified and replaced, disable the flashing LEDs with the following command:
/opt/MegaRAID/MegaCli/MegaCli64 -PdLocate -stop -physdrv[29:1] -a1
/opt/MegaRAID/MegaCli/MegaCli64 -PdLocate -stop -physdrv[29:3] -a1

 

Attachments

    Outcomes