000014609 - Error message 'EDAC i5000 MC0: NON-FATAL ERRORS Found!!!' reported on RSA NetWitness appliances

Document created by RSA Customer Support Employee on Jan 11, 2017Last modified by RSA Customer Support Employee on Apr 21, 2017
Version 2Show Document
  • View in full screen mode

Article Content

Article Number000014609
Applies ToRSA NetWitness NextGen
RSA NetWitness Decoder
RSA NetWitness Log Decoder
RSA NetWitness Concentrator
RSA NetWitness Hybrid
RSA NetWitness Broker
IssueError message "EDAC i5000 MC0: NON-FATAL ERRORS Found!!!" reported on RSA NetWitness appliances.

When viewing /var/log/messages on your RSA NetWitness appliance, you see an excessive amount of the following error messages:


 


Example 1:



Dec 01 13:23:01 NWAPPLIANCEXYZ kernel: EDAC i5000 MC0: NON-FATAL ERRORS Found!!! 1st NON-FATAL Err Reg= 0x200000
Dec 01 13:23:01 NWAPPLIANCEXYZ kernel: EDAC i5000: THERMAL Error, bits= 0x200000
Dec 01 13:23:02 NWAPPLIANCEXYZ kernel: EDAC i5000 MC0: NON-FATAL ERRORS Found!!! 1st NON-FATAL Err Reg= 0x200000
Dec 01 13:23:02 NWAPPLIANCEXYZ kernel: EDAC i5000: THERMAL Error, bits= 0x200000
Dec 01 13:23:03 NWAPPLIANCEXYZ kernel: EDAC i5000 MC0: NON-FATAL ERRORS Found!!! 1st NON-FATAL Err Reg= 0x80000
Dec 01 13:23:03 NWAPPLIANCEXYZ kernel: EDAC i5000: THERMAL Error, bits= 0x80000
Dec 01 13:23:04 NWAPPLIANCEXYZ kernel: EDAC i5000 MC0: NON-FATAL ERRORS Found!!! 1st NON-FATAL Err Reg= 0x200000
Dec 01 13:23:04 NWAPPLIANCEXYZ kernel: EDAC i5000: THERMAL Error, bits= 0x200000



 


Example 2:



Sep 14 06:57:24 NWAPPLIANCE15925 kernel: EDAC MC0: UE page 0x0, offset 0x0, grain 1073741824, row 3, labels ":": i3200 UE
Sep 14 06:57:25 NWAPPLIANCE15925 kernel: EDAC MC0: UE page 0x0, offset 0x0, grain 1073741824, row 2, labels ":":i3200 UE 
Sep 14 06:57:27 NWAPPLIANCE15925 kernel: EDAC MC0: UE page 0x0, offset 0x0, grain 1073741824, row 2, labels ":": i3200 UE
Sep 14 06:57:27 NWAPPLIANCE15925 kernel: EDAC MC0: UE page 0x0, offset 0x0, grain 1073741824, row 6, labels ":": i3200 UE


CauseThe cause of this error is the BIOS limiting the voltage to the RAM to prevent overheating.
Resolution

 If you are seeing an excessive amount of this error message and wish to supress it, follow the instructions below.


 


There can be several different types of edac. the way to determine which kernel module to unload look carefully at the warning message. In the 1st example above you see i5000 in the error messages. In the 2nd example above you can see the i3200 in the error message. This is how you determine which module to unload.


You can look to see what modules are loaded by running the following command. This will list out the loaded kernel modules.


# lsmod


As root unload the driver:


For Fedora 9/CentOS:


  Example 1:


    # rmmod i5000_edac 


  Example 2:


    # rmmod i3200_edac


 


To ensure that the driver is not loaded again on boot, you can add the following line to the blacklist file located in /etc/modprobe.d/blacklist:


# vi /etc/modprobe.d/blacklist


Example 1:


~ blacklist i5000_edac


Example 2:


~ blacklist i3200_edac

Legacy Article IDa58592

Attachments

    Outcomes