- Slot sensors, sometimes called optical fork sensors because of their 'forked' shape, detect objects that pass between the two arms—one with the emitter, the other with the receiver. The fixed slot width provides reliable opposed-mode sensing of objects as small as 0.30 mm. If you do not see what you are looking for, Ask an Expert.
- Drive 0 - Raid Drive 1 - Raid Drive 2 - Hotspare Drive 2 drive slot sensor drive fault detected Drive 0 drive slot sensor drive fault detected After Reboot and Scan with the Megaraid-Administration Tool all three Drives where detected as ready Drives and the System seems to be work. At this time i have no response from the Dell support.
Sun Fire X4150, X4250, and X4450 Servers Diagnostics Guide |
The Drive StorageVolumeElementName has been disabled due to a detected fault. (Drive 1) 806f010d-0402xxxx The Drive StorageVolumeElementName has been disabled due to a detected fault. (Drive 2) 806f010d-0403xxxx The Drive StorageVolumeElementName has been disabled due to a detected fault. (Drive 3) 806f010d-0404xxxx.
C H A P T E R 2 |
Troubleshooting DIMM Problems |
This chapter describes how to detect and correct problems with the server’s Dual Inline Memory Modules (DIMMs). It includes the following sections:
Note - Refer to the service manual or service label for the system that you are servicing for information on DIMM population rules. |
DIMM Replacement Guidelines
Replace a DIMM when one of the following events takes place:
- The DIMM fails memory testing under BIOS due to Uncorrectable Memory Errors (UCEs).
- UCEs occur and investigation shows that the errors originated from memory.
- More than 24 Correctable Errors (CEs) originate in 24 hours from a single DIMM and no other DIMM is showing further CEs.
Note - If more than one DIMM has experienced multiple CEs, other possible causes of CEs must be ruled out by a qualified Sun Support specialist before replacing any DIMMs. |
Retain copies of the logs showing the memory errors to send to Sun for verification prior to calling Sun.
How DIMM Errors Are Handled by the System
This section describes the following topics:
Uncorrectable DIMM Errors
For all operating systems, the behavior is the same for uncorrectable errors (UCEs):
Slot Sensor Drive Fault Detected In Car
1. When a UCE occurs, the memory controller causes an immediate reboot of the system.
2. During reboot, the BIOS checks the Machine Check registers and determines that the previous reboot was due to a UCE.
The uncorrectable ECC error is displayed in the service processor’s system event log (SEL) as shown here:
Memory | Uncorrectable ECC | Asserted | DIMM A0
Correctable DIMM Errors
If a DIMM has 24 or more correctable errors (CE)s in 24 hours, it is considered defective and should be replaced.
CEs will be captured in the SEL and light the fault LED after 24 single bit errors are detected in 24 hours. They are reported or handled in the supported operating systems as follows:
- Windows server:
a. A Machine Check error-message bubble appears on the task bar.
b. Open the Event Viewer to view errors.
Access the Event Viewer through this menu path:
Start-->Administration Tools-->Event Viewer
Slot Sensor Drive Fault Detected Free
c. View individual errors (by time) to see the details of the error.
- Solaris:
Solaris FMA reports and sometimes retires memory with correctable Error Correction Code (ECC) errors. See your Solaris documentation for details.
To view ECC errors, use the following command:
fmdump -eV
DIMM Fault LEDs
When you press the Remind button on the motherboard (or memory tray for x4450), the LEDs next to the DIMMs flash to indicate that the system has detected 24 or more CEs in a 24-hour period on that DIMM.
- DIMM fault LED is off: The DIMM is operating properly.
- DIMM fault LED is flashing (amber): At least one of the DIMMs in this DIMM pair has reported 24 CEs within a 24-hour period or a UE (uncorrectable error).
See FIGURE 2-1 and FIGURE 2-2 for the locations of the Remind button and LEDs on the motherboard.
FIGURE 2-1 DIMMs and LEDs on Motherboard (X4150 and X4250)
FIGURE 2-2 .DIMMs and LEDs on Mezzanine (x4450)
Isolating and Correcting DIMM ECC Errors
If your log files report an Error Correction Code (ECC) error or a problem with a DIMM, complete the following steps until you can isolate the fault.
In this example, the log file reports an error with the DIMM in D0. The fault LED on DIMM D0 is on.
To isolate and correct DIMM ECC errors:
1. If you have not already done so, shut down your server to standby power mode and remove the cover.
2. Inspect the installed DIMMs to ensure that they comply with the DIMM population rules in your product service manual.
3. Press the Remind button and inspect the DIMM fault LEDs. See FIGURE 2-1 and FIGURE 2-2.
For CEs and UCEs, a flashing LED identifies the DIMM where the error is located.
4. Disconnect the AC power cords from the server.
Caution - Before handling components, attach an antistatic wrist strap to a chassis ground (any unpainted metal surface). The system’s printed circuit boards and hard disk drives contain components that are extremely sensitive to static electricity. |
Note - To recover fault information, look in the SP SEL, as described in the Sun Integrated Lights Out Manager 2.0 User's Guide. |
5. Remove the DIMMs from the DIMM slots.
Refer to your server’s service manual for details.
6. Visually inspect the DIMMs for physical damage, dust, or any other contamination on the connector or circuits.
7. Visually inspect the DIMM slot for physical damage. Look for cracked or broken plastic on the slot.
8. Dust off the DIMMs, clean the contacts, and install them.
Caution - Use only compressed air to dust DIMMs. |
9. If there is no obvious damage, replace any failed DIMMs.
For UCEs, if the LEDs indicate a fault with the pair, replace both DIMMs. Ensure that they are inserted correctly with ejector latches secured.
10. Reconnect AC power cords to the server.
11. Power on the server and run the diagnostics test again.
12. Review the log file.
If the tests identify the same error, the problem is in the CPU, not the DIMMs.
Sun Fire X4150, X4250, and X4450 Servers Diagnostics Guide | 820-4213-11 |
Copyright © 2009 Sun Microsystems, Inc. All rights reserved.
Drive 4 drive slot sensor drive fault detected?

Drive 4 drive slot sensor drive fault detected?
I have a Poweredge 6650 that has been running fine with three 72GB drives in a RAID-1 stripe set. It is only powered on when needed as it hosts several VM machines that are used to test and verify things. Anyway on boot up today it complained about a failed drive and just my luck I only created a RAID-1 stripe set so no redundancy.Anyway the data was backed up so not too big a deal. I pulled the failed drive and re-inserted it and it was recognised but showed 'Failure Predicted', so I brought it online. Through OpenManage I removed the old Virtual Disk and then created a new RAID-5 set (so only 136GB instead of 204GB). I then pulled the Failure Predicted drive and replaced it with a spare.
It is now rebuilding (taking ages..) but there is an error showing that appeared 25-seconds after the replacement was installed:
CODE
Thu Aug 21 13:23:26 2008 Drive 4 drive slot sensor drive ok
I find it hard to believe the replacement drive is also faulty so is it possibly something else? I am not sure what the 'drive slot sensor drive fault detected' error actually means?
Cheers
Andy
Comments are closed.