RAID 5 Two Drives Failed
A RAID 5 array with two failed drives is not automatically unrecoverable. The second failure is often a statistical consequence of URE math during rebuild, an SMR timeout ejection, or a controller firmware mismatch rather than true mechanical death. We recover double-degraded RAID 5 arrays by imaging every member offline and assembling the volume virtually on PC-3000 RAID Edition. The original chassis is never written to. Free evaluation. No data recovered means no charge.

What Happens When Two Drives Fail in a RAID 5 Array?
The controller stops servicing reads and writes because it cannot satisfy either operation. For a read, it needs all data blocks or parity plus all but one data block to XOR the missing piece; with two missing blocks, the math has two unknowns and one equation. For a write, it cannot update parity without reading the old data and old parity, which is now impossible because two members are unreadable.
The critical distinction is that the array is inaccessible, not destroyed. Every block that was readable before the second failure is still on the platters. The problem is geometric: the controller no longer knows which blocks form complete stripes, and even if it did, it lacks the parity data to fill the gaps. Recovery means extracting those blocks and reconstructing the stripe layout without the controller.
Not all dual failures are equal. If the two drives failed at different times, the older failure may contain data that was current when it dropped out. The overlap window between the first failure and the second failure determines how much data is fully reconstructible versus how much spans stripes with two permanently missing blocks.
Why the Second Failure Was Statistically Likely
Hard drives are sold with a URE specification, also called the bit error rate. Consumer SATA drives such as the WD Blue, Seagate Barracuda, and Toshiba P300 spec one URE per 10^14 bits read, which works out to about one unreadable sector per 12.5 TB of sequential reads. Enterprise drives such as the WD Ultrastar and Seagate Exos spec one URE per 10^15 bits, ten times better, or roughly one unreadable sector per 125 TB.
During a RAID 5 rebuild the controller must read every sector of every surviving member in order to XOR them together and reconstruct the missing data. A four-member RAID 5 with 8 TB drives, after losing one member, has three surviving members of 8 TB each. The controller must read 24 TB sequentially under sustained load. That is 1.92 times the URE budget of a single consumer drive. The expected number of UREs across the full rebuild pass is approximately 1.92 on consumer media and 0.19 on enterprise media.
When the controller hits a sector it cannot read, it cannot compute the missing XOR for that stripe. Different controllers handle this differently. Some halt the rebuild entirely and mark the array failed. Others write garbage parity for the affected stripe and continue, which produces a rebuild that completes but leaves silent corruption. Either outcome converts a single-fault degraded array into a double-fault offline array.
This is why we do not attempt live rebuilds on degraded arrays. We image each member offline through PC-3000 Express or DeepSpar Disk Imager with adaptive retry settings. If a sector fails on the first pass, the imager retries with adjusted read parameters. Sectors that remain unreadable after exhaustive retries are flagged, and the missing data for those stripes is reconstructed from parity during offline assembly rather than during a live rebuild under controller timeout pressure.
When the "Second Failure" Is Actually an SMR Timeout
Drive-managed Shingled Magnetic Recording (SMR) drives use a small persistent conventional-recording (CMR) cache zone for incoming writes, then reorganize that data onto overlapping shingled tracks during idle periods. A RAID rebuild is the opposite of idle. It forces continuous sequential writes to the replacement drive while the surviving members are read at sustained sequential rates.
Once the CMR cache fills, the SMR drive must pause to flush its accumulated writes into the shingled zones. That flush can stall the drive for several seconds while tracks are rewritten in band order. Hardware RAID controllers expect responses inside a Time-Limited Error Recovery (TLER) or Error Recovery Control (ERC) window that defaults to 7 to 14 seconds on enterprise cards, sometimes as low as 7 seconds on consumer cards. When the SMR pause exceeds that budget, the controller interprets the silence as drive death and drops the SMR member from the array.
Specific models known to ship as drive-managed SMR include the WD Red EFAX series (2 TB through 6 TB capacities), the Seagate Barracuda ST2000DM008 and ST4000DM004, and the Toshiba L200 and P300 families. None of these belong inside a parity RAID array. If one was placed as the replacement during a rebuild, the rebuild does not just slow down; it actively converts the array from single-fault degraded to double-fault failed.
The ejected SMR drive is not mechanically failed. When removed from the controller and connected to a direct SATA port or to PC-3000 Express, it reads normally. The array is recoverable by imaging the ejected SMR member at its own pace, repairing the mechanically failed first member if needed, and assembling the array virtually.
Distinguishing a True Dual Failure from a Timeout Ejection
| Symptom | Timeout Ejection | True Mechanical Failure |
|---|---|---|
| Spins up on power | Yes, normal speed | No, or spins then stops (stuck spindle) |
| SMART self-test | Passes | Fails or timeouts |
| Direct SATA read | Reads normally | Extensive bad sectors or no response |
| Controller status | Unconfigured-Bad or Offline | Failed or Missing |
| Next power cycle | Often returns to Unconfigured-Good | Same failure, no change |
| Recovery path | Logical imaging, no mechanical repair | Head swap, platter work, or donor transplant |
Hex-Level Disk Ordering Reconstruction
RAID metadata lives in controller-specific structures: LSI MegaRAID and Dell PERC store SNIA Disk Data Format (DDF) metadata in the trailing sectors of each member, typically a 512 MB reserved area at the end of the drive. HP Smart Array writes its proprietary RAID Information Sectors (RIS) at the beginning of each member. Linux mdadm v1.2 superblocks sit at the 4 KiB offset. When these structures are intact, we read stripe size, parity rotation, and member order directly.
When metadata has been overwritten by prior recovery attempts or controller auto-initialization, we fall back to hex analysis. The filesystem that sat on the array left signatures at known offsets. EXT4 superblock copies carry the magic signature 0x53EF at offset 0x400 from the start of each block group. XFS allocation group headers begin with 0x58465342 (XFSB). NTFS boot sectors start with NTFS at LBA 0 of the volume.
In a correctly ordered array, these signatures appear at the same logical offset across member boundaries because the stripe size places each filesystem block on the member where it belongs. In a misordered array, the signatures are scattered or absent because each member contains the wrong subset of stripes. By rotating the member order virtually and testing which permutation produces coherent filesystem headers at the expected offsets, we determine the correct assembly without relying on surviving superblocks.
Missing Parity Block Recovery via PC-3000 Virtual Assembly
RAID 5 distributes parity across all members in a rotating pattern. For any stripe, XORing all data blocks and the parity block produces zero. When one member is missing, XORing the remaining blocks reproduces the missing one. When two members are missing, the equation has two unknowns and cannot be solved algebraically.
However, if the two failures occurred at different times, the member that failed first may still contain parity blocks that were current at the moment it dropped out. The second failure happened later, after the array had continued writing with one member missing. The parity on the first-failed drive matches the data state from before the second failure for stripes that were not written after the first drive dropped.
PC-3000 RAID Edition loads the cloned images and assembles the virtual array using detected or captured metadata. For each stripe, it checks how many blocks are readable. Stripes with zero or one missing block are fully reconstructible. Stripes with two missing blocks are flagged. We then use filesystem-level analysis to determine which files span only reconstructible stripes versus which files touch permanently lost stripes. Priority data (databases, virtual machines, shared folders) is verified first.
This approach only works because we assemble the array virtually from cloned images. The original drives are never written to. The reconstruction happens in RAM, stripe by stripe, with no controller timeouts and no stress on marginal media.
Why Competitors Claim Two-Drive-Failure RAID 5 Is Unrecoverable
DiskInternals recommends connecting failing disks to a local workstation and using software to rebuild the array. This ignores URE physics, controller timeout drops, and the difference between a logical disk drop and a mechanical head crash. Running software reconstruction against a double-degraded array without imaging first risks further stress on marginal drives and produces incomplete or corrupted output.
Secure Data Recovery states that RAID 5 distributes data across multiple drives along with parity information. Parity information removes the need for a dedicated drive. This is technically true at the RAID level but provides no guidance for a sysadmin facing two simultaneous failures. It does not explain parity rotation, stripe size detection, or virtual assembly.
Ontrack claims that RAID 5 data recovery is possible in most cases... provided the failed drive is replaced quickly. This framing conflates single-fault recovery with dual-fault recovery and implies that speed of replacement is the deciding factor. The deciding factor is whether the two failures occurred at different times and whether at least one of the failed members is still mechanically readable.
The reality is that double-degraded RAID 5 recovery is a forensic exercise, not a software wizard. It requires imaging every member, parsing RAID metadata or reconstructing it from hex signatures, and performing stripe-by-stripe parity analysis on cloned images. PC-3000 RAID Edition and UFS Explorer Professional are the tools that perform this work, not generic RAID recovery software running against live drives.
Commands That Destroy Double-Degraded Arrays
If your RAID 5 has two failed drives: power down the chassis and stop. The commands below are the ones most often recommended on forums for recovering a double-degraded array. Every one of them writes to the member drives and forecloses on a clean forensic recovery.
megacli -PDMakeGood -PhysDrv [E:S] -aALLWhat it does: changes the DDF state of an Unconfigured-Bad drive to Unconfigured-Good and frequently triggers an immediate background initialization. Why it destroys data: the initialization overwrites the existing metadata that records which stripes belong to which array.MegaCli -CfgForeign -Clear -aALLWhat it does: tells the LSI controller to discard the Foreign Configuration metadata it found on the drives. Why it destroys data: the array geometry is in that metadata. Clearing it leaves the drives with valid user data but no record of how to assemble it.mdadm --create --assume-clean --level=5 --raid-devices=N ...What it does: creates a new mdadm superblock on every member and assumes parity is already consistent. Why it destroys data: the v1.2 superblock at offset 4 KiB is rewritten with new UUIDs and a new event count; the array geometry from the original create call is lost unless it happens to be identical, and silent corruption follows on the next write.- Synology DSM Storage Manager Repair button on a crashed volumeWhat it does: runs a Synology-authored script that calls mdadm and lvm with parameters intended to bring the array back online. Why it destroys data: the script can overwrite md superblocks and LVM metadata on partition 3 of the surviving members. Read-only inspection on a separate Linux workstation is the safe alternative.
- Force Online or Make Optimal in LSI or PERC BIOS menusWhat it does: overrides the controller's decision that the array is offline. Why it destroys data: writes pending in the cache flush to the drives even though parity and data are inconsistent.
Our Image-First, No Live Rebuild Process
- Power down immediately. Do not retry the rebuild, do not click Repair, and do not run any controller commands. Every additional power-on cycle increases the risk of head crash on marginal drives.
- Free evaluation and documentation. Record the controller model, RAID level, member count, filesystem (ext4, XFS, Btrfs, ZFS, NTFS, VMFS), and every prior rebuild or repair attempt and the commands run. This step is free.
- Label every drive bay. Each drive is marked with its physical slot number before removal and bagged individually. Slot order is required to validate stripe layout during virtual assembly.
- Capture RAID metadata from each member. Metadata location varies by controller family: LSI MegaRAID and Dell PERC store DDF in the trailing sectors; HPE SmartArray writes RIS at the beginning of the drive; Linux mdadm v1.2 superblock sits at offset 4 KiB. Metadata capture runs against cloned images, not the originals.
- Write-blocked forensic imaging. Each member is connected through a hardware write-blocker to PC-3000 Express or DeepSpar Disk Imager. Adaptive retry and head-map analysis pull marginal sectors that the controller had given up on inside its TLER window. Mechanical members receive donor head transplants on the 0.02 micron ULPA-filtered laminar-flow clean bench before imaging.
- Offline virtual assembly. PC-3000 RAID Edition loads the cloned images and assembles the array virtually using the captured metadata. The stripe size, parity rotation, and member order are read from the on-disk metadata or determined by hex analysis if metadata is damaged.
- Parity recalculation and filesystem extraction. Stripes with missing data are reconstructed from parity. The assembled volume is mounted read-only. R-Studio and UFS Explorer handle filesystem-level recovery if the filesystem itself sustained damage.
- Delivery and secure purge. Recovered data is copied to your target media. After you confirm receipt, working copies are securely purged on request.
How Much Does RAID 5 Two-Drive-Failure Recovery Cost?
Per-Member Imaging
- Logical or firmware-level issues: $250 to $900 per drive. Covers filesystem corruption on the array, firmware module damage that prevents normal reads, and SMART threshold failures.
- Mechanical failures (head swap, motor seizure): $1,200 to $1,500 per drive with a 50% deposit. Donor parts are consumed during the transplant. Head swaps are performed on a validated laminar-flow clean bench before write-blocked cloning.
- Timeout-ejected SMR or desktop drives: $250 per drive. The drive is mechanically healthy and images cleanly once removed from the controller.
Array Reconstruction
- $400 to $800 depending on member count, filesystem type, and whether RAID parameters must be detected from raw data versus captured from surviving DDF or mdadm superblocks.
- PC-3000 RAID Edition performs parameter detection and virtual assembly from cloned member images. R-Studio and UFS Explorer handle filesystem-level extraction after reconstruction.
No Data = No Charge: if we recover nothing from your array, you owe $0. Free evaluation, no obligation.
Example: a four-member array with one mechanically failed member and one timeout-ejected member costs approximately $1,200 (head swap) + $250 (logical imaging) + $400-$800 (reconstruction) = $1,850 to $2,250.
+$100 rush fee to move to the front of the queue. Full HDD pricing is published at our HDD recovery service page.
RAID 5 Two Drives Failed Recovery Questions
Is a RAID 5 with two failed drives unrecoverable?
Why did the second drive fail during the RAID 5 rebuild?
Can an SMR drive look like a failed drive when it is actually healthy?
How do you tell the difference between a timeout ejection and a real drive failure?
What is virtual assembly in RAID 5 recovery?
What is hex-level disk ordering reconstruction?
How do you recover missing parity blocks?
Should I force the failed drives online in the controller BIOS?
How much does RAID 5 two-drive-failure recovery cost?
How long does RAID 5 two-drive-failure recovery take?
Data Recovery Standards & Verification
Our Austin lab operates on a transparency-first model. We use industry-standard recovery tools, including PC-3000 and DeepSpar, combined with strict environmental controls to make sure your hard drive is handled safely and properly. This approach allows us to serve clients nationwide with consistent technical standards.
Open-drive work is performed in a ULPA-filtered laminar-flow bench, validated to 0.02 µm particle count, verified using TSI P-Trak instrumentation.
Transparent History
Serving clients nationwide via mail-in service since 2008. Our lead engineer holds PC-3000 and HEX Akademia certifications for hard drive firmware repair and mechanical recovery.
Media Coverage
Our repair work has been covered by The Wall Street Journal and Business Insider, with CBC News reporting on our pricing transparency. Louis Rossmann has testified in Right to Repair hearings in multiple states and founded the Repair Preservation Group.
Aligned Incentives
Our "No Data, No Charge" policy means we assume the risk of the recovery attempt, not the client.
Technical Oversight
Louis Rossmann
Louis Rossmann's well trained staff review our lab protocols to ensure technical accuracy and honest service. Since 2008, his focus has been on clear technical communication and accurate diagnostics rather than sales-driven explanations.
We believe in proving standards rather than just stating them. We use TSI P-Trak instrumentation to verify that clean-air benchmarks are met before any drive is opened.
See our clean bench validation data and particle test videoTwo drives failed. Power down before doing anything else.
Free evaluation. No data = no charge. Mail-in from anywhere in the U.S.