What Is NAND Degradation?
NAND degradation is the gradual physical wearing of the memory cells inside your SSD. Every time data is written and erased, the insulating layer inside each cell gets thinner. Once that layer is too damaged, the cell can no longer reliably store data. The SSD's built-in error correction compensates for a while, but once too many cells degrade past the correction threshold, the drive starts losing data or stops working.
This is a normal end-of-life process for all SSDs. The drive's rated lifespan (expressed as TBW, or terabytes written) is the manufacturer's estimate of how much data can be written before degradation causes failures. Heavy workloads, high operating temperatures, and frequent small writes accelerate the process.
Consumer recovery software cannot help once NAND degradation reaches the point where the controller rejects reads. The controller returns I/O errors to the operating system, and no software running on top of that OS can override the controller's decision. Professional recovery tools like PC-3000 SSD communicate with the controller through vendor-specific diagnostic channels, adjusting how the controller reads the degraded cells.
How Do You Know Your SSD's NAND Is Failing?
NAND degradation produces specific, observable symptoms before total failure. Recognizing these early gives you the best chance of a complete recovery.
- ●Read-only mode. The SSD switches itself to read-only to prevent further writes from destroying remaining data. Files are visible but you cannot save, delete, or modify anything. This is a firmware-level protection triggered when the reserved block pool is depleted.
- ●Intermittent slowdowns. The drive pauses for seconds or minutes during read operations as the controller retries failed NAND pages. SMART attribute 1 (Raw Read Error Rate) or attribute 187 (Uncorrectable Error Count) spikes.
- ●File corruption without warning. Files open but contain garbled data, truncated images, or zero-filled blocks. The controller returned data from cells where the voltage state was misread due to degradation.
- ●BIOS detection, OS failure. The drive appears in BIOS with correct model and capacity, but the operating system cannot mount the file system. The firmware is functional, but too many NAND pages return uncorrectable errors for the file system to parse.
- ●SMART warnings. CrystalDiskInfo or smartmontools reports "Caution" or "Bad" health. Media Wearout Indicator (SMART 233) near zero, Percentage Lifetime Used above 95%, or Available Reserved Space depleted.
If the drive shows any of these symptoms, power it off. Continued read attempts accelerate garbage collection and can trigger block erases that destroy recoverable data.
How We Recover Data from Degraded NAND
NAND degradation recovery is a firmware-level process. The PC-3000 SSD module enters the controller's diagnostic mode and reads NAND pages with adjusted parameters that the controller would never use on its own. SSD recovery is board-level electronics work, not mechanical.
- Free evaluation. We assess the drive's SMART data, controller model, and failure mode. You receive a firm price quote before any recovery work begins.
- Controller identification and diagnostic entry. PC-3000 SSD identifies the controller family (Phison, Silicon Motion, Samsung, Marvell, Maxio, Realtek) and enters vendor-specific diagnostic mode to halt background operations.
- Baseline error assessment. The technician runs a full-surface read pass to map which NAND blocks are readable, marginal, and unreadable at default settings.
- Read retry calibration. For marginal and unreadable blocks, PC-3000 adjusts the read retry count and voltage reference thresholds. Each retry uses a slightly different voltage level to resolve ambiguous cell states.
- Multi-pass imaging. The drive is imaged across multiple passes, each with different read parameters. Sectors recovered in later passes fill gaps from earlier attempts.
- File system reconstruction. The composite image is assembled and the file system is parsed. You receive a file listing before final delivery.
If the controller itself is also damaged (dead, not detected in BIOS), board-level microsoldering repair is required before NAND reads can begin. On unencrypted drives where the controller cannot be repaired, chip-off extraction may be attempted as a last resort. Drives with hardware AES encryption require a functional controller to decrypt, so board repair is mandatory.
SSD Recovery Pricing
NAND degradation recovery is covered by our standard SSD recovery pricing tiers. Most degraded-NAND cases fall into the firmware recovery tier because they require PC-3000 low-level access with custom read parameters. SATA SSD recovery ranges from $200–$1,500. NVMe SSD recovery ranges from $200–$2,500.
Free evaluation, firm quote, no data = no charge. +$100 rush fee to move to the front of the queue. Tiers requiring donor drives include additional donor cost (A donor drive is a matching SSD used for its circuit board. Typical donor cost: $40–$100 for common models, $150–$300 for discontinued or rare controllers.).
Simple Copy
Low complexityYour drive works, you just need the data moved off it
$200
3-5 business days
Functional drive; data transfer to new media
Rush available: +$100
File System Recovery
Low complexityYour drive isn't showing up, but it's not physically damaged
From $250
2-4 weeks
File system corruption. Visible to recovery software but not to OS
Starting price; final depends on complexity
Circuit Board Repair
Medium complexityYour drive won't power on or has shorted components
$450–$600
3-6 weeks
PCB issues: failed voltage regulators, dead PMICs, shorted capacitors
May require a donor drive (additional cost)
Firmware Recovery
Medium complexityMost CommonYour drive is detected but shows the wrong name, wrong size, or no data
$600–$900
3-6 weeks
Firmware corruption: ROM, modules, or system files corrupted
Price depends on extent of bad areas in NAND
PCB / NAND Swap
High complexityYour drive's circuit board is severely damaged and requires NAND chip transplant to a donor PCB
$1,200–$1,500
4-8 weeks
NAND swap onto donor PCB. Precision microsoldering and BGA rework required
50% deposit required; donor drive cost additional
50% deposit required
Hardware Repair vs. Software Locks
Our "no data, no fee" policy applies to hardware recovery. We do not bill for unsuccessful physical repairs. If we replace a hard drive read/write head assembly or repair a liquid-damaged logic board to a bootable state, the hardware repair is complete and standard rates apply. If data remains inaccessible due to user-configured software locks, a forgotten passcode, or a remote wipe command, the physical repair is still billable. We cannot bypass user encryption or activation locks.
No data, no fee. Free evaluation and firm quote before any paid work. Full guarantee details. NAND swap requires a 50% deposit because donor parts are consumed in the attempt.
Rush fee: +$100 rush fee to move to the front of the queue.
Donor drives: A donor drive is a matching SSD used for its circuit board. Typical donor cost: $40–$100 for common models, $150–$300 for discontinued or rare controllers.
Target drive: The destination drive we copy recovered data onto. You can supply your own or we provide one at cost plus a small markup. All prices are plus applicable tax.
Estimate Your SSD Recovery Cost
Select your symptoms and drive type for a preliminary cost range. Final pricing comes after a free evaluation at our Austin, TX lab.
What type of SSD do you have?
This determines the recovery method and pricing.
Not sure which type you have? Call (512) 212-9111 and we can help identify it.
P/E Cycle Exhaustion: Tunnel Oxide Degradation in NAND Cells
Every program/erase cycle forces electrons through the tunnel oxide layer via Fowler-Nordheim tunneling. Each pass leaves behind trapped charge in the oxide and physically weakens the dielectric. After enough cycles, the oxide can no longer hold a consistent charge, and the threshold voltage distributions for each cell state widen until they overlap.
The endurance ceiling depends on NAND density. SLC NAND, storing 1 bit per cell with only two voltage states, tolerates the widest voltage margins and survives the most cycles. Each additional bit per cell halves the available voltage window between states.
| NAND Type | Bits per Cell | Voltage States | Typical P/E Endurance |
|---|---|---|---|
| SLC | 1 | 2 | 100,000 cycles |
| MLC | 2 | 4 | 3,000 to 10,000 cycles |
| TLC | 3 | 8 | 1,000 to 3,000 cycles |
| QLC | 4 | 16 | 100 to 1,000 cycles |
Consumer SSDs sold today use TLC or QLC NAND. A 1TB TLC drive rated at 600 TBW allows roughly 600 full-drive writes before the manufacturer expects degradation failures. Write amplification from garbage collection, wear leveling, and journal writes means the NAND sees 2x to 10x more physical writes than the host reports, depending on workload pattern and controller efficiency.
How Write Amplification Accelerates Wear
Write amplification is the ratio of physical NAND writes to logical host writes. A write amplification factor (WAF) of 3.0 means the NAND receives 3 bytes of physical writes for every 1 byte the host sends. The Flash Translation Layer (FTL) running on the controller manages this process through garbage collection, wear leveling, and metadata journaling.
- Garbage Collection
- NAND flash can only be erased in full blocks (typically 256 to 512 pages). When valid and invalid pages are mixed in a block, the controller must copy valid pages to a new block before erasing the old one. This internal copy-and-erase adds write cycles that the host never requested. Small, random writes produce the worst garbage collection overhead because they invalidate individual pages across many blocks.
- Wear Leveling
- The FTL distributes writes across all NAND blocks to prevent any single block from wearing out prematurely. Dynamic wear leveling moves data between frequently and infrequently written blocks. This spreads the P/E cycle count evenly, but the redistribution itself consumes additional erase cycles.
- FTL Journal Writes
- The controller maintains a mapping table that translates logical block addresses (LBAs) to physical NAND page locations. Updates to this map are journaled to NAND to survive power loss. On drives without a DRAM cache, the map is written more frequently to flash, adding P/E cycles. This is why DRAM-less SSDs (common in budget models) often degrade faster under sustained random write workloads.
The SMART attribute "Total LBAs Written" (SMART 241) tracks host writes. Comparing this to the NAND-level write count (sometimes exposed as "Total NAND Writes") reveals the actual WAF. A WAF above 5.0 indicates a workload pattern that is consuming NAND endurance at an accelerated rate.
What Is Read Disturb?
Read disturb is an unintended charge injection into NAND cells caused by read operations on neighboring cells in the same block. Every read applies a pass-through voltage to unselected word lines. Over millions of reads, this accumulated voltage slowly shifts the threshold voltage of unselected cells, eventually flipping bits.
The controller tracks read counts per block and triggers a background scrub (read-refresh) before the disturb accumulation reaches a dangerous level. The scrub reads all pages in the block, corrects any bit errors with ECC, erases the block, and rewrites the corrected data. This consumes one P/E cycle per scrub.
On drives with degraded NAND, the safety margin between the read disturb threshold and the ECC correction limit narrows. A block that could tolerate 500,000 reads on fresh NAND may fail after 100,000 reads on worn NAND. Surveillance systems, database servers, and caching layers that generate sustained read-heavy workloads trigger read disturb faster than consumer desktop usage.
Read Disturb During Recovery Attempts
Running consumer recovery software on a drive with marginal NAND compounds the problem. Each scan pass adds read disturb to every block it touches. A full-surface scan of a 1TB drive reads every page, applying pass-through voltage to every word line in every block. If the drive is already near its disturb threshold, the recovery scan itself can push cells past the point of no return. This is why the first step in professional recovery is to halt background controller operations and image using controlled, selective reads rather than brute-force full-surface scans.
Data Retention Failure in Powered-Off SSDs
NAND flash stores data as trapped electrons. Without power, those electrons slowly leak through the tunnel oxide via quantum mechanical tunneling. The rate of leakage follows the Arrhenius equation: it doubles for every 10 degrees Celsius increase in storage temperature. On degraded NAND where the oxide is already thinned from P/E cycling, leakage accelerates further.
JEDEC standard JESD218A defines retention requirements. A consumer SSD at end-of-life should retain data for 52 weeks at 30 degrees Celsius. Enterprise SSDs are rated for 3 months at 40 degrees. These are minimum specifications for new drives at their rated endurance limit. A drive that has exceeded its rated P/E cycles will fall short of these numbers.
| NAND Type | Retention at 30°C | Retention at 40°C | Recovery Difficulty |
|---|---|---|---|
| SLC (end-of-life) | Years | 12+ months | Low; wide voltage margins |
| MLC (end-of-life) | 12 months | 6 months | Moderate |
| TLC (end-of-life) | 52 weeks | ~26 weeks | High; 8 states compressed |
| QLC (end-of-life) | 26+ weeks | ~12 weeks | Very high; 16 states, minimal margins |
For recovery, retention-failed drives are candidates for thermal stabilization. Controlled cooling can temporarily slow electron leakage and raise apparent threshold voltages back into a readable range while the technician images the drive through PC-3000.
What Happens When ECC Correction Capacity Is Exceeded?
Every SSD controller runs an error correction algorithm on each NAND page read. Modern controllers use LDPC (Low-Density Parity-Check) codes, which correct more errors than the older BCH codes used in pre-2016 drives. LDPC operates in two modes: hard-decision decoding (fast, limited correction) and soft-decision decoding (slower, reads each cell at multiple voltage levels for higher accuracy).
The raw bit error rate (RBER) of NAND increases as the cells degrade. On worn TLC NAND, the RBER climbs as the tunnel oxide thins, and the voltage distributions for each cell state overlap more with each P/E cycle. JEDEC mandates an uncorrectable bit error rate (UBER) of 10⁻¹⁵ or better for consumer SSDs. When the RBER exceeds the LDPC correction ceiling required to maintain that UBER, the page is flagged as uncorrectable. The controller retries the read using different internal voltage offsets, but these retries use the controller's own default retry tables, which are conservative.
PC-3000 SSD goes further. It allows the technician to set custom retry tables with voltage offsets outside the controller's default range, testing additional voltage levels against marginal pages. This is the difference between a consumer drive that declares a page "unrecoverable" and a professional tool that finds a voltage window where the page resolves.
SMART Attributes That Indicate NAND Degradation
SMART monitoring provides early warning of NAND degradation. Not all controllers expose the same attributes, and interpretation varies by manufacturer. The following table covers the attributes most relevant to degradation assessment.
| SMART ID | Attribute | Concern Threshold | What It Means |
|---|---|---|---|
| 5 | Reallocated Sector Count | Any non-zero value | NAND blocks retired to the spare pool. Rising count means active degradation. |
| 170 | Available Reserved Space | Below 10% | Spare block pool nearly exhausted. No room to remap further failures. |
| 173 | SSD Wear Leveling Count | Vendor-specific | Average P/E cycle count across all blocks. Compare to rated endurance. |
| 187 | Uncorrectable Error Count | Any non-zero value | Errors that exceeded the controller's ECC capacity. Direct evidence of degradation past the correction limit. |
| 202 | Percentage Lifetime Used | Above 90% | Counts up from 0 to 100. Values above 90% indicate the tunnel oxide is near end-of-life. |
| 233 | Media Wearout Indicator | Below 10 | Counts down from 100 to 0. Near-zero values mean the NAND has consumed its rated endurance. |
| 241 | Total LBAs Written | Compare to TBW rating | Total host writes. If approaching or exceeding the manufacturer's TBW rating, expect degradation. |
SMART data is a guide, not a guarantee. Some drives fail from firmware bugs or power events with perfect SMART readings. Others exceed their rated TBW by 2x with no issues. SMART helps the lab estimate how much read retry tuning the recovery will require.
PC-3000 SSD Recovery Workflow for Degraded NAND
The PC-3000 SSD module provides controller-specific access to the internal firmware command set. For degraded NAND, the critical capabilities are read retry table manipulation and direct NAND page addressing. The workflow varies by controller family, but the core approach is consistent across Phison, Silicon Motion, Samsung, and Marvell platforms.
- Halt background operations. PC-3000 sends vendor-specific commands to disable garbage collection, wear leveling, and TRIM execution. On Phison controllers, this is done through the vendor-specific SATA/NVMe command set that enters "vendor mode." Silicon Motion controllers use a separate "ISP mode" entry. This prevents the controller from erasing blocks or rewriting the FTL during imaging.
- Read retry table expansion. The controller's default read retry table contains a fixed set of voltage offsets it applies when a page read fails ECC. PC-3000 replaces this table with an expanded set that tests more voltage levels across a wider range than the controller would attempt on its own.
- Soft-decision read activation. For controllers that support it, PC-3000 forces the LDPC decoder into soft-decision mode, where each cell is read at 3-7 voltage levels instead of a single threshold. The probability distribution of each bit state feeds the LDPC decoder, which achieves higher correction rates than hard-decision reads. This is the same technique the controller uses internally, but PC-3000 makes the read voltages configurable.
- Block-by-block imaging. Rather than a sequential full-surface read, PC-3000 images blocks categorized by their error rate. Low-error blocks image first (fastest, highest yield). Marginal blocks are imaged with progressively more aggressive retry settings. Unreadable blocks are flagged for thermal-assisted reads or skipped entirely if no voltage window resolves them.
- Composite image assembly. Sectors recovered across all passes and parameter sets are merged into a single image. Cross-references between the FTL map and physical NAND addressing resolve logical-to-physical mapping for any sectors read outside the normal controller pipeline.
When degraded NAND is compounded by thermal sensitivity, the workflow integrates with thermal stabilization techniques. The technician applies controlled temperature changes to the NAND packages using hot air rework equipment (Atten 862) while monitoring sector error rates through PC-3000, imaging at the temperature that produces the lowest RBER for each block.
FTL Metadata Corruption from NAND Degradation
The Flash Translation Layer (FTL) is the firmware mapping table that translates logical block addresses (how the operating system sees files) to physical NAND page locations (where the electrons are stored). The controller updates this map constantly. Because the FTL metadata pages are written far more often than user data, they degrade faster than the rest of the NAND.
When the NAND blocks storing the FTL degrade past the ECC correction threshold, the drive loses its map. The user data is still physically present on the NAND chips, but the controller can no longer locate it. The controller enters a diagnostic state: Phison PS3111-based drives report as "SATAFIRM S11" with 0MB capacity; Silicon Motion controllers drop to 8MB or 32MB; Samsung drives may show a generic "SAMSUNG" model string with no partition table.
This is the most common manifestation of NAND degradation in the lab. PC-3000 SSD handles it by uploading a firmware loader into the controller's RAM, bypassing the corrupted on-NAND firmware. The loader provides direct access to the physical NAND blocks. PC-3000 then scans the raw blocks, locates surviving metadata markers, and reconstructs a virtual translator. The virtual map replaces the corrupted FTL, allowing the data to be imaged sector by sector to a target drive. See our firmware corruption recovery page for controller-specific details.
Frequently Asked Questions
Can data be recovered from a worn-out SSD?
What causes NAND flash to degrade?
How do I know if my SSD's NAND is degraded?
Why does consumer recovery software fail on degraded SSDs?
What is read disturb and how does it cause data loss?
How long can a powered-off SSD retain data?
How much does NAND degradation recovery cost?
Does TRIM accelerate NAND degradation?
Related Recovery Services
Full SSD recovery service overview
Temperature-controlled imaging for degraded cells
Last-resort physical NAND extraction
TRIM-related data loss recovery
FTL and controller firmware recovery
Firmware-level recovery hardware
SSD showing wear warnings or read errors?
Free evaluation. PC-3000 SSD read retry calibration for degraded NAND. SATA SSD from From $200, NVMe from From $200. No data, no fee.
