If your SSD stopped working after a power surge, lightning strike, or ESD event, do not reconnect it. A shorted component on the PCB can draw excess current through the controller or NAND power pins, causing secondary damage each time power is applied. Do not attempt recovery software; the drive isn't visible to the OS when the power path is broken. Call (512) 212-9111 for a free evaluation.
What Is SSD Electrical Failure?
SSD electrical failure is physical damage to the circuit board components that deliver power to the controller & NAND flash memory. Burnt TVS diodes, blown voltage regulators, and shorted capacitors cut the power path. The NAND chips still hold your data, but the controller can't boot without clean power on the correct voltage rails.
This is different from firmware corruption, where the controller powers on but its internal mapping table is scrambled. Electrical failure means the hardware itself is broken. The drive doesn't show up in BIOS, doesn't spin (SSDs don't spin), doesn't enumerate on the bus at all. Recovery software can't see a drive that has no power.
How Do You Know If Your SSD Has Electrical Damage?
Electrical damage produces specific symptoms that differ from firmware corruption or NAND degradation. The common thread: the drive receives no power or receives corrupted power, so the controller never initializes.
- ●Drive is completely invisible to BIOS and Device Manager after a power surge, lightning strike, or PSU failure
- ●Visible burn marks, discoloration, or a burnt smell on the SSD circuit board near the power connector or PMIC area
- ●System freezes or reboots when the SSD is connected (shorted component drawing excess current from the power rail)
- ●Drive worked before a known ESD event (static shock during installation, handling without grounding)
- ●Drive not detected after liquid spill that reached the M.2 slot or SATA port
- ●Laptop or desktop no longer recognizes the SSD after a power outage, but other drives work in the same slot
If your SSD shows SATAFIRM S11, 0 bytes capacity, or the wrong model name but still enumerates in BIOS, the controller is powering on. That's firmware corruption, not electrical failure. Different fix, different price.
How Much Does SSD Electrical Failure Recovery Cost?
SSD circuit board repair for electrical damage costs $450–$600 for SATA SSDs and $600–$900 for NVMe drives. If the controller itself is destroyed beyond repair and NAND chips must be transplanted to a donor board, the cost is $1,200–$1,500 (SATA) or $1,200–$2,500 (NVMe), plus donor drive cost. A donor drive is a matching SSD used for its circuit board. Typical donor cost: $40–$100 for common models, $150–$300 for discontinued or rare controllers.
Every case starts with a free evaluation & a firm quote before any paid work begins. No data recovered means no charge. +$100 rush fee to move to the front of the queue.
| Failure Severity | SATA SSD Price | NVMe SSD Price | Typical Timeline |
|---|---|---|---|
| TVS diode / capacitor replacement | $450–$600 | $600–$900 | 3-6 weeks |
| Voltage regulator / PMIC repair | $450–$600 | $600–$900 | 3-6 weeks |
| Controller BGA reflow / reball | $600–$900 | $900–$1,200 | 3-6 weeks |
| NAND transplant to donor PCB (controller destroyed) | $1,200–$1,500 | $1,200–$2,500 | 4-8 weeks |
NAND transplant requires a 50% deposit. Donor drive cost is additional. All prices exclude tax & target drive.
How Do We Recover Data from Electrically Damaged SSDs?
The goal is to restore the original power delivery path so the native controller boots & decrypts your data. We fix the support circuitry around the controller, not the controller itself. With clean power restored, the original controller initializes & decrypts the NAND through its hardware AES-256 encryption pipeline.
- 01
Visual Inspection & Thermal Imaging
We photograph the PCB under magnification, checking for burnt, cracked, or discolored components. FLIR thermal imaging with a low-voltage bench supply identifies shorted components that draw excess current. A burnt TVS diode near the SATA power connector shows up as a thermal hotspot before any other test.
- 02
Voltage Rail Verification
Multimeter measurements on each power rail confirm which regulators failed. Consumer 2.5" SATA SSDs draw power from the 5V rail on the SATA power connector; onboard regulators step this down to 1.8V, 1.2V, and 0.9V core rails for the controller, DRAM, & NAND. NVMe drives pull 3.3V from the M.2 slot. If any rail reads short to ground or produces the wrong voltage, the regulator or its output filter capacitors are damaged.
- 03
Component-Level Repair
Using Hakko FM-2032 microsoldering irons & Atten 862 hot air rework, we remove the failed component and solder a replacement. TVS diodes, capacitors, and voltage regulators are surface-mount packages (0402, 0603, SOT-23) that require precision hand soldering under magnification. If BGA controller joints have fractured from thermal cycling, the Zhuo Mao precision BGA rework station reflows or reballs the package.
- 04
Controller Boot & Data Imaging
With clean power restored, the original controller initializes and decrypts the NAND through its hardware AES-256 encryption pipeline. We connect the drive to PC-3000 SSD to image the data sector-by-sector. If the firmware was also corrupted by the power event, we reconstruct the Flash Translation Layer before imaging.
Which PCB Components Fail on SSDs?
SSD circuit boards pack the controller, NAND flash, DRAM cache, and power management into a compact PCB. The power delivery components are the first to fail during a surge because they sit between the external power source and the protected silicon.
- TVS (Transient Voltage Suppressor) Diodes
- TVS diodes clamp overvoltage spikes on the SATA power connector or M.2 3.3V rail. During a surge, they absorb the excess energy and burn out to protect downstream components. A shorted TVS diode pulls the entire power rail to ground, preventing the drive from enumerating. Removing the shorted TVS diode and replacing it restores the rail. The drive was never damaged beyond the TVS diode itself; it did its job.
- Voltage Regulators & PMICs
- DC-DC converters (buck regulators) and LDOs step down the input voltage to 1.8V, 1.2V, and 0.9V core rails for the controller, DRAM, and NAND. On many SSDs, a single multi-output PMIC handles all rails. When it fails, the controller receives no power or incorrect voltage. Replacing a failed PMIC requires hot air removal of the old chip and precise alignment of the replacement under magnification.
- Filter Capacitors
- Ceramic capacitors on the output of each voltage rail filter high-frequency noise and store charge for transient current demands. A cracked or shorted capacitor on the 1.2V core rail creates a dead short that prevents the regulator from starting. These are 0402 or 0603 packages, measuring 1mm or 1.6mm long.
- Controller BGA Solder Joints
- The main controller IC (Phison, Silicon Motion, Samsung, Marvell) connects to the PCB through hundreds of solder balls in a Ball Grid Array package. Thermal cycling from repeated heat-cool cycles or a sudden thermal shock can fracture these joints. The controller loses contact with the PCB traces on specific pins, causing intermittent detection or complete failure. BGA rework with the Zhuo Mao rework station reflows or reballs the solder joints.
- ESD-Damaged Controller Pins
- Electrostatic discharge during handling (installing an M.2 drive without a grounding strap) can damage the controller silicon's input protection diodes on specific data or power pins. The drive may partially enumerate but fail during data transfer, or fail to initialize entirely depending on which pins were affected.
What Is the Diagnostic Process for Electrically Damaged SSDs?
Diagnosis follows a systematic voltage-rail-by-rail approach. The goal is to identify which specific component failed before applying any power to the drive through the system bus, preventing secondary damage from shorts.
- Visual inspection under 20x-40x magnification. Check for burn marks, cracked components, flux residue from liquid damage, or discolored solder joints. Document all visible damage before powering.
- Cold resistance measurements on each power rail. Measure resistance to ground on the 3.3V, 1.8V, and 1.2V rails without applying any voltage. A short to ground (reading under 1 ohm) indicates a shorted component on that rail.
- FLIR thermal imaging with current-limited bench supply. Apply 3.3V at 100mA current limit. The shorted component heats up first because it draws all available current. FLIR identifies the hotspot within seconds. This prevents damage to healthy components.
- PC-3000 SSD controller identification & communication test. With power restored, connect the drive to PC-3000 SSD and verify the controller responds to vendor-specific diagnostic commands. Select the correct loader module (Phison or Silicon Motion utility) based on the controller IC marking. Controllers outside PC-3000 SSD coverage (Samsung proprietary, SK hynix, Kioxia) boot on their own silicon once rails are clean and image through the standard bus.
How Does a PMIC Failure Collapse the SSD Power Tree?
The Power Management IC is the single point where a surge or ESD event cascades into a dead drive. One PMIC takes the input rail from the host (3.3V on M.2 NVMe, 5V on SATA, 12V on U.2 enterprise) and steps it down to every voltage the controller and NAND need. When it fails, nothing downstream sees clean power.
Most consumer NVMe and SATA SSDs integrate a multi-output buck/LDO PMIC from Richtek, Monolithic Power Systems, Silergy, or Texas Instruments storage-class parts. A single package generates the controller core rail (typically 0.9V-1.2V), the NAND Vcc rail (3.3V), the NAND Vccq I/O rail (1.8V on older 2D NAND, 1.2V on modern 3D TLC and QLC), and the DRAM rail where a separate DDR cache is present. When any one of those outputs shorts internally, the regulator either shuts down through its own over-current protection or fails open and stops switching entirely.
The failure mode we see most often on post-surge drives is a shorted TVS diode on the input clamping the whole rail to ground. On a current-limited bench PSU at 3.3V, the drive draws the full limit current with zero voltage development, and the bench ammeter pegs at the limit with the TVS package visibly heating within a few seconds on FLIR. That signature is a dead short, not a dead controller, which is why lifting the TVS alone often restores the drive.
A shorted Vccq decoupling capacitor behaves similarly but localizes the heat to a different area of the board; the FLIR image separates the two within one bench-PSU ramp. A failed PMIC itself lands in the middle: the chip heats uniformly across its package rather than at one pin, and the output rails read as either collapsed or incorrect under probe.
What Voltage Rails Does an SSD PMIC Generate, and What Does Each One Do?
A single PMIC package on a consumer SSD generates four to five distinct rails from the host input. Each rail powers a different block on the drive, and each one has a characteristic failure signature on the bench. Knowing which rail collapsed is the shortest path from a dead drive to a target component.
On M.2 NVMe drives the input is 3.3V from the slot. On 2.5" SATA the input is 5V from the SATA power connector and an onboard buck steps it down to 3.3V before the PMIC. Common parts in this class include Richtek RT-series multi-output PMICs, MPS storage PMICs, and Silergy equivalents engineered for solid-state drive applications. The rail breakdown is consistent across vendors because the controller and NAND voltage requirements are fixed by the silicon process.
| Rail | What it powers | Failure symptom on the bench |
|---|---|---|
| 3.3V host interface | Input rail from M.2 slot or post-buck on SATA. Feeds the PMIC, the input TVS diode network, and the NAND Vcc on most modern parts | Cold short to ground on the input rail reads under 1 ohm. Bench PSU pegs at the current limit immediately, FLIR shows the input TVS diode or input decoupling cap heating within seconds. No internal rail develops because the PMIC never gets clean input |
| 0.9V-1.2V controller core (Vcore) | Core logic supply for the controller silicon on Phison, Silicon Motion, Marvell, and Realtek parts. Modern lithography (28nm/16nm/12nm) requires sub-1.5V Vcore to avoid transistor breakdown. The 1.8V rail is reserved for analog blocks, PHY signaling (SATA/PCIe), and legacy NAND I/O | Cold resistance reads short on the Vcore rail. With the input rail current limited the PMIC may chirp into hiccup mode rather than holding off, and FLIR localizes the heat to a 0402 or 0603 decoupling cap on the controller side of the board |
| 1.2V NAND Vccq I/O | NAND I/O rail on modern 3D TLC and QLC. Older 2D NAND used 1.8V Vccq; current Toggle and ONFI parts use 1.2V to cut switching power on the data lines between controller and NAND packages | Cold short on the Vccq rail reads through the NAND I/O bus. PSU clamps at the current limit, and FLIR shows the heat on a decoupling cap near the NAND packages rather than near the controller. Drive does not enumerate because the controller cannot complete the NAND boot handshake without clean Vccq |
| 3.3V NAND Vcc | Bulk supply for the NAND core. Often shared with or derived from the host input rail through a separate LDO or PMIC output | Rail collapse on Vcc starves the NAND array. Controller may enumerate briefly with vendor-specific identifiers but every NAND read fails. On bench probe the rail is missing or sagging well below 3.3V under load |
| 1.1V-1.35V DRAM (DRAM-cached drives only) | Separate rail for the DDR3L/DDR4 cache chip on DRAM-cached SSDs. Absent on DRAM-less HMB designs that borrow host RAM through PCIe | Drive enumerates but the controller fails its DRAM training pass and refuses to mount the FTL. PC-3000 SSD logs the DRAM init error during loader handoff, which is the cleanest tell that this specific rail is the fault |
The decoupling capacitor population on each rail is what fails most often. A 0402 ceramic cap on the 1.2V Vccq rail that develops an internal short is enough to keep the drive from ever enumerating, even though the controller silicon, NAND, and PMIC are all healthy. Lifting the shorted cap with the Hakko FM-2032 and confirming the cold resistance returns to a sane reading often restores the drive in a single bench session. PMIC replacement itself, when needed, runs through the Atten 862 hot air station on a controlled thermal profile to avoid disturbing the adjacent NAND packages during rework.
No software product can substitute for this work. Recovery utilities require an enumerated drive on the SATA or NVMe bus. A drive with a collapsed Vccq rail is invisible to the host controller, which means it is invisible to every consumer recovery tool ever shipped. The repair has to happen at the rail before any firmware-level work becomes possible.
What Is the Bench Sequence for Isolating an Electrical Fault?
The goal is to identify the failed component before applying system-bus power, so a shorted rail does not take out adjacent parts during diagnosis. Every step below runs on the same bench, on a drive that is disconnected from any host.
- Cold resistance sweep. Meter each power rail to ground with the drive unpowered. A reading under 1 ohm on the 3.3V input rail flags a shorted TVS or input cap. A short on an internal rail (1.8V, 1.2V, 0.9V) points to a downstream capacitor or the PMIC itself.
- Current-limited bench PSU ramp with FLIR overlay. Apply the input rail through a bench supply set to 100mA current limit. A healthy idle draw is tens of milliamps; a shorted component pulls the full limit immediately and heats within seconds. The FLIR thermal camera pinpoints the hot component to a specific 0402 or 0603 package without guesswork.
- Component lift with the Hakko FM-2032. Remove the suspected failed part with a fine-tip microsoldering iron on the Hakko FM-2032. For small TVS diodes and decoupling capacitors the lift is direct. Repeat the cold resistance sweep: if the short clears with the part removed, the fault is isolated.
- PMIC or BGA rework with hot air and a BGA station. When the failed part is the PMIC itself or the controller BGA has fractured joints, the Atten 862 hot air station removes the package with a controlled thermal profile. For BGA reball or controller reflow the Zhuo Mao BGA rework station holds the profile across preheat, soak, and reflow stages to avoid damaging the adjacent NAND packages.
- Replacement from a donor board. TVS diodes and decoupling caps come from a parts library. A PMIC or controller pull comes from a model-matched donor PCB so the package, pinout, and firmware compatibility are identical. A donor drive is a matching SSD used for its circuit board. Typical donor cost: $40–$100 for common models, $150–$300 for discontinued or rare controllers.
- Power restoration and PC-3000 SSD handoff. Once the rails come up clean on the bench, the drive goes onto the PC-3000 SSD complex. The original controller boots on its own silicon, which preserves the hardware AES-256 Media Encryption Key, and the imaging pass reads the NAND through the normal translator rather than through a raw chip-off path.
When Does Board Repair Work and When Do We Switch to Chip-Off?
Board-level repair keeps the original controller on the PCB. Chip-off removes the NAND packages and reads them raw. The decision is not preference; it is whether the controller silicon is still alive.
| Failure signature | Repair path | Why |
|---|---|---|
| Shorted TVS or cap on input rail, controller silicon intact | Board repair | Lift the shorted part, rail restores, controller boots on its own key |
| Failed PMIC, controller silicon intact | Board repair with donor PMIC | PMIC replacement restores clean rails to the original controller |
| Fractured BGA joints on controller, die not cracked | Board repair with BGA reflow or reball | Reseating the controller on the PCB restores its connections without replacing the die |
| Controller silicon cracked, burned through, or ESD-killed internally | Chip-off NAND, only if the drive does not use hardware encryption | The original controller is gone, so its AES-256 key is gone with it. Raw NAND reads are only useful when no hardware key is required to decrypt the data |
| Controller destroyed on a hardware-encrypted SSD (Opal, TCG, manufacturer AES) | Unrecoverable | The key lived in the controller silicon and cannot be reconstructed from chip-off reads. We tell you this during the free evaluation rather than billing for a run that cannot succeed |
The electrical failure path on this page covers the top three rows. The chip-off NAND page covers the fourth and explains the hardware-encryption limit in more detail.
Why Can't Most Data Recovery Labs Fix Electrically Damaged SSDs?
The data recovery industry grew up around firmware tools. PC-3000, MRT, and similar platforms communicate with functioning controllers to extract data from corrupted NAND. When the controller is electrically dead, these tools have nothing to talk to.
Fixing the power delivery path requires a different skill set: board-level component identification, surface-mount soldering on 0402-package components, BGA rework for controller reflow, and thermal profiling to avoid damaging adjacent NAND chips during hot air work. Most data recovery labs don't employ technicians with microsoldering training and don't stock Hakko FM-2032 irons, hot air rework stations, or BGA rework equipment.
Rossmann Repair Group started as a board repair operation in 2008. MacBook logic board repair, component-level microsoldering, and BGA rework were the business before data recovery was added.
The soldering infrastructure and trained technicians were already in place. Electrical SSD failure is where those two capabilities intersect: the firmware tools to read the NAND and the soldering skills to make the controller boot.
What Is the Difference Between Electrical Damage and Power Loss Corruption?
Both problems involve power, but the damage is in different layers. Electrical failure breaks the hardware: TVS diodes, regulators, or capacitors fail and the drive goes completely invisible to BIOS. Power loss corrupts the firmware: the controller powers on but the Flash Translation Layer mapping is scrambled. The diagnostic path and repair procedure are different.
| Characteristic | Electrical Failure | Power Loss Corruption |
|---|---|---|
| What's damaged | Physical components (TVS diodes, regulators, caps) | Flash Translation Layer mapping in DRAM |
| BIOS detection | Drive invisible | Drive shows as SATAFIRM S11 or 0 bytes |
| Visible damage | Burn marks, discoloration possible | PCB looks physically intact |
| Repair method | Microsoldering component replacement | PC-3000 FTL reconstruction |
| SATA SSD cost | $450–$600 | $600–$900 |
| NVMe SSD cost | $600–$900 | $900–$1,200 |
Some power surges cause both: the surge blows the PMIC (electrical failure) and the resulting unclean shutdown corrupts the FTL (power loss corruption). In that case, we fix the hardware first, then reconstruct the firmware. The power loss recovery page covers the firmware side in detail.
Why Does Board Repair Preserve Encrypted Data?
Modern SSDs use AES-256 hardware encryption where the Media Encryption Key is bound to the original controller silicon. If someone replaces the entire controller, the new chip has a different key and the NAND data is unreadable ciphertext.
Board-level repair preserves the original controller and its encryption key. We replace the support components around the controller (TVS diodes, regulators, capacitors), not the controller itself. When the original controller boots with clean power, it decrypts the NAND using the key that was baked into its silicon at the factory.
This is the only viable path for encrypted SSDs with electrical damage. If the controller silicon is cracked or destroyed, the encryption key is lost and the data is unrecoverable. We'll tell you that during the free evaluation.
For cases where the controller is destroyed and the drive doesn't use hardware encryption, chip-off NAND extraction is the last-resort option: desolder the NAND chips, read them raw, and reconstruct the file system from flash page data.
Why an Electrically Dead SSD Needs Microsoldering, Not a Cleanroom
SATA and NVMe SSDs are factory-sealed BGA and TSOP packages on a PCB. Nothing about an electrical failure on the power tree calls for cleanroom-grade air filtration; the work happens at an ESD-safe bench under a stereo microscope with hot air and fine-tip microsoldering. The relevant constraint is not particulate contamination. It is power delivery and the cryptographic binding of the data to one specific controller die.
Named failure modes on the SSD power tree
- PMIC blowout
- A surge or unstable PSU rail drives the Power Management IC into permanent protection, latches it into hiccup mode, or burns the package outright. With the PMIC down, the controller core rail (0.9V-1.2V), NAND Vccq (1.2V on modern 3D TLC, 1.8V on older 2D), and NAND Vcc (3.3V) never come up. The controller silicon is healthy; it just sees no power.
- TVS diode short on the host rail
- Transient Voltage Suppression diodes on the SATA 5V pin or the M.2 3.3V pin clamp surge energy by sacrificing themselves into a dead short. On a current-limited bench PSU the rail pegs at the current limit with no voltage development, and FLIR localizes the heat to the TVS package within seconds. Lifting the shorted TVS often restores the drive on its own.
- Controller VCC rail collapse
- A cracked 0402 or 0603 ceramic decoupling cap on the controller Vcore rail presents as a sub-1-ohm short to ground. The PMIC enters over-current protection and refuses to start the rail, so the controller never boots even though the silicon is intact. Same pattern on the 1.2V Vccq line into the NAND bus.
- Blown LDO feeding 3.3V, 1.8V, or 1.2V
- Discrete LDOs that derive analog or PHY rails from the PMIC output can fail open or fail short. Open LDOs leave a downstream block unpowered; shorted LDOs pull the upstream rail down. Either way the controller fails enumeration on the SATA or PCIe bus and the drive is invisible to the host.
Why chip-off does not substitute for board repair
Modern Phison, Silicon Motion, Samsung, and Marvell controllers run always-on AES-256 in XTS mode. The Media Encryption Key is generated by the controller at manufacture and wrapped by a Hardware Unique Key fused into the controller silicon with one-time-programmable eFuses. The MEK never leaves the die in plaintext. Pull the NAND off the board and the bytes coming back are ciphertext; bond them to a donor controller of the identical part number and the donor still cannot unwrap the MEK, because its HUK is different. The hardware-encryption page has the full key-binding breakdown. The practical consequence on this page is simple: reviving the original PCB is the only path that yields readable data.
Equipment mapped to the diagnostic step it serves
- FLIR thermal camera. Run with a current-limited bench PSU to localize the shorted component on the cold board before any solder iron touches it. The hotspot tells you whether the fault is a TVS on the input, a decoupling cap on a controller rail, or the PMIC itself.
- Hakko FM-2032 microsoldering iron. Lifts 0402 and 0603 caps, TVS diodes, and discrete LDOs from the board. After the lift, the cold resistance sweep tells you whether the short cleared.
- Atten 862 hot air rework station. Removes shorted PMIC packages on a controlled thermal profile that does not disturb adjacent NAND packages. Used for the actual PMIC swap once the cold bench work has isolated the failed part.
- Zhuo Mao BGA rework station. Holds preheat, soak, and reflow stages for controller reflow or reball when thermal cycling has fractured the BGA joints. Bottom-side preheat prevents PCB warping during the reflow.
- PC-3000 SSD complex. Comes in after the rails are clean. The original controller boots on its own silicon and decrypts the NAND through the standard translator, and PC-3000 SSD handles the imaging and any FTL reconstruction needed if the surge also corrupted firmware.
The full SSD service catalog and pricing live on the SSD data recovery hub. If the controller die itself is cracked, burned through, or ESD-killed at the silicon level, the encryption key is gone with it and we will tell you that during the free evaluation rather than bill for a run that cannot succeed.
How Do You Diagnose the SSD Power Section at the Rail Level?
Power-section diagnostics on a dead SSD work from the input rail inward: confirm the host rail, then probe each downstream rail at its inductor output or dedicated test point. The goal is to find which rail is missing, sagging, or short to ground before any rework, so the lift target is one specific 0402 cap, one LDO, or one PMIC package rather than guesswork.
Voltage rail measurement procedure
The drive sits on an ESD-safe bench, disconnected from any host. A current-limited bench PSU supplies the input rail (3.3V on M.2, 5V on a 2.5" SATA drive). A FLIR thermal sweep runs first so a dead short is localized before any voltage probe touches the board. Once the cold short is either ruled out or lifted, a DMM in DC mode probes the inductor outputs on each buck converter and the dedicated test pads where the PMIC vendor exposes them. Reading the inductor output rather than the bulk cap pad avoids loading the rail through the probe tip.
| Rail | Normal range | What it feeds |
|---|---|---|
| VCC core (controller logic) | 0.9V to 1.2V | Controller SoC core, ARM cores, FTL accelerator, AES engine. Modern parts on 12nm silicon (Phison E18/E26, SMI SM2264, Samsung Elpis and Pascal, Marvell) sit at the low end of this range, while older 28nm parts such as the SMI SM2267 sit slightly higher within the envelope |
| VCCQ (NAND I/O) | 1.2V or 1.8V | I/O between the controller and the NAND packages. Modern Toggle and ONFI 3D TLC or QLC negotiates 1.2V to cut switching power on the data bus. Older 2D NAND uses 1.8V |
| VPP (NAND program/erase bias) | 2.5V to 3.3V external | External bias the NAND charge pumps boost internally toward 15V to 20V for Fowler-Nordheim tunneling during program and erase. The intermediate 9V to 12V range is used as the pass voltage on unselected wordlines rather than for the tunneling itself. Sector reads do not need the VPP boost, so a partial PMIC fault that kills only VPP can still allow read-only imaging through PC-3000 SSD |
A rail that sits at the wrong voltage with no thermal hotspot points to a buck regulator out of regulation or an LDO that has failed open. A rail at zero with a thermal hotspot points to a downstream short. A rail at the host input voltage instead of the regulated voltage points to a buck FET failing closed and passing the input directly through; that condition destroys whatever silicon sits on the downstream rail within seconds, so the bench PSU stays current-limited until the rail is confirmed clean.
TVS diode and PMIC failure modes
TVS diodes sit on the host input rail in 0402, 0603, or SOD-323 packages. M.2 2280 and 2230 PCBs use 0402 and 0603 footprints because of the spatial constraint; 2.5" SATA and U.2 enterprise boards often use SOD-323 because the 5V or 12V input rails carry more energy and need a part with higher peak power dissipation. Vishay and ProTek Devices engineering documentation classifies TVS failure into three modes.
- TVS: closed circuit (short)
- Most common mode. Thermal runaway during a surge fuses the junction into a dead short to ground. The drive will not power on until the shorted TVS is lifted. The Atten 862 hot air station or Hakko FM-2032 removes the part; the cold resistance sweep then confirms the input rail is no longer short.
- TVS: open circuit
- Catastrophic surge vaporizes the metallization. The drive may still operate because downstream silicon survived, but the input rail no longer has surge protection. The TVS is replaced before the drive goes back into service.
- TVS: degraded (leaky)
- The most subtle mode. Sub-lethal surges or sustained heat degrade the junction over time, leaving the diode with elevated reverse leakage at the nominal rail voltage. The drive runs at idle but drops out under heavy workload as the leakage current pulls the rail down. On the bench the TVS shows abnormal forward and reverse readings under diode-test mode; under load the FLIR shows the package warming when it should be cold.
- PMIC: catastrophic short to ground
- An internal switching MOSFET or LDO pass element fails closed, pulling the host input rail directly to ground. The host motherboard sees the short and refuses to start, or the bench PSU pegs at the current limit immediately. FLIR shows the PMIC package heating uniformly across its body rather than at a single pin.
- PMIC: loss of regulation (overvoltage)
- The PMIC fails to regulate a lower-voltage rail and passes the full input through to the downstream block. A 3.3V or 5V input arriving on what should be a 1.2V core rail will destroy the controller silicon. This mode is why bench bring-up runs current-limited; without the limit, a single PMIC fault takes out the controller during the diagnostic step.
- PMIC: LDO failure (missing rail)
- A single internal LDO dies open. The host input rail is fine, the drive does not short, but one downstream rail (commonly VCCQ on the NAND bus) is missing. The controller starts to boot, fails the NAND handshake, and does not enumerate on the bus. Diagnosis is a DMM sweep across the regulated outputs, looking for the one rail that reads zero with no short to ground.
PLP capacitor discharge and replacement
Enterprise and prosumer SSDs (for example the Samsung PM9A3 and Micron 7450 class of drive) carry Power Loss Protection capacitor banks: tantalum polymer arrays that hold the rails up for roughly 20 to 50 milliseconds after an unclean shutdown, long enough for the controller to flush the in-flight DRAM FTL state to NAND. The dominant failure mode is dielectric breakdown into a short. A shorted tantalum on the PLP rail trips the PMIC overcurrent protection and the drive refuses to initialize.
Before any rework on a PLP-equipped drive, the capacitor bank has to be safely discharged. Healthy PMICs include an active discharge path that bleeds the tantalum bank when the input rail drops below the undervoltage lockout. When the PMIC itself is the failed part, that path cannot be trusted and the discharge has to happen manually.
- Measure first. Probe each tantalum across its terminals with a DMM in DC volts. A bank that reads near zero is already discharged; a bank that reads at or near the rail voltage is live and needs to be brought down before any iron touches the board.
- Bleed through a resistor. Place a 100 ohm, 5W bleed resistor across the capacitor terminals for several seconds. Never short a charged tantalum with metal tweezers; the discharge current can weld the tips, lift adjacent pads, and pit the PCB copper.
- Confirm zero before rework. Re-probe with the DMM and confirm the bank is at zero volts before the soldering iron or hot air goes near the area.
- Two-sided heat for the lift. PLP capacitor pads sit on large copper thermal planes that wick heat away faster than a single-sided tool can deliver it. A Zhuo Mao precision bottom-side heater holds the board at a controlled preheat while the Atten 862 hot air station works the top side. Top-only heat warps the PCB and lifts adjacent pads on a PLP array.
- Replace from a parts library. Tantalum polymer replacements are sourced by capacitance, voltage rating, and package. The new part goes on under the same two-sided thermal profile to avoid the same warp on installation.
Board-level repair versus chip-off NAND
The choice is dictated by whether the controller silicon is still alive. Board repair works when the controller is intact but a support component on the power tree has failed: TVS, decoupling cap, LDO, PMIC, or BGA solder joint. Each of those is a discrete part on the PCB that can be lifted and replaced while the controller die stays in place. Once the rails come up clean, the controller boots on its own silicon, the Hardware Unique Key is still on the die, and the Media Encryption Key unwraps normally for imaging through PC-3000 SSD.
Chip-off NAND is the path when the controller die itself is destroyed: cracked silicon from a heavy impact, ESD-killed core, or a die burned through from a loss-of-regulation event that put the host input voltage directly on the core rail. With the controller gone, there is nothing to revive. On modern hardware-encrypted controllers (Phison E18/E26, SMI SM2264, Samsung Elpis and Pascal, Marvell with AES-256 XTS), raw NAND reads come back as ciphertext and no donor controller can decrypt them, because the HUK was fused into the original die. The hardware-encryption page covers the key-binding mechanics in full; chip-off NAND covers the path for older non-encrypted parts where raw reads do yield usable data.
The decision happens at the free evaluation, not at the quote stage. If the controller is alive, the work goes onto the board repair path at $600–$900 for NVMe or $450–$600 for SATA. If the controller die is gone and the drive is hardware-encrypted, we tell you the data is unrecoverable rather than bill for a chip-off run that cannot succeed.
Frequently Asked Questions
Can data be recovered from an SSD with a burnt circuit board?
What is the difference between electrical failure and firmware corruption on an SSD?
Why do most data recovery labs return electrically damaged SSDs as unrecoverable?
How much does SSD electrical failure recovery cost?
Can recovery software fix an SSD that won't power on?
What causes electrical damage to an SSD?
My SSD draws current but doesn't show up. Is it shorted?
How is electrical failure on a USB-bridged SSD different from a native SATA or NVMe SSD?
Can a no-power SSD be recovered without the original controller?
Related SSD Recovery Services
Power Loss Recovery
FTL corruption from power outages. Firmware-level repair when the controller powers on but can't find your data.
Chip-Off NAND Extraction
Last resort when the controller is destroyed. Raw NAND read with honest limits on encrypted drives.
Firmware Corruption Recovery
SATAFIRM S11, 0GB capacity, wrong model name. Controller works but firmware mapping is corrupted.
Hardware Encryption Recovery
AES-256 encrypted SSDs where the controller must be repaired to preserve the encryption key.
SSD won't power on after a surge or ESD event?
Free evaluation. SATA PCB repair: $450–$600. NVMe PCB repair: $600–$900. No data, no fee.
