Jump to content

The issue:

Yesterday, my home server suddenly unexpectedly turned off, after a reboot everything ran fine for almost an hour when it turned off again.

Restarting with the power button does not work until I power cycle the PSU.

 

Things I have tried today:
Disconnect ssd and hard disks, boot an Ubuntu live cd and run a CPU stress test.
I did stop the test after 2 hours. Nothing exceptional happened.

 

Then I turned everything off, connected the hard disks and went back in live cd.
After importing my ZFS pool i started a scrub while also running the cpu test, this resulted in a system crash after 10 minutes.

 

I rebooted, imported the zfs pool and tried to copy a couple configs and docker compose files off the zfs array.

This initialy worked but when i tried it a second time the system scrashed again.

 

This time I did not power cycle the PSU but disconnect and reconnect the ATX power cable to the motherboard.

This also allowed me to boot again. Going from this it seems the motherboard needs to be power cyceled, not the PSU.

 

How do I go further in diagnosing this problem?
As far as I can tell the problem can be PSU but more likely the motherboard.
Or can a HDD suddenly take down the whole system without any other sign of anything being wrong?

 

Any advice on further diagnosing or resolving this issue would be greatly appreciated!

 

Hardware Configuration:

  • CPU: AMD Ryzen 5 3600
  • Motherboard: Asus ROG STRIX X470-F GAMING X470
  • Memory: 2 x Corsair Vengeance LPX, 16 GB DDR4-3600 CMK16GX4M2Z3600C18
  • Storage:
    • 2 x SSD 120GB MP300 (RAID1 using mdadm)
    • 6 x WD Red 8TB WD80EFAX (RAIDZ2)
  • GPU: MSI GT 710 2GB
  • PSU: Antec Earthwatts Gold Pro 550W

 

Link to post
Share on other sites

I know you ran a cpu test but did you run a memory stress test? Im leaning towards a potential memory issues since it re-occurred when the zfs pool was being imported or being scrubbed and it would have been loading into memory.

 

also what OS? I may have just overlooked it in the post. 

Link to post
Share on other sites

Under normal conditions I run Ubuntu Server 24.04, my testing was done using an Ubuntu 24.04 Live Cd.
I am running Memtest now, however with bad memory I would expect a kernel panick, system hang or reboot, not a hard turn off.

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

×