Jump to content

Hey All,

 

So I recently built a brand new system from scratch, something I've been saving a few years for. However, it's having a complete system lock-up that I've been having issues tracking down. I'll be as brief, but as complete, as I can.

 

Build

Quirk

   Not sure if this is important, but going to mention it anyway. Most of the time when the PC locked up, I'd have to reboot the computer twice. First reboot to get it out of the freeze, it would typically hang on Windows load with an error ranging from "Load VGA BIOS" to "Memory Test" to "Detect HDD". After rebooting a second time, typically it would load into Windows with no issue.

 

History

  • The first time the system locked up was while attempting to use ClockTuner. Trying to run the "Diagnostic" or even click the "CPU Stability" button instantly locked up the PC, but it ran fine if I didn't use the program.
  • The second time was when I was running the Asus AI Suite 3 software. It actually did just fine through the CPU overclocking each time, but locked up during the "fan" phase. It always did this at 51%.
    • I eventually realized my PC case wasn't set to let the mobo control the fans. After changing that, no more lockup, and it ran through AI Suite 3's system tuning every time with no lockup.
  • There were two instances the PC seemed to lock up while going into or after going into a sleep mode, but honestly I wasn't around to see what happened. Just found it locked up and screens dark.

While this sounds like a lot, they were spread out, and eventually 'stopped' when I didn't try messing with anything. Ran Cinebench multiple times without incident, even while overclocked through the Asus software. However, to be on the safe side, I uninstalled the Asus AI Suite 3, and later disabled auto-tuning from the board core boost and the precision boost, and removed OC settings.

  • A while later after getting software and everything set up, I tried to run a game. Within five minutes, the system locked up.
  • Tried multiple games/graphic programs, including the Heaven benchmark, locked up each time within a max of 10 minutes, typically less than 5.
  • Once locked up from opening, closing, and reopening Chrome too quickly (might indicate more of something was in use from two instances of chrome being 'active'?)

Working with a friend, he suggested the GPU was bad so I swapped in my old 1070Ti. Lockups indeed stopped. It ran all the same programs without a hitch. I went and RMA'd the GPU thinking we'd found the problem.

  • Next day, I decide to try some Warframe. System locked up in 5 minutes.
  • This time I had to reboot the system 3 times, and it showed new errors, including "Check NVRAM" and "POST Error" which hadn't occurred before.
  • I cleared the CMOS and re-flashed the BIOS to default, basically only applying the DOCP for the RAM and leaving everything else be.
  • Went back, played Warframe for several hours with no problem, and another game for a few hours with no problem.

A friend suggested just to be safe I should run Prime95 as a stress test while I slept that night to see if it locked up and flush out any lurking problems. So I did, and ran it before bed. I woke up to find the PC locked up, but no clear indicator when. After rebooting the system, I ran it a second time, and it locked up my system almost immediately.

 

Final Thoughts

  • Before I go run out and RMA anything else, I really need to try and determine what actually needs to be RMA'd if anything.
  • RAM has been tested and didn't show any errors in memtest.
  • Friend suggested that something was needing more power and not getting it. In that case, the 90-degree adapter might be a culprit, or the Strimer cable? Hard to test the strimer cable as having that extension is the only way the mobo is getting power at this point, but I'll think of something if you all believe it might be at fault.
  • CPU has been running hot, even for a 3960x, but it is nowhere near max when lockups occur.

 

Hopefully this all... "helps." After dumping everything I'd saved into this PC just to have this happen I've been pretty torn up as you might imagine. I can't exactly run out and grab replacements. I'm going to try removing the 24-pin power adapter since it's the one thing that's "easy" to test, but any thoughts you all have in the meantime would be helpful.

Link to comment
https://linustechtips.com/topic/1345817-need-help-tracking-system-freeze/
Share on other sites

Link to post
Share on other sites

8 minutes ago, Brandon Shinobu said:

128Gb G.Skill RAM, 3600Mhz 14-15-15-35

Are you running the RAM at DOCP or stock? If DOCP, repeat tests at stock.

Main System (Byarlant): Ryzen 9 5950X | Asus B550-Creator ProArt | EK 240mm Basic AIO | 32GB G.Skill DDR4 3600MT/s CL16 | XFX Speedster SWFT 210 RX 6600 | Samsung 990 PRO 2TB / Samsung 990 EVO Plus 4TB | Corsair RM750X | StarTech 4× USB 3.0 Card | Realtek RTL8127 10G NIC | Hyte Y60 Case | Dell U3415W Monitor | Keychron K12 Blue (RGB backlight)

 

Laptop (Narrative): Lenovo Flex 5 81X20005US | Ryzen 5 4500U | 16GB DDR4 3200MT/s (soldered) | Vega II 384SP Graphics | SKHynix P31 1TB NVMe SSD | Intel AX200 Wifi | Asus 2.5G USB NIC | Asus ProArt PA278QV | Keychron K4 Brown (white backlight)

 

Proxmox Server (Veda): Ryzen 7 3800XT | ASRock Rack X470D4U | Corsair H80i v2 | 128GB Micron DDR4 ECC 3200MT/s | 2× Samsung PM963a 960GB SSD / 4× WD 10TB / 4× Seagate 14TB Exos / 4× Micron MX500 2TB / 8× WD 12TB (custom external SAS enclosure) | Seasonic Prime Fanless 500W | Intel X550-T2 10G NIC | LSI 9300-8i HBA | Adaptec 82885T SAS Expander | Fractal Design Node 804 Case

 

Proxmox Server (La Vie en Rose)GMKtec Mini PC | Ryzen 7 5700U | 32GB Lexar DDR4 (SODIMM) | Vega II 512SP Graphics | Lexar 1TB 610 Pro SSD | 2× Realtek 8125 2.5G NICs


Media Center/Video Capture (Jesta Cannon): Ryzen 5 1600X | ASRock B450M Pro4 R2.0 | Noctua NH-L12S | 16GB Crucial DDR4 3200MT/s | EVGA GTX750Ti SC | UMIS NVMe SSD 256GB / TEAMGROUP MS30 1TB | Corsair CX450M | Viewcast Osprey 260e Video Capture | TrendNet (AQC107) 10G NIC | LG WH14NS40 BD-ROM | Silverstone Sugo SG-11 Case | Sony XR65A80K

 

Workbench (Doven Wolf): Lenovo m715q | Ryzen Pro 3 2200GE | 16GB Crucial DDR4 3200MT/s (SODIMM) | Vega 8 Graphics | SKHynix (OEM) 256GB NVMe SSD | uni 2.5G USB NIC | HDMI add-in module

 

Network:

Spoiler
                       ┌─────────────── Office/Rack ───────────────────────────────────────────────┐
Google Fiber Webpass ── Cloud Gateway Max ══╦═ Pro XG 8 ══╦═ Flex 2.5-8 ══╦═ Doven Wolf
                      La Vie en Rose (DNS) ═╬═ Narrative  ╠═ Veda-NAS     ╠═ La Vie en Rose (vmbr)
                                Veda (DNS) ─┘             ╠═ Veda (vmbr)  ├─ Ptolemy (vmbr)
╔═════════════════════════════════════════════════════════╩═ Ptolemy-NAS  ├─ Veda (Mgmt)
║   ┌ Closet ┐      ┌───────── Bedroom ─────────┐                         └─ Veda (IPMI)
╚═══ Flex XG ══╦╤═══ Flex XG ══╤╦═ Byarlant
       (PoE)   ║│              │╠═ Narrative 
Kitchen Jack ══╣└─ Dual PoE ┐  │╚═ Jesta Cannon*
   (Testing)   ║┌─ Injector ┘  └── Work Laptop
     Bedroom ══╝│        ┌─────── Media Center ────────────────────────────┐
     Jack #2    └──────── Switch 8 ────────────┬─ nanoHD Access Point (PoE)
Notes:                                         ├─ Sony PlayStation 4 
─── is Gigabit / ═══ is Multi-Gigabit          ├─ Pioneer VSX-S520
* = cable passed from Bedroom to Media Center  └─ Sony XR65A80K (Google TV)
Link to post
Share on other sites

10 minutes ago, Brandon Shinobu said:

128Gb G.Skill RAM, 3600Mhz 14-15-15-35

This timing is very tight, try reducing the number of modules and retest.

Ryzen 5700g @ 4.4ghz all cores | Asrock B550M Steel Legend | 3060 | 2x 16gb Micron E 2666 @ 4200mhz cl16 | 500gb WD SN750 | 12 TB HDD | Deepcool Gammax 400 w/ 2 delta 4000rpm push pull | Antec Neo Eco Zen 500w

Link to post
Share on other sites

9 minutes ago, AbydosOne said:

Are you running the RAM at DOCP or stock? If DOCP, repeat tests at stock.

 

6 minutes ago, SupaKomputa said:

This timing is very tight, try reducing the number of modules and retest.

 

Just to clarify, the RAM itself was rated for those timings and speed at stock. I only activated the DOCP so the RAM would operate at the 'stock' speed instead of the mobo's default 2133Mhz. However, I have cleared the DOCP settings and I'll run Prime95 again here in just a moment.

Link to post
Share on other sites

16 minutes ago, SupaKomputa said:

This timing is very tight, try reducing the number of modules and retest.

 

19 minutes ago, AbydosOne said:

Are you running the RAM at DOCP or stock? If DOCP, repeat tests at stock.

 

Okay, Prime95 is running without the DOCP, and thusfar the system hasn't locked up yet. I'll have to let it run for a while before I feel I'm "safe", and I'll update this thread later with the results.

 

That being said, I had specifically purchased the higher rated RAM for those speeds. Does this mean that the RAM itself is bad, or the motherboard? If not, is there any way to get the speed that was listed on the packaging? Ultimately, anything/everything will be an upgrade from the 666.5Mhz RAM on my old machine here, but i'll be disappointed if I spent extra money for something I won't be able to utilize.

Link to post
Share on other sites

2 minutes ago, Brandon Shinobu said:

RAM itself was rated for those timings and speed at stock

Doesn't matter, if the mobo doesn't like it, you need manual tuning, especially for running more than 4 modules on high speed at the same time (or full slot).

One way is to increase the voltage, the other is reducing the speed.

Zen 2 officially support up to 3200mhz.

Ryzen 5700g @ 4.4ghz all cores | Asrock B550M Steel Legend | 3060 | 2x 16gb Micron E 2666 @ 4200mhz cl16 | 500gb WD SN750 | 12 TB HDD | Deepcool Gammax 400 w/ 2 delta 4000rpm push pull | Antec Neo Eco Zen 500w

Link to post
Share on other sites

1 minute ago, SupaKomputa said:

Doesn't matter, if the mobo doesn't like it, you need manual tuning, especially for running more than 4 modules on high speed at the same time (or full slot).

One way is to increase the voltage, the other is reducing the speed.

Zen 2 officially support up to 3200mhz.

Understood, and I think I may see where I went wrong then. When I was originally planning my build, I was going to be using a 5950x, but I later decided I needed the extra PCIe lanes for my work and switched over to a threadripper for rendering and load times. I remembered several places mentioning that 3600 was the "sweet spot", but that was probably for the 5950x, not the threadripper, and I never thought to check that.

 

And now I just need for Asus to get back to me and tell me "we didn't find anything wrong with your GPU, here, have it back." I suppose there are harder lessons than $60 RMA shipping costs. Like overpaying for RAM, heh.

 

I really appreciate the help. And thusfar the system is still running without locking up, whereas it was locking up within 5 seconds of Prime95 before. I'll let it run a couple hours to be sure.

 

I honestly don't know how to manual clock my RAM, and for lack of knowledge I'm a bit afraid to, but I do want to learn and also get at least some of the speed I paid for. Do you have any recommendations for speeds/timings that would be safe to do for a 3960x? Or perhaps a tool that would make it easier to do safely?

Link to post
Share on other sites

Yeah 3600 is the sweet spot, but most done it with only 2 modules, 4 if your lucky.

Some motherboard can't run reliably with fully populated slots without having to tweak the settings (primarily the voltage).
AMD only guarantee up to 3200mhz, even on 5th gen.

 

To manually underclock the ram, it's generally go to the bios, in memory settings choose manual instead of XMP.

Select 3200mhz in the option, save and test.

No worries, changing memory settings won't break anything.

If failed, just reset the bios settings, it will go back to default.

Ryzen 5700g @ 4.4ghz all cores | Asrock B550M Steel Legend | 3060 | 2x 16gb Micron E 2666 @ 4200mhz cl16 | 500gb WD SN750 | 12 TB HDD | Deepcool Gammax 400 w/ 2 delta 4000rpm push pull | Antec Neo Eco Zen 500w

Link to post
Share on other sites

9 minutes ago, SupaKomputa said:

Yeah 3600 is the sweet spot, but most done it with only 2 modules, 4 if your lucky.

Some motherboard can't run reliably with fully populated slots without having to tweak the settings (primarily the voltage).
AMD only guarantee up to 3200mhz, even on 5th gen.

 

To manually underclock the ram, it's generally go to the bios, in memory settings choose manual instead of XMP.

Select 3200mhz in the option, save and test.

No worries, changing memory settings won't break anything.

If failed, just reset the bios settings, it will go back to default.

Thanks! Should I keep the same timings of 14-15-15-35, or would I need to change those also?

Link to post
Share on other sites

Well you can try, but if you have a problem, just set it in auto.

If that module chip what i think it is (samsung b-die), it can run at that timings.

The motherboard usually will automatically run it at the safest timings (not the fastest though).

For your applications i suggest stability over speed.

Ryzen 5700g @ 4.4ghz all cores | Asrock B550M Steel Legend | 3060 | 2x 16gb Micron E 2666 @ 4200mhz cl16 | 500gb WD SN750 | 12 TB HDD | Deepcool Gammax 400 w/ 2 delta 4000rpm push pull | Antec Neo Eco Zen 500w

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

×