Jump to content

Hi All, 

 

I am attempting to build a rig for Data Science. My main question here is which processor would be the best for single thread work/price. Python and R are usually the most commonly used languages in the Data Science community. However, they only use a single core and thread unless you code in the Multi threading and Multiprocessing. Most data crunching is done before optimization and production of an application, hence I am chewing through some good size data on a single core before I put any efficiency in the code. The next main question is that Google's Tensor Flow, which is a machine learning library is able to make use of a GPU, is becoming more commonly used. So some moderately priced GPU advice would be helpful as well.  

 

I have been watching Linus tech tips for a couple years now and the videos that include the Intel Pentium G3258 dual core processor seemed intriguing for the data science purpose due to the high clock speeds and the price. However, with the release of the 7th gen Intel Core i3-7350K I am leaning toward the newer product. Also this is a machine that my IT department will have control over, so any risky overclocking is out of the question. 

 

There are a number efficient pieces of hardware I will be adding the the building including an M.2 SSD over PCI Express 3.0. Here is a what I have in my New Egg shopping cart as of now. Someone please stop me if something isn't going to work. I am also thinking running Ubuntu (or Peppermint) rather than windows.

 

LINUS!!!!!!!!! NEW BUILD IDEA

 

My Build

-CPU: Intel Core i3-7350K

-Mother Board: ASUS Z170-E ATX

-Memory: 64GB CORSAIR Vengeance DDR4 (4 X 16)

-GPU: ASUS GeForce GTX 1060-DUAL 3GB

-Power Supply: CORSAIR CX-M series 750W ATX

-CPU Cooler:Cooler Master RR-212X

-M.2: Toshiba OCZ RD400 128GB PCI Express 3.0

-SSD: 2 Sandisk SSD Plus 1Tb

-Case: Fractal Design Define R5 Black Window Silent ATX Midtower

Link to comment
https://linustechtips.com/topic/743864-data-science-rig-single-core-single-thread/
Share on other sites

Link to post
Share on other sites

I'd just like to point out that python CAN do true (e.g. non-time shared) concurrency. It's not simple and there is no shared memory between threads, but it's possible. If you're looking to make learning models in python, it is possible to do concurrently. I've not used R though.

 

I don't know a ton about data science professionally (I took a graduate course in college, that's it) but I'm pretty sure there are ways to get use of multiple cores. Lots of people I know who do use data science have clusters of large machines to do their crunching, and many research domains are taking advantage of AWS services through amazon for cloud computing. 

 

tl;dr im no expert, but are you SURE you only need single core performance when teaching models? That seems really backwards to me. 

Gaming build:

CPU: i7-7700k (5.0ghz, 1.312v)

GPU(s): Asus Strix 1080ti OC (~2063mhz)

Memory: 32GB (4x8) DDR4 G.Skill TridentZ RGB 3000mhz

Motherboard: Asus Prime z270-AR

PSU: Seasonic Prime Titanium 850W

Cooler: Custom water loop (420mm rad + 360mm rad)

Case: Be quiet! Dark base pro 900 (silver)
Primary storage: Samsung 960 evo m.2 SSD (500gb)

Secondary storage: Samsung 850 evo SSD (250gb)

 

Server build:

OS: Ubuntu server 16.04 LTS (though will probably upgrade to 17.04 for better ryzen support)

CPU: Ryzen R7 1700x

Memory: Ballistix Sport LT 16GB

Motherboard: Asrock B350 m4 pro

PSU: Corsair CX550M

Cooler: Cooler master hyper 212 evo

Storage: 2TB WD Red x1, 128gb OCZ SSD for OS

Case: HAF 932 adv

 

Link to post
Share on other sites

A 7700K is probably your best bet, it has reached over 7GHz on liquid nitro with only 1 core enabled. I'd get a Kraken X62 with a 7700K if your budget allows

-----> Official Unofficial Favorite Keyswitch Type Survey <-----

 OWNER OF THE FASTEST INTEL iGPU ON LTT UNIGINE SUPERPOSITION [lol]

 

GAMING RIG "SNOWBLIND"

CPU i5-13600k | COOLING Corsair H150i Elite Capellix 360mm (White) | MOTHERBOARD Gigabyte Z690 Aero G DDR4 | GPU Gigabyte RTX 3070 Vision OC (White) | RAM  16GB Corsair Vengeance Pro RGB (White)SSD Samsung 980 Pro 1TB | PSU ASUS STRIX 850W (White)CASE  Phanteks G360a (White) | HEADPHONES  Beyerdynamic DT990 Pro | KEYBOARD Zoom75 (KTT Strawberry w/ GMK British Racing Green keycaps) | MOUSE  Cooler Master MM711 (White) MONITOR HP X32 1440p 165hz IPS

 

WORK RIG "OVERPRICED BRICK"

Mac Studio (M2 Ultra / 128GB / 1TB) | HEADPHONES  AirPods Pro 2 | KEYBOARD Logitech MX Mechanical Mini | MOUSE  Logitech MX Master 3S MONITOR 2x Dell 4K 32"

 

SECONDARY RIG "ALCATRAZ"

CPU i7-4770K OC @ 4.3GHz | COOLING Cryorig M9i (review| MOTHERBOARD ASUS Z87-PROGPU Gigabyte 1650 Super Windforce OC | RAM  16GB Crucial Ballistix Sport DDR3 1600 MHzSSD Samsung 860 Evo 512GB | HDD Toshiba 3TB 7200RPMPSU EVGA SuperNOVA NEX 750WCASE  NZXT H230 | HEADPHONES  Sony WH-1000XM3  | KEYBOARD Corsair STRAFE - Cherry MX Brown | MOUSE  Logitech G602 MONITOR LG 34UM58-P 34" Ultrawide

HOLA NIGHT THEMERS

GET YOUR ASS ON NIGHT THEME

OTHER TECH I OWN:

MacBook Pro 16" [M1 Pro/32GB/1TB] | 2022 Volkswagen GTI | iPhone 14 Pro | Sony a6000 | Apple Watch Series 8 45mm | 2018 MBP 15" | Lenovo Flex 3 [i7-5500U, HD5500 (fastest on the forum), 8GB RAM, 256GB Samsung 840 Evo] | PS5, Xbox One & Nintendo Switch [Home Theater setup] | DJI Phantom 3 Standard | AirPods 2 | Jaybird Freedom (two pairs) & X2 [long story, PM if you want to know why I have 3 pairs of Jaybirds]

 

Link to post
Share on other sites

reniat, 

 

You are absolutely correct. Once models are production ready, or real training is to be done, the full spectrum of technology will be implemented. However, in development there are a bunch of tasks like data collection and cleaning that I am just doing on my machine and not looking to write perfectly efficient code. So until I put applications onto production server, a modest machine will do the trick. My current quad core 2.3 GHz laptop isn't overly exciting to work on.  

Link to post
Share on other sites

No matter what you get I would wait until after the full ryzen releas to see if prices change. The 7350K is horibly overpriced and may come down in price once the competition has returned. It would even be worth waiting for the quad core ryzen to see if it can force its price down even more.

 

For what you are doing I think that the 7350K would be the perfect processor if its price is right. For a GPU an RX 460 or GTX 1050 would be a good pick.

Intel Xeon 1650 V0 (4.4GHz @1.4V), ASRock X79 Extreme6, 32GB of HyperX 1866, Sapphire Nitro+ 5700XT, Silverstone Redline (black) RL05BB-W, Crucial MX500 500GB SSD, TeamGroup GX2 512GB SSD, WD AV-25 1TB 2.5" HDD with generic Chinese 120GB SSD as cache, x2 Seagate 2TB SSHD(RAID 0) with generic Chinese 240GB SSD as cache, SeaSonic Focus Plus Gold 850, x2 Acer H236HL, Acer V277U be quiet! Dark Rock Pro 4, Logitech K120, Tecknet "Gaming" mouse, Creative Inspire T2900, HyperX Cloud Flight Wireless headset, Windows 10 Pro 64 bit
Link to post
Share on other sites

7700k is the best single core performance I think if you ignore price.

“Remember to look up at the stars and not down at your feet. Try to make sense of what you see and wonder about what makes the universe exist. Be curious. And however difficult life may seem, there is always something you can do and succeed at. 
It matters that you don't just give up.”

-Stephen Hawking

Link to post
Share on other sites

32 minutes ago, jpauk said:

reniat, 

 

You are absolutely correct. Once models are production ready, or real training is to be done, the full spectrum of technology will be implemented. However, in development there are a bunch of tasks like data collection and cleaning that I am just doing on my machine and not looking to write perfectly efficient code. So until I put applications onto production server, a modest machine will do the trick. My current quad core 2.3 GHz laptop isn't overly exciting to work on.  

i would get the i3, it will be so much faster than the laptop

             ☼

ψ ︿_____︿_ψ_   

Link to post
Share on other sites

what is your cooling situation?

Gaming build:

CPU: i7-7700k (5.0ghz, 1.312v)

GPU(s): Asus Strix 1080ti OC (~2063mhz)

Memory: 32GB (4x8) DDR4 G.Skill TridentZ RGB 3000mhz

Motherboard: Asus Prime z270-AR

PSU: Seasonic Prime Titanium 850W

Cooler: Custom water loop (420mm rad + 360mm rad)

Case: Be quiet! Dark base pro 900 (silver)
Primary storage: Samsung 960 evo m.2 SSD (500gb)

Secondary storage: Samsung 850 evo SSD (250gb)

 

Server build:

OS: Ubuntu server 16.04 LTS (though will probably upgrade to 17.04 for better ryzen support)

CPU: Ryzen R7 1700x

Memory: Ballistix Sport LT 16GB

Motherboard: Asrock B350 m4 pro

PSU: Corsair CX550M

Cooler: Cooler master hyper 212 evo

Storage: 2TB WD Red x1, 128gb OCZ SSD for OS

Case: HAF 932 adv

 

Link to post
Share on other sites

18 minutes ago, done12many2 said:

Did someone say single thread?  :D

 

 

 

 

Capture2.JPG

Capture5.JPG

 

 

 

5 minutes ago, imreloadin said:

Damn son, how high do you have that overclocked lol?

 

Cfu4lXZW4AAjZOU.jpg

 

txoRJ.png

Stuff:  i7 7700k @ (dat nibba succ) | ASRock Z170M OC Formula | G.Skill TridentZ 3600 c16 | EKWB 1080 @ 2100 mhz  |  Acer X34 Predator | R4 | EVGA 1000 P2 | 1080mm Radiator Custom Loop | HD800 + Audio-GD NFB-11 | 850 Evo 1TB | 840 Pro 256GB | 3TB WD Blue | 2TB Barracuda

Hwbot: http://hwbot.org/user/lays/ 

FireStrike 980 ti @ 1800 Mhz http://hwbot.org/submission/3183338 http://www.3dmark.com/3dm/11574089

Link to post
Share on other sites

2 minutes ago, done12many2 said:

 

I'll bet you a $100 that you can't rerun that again right now and post it?  

 

That's right, you'd have to pull the board and hook up all the DICE stuff to do that.  lol  :P

 

Ambient rules!!

I can go beat yours tho on ambient :P

Stuff:  i7 7700k @ (dat nibba succ) | ASRock Z170M OC Formula | G.Skill TridentZ 3600 c16 | EKWB 1080 @ 2100 mhz  |  Acer X34 Predator | R4 | EVGA 1000 P2 | 1080mm Radiator Custom Loop | HD800 + Audio-GD NFB-11 | 850 Evo 1TB | 840 Pro 256GB | 3TB WD Blue | 2TB Barracuda

Hwbot: http://hwbot.org/user/lays/ 

FireStrike 980 ti @ 1800 Mhz http://hwbot.org/submission/3183338 http://www.3dmark.com/3dm/11574089

Link to post
Share on other sites

1 minute ago, done12many2 said:

 

Oh, you said your sub ambient can beat mine on ambient?  I won't argue with that buddy.  Now go find your own sandbox to play in.  xD

No I mean I can go beat your score right now on my ambient setup lol

Stuff:  i7 7700k @ (dat nibba succ) | ASRock Z170M OC Formula | G.Skill TridentZ 3600 c16 | EKWB 1080 @ 2100 mhz  |  Acer X34 Predator | R4 | EVGA 1000 P2 | 1080mm Radiator Custom Loop | HD800 + Audio-GD NFB-11 | 850 Evo 1TB | 840 Pro 256GB | 3TB WD Blue | 2TB Barracuda

Hwbot: http://hwbot.org/user/lays/ 

FireStrike 980 ti @ 1800 Mhz http://hwbot.org/submission/3183338 http://www.3dmark.com/3dm/11574089

Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

×