Jump to content

LTT Folding Team's Emergency Response to Covid-19

Go to solution Solved by GOTSpectrum,

This event has ended and I recommend you guys head over to the Folding Community Board for any general folding conversation. 

 

 

1 hour ago, Inkertus said:

Quick question:  I have a Ryzen 3700x with 12 threads I can spare.  Do you recommend leaving it at the 12 w/1 slot or 6 w/2 slots?

one slot with a higher thread count will produce more points than the same thread count split over two slots because of the Quick Return Bonus (QRB) which gives a disproportianetly larger bonus for Work Units (WUs) returned faster.

 

Having said that on Windows make sure you leave at least one thread free for the OS or it will be very unhappy.

 

The only reason for going with 2 CPU slots is if you get screwed by the "multiple of large primes" bug. In my case I have a 2700x driving 2 GPUs under Linux so 14 threads left over. But 14 is 2 x7 and 7 is considered a large prime so it drops it down to 13 which is also a large prime so it drops it down to 12. So I setup an 8-thread slot and a 6-thread slot as I didn't want to leave any performance on the table

FaH BOINC HfM

Bifrost - 6 GPU Folding Rig  Linux Folding HOWTO Folding Remote Access Folding GPU Profiling ToU Scheduling UPS

Systems:

desktop: Lian-Li O11 Air Mini; Asus ProArt x670 WiFi; Ryzen 9 7950x; EVGA 240 CLC; 4 x 32GB DDR5-5600; 2 x Samsung 980 Pro 500GB PCIe3 NVMe; 2 x 8TB NAS; AMD FirePro W4100; MSI 4070 Ti Super Ventus 2; Corsair SF750

nas1: Fractal Node 804; SuperMicro X10sl7-f; Xeon e3-1231v3; 4 x 8GB DDR3-1666 ECC; 2 x 250GB Samsung EVO Pro SSD; 7 x 4TB Seagate NAS; Corsair HX650i

nas2: Synology DS-123j; 2 x 6TB WD Red Plus NAS

nas3: Synology DS-224+; 2 x 12TB Seagate NAS

dcn01: Fractal Meshify S2; Gigabyte Aorus ax570 Master; Ryzen 9 5900x; Noctua NH-D15; 4 x 16GB DDR4-3200; 512GB NVMe; 2 x Zotac AMP 4070ti; Corsair RM750Mx

dcn02: Fractal Meshify S2; Gigabyte ax570 Pro WiFi; Ryzen 9 3950x; Noctua NH-D15; 2 x 16GB DDR4-3200; 128GB NVMe; 2 x Zotac AMP 4070ti; Corsair RM750x

dcn03: Fractal Meshify C; Gigabyte Aorus z370 Gaming 5; i9-9900k; BeQuiet! PureRock 2 Black; 2 x 8GB DDR4-2400; 128GB SATA m.2; MSI 4070 Ti Super Gaming X; MSI 4070 Ti Super Ventus 2; Corsair TX650m

dcn05: Fractal Define S; Gigabyte Aorus b450m; Ryzen 7 2700; AMD Wraith; 2 x 8GB DDR 4-3200; 128GB SATA NVMe; Gigabyte Gaming RTX 4080 Super; Corsair TX750m

dcn06: Fractal Focus G Mini; Gigabyte Aorus b450m; Ryzen 7 2700; AMD Wraith; 2 x 8GB DDR 4-3200; 128GB SSD; Gigabyte Gaming RTX 4080 Super; Corsair CX650m

Link to comment
Share on other sites

Link to post
Share on other sites

42 minutes ago, Shigeru said:

Surpassed the 50k barrier, yay for me XD

imagen.png.6a8941060b2bb7fd6d61c113febf66ac.png

How come your points are that low from a total of 31 WU?  Are you not getting the Bonus(QRB)?

Link to comment
Share on other sites

Link to post
Share on other sites

5 hours ago, J1mjam said:

Hey guys, 

 

Has anyone had any luck running this stuff in AWS?  I have looked into using spot instances, but I am concerned about the possibility of then having those instances removed and losing the WU. 

 

Of course.  I want to help out more, by donating a little more to the resources but I'm not sure I can really afford the on demand prices for some of the GPU boxes, as they are 4 times the price of the spot instance. 

I run on g4dn.xlarge spot instances running inside containers. Yes, I'll lose the WU if I scale them down because a price trigger happens but I've set it up so the price would have to spike pretty good. 

Link to comment
Share on other sites

Link to post
Share on other sites

Just now, J1mjam said:

How come your points are that low from a total of 31 WU?  Are you not getting the Bonus(QRB)?

I think what I have, but aside of my poor GT 1030 all the other WUs are CPU ones and I started with that account in March 28 :)

Link to comment
Share on other sites

Link to post
Share on other sites

1 hour ago, J1mjam said:

So I got everything working, but only on a Promo k80 instance.  But its better than nothing.  I had to convert the Free Trial to a pay as you go.  It means Ill be charged, but only once i use the free £150.  The free trial is limited to only 4 vCPUs, and all the GPU boxes are 6+. 

 

So now I have a 4 core (E5-2690 v3) slot and a K80 slot going too!  Its only about 30% of the performance of my 1080ti, but every little helps!

Everything counts! I'm also using the promo instances.(2x NC12 Promo) both doing CPU and GPU work. And yes! They are far from powerful. 

Link to comment
Share on other sites

Link to post
Share on other sites

My 2070 back online!  I shall rise from the ashes!

El Zoido:  9900k + RTX 4090 / 32 gb 3600mHz RAM / z390 Aorus Master 

 

The Box:  3900x + RTX 3080 /  32 gb 3000mHz RAM / B550 MSI mortar 

Link to comment
Share on other sites

Link to post
Share on other sites

2 minutes ago, Macaw2000 said:

I run on g4dn.xlarge spot instances running inside containers. Yes, I'll lose the WU if I scale them down because a price trigger happens but I've set it up so the price would have to spike pretty good. 

It's not just the price spike thats the problem though, is it?  Spot instances, by design, can be spun down by AWS whenever they want, due to demand from pay as you go customers?  Or am I misunderstanding that?

 

1 minute ago, Shigeru said:

I think what I have, but aside of my poor GT 1030 all the other WUs are CPU ones and I started with that account in March 28 :)

Have you set up the Passkey correctly, yea?  I guess the CPU WU would explain the low score, but Id have thought that 20+ WU would have got you a little more by now!

Link to comment
Share on other sites

Link to post
Share on other sites

9 hours ago, pred1tor83 said:

 

@n0xlf I am glad to hear you are feeling better, how long did it take you to recover?

I'm on day 17 and still have a bit of a cough.  The bad stuff stopped about 5 days ago.

Link to comment
Share on other sites

Link to post
Share on other sites

2 minutes ago, GalaxyNetworks said:

Everything counts! I'm also using the promo instances.(2x NC12 Promo) both doing CPU and GPU work. And yes! They are far from powerful. 

I'm waiting for my quota increase to go through, so i can spin up a NC12,  I'm stuck on a NC6 right now, a I can only use 10 vCPUs.

Link to comment
Share on other sites

Link to post
Share on other sites

Just now, J1mjam said:

It's not just the price spike thats the problem though, is it?  Spot instances, by design, can be spun down by AWS whenever they want, due to demand from pay as you go customers?  Or am I misunderstanding that?

 

Have you set up the Passkey correctly, yea?  I guess the CPU WU would explain the low score, but Id have thought that 20+ WU would have got you a little more by now!

Yep! The passkey matchs in all machines, but the other factor is what my CPUs are old models, and A8-5600k, Phenom II X4 and one intel i5 2310 (testing several old processors what I have, maybe the E2160 could be good, but the rest are nahhh)

Link to comment
Share on other sites

Link to post
Share on other sites

3 minutes ago, J1mjam said:

It's not just the price spike thats the problem though, is it?  Spot instances, by design, can be spun down by AWS whenever they want, due to demand from pay as you go customers?  Or am I misunderstanding that?

Yeah you are correct. It generally doesn't happen though as long as there is plenty of capacity in the availability zone. You piqued my curiosity though so I just filtered the logs on my cluster of 10 containers and indeed:

 

FS01:0x22:Folding@home Core Shutdown: INTERRUPTED 

FS01:0x22:Caught signal SIGINT(2) on PID 753

 

That means a spot instance got nuked. But, considering it's been around 10 spot instance running over four days, I can live with it.

Link to comment
Share on other sites

Link to post
Share on other sites

1 minute ago, Macaw2000 said:

Yeah you are correct. It generally doesn't happen though as long as there is plenty of capacity in the availability zone. You piqued my curiosity though so I just filtered the logs on my cluster of 10 containers and indeed:

 

FS01:0x22:Folding@home Core Shutdown: INTERRUPTED 

FS01:0x22:Caught signal SIGINT(2) on PID 753

 

That means a spot instance got nuked. But, considering it's been around 10 spot instance running over four days, I can live with it.

Interesting!  Obviously, dumped WU have a few concerns.  If you drop too many, you lose your QRB.  And of course, there's the issue with the lost work, that someone else will have to pick up after the fact.  

 

When the spot instances die, do they just get deallocated, or are their VHD destroyed too.  i.e. is there any way of retrieving the work unit, or at least to upload and inform the server that's it's borked so it doesn't sit until the timeout in order to reassign it to someone else. If you follow me.

Link to comment
Share on other sites

Link to post
Share on other sites

Came back from getting something to eat to find I had a full set of WUs, Looks like I'll be breaking 4 million soon! :)

 

Spoiler

Annotation 2020-03-30 115722.png

 

Link to comment
Share on other sites

Link to post
Share on other sites

2 minutes ago, J1mjam said:

Interesting!  Obviously, dumped WU have a few concerns.  If you drop too many, you lose your QRB.  And of course, there's the issue with the lost work, that someone else will have to pick up after the fact.  

 

When the spot instances die, do they just get deallocated, or are their VHD destroyed too.  i.e. is there any way of retrieving the work unit, or at least to upload and inform the server that's it's borked so it doesn't sit until the timeout in order to reassign it to someone else. If you follow me.

 

Yeah it's just a flag: https://aws.amazon.com/premiumsupport/knowledge-center/ami-preserve-ebs-spot/

 

I'm forked from this repo FYI: https://github.com/raykrueger/FoldingOnECS

 

It's a fancier way to fold than just standing up a bunch of servers. Let's me be more "elastic" but it does risk dropping a WU. It looks like I may drop one of out every 300 WU.

Link to comment
Share on other sites

Link to post
Share on other sites

7 minutes ago, J1mjam said:

When the spot instances die, do they just get deallocated, or are their VHD destroyed too.  i.e. is there any way of retrieving the work unit, or at least to upload and inform the server that's it's borked so it doesn't sit until the timeout in order to reassign it to someone else. If you follow me.

Both options are available (destroy entirely or keep).  Keeping the instance (storage) is new for 2020, so it wasn't that way before.  There are multiple ways to bring the same instance back, so you could do it without losing the WU progress.

Link to comment
Share on other sites

Link to post
Share on other sites

Just now, efka112 said:

10 mil points reached 🥳

1.jpg

NICE!

My Folding Stats - Join the fight against COVID-19 with FOLDING! - If someone has helped you out on the forum don't forget to give them a reaction to say thank you!

 

The only true wisdom is in knowing you know nothing. - Socrates
 

Please put as much effort into your question as you expect me to put into answering it. 

 

  • CPU
    Ryzen 9 5950X
  • Motherboard
    Gigabyte Aorus GA-AX370-GAMING 5
  • RAM
    32GB DDR4 3200
  • GPU
    Inno3D 4070 Ti
  • Case
    Cooler Master - MasterCase H500P
  • Storage
    Western Digital Black 250GB, Seagate BarraCuda 1TB x2
  • PSU
    EVGA Supernova 1000w 
  • Display(s)
    Lenovo L29w-30 29 Inch UltraWide Full HD, BenQ - XL2430(portrait), Dell P2311Hb(portrait)
  • Cooling
    MasterLiquid Lite 240
Link to comment
Share on other sites

Link to post
Share on other sites

9 minutes ago, Macaw2000 said:

 

Yeah it's just a flag: https://aws.amazon.com/premiumsupport/knowledge-center/ami-preserve-ebs-spot/

 

I'm forked from this repo FYI: https://github.com/raykrueger/FoldingOnECS

 

It's a fancier way to fold than just standing up a bunch of servers. Let's me be more "elastic" but it does risk dropping a WU. It looks like I may drop one of out every 300 WU.

 

5 minutes ago, n0xlf said:

Both options are available (destroy entirely or keep).  Keeping the instance (storage) is new for 2020, so it wasn't that way before.  There are multiple ways to bring the same instance back, so you could do it without losing the WU progress.

Are you guys able to let me know how you set it up?

 

I looked into raykruegers repo, but I couldnt get it to work as intended.  I find the networks/firewalls easier to configure in Azure.  I also couldnt find a way for the ECS instance to configure the client to not fold as anon with the basic config.xml, without having to manually go and edit the file for each ec2 instance that it created. 

 

Any tips are welcome!

Link to comment
Share on other sites

Link to post
Share on other sites

5 minutes ago, J1mjam said:

 

Are you guys able to let me know how you set it up?

 

I looked into raykruegers repo, but I couldnt get it to work as intended.  I find the networks/firewalls easier to configure in Azure.  I also couldnt find a way for the ECS instance to configure the client to not fold as anon with the basic config.xml, without having to manually go and edit the file for each ec2 instance that it created. 

 

Any tips are welcome!

That repo describes the config in the container so you'd have to fork the container and put in your own config.

 

Easier though might be to just have it copy the config from an S3 bucket using curl on instance launch. When you launch an EC2 under advanced you can put some code.

 

Edit @J1mjam here's an example of what you could put in the that code box when you launch. This would install the FAH client, install the NVIDIA tools, and copy in your config.

apt-get update
apt-get install -y curl ocl-icd-opencl-dev nvidia-opencl-dev
rm -rf /var/lib/apt/lists/*
curl --silent --fail -o /fahclient.deb https://download.foldingathome.org/releases/public/release/fahclient/debian-testing-64bit/${BASE}/fahclient_${VERSION}_amd64.deb \
dpkg -x ./fahclient.deb / \
rm /fahclient.deb

curl -o output.file https://J1mjamsbucket.s3-us-west-2.amazonaws.com/config.xml

 
Link to comment
Share on other sites

Link to post
Share on other sites

5 minutes ago, Plexas said:

This a tough event, almost 0 work today :| 

What's your time until next attempt at grabbing a WU?

 

If it's above 1 hour, pause the offending slot(s) and resume them. Or you can try restarting the client, restarting the computer, deleting and restarting slots (both of mine were hung this morning and only got them to attempt reconnecting again by killing the slots)

Link to comment
Share on other sites

Link to post
Share on other sites

Is it possible to use a free AWS account to fold with?

My Folding Stats

My BOINC Stats

 

 

VelosterN:

AMD Ryzen 9 5950X - Asus ROG Strix X570-E Gaming - Corsair Vengeance RGB Pro 3600Mhz 32GB - Asus ROG Strix Gaming 6750 XT OC

Corsair Crystal Series 680x RGB - Samsung 970 Evo Plus 250GB NVMe - Samsung 970 Pro 512GB NVMe - Samsung 860 Pro 256 GB 2.5" SSD X2

EVGA P2 80+ Platinum 850Watt PSU - BenQ XL2730Z 27.0" 2560x1440 144 Hz - be quiet! Dark Rock 4

Corsair K70 LUX - Logitech G502 Proteus Spectrum - Sennheiser HD599 - Blue Yeti Mic

Windows 11 Professional Version 22H2

BettyBoop:

AMD Ryzen 5 2600X - Asus ROG Strix B450-I Gaming - Corsair Vengeance LPX 3000Mhz 16GB - Asus ROG Strix Gaming 5500 XT - Fractal Design Core 500

Samsung 860 Pro 512GB Sata - EVGA 550GM 80+ Gold 550Watt SFX PSU - be quiet! Dark Rock 4

Windows 11 Professional Version 22H2

HTPC:

Intel i7 6700K - Asus Maximus Hero VIII - Corsair Vengeance LPX 3000MHz 16GB - MSI RX480 Gaming-X 8GB - Cooler Master 932 HAF

Seagate 250GB HDD - EVGA G2 80+ Gold 650Watt PSU - Corsair H100i

Windows 10 Professional Version 21H2

 

Link to comment
Share on other sites

Link to post
Share on other sites

Has anyone noticed that maybe AWS is folding and not telling anyone?

Essentially, this user is the #1 producer and 3x anonymous. 

Screenshot_2020-03-30 ec2spot User Summary - Folding Home Stats.png

Link to comment
Share on other sites

Link to post
Share on other sites

Guest
This topic is now closed to further replies.


×