Message boards : Number crunching : GPU version tweaking for current WUs
Author | Message |
---|---|
Sergei Chernykh Project administrator Project developer Send message Joined: 5 Jan 17 Posts: 534 Credit: 72,451,573 RAC: 0 |
GPU version becomes more and more demanding for CPU with current type of work units. This was expected, and it's already affecting owners of GTX 1070 (and faster) GPUs. The fix is to run 2 or more tasks per GPU. My tests have shown that the following app_config.xml <app_config> <app> <name>amicable_2_64</name> <max_concurrent>3</max_concurrent> <report_results_immediately/> <fraction_done_exact/> <gpu_versions> <gpu_usage>0.5</gpu_usage> <cpu_usage>1</cpu_usage> </gpu_versions> </app> </app_config> is enough to get 100% load on GTX 1080 Ti now, but I'll have to switch to 4 tasks per GPU soon. |
KaptainBlaZzed Send message Joined: 28 Mar 17 Posts: 1 Credit: 100,600,495 RAC: 0 |
thx for this guidance. I was having issues with my my 1070 and 1080's getting horrible run times while running a single WU and reserving more CPU threads did not help. I am now running 3 WU on my 1070 and 4 WU's on my 1080's. My cpus are slow Xeons so i need to run more WU's to keep usage up. I will probably have to bump it up to 4 and 5 WU respectively to keep usage up. |
Matt Kowal Send message Joined: 12 Feb 17 Posts: 14 Credit: 177,829,581 RAC: 0 |
|
Su_Root@jisaku Send message Joined: 16 May 17 Posts: 3 Credit: 272,027,919 RAC: 0 |
Thanks My GTX 1050Ti is around 75% load, it's fixed. |
Sergei Chernykh Project administrator Project developer Send message Joined: 5 Jan 17 Posts: 534 Credit: 72,451,573 RAC: 0 |
GTX 1080 Ti requires 8 simultaneous tasks now to get 100% load, my Ryzen 7 1700 @ 3.6 GHz is at 70% load already. Damn, it will soon be not powerful enough to feed 1080 Ti. |
RobertN Send message Joined: 30 Mar 17 Posts: 7 Credit: 153,642,561 RAC: 0 |
This is with a GTX 1080: GPU load is around 80%. :( |
Sergei Chernykh Project administrator Project developer Send message Joined: 5 Jan 17 Posts: 534 Credit: 72,451,573 RAC: 0 |
This is with a GTX 1080: I've created a new application Amicable Numbers up to 2^64 (double check) for selective checking of the current search range. It'll have GPU-friendly work units, one WU will be enough for 100% load even on 1080 Ti. You can switch to it when it has work to do (hopefully this evening, I'm generating work now). |
RobertN Send message Joined: 30 Mar 17 Posts: 7 Credit: 153,642,561 RAC: 0 |
Works great! :) |
Su_Root@jisaku Send message Joined: 16 May 17 Posts: 3 Credit: 272,027,919 RAC: 0 |
Amicable Numbers up to 2^64 Run time : 1453.56 CPU time : 1453.56 https://sech.me/boinc/Amicable/result.php?resultid=4303593 ↓↓↓ Amicable Numbers up to 2^64 (double check) Run time : 559.51 CPU time : 8.97 https://sech.me/boinc/Amicable/result.php?resultid=4312215 Awesome update!!! Thank you! |
tictoc Send message Joined: 4 Feb 17 Posts: 1 Credit: 71,091,343 RAC: 0 |
I'm now up to 12 tasks on my 1070. That puts my 1070 at about 85-90% GPU load. I tried running 14 tasks, but at that setting the tasks errored out within a few seconds. Running this many concurrent GPU tasks, on a single GPU, is uncharted territory for a GPU project, so I am not too surprised that 14x tasks instantly failed. |
Tex1954 Send message Joined: 4 Feb 17 Posts: 4 Credit: 24,049,346 RAC: 0 |
I'm up to 4 tasks on a GTX-980 and still only at 65%-75%... BUT, I can't run anything else because it dramatically decreases the GPU utilization if I do. These new tasks are VERY CPU intensive... I kinda thought the whole point of using a GPU was to offload the CPU and increase speed... maybe we need more than 21 Kernels? 8-) |
Sergei Chernykh Project administrator Project developer Send message Joined: 5 Jan 17 Posts: 534 Credit: 72,451,573 RAC: 0 |
I'm up to 4 tasks on a GTX-980 and still only at 65%-75%... BUT, I can't run anything else because it dramatically decreases the GPU utilization if I do. These new tasks are VERY CPU intensive... The part that runs on CPU (generating numbers to check on GPU) gets harder in the end of the search. Yes, it can also be done on GPU, but it's hard to implement without mistakes. Wait a few more days until the next search is started, it will be GPU-friendly as before. |
cibban Send message Joined: 28 Jun 17 Posts: 4 Credit: 61,849 RAC: 0 |
I can't get BOINC to run more than one WU whatever I do, with GPU at ~30%. I tried some different settings in app_config.xml but nothing changes. I have an old GeForce GTX 560 with 1024 MB RAM. Could that be the reason? |
Sergei Chernykh Project administrator Project developer Send message Joined: 5 Jan 17 Posts: 534 Credit: 72,451,573 RAC: 0 |
Each WU requires RAM, so you can run only one WU with 1024 MB RAM. |
cibban Send message Joined: 28 Jun 17 Posts: 4 Credit: 61,849 RAC: 0 |
Thank you! I suspected that. Now I can go on with my day ans stop tweaking the untweakable. :) Will the new 10^20 WU:s utilize my GPU better? |
cibban Send message Joined: 28 Jun 17 Posts: 4 Credit: 61,849 RAC: 0 |
Since this morning, I get no WU's at all. I'm guessing my GPU RAM is not sufficient for the 10^20 tasks. Could that be it? |
Sergei Chernykh Project administrator Project developer Send message Joined: 5 Jan 17 Posts: 534 Credit: 72,451,573 RAC: 0 |
Since this morning, I get no WU's at all. I'm guessing my GPU RAM is not sufficient for the 10^20 tasks. Could that be it? Sorry, but they require at least 1600 MB RAM each (depending on your kernel size setting). I did my best to fit them into 1024 MB, but it was impossible: prime numbers table alone is ~1208 MB in size (all primes up to 14142135624 => 633521375 primes, 2 bytes per each prime number). |
cibban Send message Joined: 28 Jun 17 Posts: 4 Credit: 61,849 RAC: 0 |
Thanks. I saw that in the eventlog too. I'm just going to have to get a graphics card from this decade. ;) |
klepel Send message Joined: 31 Oct 18 Posts: 3 Credit: 963,602,505 RAC: 425,040 |
The fix is to run 2 or more tasks per GPU. My tests have shown that the following app_config.xml I do have two questions: 1) My GTX 1070 does not recieve WUs on my Ryzen 1700x with Lubuntu 18.10. What is wrong? 2) On my other system with 2 GTX970 under Windows 7, the above App_Config does not work. Does not run two Wus at the same time: I changed <name>amicable_2_64</name> to <name>amicable_10_20</name>. I should say that I run App_Config files on other projects with no problems. Has anybody any suggestions? Thanks |
klepel Send message Joined: 31 Oct 18 Posts: 3 Credit: 963,602,505 RAC: 425,040 |
The fix is to run 2 or more tasks per GPU. My tests have shown that the following app_config.xml 1) My GTX 1070 does not recieve WUs on my Ryzen 1700x with Lubuntu 18.04: STILL DOES NOT DOWNLOAD WUs. 2) Second try. The app_config works on the GTX970. |
Message boards : Number crunching : GPU version tweaking for current WUs
©2024 Sergei Chernykh