Message boards : Number crunching : GPU version: kernel size tuning and less UI lags
Previous · 1 · 2 · 3 · Next
Author | Message |
---|---|
Sergei Chernykh Project administrator Project developer Send message Joined: 5 Jan 17 Posts: 519 Credit: 72,451,573 RAC: 0 |
From what I heard, RTX series has separate integer and FP units, and this project is integer only. This is probably why RTX gets such boost. |
Jozef J Send message Joined: 24 Jan 17 Posts: 20 Credit: 1,193,014,322 RAC: 1 |
dont expect much increase for RTX cards, nearly half of gpu core take that "tensor".. Nice numbers )) Thank you for sharing. Soo all about ..is like sergey wrote in up .. Custom 2080Ti can get maybe 80-100 sec ..? have you also 2080Ti ? Saddly prices on custom or msrp are too high .. on rtx 1080ti have 699 usd ... two used 1080ti is about 1000 usd now. |
vseven Send message Joined: 15 Mar 18 Posts: 12 Credit: 587,338,410 RAC: 0 |
No, I only got a 2080, no Ti. I'm actually really surprised the performance was that good. My 1080 only eked out 300 second WU's but this was very surprising. Have to do more testing with other Boinc projects and see if these results are similar or something that is just benefiting this project. Soooo.....about that 24 or 25 kernel size. lol. :) |
Jozef J Send message Joined: 24 Jan 17 Posts: 20 Credit: 1,193,014,322 RAC: 1 |
133 vs 601 sec on some tasks ... Linux Fedora Fedora 27 (Twenty Seven) [4.18.7-100.fc27.x86_64|libc 2.26 (GNU libc)] https://sech.me/boinc/Amicable/results.php?hostid=54096&offset=0&show_names=0&state=4&appid= this host with nvidia RTx do task average 133 sec. but there is also some task 600 sec.. <core_client_version>7.10.2</core_client_version> <![CDATA[ <stderr_txt> Initializing prime tables...done X server found. dri2 connection failed! DRM_IOCTL_I915_GEM_APERTURE failed: Invalid argument Assuming 131072kB available aperture size. May lead to reduced performance or incorrect rendering. get chip id failed: -1 [22] https://sech.me/boinc/Amicable/result.php?resultid=17599793 is from 600 sec task this one is from 133 sec task <core_client_version>7.10.2</core_client_version> <![CDATA[ <stderr_txt> Initializing prime tables...done X server found. dri2 connection failed! DRM_IOCTL_I915_GEM_APERTURE failed: Invalid argument Assuming 131072kB available aperture size. May lead to reduced performance or incorrect rendering. get chip id failed: -1 [22] param: 4, val: 0 |
vseven Send message Joined: 15 Mar 18 Posts: 12 Credit: 587,338,410 RAC: 0 |
They probably have a 2080 and a second older card. BOINC only reports back how many cards you have installed and only the model of one, usually the fastest one. So the 133 is coming from the 2080 and the 600 is from whatever other card they have. |
Jozef J Send message Joined: 24 Jan 17 Posts: 20 Credit: 1,193,014,322 RAC: 1 |
https://foldingforum.org/viewtopic.php?f=83&t=31051&sid=3692e2e3ee44efee1f6c8544bea63eae&start=15 2080Ti amazing impresive 2,2 mil. PPD in folding home https://docs.google.com/spreadsheets/d/1v5gXral3BcFOoXs5n1M6l_Uo3pZpQYogn6gVlxRPnz0/edit#gid=0 Sorry bit off topic.. but can maybe help in some way for this project ) to show performance for users if not any more |
MagicEye04 Send message Joined: 26 Apr 18 Posts: 5 Credit: 22,935,430 RAC: 0 |
Hi, i am using a Vega VII and with standard kernel size 21 i got a run time of 12754,61s. The GPU-Load was about 0,00...1%. For me it looks as if the GPU is not used. Now i try 23 - but maybe there is something completely wrong if others have some minutes run times and i need hours? |
Kellen Send message Joined: 14 Nov 17 Posts: 70 Credit: 1,000,005,236 RAC: 0 |
Hi MagicEye04, There is nothing wrong with your computer, so please do not worry about that :) Presently; the calculations performed for this project are almost exclusively performed with the CPU and almost nothing is done on the GPU. You can run multi-threaded CPU tasks and, with your CPU, your run-time should be approximately 30 minutes per task if you are using all threads. Regards, Kellen |
MagicEye04 Send message Joined: 26 Apr 18 Posts: 5 Credit: 22,935,430 RAC: 0 |
But why are there tasks send out that seem to use a GPU? Wouldnt it be better to send only the CPU tasks? |
dannyridel Send message Joined: 5 Feb 19 Posts: 7 Credit: 4,388,836 RAC: 0 |
Hi, I'm using an AMD Radeon Vega 8 INTERGRATED gfx. It seems that with any kernel size the screen lags and soon the cursor become immovable. How to solve this? |
Sergei Chernykh Project administrator Project developer Send message Joined: 5 Jan 17 Posts: 519 Credit: 72,451,573 RAC: 0 |
It's not GPU lags in your case, you jut need more than 8 GB system memory to run GPU tasks. Your notebook is swapping and everything starts to lag. |
Azmodes Send message Joined: 5 Mar 17 Posts: 20 Credit: 3,508,065,235 RAC: 142,695 |
When looking at averaged task times, I noticed that my GTX 1650 appears to be at least as productive as my 1080 Ti, if not even a tad better (both 1.1-1.2 million cr/day). I have both running one task at a time, kernel size 23. Is Turing really that superior or am I doing something wrong with the 1080 Ti? The Ti is even running on a DDR4 Linux system, the 1650 only DDR3 and Windows. Both have ample RAM available and core load is close to 100%. |
Kellen Send message Joined: 14 Nov 17 Posts: 70 Credit: 1,000,005,236 RAC: 0 |
Turing really is that much better for these tasks :) |
Kellen Send message Joined: 14 Nov 17 Posts: 70 Credit: 1,000,005,236 RAC: 0 |
Turing really is that much better for these tasks :) |
Azmodes Send message Joined: 5 Mar 17 Posts: 20 Credit: 3,508,065,235 RAC: 142,695 |
I guess so. Another thing, I'm running two tasks at once on this RTX 2080 and although it runs well (and throughput improves) every once in a while a task gets stuck and runs endlessly. Trying to reduce kernel size to 22 and see if it happens again. |
Sergei Chernykh Project administrator Project developer Send message Joined: 5 Jan 17 Posts: 519 Credit: 72,451,573 RAC: 0 |
It doesn't get stuck: https://sech.me/boinc/Amicable/workunit.php?wuid=11804634 - another GPU finished it, you just had to wait a bit more. |
Azmodes Send message Joined: 5 Mar 17 Posts: 20 Credit: 3,508,065,235 RAC: 142,695 |
Huh. Why are these so exceptionally long? |
Sergei Chernykh Project administrator Project developer Send message Joined: 5 Jan 17 Posts: 519 Credit: 72,451,573 RAC: 0 |
Probably because task size is 68*1012, 100 times more than usual. Task generator was definitely not perfect. |
fzs600 Send message Joined: 23 Jan 17 Posts: 8 Credit: 323,736,694 RAC: 0 |
2 wu too little crediter 10 Nov 2019, 21:13:38 UTC 14 Nov 2019, 19:29:48 UTC Terminé et validé 141,192.21 4,272.16 6,836.19 Amicable Numbers up to 10^21 v3.02 (opencl_nvidia) 10 Nov 2019, 21:13:36 UTC 12 Nov 2019, 20:26:03 UTC Terminé et validé 137,003.05 3,820.35 6,836.19 Amicable Numbers up to 10^21 v3.02 (opencl_nvidia) |
Azmodes Send message Joined: 5 Mar 17 Posts: 20 Credit: 3,508,065,235 RAC: 142,695 |
Probably because task size is 68*1012, 100 times more than usual. Task generator was definitely not perfect. I guess I got another one of these. One of my GPU tasks has been running for almost three hours now and it's 9.5% done. I'll leave it running. |
Message boards : Number crunching : GPU version: kernel size tuning and less UI lags
©2024 Sergei Chernykh