Message boards : News : App update 17 April 2017
Author | Message |
---|---|
Dear All, | |
ID: 46981 | Rating: 0 | rate: / Reply Quote | |
I've checked this on one of my Windows XP x64 hosts, and it has received the v9.18 (CUDA8.0) client. | |
ID: 46983 | Rating: 0 | rate: / Reply Quote | |
I have received two 9.18 CUDA 8.0 tasks on my xp computer, they errored out: | |
ID: 46985 | Rating: 0 | rate: / Reply Quote | |
I have 2 GTX460 cards (Hosts 317733 & 208443) that have been unable to get work since the changes over the weekend. | |
ID: 46989 | Rating: 0 | rate: / Reply Quote | |
I'm getting the same error of not receiving tasks for a GTX 570 with 381.65 Windows 8.1 However no problems with my other cards: GTX 770, and GTX 660TI. | |
ID: 46990 | Rating: 0 | rate: / Reply Quote | |
Based on the recent information that GPUGRID will run on Windows XP for one more year, I tried to continue crunching on my XP machine (driver 361.75). | |
ID: 46998 | Rating: 0 | rate: / Reply Quote | |
What can I do to get the thing work? I tried again late afternoon, and could download new tasks :-) | |
ID: 47015 | Rating: 0 | rate: / Reply Quote | |
I confirm too, that my Windows XP x64 hosts have received the 8.49 app, and it's working. | |
ID: 47023 | Rating: 0 | rate: / Reply Quote | |
Suspending, with the 918 app, is not working correctly! It sometimes takes ~20+ seconds to see the app respond to a suspend request. It should be < 3 seconds! | |
ID: 47040 | Rating: 0 | rate: / Reply Quote | |
Suspending, with the 918 app, is not working correctly! It sometimes takes ~20+ seconds to see the app respond to a suspend request. It should be < 3 seconds! I have the same issue on my windows 10 computer, running GTX980ti cards. | |
ID: 47042 | Rating: 0 | rate: / Reply Quote | |
Suspending, with the 918 app, is not working correctly! It sometimes takes ~20+ seconds to see the app respond to a suspend request. It should be < 3 seconds! Suspending has taken up to 5min on Win8.1. Resume/suspend an issue where (15) WU error upon resume on same or different GPU. WU need to run without interruption to get a proper validation. | |
ID: 47049 | Rating: 0 | rate: / Reply Quote | |
I upgraded both of my PCs to the 381.65 driver, and I am no longer getting work. It is a bit unclear to me whether I should be getting work. From what I understand, my 460 and 580 are fermi based, but from the announcement, I am lead to believe that Fermi is still supported. I do realize that there are others using similar cards and that they are not receiving work, either, so I am guessing that these cards are now no longer supported. | |
ID: 47066 | Rating: 0 | rate: / Reply Quote | |
is it official that Fermi is no longer supported for gpugrid?Yes. Fermi (GTX 4xx & GTX 5xx series) is CC2.0 and CC2.1 which is below the minimum required CC3.0 therefore they are no longer supported for GPUGrid. Due to some undisclosed "compiler problems" the the lesser Kepler based cards (CC3.0, GTX 660Ti) will fail all tasks. See the compute capability (CC, also known as Shader Model (SM)) table on Wikipedia. | |
ID: 47067 | Rating: 0 | rate: / Reply Quote | |
MJH said: For some reason the sm 3.0 support (and only that sm version) is broken. 17 Apr 2017 | 19:49:15 UTC http://www.gpugrid.net/forum_thread.php?id=4551&nowrap=true#46981 The peculiar exception for sm 3.0 devices is due to a compiler problem with CUDA 80 that affects only that hardware version. When that's fixed, hosts with a non-XP Windows will get 918. ..... But I don't know what that means! Is it a problem that GPUGrid must fix, or is it a problem that NVIDIA must fix? I feel like nobody is trying to fix it. | |
ID: 47129 | Rating: 0 | rate: / Reply Quote | |
is it official that Fermi is no longer supported for gpugrid?Yes. Thanks, Retvari. The link will help me pick a new card later this year. I'll probably get something off of e-bay, but I am not sure what yet. I will, however, be considering cards that have more recent SM implementations. ____________ | |
ID: 47136 | Rating: 0 | rate: / Reply Quote | |
is it official that Fermi is no longer supported for gpugrid?Yes. What is interesting is that my GTX 660TI is working just fine. Name e84s30_e74s25p0f47-PABLO_P04637_0_IDP-0-1-RND6466_1 Workunit 12522600 Created 27 Apr 2017 | 14:41:13 UTC Sent 27 Apr 2017 | 16:27:39 UTC Received 28 Apr 2017 | 21:56:18 UTC Server state Over Outcome Success Client state Done Exit status 0 (0x0) Computer ID 151018 Report deadline 2 May 2017 | 16:27:39 UTC Run time 70,132.96 CPU time 16,657.83 Validate state Valid Credit 210,625.00 Application version Long runs (8-12 hours on fastest card) v8.49 (cuda65) Stderr output <core_client_version>7.4.36</core_client_version> <![CDATA[ <stderr_txt> # GPU [GeForce GTX 660 Ti] Platform [Windows] Rev [3212] VERSION [65] # SWAN Device 0 : # Name : GeForce GTX 660 Ti # ECC : Disabled # Global mem : 2048MB # Capability : 3.0 # PCI ID : 0000:02:00.0 # Device clock : 1045MHz # Memory clock : 3004MHz # Memory width : 192bit # Driver version : r381_64 : 38165 # GPU 0 : 65C # GPU 0 : 67C # GPU 0 : 68C # BOINC suspending at user request (exit) # GPU [GeForce GTX 660 Ti] Platform [Windows] Rev [3212] VERSION [65] # SWAN Device 0 : # Name : GeForce GTX 660 Ti # ECC : Disabled # Global mem : 2048MB # Capability : 3.0 # PCI ID : 0000:02:00.0 # Device clock : 1045MHz # Memory clock : 3004MHz # Memory width : 192bit # Driver version : r381_64 : 38165 # GPU 0 : 50C # GPU 0 : 55C # GPU 0 : 59C # GPU 0 : 61C # GPU 0 : 64C # GPU 0 : 65C # GPU 0 : 66C # GPU 0 : 67C # GPU 0 : 68C # Time per step (avg over 12035000 steps): 5.614 ms # Approximate elapsed time for entire WU: 70171.554 s # PERFORMANCE: 44910 Natoms 5.614 ns/day 0.000 ms/step 0.000 us/step/atom 14:49:34 (5348): called boinc_finish </stderr_txt> ]]> | |
ID: 47148 | Rating: 0 | rate: / Reply Quote | |
What is interesting is that my GTX 660TI is working just fine. I think that's because you have a nice simple setup with just one GPU per machine. BOINC can see exactly what you've got, and GPUGrid have set things up properly to avoid sending that machine the app which is causing the problems. Instead, you got Application version Long runs (8-12 hours on fastest card) v8.49 (cuda65) It seems to be the people who are running multiple cards with mixed compute capabilities in the same computer that are hitting snags. It's a long-standing and well known weakness in BOINC that the client only tells the server about the 'best' card in a system. If a CC 3.5 or higher card is present alongside the GTX 660Ti, BOINC will assign the application suitable for that newer card instead, and that's what causes the problem if the task is run on the secondary card. | |
ID: 47149 | Rating: 0 | rate: / Reply Quote | |
MJH (et. al): | |
ID: 47167 | Rating: 0 | rate: / Reply Quote | |
Month 2. | |
ID: 47219 | Rating: 0 | rate: / Reply Quote | |
Suspending, with the 918 app, is not working correctly! It sometimes takes ~20+ seconds to see the app respond to a suspend request. It should be < 3 seconds! 13 days ago I updated to 382.05 from 381.89 - today my 1080/1070/1060/970/970 (z87 system) has 121 consecutive valid 9.18 tasks. Suspend / resume working without issue. WU are taking 1~8 seconds to suspend. Maybe suspend problem been resolved in most recent 381.99.* branch driver? 382.05 is the best driver for Win8.1 in several months (369.00). I'm now able to leave my system unsupervised for extended periods of time. (Just in time for humid summertime crunching.) Prior couple of branches I had a daily random reset - previously thinking might of been a hardware issue or OS problem since it's only a bone stock 2015 Win8.1 with no updates. IMHO: Win8.1 is faster than Win 7 or 10 running (multi) GPU compute. | |
ID: 47272 | Rating: 0 | rate: / Reply Quote | |
what happens to my two hosts on Windows 10 with acemd 918.80 and driver 381.65, you can read here: | |
ID: 47273 | Rating: 0 | rate: / Reply Quote | |
OK, finally know why I am not getting any work for GT 610. So, deleting the project from my client, no more reason to have it there. | |
ID: 47280 | Rating: 0 | rate: / Reply Quote | |
GPUGrid: | |
ID: 47283 | Rating: 0 | rate: / Reply Quote | |
Hello? | |
ID: 47315 | Rating: 0 | rate: / Reply Quote | |
*tap tap* Is this thing on? | |
ID: 47350 | Rating: 0 | rate: / Reply Quote | |
5 weeks have now gone by, with no response to my questions! GPUGrid: | |
ID: 47497 | Rating: 0 | rate: / Reply Quote | |
I'd love to help, somehow. | |
ID: 47540 | Rating: 0 | rate: / Reply Quote | |
I think that the only possible way to help the team is making a lot of donations. They for sure need a lot of money to hire someone with good programming skills. Could we as a community aford that? Probably. Would we? I don't think so. | |
ID: 47543 | Rating: 0 | rate: / Reply Quote | |
And they appear to have enough support, even with the reduced output on Windows. The unsent tasks are down to zero, probably due to the summer lull. There is no point in increased crunching power for the moment. | |
ID: 47544 | Rating: 0 | rate: / Reply Quote | |
I think that the only possible way to help the team is making a lot of donations. They for sure need a lot of money to hire someone with good programming skills. Could we as a community aford that? Probably. Would we? I don't think so. Well put! ____________ Cruncher/Learner in progress. | |
ID: 47580 | Rating: 0 | rate: / Reply Quote | |
Jacob I'm really sorry for this. Matt has effectively left our team so now we have hired a new person to work on Acemd and it's replacement in Acemd3. I doubt it will be very efficient to solve this issue when we can hopefully transition to Acemd3 which will use a more stable code base. If it's something urgent or really annoying such as this case feel free to PM me where I will definitely see it sooner. | |
ID: 47585 | Rating: 0 | rate: / Reply Quote | |
Jacob I'm really sorry for this. Matt has effectively left our team so now we have hired a new person to work on Acemd and it's replacement in Acemd3. I doubt it will be very efficient to solve this issue when we can hopefully transition to Acemd3 which will use a more stable code base. If it's something urgent or really annoying such as this case feel free to PM me where I will definitely see it sooner. I'm sorry you've had to take on the PR role as a scientist. I think having a meeting with the other scientists to allow them to more frequently check the forums and server status would mitigate many of these problems. | |
ID: 47587 | Rating: 0 | rate: / Reply Quote | |
Jacob I'm really sorry for this. Matt has effectively left our team so now we have hired a new person to work on Acemd and it's replacement in Acemd3. I doubt it will be very efficient to solve this issue when we can hopefully transition to Acemd3 which will use a more stable code base. If it's something urgent or really annoying such as this case feel free to PM me where I will definitely see it sooner. Thanks for the reply, Stefan. It was helpful to me! If you decide to solve it, and would like help to reproduce a problem or test a fix, I'm an all-star at both, and would do my best to help. I'm also an active BOINC alpha tester, and work with the BOINC devs sometimes. In the meantime, we eagerly await the promising Acemd3. | |
ID: 47588 | Rating: 0 | rate: / Reply Quote | |
Yeah, I talked with Adria and Pablo today. They should be having a more active role from now on in the forums. | |
ID: 47589 | Rating: 0 | rate: / Reply Quote | |
That would gen great; nothing worst then silence. It's ok to have problems, but sharing it will reduce noise and can activate support. | |
ID: 47590 | Rating: 0 | rate: / Reply Quote | |
That would be great; nothing worst then silence. It's ok to have problems, but sharing it will reduce noise and can activate support. | |
ID: 47591 | Rating: 0 | rate: / Reply Quote | |
Matt has effectively left our team so now we have hired a new person to work on Acemd and it's replacement in Acemd3. As you may remember, last April Matt announced that Windows XP support will definitely be stopped in April 2018. I strongly had the feeling that this was a decision made by himself, as he probably planned not take his time to put together another crunching software for XP. Now that Matt has left GPUGRID, I hope that the team and the new person see the situation differently, and that the crunchers who still use Windows XP for good reason will get a crunching software beyond April 2018. | |
ID: 47592 | Rating: 0 | rate: / Reply Quote | |
If we manage to make the switch to Acemd3 it will be based on OpenMM so it will only depend on where OpenMM can run on. | |
ID: 47594 | Rating: 0 | rate: / Reply Quote | |
Sounds interesting, I hope you can do it Stefan. I read briefly on OpenMM's site, it suggests a support for AMD, so maybe you will begin support for that? Not that I have any AMD cards myself, but it could open up for a larger userbase, which could give a shorter turnaround time for workunits in the end. A plus for your research I take it. | |
ID: 47595 | Rating: 0 | rate: / Reply Quote | |
As for windowsXP, I don't use it myself, but maybe the more tech savvy users on XP, should consider a switch to linux eventually. ... As I wrote in another thread recently, obviously Swan_Sync is NOT possible with Linux (in Windows - this experience I've made myself - it's essential for GPUGRID crunching). That's why some crunchers have not switched so far. On the other hand, one of the replies I got for this recent posting was that Swan_Sync is NOT necessary with Linux. So, no idea what's correct and what not. | |
ID: 47596 | Rating: 0 | rate: / Reply Quote | |
The main concern with GPUGrid is GPU utilization. SWAN_SYNC is a way to mitigate this with windows but definitely not solve it. From what I've found, using linux under the 9.14 application you will see 90%+ on high end GPUs with an appropriately fast cpu. On windows it is hard to match this, even with these techniques. It almost doesn't matter that SWAN_SYNC isn't on linux because the utilization is so high. | |
ID: 47597 | Rating: 0 | rate: / Reply Quote | |
My experience with (or without) Swan_Sync in Windows is the following: | |
ID: 47598 | Rating: 0 | rate: / Reply Quote | |
After I had set Swan_Sync, acemd.exe was using nearly 100% of a CPU core. That would be a show-stopper for me: I use my CPUs for other purposes. And I grumble at developers who wrote OpenCL apps for NVidia cards, for the same reason. Not saying that either of us is right or wrong: it's just that different users have different priorities. It might be nice to have a definitive FAQ on Swan_Sync (if there isn't one already), but it should be left up to each user to apply it or not as they choose. | |
ID: 47599 | Rating: 0 | rate: / Reply Quote | |
Yup, Swan_Sync on Windows was also something I tried, verified worked, then immediately undid. I like my CPUs keeping as busy as possible doing other projects, rather than "spin-waiting" on GPU kernels to make them only slightly faster. | |
ID: 47600 | Rating: 0 | rate: / Reply Quote | |
can we exspect short run tasks to come in the near future? | |
ID: 47601 | Rating: 0 | rate: / Reply Quote | |
After I had set Swan_Sync, acemd.exe was using nearly 100% of a CPU core. Well, on all my PCs (all having a different number of CPU cores) with which I do GPU crunching, I dedicate 1 CPU core to 1 GPU. All other CPU cores are free for other activities. | |
ID: 47602 | Rating: 0 | rate: / Reply Quote | |
After I had set Swan_Sync, acemd.exe was using nearly 100% of a CPU core. Yes. And we're saying that you end up "spending" some CPU cycles, in order to get GPU gains... whereas we are on the opposite end of the spectrum, wanting to use as little CPU as possible to keep the GPU going, so that the CPU (even the unused cycles from the GPU jobs) can be utilized. | |
ID: 47603 | Rating: 0 | rate: / Reply Quote | |
The main concern with GPUGrid is GPU utilization.The main concern of GPUGrid depends on the user's mind, not on GPUGrid. SWAN_SYNC is a way to mitigate this with windows but definitely not solve it.GPU utilization without SWAN_SYNC is improved over newer CUDA drivers, but it depends on many other factors like the WDDM (which can't be turned off, that makes SWAN_SYNC less effective on modern Windows OSes) or the workunit itself (some workunits could gain up to 15% performance boost in the past by applying SWAN_SYNC even on Linux, so I assume that it's possible that some future workunits will be similar) From what I've found, using linux under the 9.14 application you will see 90%+ on high end GPUs with an appropriately fast cpu. On windows it is hard to match this, even with these techniques.Wrong: I have 96% GPU utilization on Windows XP x64, i7-4770k, GTX 980Ti, PABLO_P01106_0_IDP workunit, 1 CPU task; 98% GPU utilization on Windows XP x64, i3-4360, GTX 980Ti, PABLO_all_data_goal_KIX_CMYB-0 workunit, no CPU tasks It almost doesn't matter that SWAN_SYNC isn't on linux because the utilization is so high.It's almost true, because the present workunits do not use that much CPU-GPU interaction. Since we can't turn on SWAN_SYNC under Linux, we can't measure the performance gain, but there could be some, even with the present workunits. The main reason for me to still use Windows XP for crunching is that I want to optimize all resources of my PCs to make the GPUGrid app run as fast as it could, so the lack of SWAN_SYNC under Linux is a show-stopper for me. | |
ID: 47604 | Rating: 0 | rate: / Reply Quote | |
I haven't seen much difference(in tasks's execution speed) between using SWAN_SYNC and increasing priority(High:13) of process acemd-918-80.exe. | |
ID: 47606 | Rating: 0 | rate: / Reply Quote | |
Sorry to interrupt, I just want to say that I have made a small (10 euros) donation. When I am sure that everything goes well I will make another. | |
ID: 47607 | Rating: 0 | rate: / Reply Quote | |
It's good to see more forum participation from project admins/scientists. Even a "We'll take a look at xx issue" comment here and there gives end users the confidence to keep donating processor cycles. | |
ID: 47608 | Rating: 0 | rate: / Reply Quote | |
Now that Matt has left GPUGRID, I hope that the team and the new person see the situation differently, and that the crunchers who still use Windows XP for good reason will get a crunching software beyond April 2018. If we manage to make the switch to Acemd3 it will be based on OpenMM so it will only depend on where OpenMM can run on. do any of the experts here know whether OpenMM can be run on WindowsXP ? I was searching for this kind of information, but did not find anything. | |
ID: 47609 | Rating: 0 | rate: / Reply Quote | |
I still find it odd that a setting like swan_sync needs to be used at all. swan_sync isn't needed at all - I run GPUGrid under Windows without using it. But it is available for use, for those members who wish to arrange their processing priority that way. I also use Process Lasso - there's one particular application, from one other BOINC project, which gains something like a six-fold increase in productivity from being run at real time priority, without having any ill-effects on the general use of the computer (Both GPUGrid and that other application are running on this computer as I type). So I have nothing against using productivity tweaks where available: but they are choices, rather than requirements. | |
ID: 47610 | Rating: 0 | rate: / Reply Quote | |
I still find it odd that a setting like swan_sync needs to be used at all.It doesn't need to be used. This is merely an option put in the hands of the user, to use it if they want to optimize their system to maximize the performance of GPUGrid (at the expense of a CPU task). Other GPU apps do not need it. BOINC or FAH.High GPU usage readings of other GPU apps are usually misleading. You should compare the power consumption of your GPU while running different GPU apps to get a more realistic measurement of the amount of operations done per second. Other GPU apps have lower power consumption while showing higher GPU utilization than GPUGrid. | |
ID: 47611 | Rating: 0 | rate: / Reply Quote | |
I think I read somewhere that the utilisation figures for NVidia GPUs are taken from the first shader multiplex (or render output unit, which seems to be the latest terminology): the rest of the card isn't measured or reported. | |
ID: 47612 | Rating: 0 | rate: / Reply Quote | |
I hope that the team and the new person see the situation differently, and that the crunchers who still use Windows XP for good reason will get a crunching software beyond April 2018. Absolutely it does NOT make any sense to crunch with XP in 2017. 2018.....why not 2020 with XP? | |
ID: 47624 | Rating: 0 | rate: / Reply Quote | |
do any of the experts here know whether OpenMM can be run on WindowsXP ? From OpenMM site: OpenMM is optimized for the latest generation of compute hardware, including AMD (via OpenCL) and NVIDIA (via CUDA) GPUs. We also heavily optimize for CPUs using intrinsics Latest hw has very bad support for XP. Maybe it runs, but with very poor performances | |
ID: 47625 | Rating: 0 | rate: / Reply Quote | |
Sounds interesting, I hope you can do it Stefan. I read briefly on OpenMM's site, it suggests a support for AMD, so maybe you will begin support for that? Not that I have any AMD cards myself, but it could open up for a larger userbase, which could give a shorter turnaround time for workunits in the end. A plus for your research I take it. It's a long story :-( And today there are a lot of tools to help the opencl development from Cuda For example: http://gpuopen.com/whats-new-hip-hcc-rocm-1-6/ But if team has no developers.... | |
ID: 47626 | Rating: 0 | rate: / Reply Quote | |
You cannot install a pascal GPU into an XP machine, and there is no way to hack it like with the 980ti. For the future is 14nm and below, I don't see a point in running these 28nm GPUs for any longer than they have to. | |
ID: 47628 | Rating: 0 | rate: / Reply Quote | |
I still find it odd that a setting like swan_sync needs to be used at all. Fine I'll word it differently. It shouldn't be needed to fully utilize a GPU. As I mentioned, other projects take what is needed before CPU apps. GPUGrid should be no different. | |
ID: 47629 | Rating: 0 | rate: / Reply Quote | |
Fine I'll word it differently. It shouldn't be needed to fully utilize a GPU.In an ideal world that would be this way. Unfortunately the real world is far from the ideal, in many aspects. The GPUGrid app is one of these aspects. However, I still think that even other projects show 99% GPU usage, the GPUGrid app does more FLOPS than the others while showing "only" 90% GPU usage. There are users who have to throttle the GPUGrid client to avoid overheating their GPU. As I mentioned, other projects take what is needed before CPU apps. GPUGrid should be no different.Why not? The science and the methods are different for each project. | |
ID: 47630 | Rating: 0 | rate: / Reply Quote | |
Absolutely it does NOT make any sense to crunch with XP in 2017.Well, if you take a look at the performance page, you'll be surprised. Check the rankings of the PABLO_P01106_2 batch, and you'll find my GTX 980Ti at the 9th place, right above a GTX 1080, and a GTX TitanX (Pascal). If you check the rankings above my GTX 980Ti, you could see that a GTX 1080Ti (which costs 2.5 times as a 2nd hand GTX980Ti) is only 10-15% faster. So much for the WDDM (introduced in Windows Vista and all later versions have it). Oh, that GTX 980Ti runs under the obsolete Windows XP. I would instantly switch to Linux if the Linux app would have the SWAN_SYNC option. To be able to compare the performance of Windows XP with SWAN_SYNC, and Linux without SWAN_SYNC I will install Linux to one of my hosts, and put the same make / model GPU in it (MSI GTX980Ti Gaming 6G) as the one of my Windows XP host has. | |
ID: 47631 | Rating: 0 | rate: / Reply Quote | |
I have noticed an issue where GPUGRID fails to resume properly after being suspended. When BOINC suspends all current tasks due to the CPU being busy for a few seconds (like when launching a large program), your task doesn't resume when the CPU is free again. It says it is running and the elapsed time changes correctly, but no progress is made. However, it will successfully resume after a GPU exclusive task has been run. | |
ID: 47638 | Rating: 0 | rate: / Reply Quote | |
... GPUGRID fails to resume properly after being suspended. ... The task in question was a 918 version.It's a know bug, seems to depend on the workunit batch. A similar bug is when you (or the BOINC manager) suspend(s) the task, it could take a while when it really happens. Is there any additional information I can provide?No. | |
ID: 47639 | Rating: 0 | rate: / Reply Quote | |
It's really too bad that in April Matt put together, in a hurry, a rather buggy software, and short time later he left GPUGRID without eliminating all these bugs :-( | |
ID: 47641 | Rating: 0 | rate: / Reply Quote | |
Fine I'll word it differently. It shouldn't be needed to fully utilize a GPU.In an ideal world that would be this way. Unfortunately the real world is far from the ideal, in many aspects. The GPUGrid app is one of these aspects. However, I still think that even other projects show 99% GPU usage, the GPUGrid app does more FLOPS than the others while showing "only" 90% GPU usage. There are users who have to throttle the GPUGrid client to avoid overheating their GPU. The science and methods have nothing to with with the priority of the executable. 90% is a dream. Try in the 60s with a single task and low 80s with two tasks. Think of the flops that could be achieved if GPU util was above 95%. | |
ID: 47709 | Rating: 0 | rate: / Reply Quote | |
90% is a dream. Try in the 60s with a single task and low 80s with two tasks. Think of the flops that could be achieved if GPU util was above 95%. I think it has long been recognized on some projects that the GPU% is just the value that some counter gives somewhere on the chip, but no one really knows what it is. If the software runs the desired science application as well as the compilers allow (of which I know nothing about), then it works well enough. And the scientists, worthy as they are, are not compiler specialists insofar as I can see, and they probably have better ways of spending their time anyway. | |
ID: 47710 | Rating: 0 | rate: / Reply Quote | |
Think of the flops that could be achieved if GPU util was above 95%. I think the total flops of a GPU driving all multiplexes at 85-90% (probably running into thermal or power throttling) would be much greater than a different app driving the first multiplex at 95% unthrottled, but leaving the rest to idle. | |
ID: 47712 | Rating: 0 | rate: / Reply Quote | |
The science and methods have nothing to with with the priority of the executable.The GPUGrid app (just like any other GPU app) runs at "below normal" process priority, while CPU apps run at "low" process priority (which is one lower than "below normal"). Other processes usually run at "normal" piority. SWAN_SYNC does not change the process priority level, it changes the behavior of the app. With SWAN_SYNC off: 1. the app gives the control back to the OS 2. the GPU gives an interrupt (signaling that it has finished the tiny part of the calculation), and due to this interrupt the OS gives the control back to the GPUGrid app 3. The app reads the result, does some calculations in double precision (if needed), then sends the next piece of the calculation to the GPU. 4. it starts all over again, until the number of iterations has been reached With SWAN_SYNC on: 1. the GPUGrid app continuously polls the GPU waiting for the GPU to finish the actual piece of calculation 2. the app reads the result, does some calculations in double precision (if needed), then sends the next piece of the calculation to the GPU. 3. it starts all over again, until the number of iterations has been reached Now, with modern Windows OSes there's an extra time needed in every CPU-GPU interactions due to Windows Display Driver Model. That's why the GPUGrid app runs faster (has higher GPU utilization) under Windows XP and Linux. This cycle has to run a couple of thousand times per second. If the given batch needs a lot of double precision calculations, the difference of overall processing speed between WDDM and non-WDDM OS will be larger, and with SWAN_SYNC on this difference is even higher (due to the missing step in the processing cycle) Other GPU projects (like SETI@home, or Einstein@home) do not interact that frequently with the GPU, hence they can reach higher GPU utilization. 90% is a dream. Try in the 60s with a single task and low 80s with two tasks. Think of the flops that could be achieved if GPU util was above 95%.I have 95% GPU usage under Windows XP, so it's not the fault of the GPUgrid app alone. | |
ID: 47713 | Rating: 0 | rate: / Reply Quote | |
Excellent detail there, especially the juicy innards of process priorities and the SWAN_SYNC variable! | |
ID: 47714 | Rating: 0 | rate: / Reply Quote | |
Excellent detail there, especially the juicy innards of process priorities and the SWAN_SYNC variable! According to Process Explorer, my CPU apps are running at a priority of 1, on a scale that goes up to 16. | |
ID: 47715 | Rating: 0 | rate: / Reply Quote | |
Excellent detail there, especially the juicy innards of process priorities and the SWAN_SYNC variable!You are right. This is my mistake, as I use a localized OS (in Hungarian) and it calls this priority level "alacsony" which I've translated back to English ("low"), while originally this priority level is called "Idle". | |
ID: 47717 | Rating: 0 | rate: / Reply Quote | |
Assuming an OpenMM based Acemd3 app turns up, all & any user side tweaks, modifications & optimizations will need to be reassessed, primarily to see if they work. Coolbits, nice, SWAN_SYNC, process lasso, priority, HT ON/OFF, freeing up a CPU/thread... | |
ID: 47718 | Rating: 0 | rate: / Reply Quote | |
The importance of PCIE & CPU rates/strengths/weaknesses might change as might the operating systems we can use... no idea whether WDDM would then any longer play a role; and, subsequently, whether WindowsXP, due to it's lack of WDDM, would any longer have an advantage over the newer systems. Besides the question whether OpenMM would run on WindowsXP at all. | |
ID: 47719 | Rating: 0 | rate: / Reply Quote | |
Don't know what the situation is regarding a new app but if an OpenMM app was compiled using VS 2010, in theory it might still work on XP. The latest VS version doesn't support XP IIRC. Other factors (CUDA Drivers, a mainstream Volta release) might still prevent it working and if the WDDM overhead isn't an issue with OpenMM then supporting XP might be unnecessary. | |
ID: 47920 | Rating: 0 | rate: / Reply Quote | |
... and if the WDDM overhead isn't an issue with OpenMM then supporting XP might be unnecessary. good point! The question though is whether WDDM overhead indeed is NOT an issue with OpenMM. Is there anyone who can answer this question for sure? | |
ID: 47921 | Rating: 0 | rate: / Reply Quote | |
Message boards : News : App update 17 April 2017