Message boards : News : New CUDA4.2 applications are out for Kepler GPUs
Author | Message |
---|---|
We have finally uploaded the new applications for Kepler supporting cuda4.2. | |
ID: 25848 | Rating: 0 | rate: / Reply Quote | |
Substantially faster for what GPUs? Only those with the Kepler chips, or the older ones with Fermi chips also? | |
ID: 25854 | Rating: 0 | rate: / Reply Quote | |
Fermi as well. | |
ID: 25855 | Rating: 0 | rate: / Reply Quote | |
I'm still receiving CUDA 4.2 and CUDA 3.1 by turns on the same host. | |
ID: 25856 | Rating: 0 | rate: / Reply Quote | |
I'm still receiving CUDA 4.2 and CUDA 3.1 by turns on the same host. I suggest that the project duplicats under "Preference of GPUGRID/Run only the selected applications" the possibilities: ACEMD standard: Cuda 3.1 ACEMD standard: Cuda 4.2 ACEMD beta ACEMD for long runs (8-12 hours on fastest GPU): Cuda 3.1 ACEMD for long runs (8-12 hours on fastest GPU): Cuda 4.2 It is not so much that I care about speed, but in my special case, I suspect, that the Cuda 3.1 apps causes problems on my GTX570 with driver: 301.42, as I have posted before on other threads. | |
ID: 25859 | Rating: 0 | rate: / Reply Quote | |
Interesting... Only one of my two rigs has completed a cuda3.1 WU since this announcement and thus become eligible for the long cuda4.2 app. However, the newly downloaded WU is labeled as using cuda3.1 still. | |
ID: 25863 | Rating: 0 | rate: / Reply Quote | |
I let receive the GTX570 only short apps until yesterday evening. However this card got a mix of work of Cuda 3.1 and Cuda 4.2. It seems to me that the Cuda 4.2 is slightly more stable than the Cuda 3.1 on my configuration. I am actually crunching two long tasks Cuda 3.1, the first crashed after a few seconds (NAN). The second caused a crash of the computer, luckily after reboot, the computer continuous the same work unit. | |
ID: 25864 | Rating: 0 | rate: / Reply Quote | |
This problem of receiving multiple applications is probably a bug of the server. | |
ID: 25866 | Rating: 0 | rate: / Reply Quote | |
i got a unit on a gtx260 (win7 x64 latest boinc and drivers), but i had to abort it as it was making my computer completely unusable with 99% usage. | |
ID: 25867 | Rating: 0 | rate: / Reply Quote | |
On my 2nd rig, I saw the current cuda3.1 WU getting close to finishing. After it completed and uploaded, I clicked the "reset project" button in BOINC. Now, it could be a complete coincidence, but I did get a WU using cuda4.2 afterward. | |
ID: 25868 | Rating: 0 | rate: / Reply Quote | |
Hmmm, the first 4.2 I recived crashed after about 12 min. The workunit seems to have crashed on another host as well. | |
ID: 25870 | Rating: 0 | rate: / Reply Quote | |
Hmmm, the first 4.2 I recived crashed after about 12 min. The workunit seems to have crashed on another host as well. I'm sure they are rock solid on Fermi. I have a couple of them running for more than 4 hours now, they will finish soon. However, your GTX 570 is overclocked to 830MHz, and as I've wrote about this problem in another thread earlier: The CUDA4.2 failures could be caused by overclocking adjusted for CUDA3.1 tasks. It's possible that many of us have to adjust GPU (and CPU) overclocking (and voltages and cooling) for CUDA4.2 tasks. For example my GTX 480s need 25mV more for crunching CUDA4.2 tasks without failures, also my GTX 590s had to be set to 625MHz instead of 725MHz (I don't want to raise their voltage). The CUDA4.2 tasks are less tolerant of overclocking than CUDA3.1 tasks, partly because they are running much faster on the same hardware and on the same clocks. | |
ID: 25871 | Rating: 0 | rate: / Reply Quote | |
Yes, I have seen that other thread and have set the card to default speed now, so now I'm just waiting for the next 4.2 task to arrive :-) | |
ID: 25872 | Rating: 0 | rate: / Reply Quote | |
Wow, almost forgot to bring my clocks back down. My 570 has a 4.2 up next. | |
ID: 25873 | Rating: 0 | rate: / Reply Quote | |
Hello: In my GTX295 with WIN-7 tasks CUDA 4.2 are working fine and faster, with OC (666Mhz) | |
ID: 25874 | Rating: 0 | rate: / Reply Quote | |
I will let that run and see if I get any more cuda3.1 WU's. Well, 1st long cuda4.2 finished (and quite faster than cuda3.1, as promised). It immediately downloaded another cuda4.2. All seems well, though 2 WU's hardly indicates a pattern. Will keep an eye on it. On my 2nd rig, I did the same thing after the cuda3.1 WU finished. "Project reset" in BOINC, and got a cuda4.2. There may be something to that, or again maybe just 3 for 3 luck. If anyone isn't getting any cuda4.2's (or a mix), could be worth a shot to try it and see. | |
ID: 25878 | Rating: 0 | rate: / Reply Quote | |
My two 570's (in separate machines) are factory OC'd to 797 Mhz and 780 Mhz. The 780 Mhz card finished a long WU about 30% faster with cuda4.2 than cuda3.1, and I didn't have any issues with errors. Also, given it is summer that card is running warmer than normal. | |
ID: 25879 | Rating: 0 | rate: / Reply Quote | |
Hi All, | |
ID: 25880 | Rating: 0 | rate: / Reply Quote | |
Looks like I just loaded up my first 4.2 app. | |
ID: 25881 | Rating: 0 | rate: / Reply Quote | |
Best work on Cuda 4.2 ! | |
ID: 25882 | Rating: 0 | rate: / Reply Quote | |
Looks like I just loaded up my first 4.2 app. hmmmm, maybe got ahead of myself. after 1:15:00, the countdown to completion is still virtually the same at 15:42:32. I'll be watching closely to see how long this actually takes. | |
ID: 25884 | Rating: 0 | rate: / Reply Quote | |
Hi All, The recursive acronym at it's best. http://www.gpugrid.net/host_app_versions.php?hostid=122075 http://www.gpugrid.net/host_app_versions.php?hostid=125945 :) However you also need to look at each task type individually to see the variation in improvement: I4R101-NATHAN_RPS1120528-13-166-RND7789_0 3518586 24 Jun 2012 | 5:58:48 UTC 24 Jun 2012 | 18:53:21 UTC Completed and validated 38,683.76 1,187.91 60,900.00 Long runs (8-12 hours on fastest card) v6.16 (cuda31) I5R92-NATHAN_RPS1120528-11-166-RND0162_2 3507092 25 Jun 2012 | 21:44:05 UTC 26 Jun 2012 | 6:38:11 UTC Completed and validated 24,334.46 517.02 60,900.00 Long runs (8-12 hours on fastest card) v6.16 (cuda42) GTX470 on Ubuntu 12.04 (195.40), Nathan tasks - 4.2 is 58% faster than 3.1. Anyone wishing to exclusively run CUDA 4.2 tasks (ie not 3.1 and 4.2)? I suggest people select no new tasks, finish any tasks in progress and then reset the project. This will delete the 3.1 App. If you reset while you have a task, and are running it on 3.1, it's probably going to start running it again and fail. ____________ FAQ's HOW TO: - Opt out of Beta Tests - Ask for Help | |
ID: 25887 | Rating: 0 | rate: / Reply Quote | |
I will let that run and see if I get any more cuda3.1 WU's. Well, crap. After the "reset project" in BOINC, I did get through several long cuda4.2 WU's before getting another cuda3.1 WU. So, seems like it is luck of the draw right now. This problem of receiving multiple applications is probably a bug of the server. Well, sporadic cuda4.2 is better than no cuda4.2 at all. :-) Please do look into the potential server issues sending out both types of WU's to a cuda4.2-eligible machine. | |
ID: 25892 | Rating: 0 | rate: / Reply Quote | |
Doesn't seem like that is a permanent solution. I did that and after receiving 2-3 cuda4.2 WU's, it went back to grabbing a cuda3.1 without intervention. 6/26/2012 9:00:34 AM | GPUGRID | Sending scheduler request: To fetch work. 6/26/2012 9:00:34 AM | GPUGRID | Requesting new tasks for NVIDIA 6/26/2012 9:00:37 AM | GPUGRID | Scheduler request completed: got 1 new tasks 6/26/2012 9:00:39 AM | GPUGRID | Started download of acemd.win.2352 6/26/2012 9:00:39 AM | GPUGRID | Started download of cudart32_31_9.dll 6/26/2012 9:00:42 AM | GPUGRID | Finished download of cudart32_31_9.dll 6/26/2012 9:00:42 AM | GPUGRID | Started download of cufft32_31_9.dll 6/26/2012 9:00:49 AM | GPUGRID | Finished download of acemd.win.2352 6/26/2012 9:00:49 AM | GPUGRID | Started download of I2R137-NATHAN_RPS1120528-12-LICENSE 6/26/2012 9:00:50 AM | GPUGRID | Finished download of I2R137-NATHAN_RPS1120528-12-LICENSE 6/26/2012 9:00:50 AM | GPUGRID | Started download of I2R137-NATHAN_RPS1120528-12-COPYRIGHT 6/26/2012 9:00:51 AM | GPUGRID | Finished download of I2R137-NATHAN_RPS1120528-12-COPYRIGHT 6/26/2012 9:00:51 AM | GPUGRID | Started download of I2R137-NATHAN_RPS1120528-12-I2R137-NATHAN_RPS1120528-11-166-RND7611_1 6/26/2012 9:00:59 AM | GPUGRID | Finished download of I2R137-NATHAN_RPS1120528-12-I2R137-NATHAN_RPS1120528-11-166-RND7611_1 6/26/2012 9:00:59 AM | GPUGRID | Started download of I2R137-NATHAN_RPS1120528-12-I2R137-NATHAN_RPS1120528-11-166-RND7611_2 6/26/2012 9:01:06 AM | GPUGRID | Finished download of I2R137-NATHAN_RPS1120528-12-I2R137-NATHAN_RPS1120528-11-166-RND7611_2 6/26/2012 9:01:06 AM | GPUGRID | Started download of I2R137-NATHAN_RPS1120528-12-I2R137-NATHAN_RPS1120528-11-166-RND7611_3 6/26/2012 9:01:11 AM | GPUGRID | Finished download of I2R137-NATHAN_RPS1120528-12-I2R137-NATHAN_RPS1120528-11-166-RND7611_3 6/26/2012 9:01:11 AM | GPUGRID | Started download of I2R137-NATHAN_RPS1120528-12-pdb_file 6/26/2012 9:01:33 AM | GPUGRID | Finished download of I2R137-NATHAN_RPS1120528-12-pdb_file 6/26/2012 9:01:33 AM | GPUGRID | Started download of I2R137-NATHAN_RPS1120528-12-psf_file 6/26/2012 9:01:54 AM | GPUGRID | Finished download of cufft32_31_9.dll 6/26/2012 9:01:54 AM | GPUGRID | Started download of I2R137-NATHAN_RPS1120528-12-par_file 6/26/2012 9:01:58 AM | GPUGRID | Finished download of I2R137-NATHAN_RPS1120528-12-par_file 6/26/2012 9:01:58 AM | GPUGRID | Started download of I2R137-NATHAN_RPS1120528-12-conf_file_enc 6/26/2012 9:01:59 AM | GPUGRID | Finished download of I2R137-NATHAN_RPS1120528-12-conf_file_enc 6/26/2012 9:01:59 AM | GPUGRID | Started download of I2R137-NATHAN_RPS1120528-12-metainp_file 6/26/2012 9:02:00 AM | GPUGRID | Finished download of I2R137-NATHAN_RPS1120528-12-metainp_file 6/26/2012 9:02:00 AM | GPUGRID | Started download of I2R137-NATHAN_RPS1120528-12-I2R137-NATHAN_RPS1120528-11-166-RND7611_7 6/26/2012 9:02:01 AM | GPUGRID | Finished download of I2R137-NATHAN_RPS1120528-12-I2R137-NATHAN_RPS1120528-11-166-RND7611_7 6/26/2012 9:02:20 AM | GPUGRID | Finished download of I2R137-NATHAN_RPS1120528-12-psf_file 6/26/2012 9:02:21 AM | GPUGRID | Starting task I2R137-NATHAN_RPS1120528-12-166-RND7611_0 using acemdlong version 616 (cuda31) in slot 3 | |
ID: 25894 | Rating: 0 | rate: / Reply Quote | |
| |
ID: 25895 | Rating: 0 | rate: / Reply Quote | |
ok i have the same problem now, i got a cuda 42 app and as next i got a cuda 32 app. intersting enough, i got no speedup on the cuda 42 WU with the 285gtx :( Seems to be a only speedup on fermi (where i got a real speedup) & kepler. | |
ID: 25897 | Rating: 0 | rate: / Reply Quote | |
You have 301.42, and did get a 4.2 task after resetting, so why the 3.1? I have two systems: A: 1x GTX570 (GPUGrid only) and 1x GT440 (non-GPUGrid, only PrimeGrid, etc.) B: 1x GTX570 (GPUGrid only) and 3x GT240 (non-GPUGrid, only PrimeGrid, etc.) I use the cc_config.xml options to specify which projects get which cards. I devote the 570's to GPUGrid (since I started with you guys), and have merely demoted old GPUGrid cards to other projects that accept them. I am OK getting a mix of cuda4.2 and cuda3.1 tasks, though I hope I don't get ONLY cuda3.1 tasks now, unless I do a reset project. I'd love the 30%+ credit bonus, and I'm sure GPUGrid would love the 30%+ throughput increase from Fermi/Kepler cards. If you guys can identify the server issue (or whatever it is), I think it'll be a win/win. But, for now, I'm happy to continue with the cuda3.2's with the hopes a nice cuda4.2 strolls down my street. :-) | |
ID: 25898 | Rating: 0 | rate: / Reply Quote | |
I would suggest making the long queue CUDA 4.2 only, but I'm not sure how many people have a suitable driver, how many don't, and of those that don't have compatible drivers for CUDA4.2 how many run normal length tasks or both normal and long? | |
ID: 25903 | Rating: 0 | rate: / Reply Quote | |
I would suggest making the long queue CUDA 4.2 only, but I'm not sure how many people have a suitable driver, how many don't, and of those that don't have compatible drivers for CUDA4.2 how many run normal length tasks or both normal and long? As was suggested before, maybe a solution is to have 2 long queues selectable in the preferences: 1 for cuda3.1 long and 1 for cuda4.2 long? Not sure if that is easy/possible, but I know if I could deselect the cuda3.1 long and only select the cuda4.2 long, that'd be great for me. For those that want both or only cuda3.1 long, it still allows full flexibility. | |
ID: 25905 | Rating: 0 | rate: / Reply Quote | |
Wonder how long they will even keep 3.1 with these kinds of results though? | |
ID: 25906 | Rating: 0 | rate: / Reply Quote | |
It's just needed because some people use older drivers, and don't read the forums. | |
ID: 25907 | Rating: 0 | rate: / Reply Quote | |
Why not just post that as of such and such date, you will no longer be issuing cuda3.1 work units, that anybody who hasn't already done so, to updated your driver to 301.xx, and then do it. This will save a lot of aggravation, and will increase overall number crunching totals, even while losing a few crunchers. | |
ID: 25911 | Rating: 0 | rate: / Reply Quote | |
I see that resetting the project when only Long runs are selected just means that 'acemdlong' is used. It does not specify that it's CUDA4.2 or CUDA3.1. | |
ID: 25912 | Rating: 0 | rate: / Reply Quote | |
Given that everyone needs to supply an e-mail address to sign-up, | |
ID: 25914 | Rating: 0 | rate: / Reply Quote | |
Also, just completed | |
ID: 25915 | Rating: 0 | rate: / Reply Quote | |
I'm still receiving CUDA 4.2 and CUDA 3.1 by turns on the same host. Just poking around as a comparison for tasks, and the last MJHarvy I did was around 11.9 hours on my GTX 580. I'm set to finish one in about 5.1 hours on the same card. That is quite an improvement. Kudos, GPU Grid researchers. However, as an FYI, in my poking around I note that this WU http://www.gpugrid.net/workunit.php?wuid=3514448 which is a CUDA 4.2 app WU, was sent to a PC with 296.10 drivers - http://www.gpugrid.net/show_host_detail.php?hostid=89848 ____________ | |
ID: 25918 | Rating: 0 | rate: / Reply Quote | |
That card's not doing too well, http://www.gpugrid.net/results.php?hostid=89848 | |
ID: 25920 | Rating: 0 | rate: / Reply Quote | |
I certainly agree that the card is not doing well. Every once in a while, it does do a WU right - the corollary to Murphy's Law in action. LOL | |
ID: 25932 | Rating: 0 | rate: / Reply Quote | |
it distributes cuda4.2 to drivers higher than 295.43 which is the linux version for cuda4.2. | |
ID: 25937 | Rating: 0 | rate: / Reply Quote | |
Retvari posted that the GPU Grid server would not distribute CUDA 4.2 tasks to PCs running drivers less than 301.42. What I was actually saying is users with the last recommended driver (v285.58 or earlier) won't receive CUDA4.2 tasks. The v295 and v296 drivers are CUDA4.2 capable but not recommended, because the monitor sleep bug. Because this bug, many users rolled back to the v285 drivers, that's why it is a good idea to update the drivers to the v301.42. Seems this exclusion is not functioning properly. This part is true.... | |
ID: 25939 | Rating: 0 | rate: / Reply Quote | |
Do the 4.2 tasks need/make use of as much of a CPU as the 3.1 tasks? I've been watching a 4.2 task run on my GTX 570 and noticed that the CPU utilization by the core I've set aside for GPU tasks is much lower than before. It looks like it uses significantly less RAM, too. | |
ID: 25941 | Rating: 0 | rate: / Reply Quote | |
@ Wiyosaya: my 460 which is clocked at 880 normally gets poala tasks in around 20 hours. It's a bit of a golden card though, they normally don't clock that high. | |
ID: 25946 | Rating: 0 | rate: / Reply Quote | |
Nathans jobs run well as long as I don't use PC for anything else. They are crippling. Remote computer not so good but have no access to it as yet to make adjustments. | |
ID: 25960 | Rating: 0 | rate: / Reply Quote | |
Nathans jobs run well as long as I don't use PC for anything else. They are crippling. Remote computer not so good but have no access to it as yet to make adjustments. Strange, Nathan tasks don't have that effect on my PC. Even the new 4.2 tasks that run 95-99% GPU utilization. even when I'm running 7 threads of WCG and 1 thread for GPUGrid with a Nathan task I have no problems. A slight lag when changing programs/screens etc, but nothing else. I can sort that slight issue out by freeing up another CPU core. What's your CPU load like? maybe you need to look at that.... | |
ID: 25963 | Rating: 0 | rate: / Reply Quote | |
I have no slowdown with mine either. All 4.2 tasks on W7 are using a minimum of 95 GPU Aw well. | |
ID: 25964 | Rating: 0 | rate: / Reply Quote | |
Graphics card is only GTX460 | |
ID: 25970 | Rating: 0 | rate: / Reply Quote | |
Graphics card is only GTX460 I run a 460 as well as a 560, no difference in functionality of the computer when either or both of them are running NATE tasks. Why do you have it in a 1.1 slot?????? Is this card also your primary graphics card? It could be that you are running out of PCIE bandwidth. | |
ID: 26020 | Rating: 0 | rate: / Reply Quote | |
It's a 16 X slot just MB is a few years old now and I don't have the option of PCIE2 | |
ID: 26027 | Rating: 0 | rate: / Reply Quote | |
Might be due to the high amount of memory required to run these tasks and W7; I'm seeing ~990MB in use. I have a GTX 470 (1279MB), so I have some headroom. However W7 eats some GPU memory leaving you short of 1024MB. Possibly too short. That said Boinc reports 1023MB (maybe another rounding error in the driver), and might not be true anyway (W7 is probably using more, ~60 to 90MB I think). | |
ID: 26029 | Rating: 0 | rate: / Reply Quote | |
I just check the GPUGrid computing preferences and noticed we still only have 3 queues, long, short and beta. It would be good to find a way to get 4.2 tasks exclusively to machines with the correct mix of hardware and drivers. | |
ID: 26031 | Rating: 0 | rate: / Reply Quote | |
I just check the GPUGrid computing preferences and noticed we still only have 3 queues, long, short and beta. It would be good to find a way to get 4.2 tasks exclusively to machines with the correct mix of hardware and drivers. I imagine that most/all new tasks will be coded in 4.2. Just have to run the 3.1 hoppers dry. (I hope this is the case anyway. Seems fairly pointless to code in 3.1 now that 4.2 is here and ~40% more efficient) | |
ID: 26043 | Rating: 0 | rate: / Reply Quote | |
There might be a lot of non-4.2 capable drivers in use. | |
ID: 26050 | Rating: 0 | rate: / Reply Quote | |
i got a unit on a gtx260 (win7 x64 latest boinc and drivers), but i had to abort it as it was making my computer completely unusable with 99% usage. 99% usage of what? If it's a CPU core, try telling BOINC to leave one CPU core free for programs other than BOINC workunits. If it's the GPU, I haven't found a usable method yet. | |
ID: 26053 | Rating: 0 | rate: / Reply Quote | |
i got a unit on a gtx260 (win7 x64 latest boinc and drivers), but i had to abort it as it was making my computer completely unusable with 99% usage. it's the gpu usage | |
ID: 26079 | Rating: 0 | rate: / Reply Quote | |
i got a unit on a gtx260 (win7 x64 latest boinc and drivers), but i had to abort it as it was making my computer completely unusable with 99% usage. Same thing's happening on the GTX275 in my wife's machine. I've simply unchecked "Use GPU while computer is in use" until I get around to upgrading the card. | |
ID: 26080 | Rating: 0 | rate: / Reply Quote | |
Can I receive only cuda 4.2 ? | |
ID: 26089 | Rating: 0 | rate: / Reply Quote | |
Not unless you use the 3.1 to 4.2 workaround | |
ID: 26095 | Rating: 0 | rate: / Reply Quote | |
thanks | |
ID: 26108 | Rating: 0 | rate: / Reply Quote | |
The 4.2 units actually run slower on my old GTX 260, and additionally the newer drivers are still causing threadsafe exit downclocks with other projects. I guess it may finally be time to get some new hardware and relegate the old card to running einstein full time. | |
ID: 26118 | Rating: 0 | rate: / Reply Quote | |
For now, go back to an older driver (285). | |
ID: 26128 | Rating: 0 | rate: / Reply Quote | |
Do the 4.2 tasks need/make use of as much of a CPU as the 3.1 tasks? I've been watching a 4.2 task run on my GTX 570 and noticed that the CPU utilization by the core I've set aside for GPU tasks is much lower than before... Still looking for feedback on this. :-) ____________ | |
ID: 26223 | Rating: 0 | rate: / Reply Quote | |
As you observed, the new app appears to use less CPU. Remember that different tasks use different amounts of the CPU, so things could change as and when new task come and go. | |
ID: 26227 | Rating: 0 | rate: / Reply Quote | |
As you observed, the new app appears to use less CPU. Remember that different tasks use different amounts of the CPU, so things could change as and when new task come and go. Okay, thanks for the feedback. So, perhaps it's still best to reserve a CPU for GPUGrid tasks? ____________ | |
ID: 26252 | Rating: 0 | rate: / Reply Quote | |
Yes, if you have plenty of cores/threads (4/8 for example), and for stability reasons as well as performance. | |
ID: 26253 | Rating: 0 | rate: / Reply Quote | |
I am still receiving an occasional 3.1 long run task on my main PC, have selected only to receive long runs. I did the workaround but somehow it reverted to the original 3.1 executable. | |
ID: 26311 | Rating: 0 | rate: / Reply Quote | |
Message boards : News : New CUDA4.2 applications are out for Kepler GPUs