Message boards : Graphics cards (GPUs) : Computation Error and ACEMD Crashes
Author | Message |
---|---|
Before you ask, yes i read this faq: http://www.gpugrid.net/forum_thread.php?id=1314#12178 <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> Incorrect function. (0x1) - exit code 1 (0x1) </message> <stderr_txt> # Using device 2 # There are 4 devices supporting CUDA # Device 0: "GeForce GTX 590" # Clock rate: 1.22 GHz # Total amount of global memory: 1610612736 bytes # Number of multiprocessors: 16 # Number of cores: 128 # Device 1: "GeForce GTX 590" # Clock rate: 1.22 GHz # Total amount of global memory: 1610612736 bytes # Number of multiprocessors: 16 # Number of cores: 128 # Device 2: "GeForce GTX 590" # Clock rate: 1.22 GHz # Total amount of global memory: 1610612736 bytes # Number of multiprocessors: 16 # Number of cores: 128 # Device 3: "GeForce GTX 590" # Clock rate: 1.22 GHz # Total amount of global memory: 1610612736 bytes # Number of multiprocessors: 16 # Number of cores: 128 MDIO: cannot open file "restart.coor" </stderr_txt> ]]> I have the nVidia 296.10 drivers installed. Yesterday i was running with the previous version and attempted an upgrade to fix the issue. As you can see from the above output i run 2x gtx 590's which is effectively quad sli however i often keep SLI turned off in the nVidia control panel since some games i play get very ornery when it's on. The SLI connector, however, is always attached. They are identical asus gtx 590's purchased from the same batch . In the above referenced FAQ it says to disable screen savers and any kind of graphics processing but i can sit here and watch the jobs fail after running for a short period of time. Another project im a part of (for now) is milkyway@home, they also utilize cuda and their GPU tasks fail instantly, also with "Computation Error". The Seti@home and collatz project GPU tasks complete without issue every time so far. I hope if there is a fix it can apply to the milkyway@home project as well or i'll have to drop it as im not doing much to help. Thanks in advance. -Dave | |
ID: 24159 | Rating: 0 | rate: / Reply Quote | |
<snipped> Get rid of the 296.x driver. There is a bug in there with the monitor going into sleep mode and killing any task being computed. The 295.x ones have the same problem. The most recent usable drivers are 290.x ones. Some projects have even blocked anyone with those drivers from getting GPU work (Einstein) until Nvidia fixes it. Another option if you really want to use either series of drivers is to change your power setting to never turn off the monitor and set your screen saver to the same. Won't help with Einstein though. ____________ BOINC blog | |
ID: 24161 | Rating: 0 | rate: / Reply Quote | |
Thank you, i'll do that :) | |
ID: 24167 | Rating: 0 | rate: / Reply Quote | |
I have the same problem as you, David. Do these solutions have solved your problem? If yes, what method did you choose? | |
ID: 24493 | Rating: 0 | rate: / Reply Quote | |
301.24 Beta get's rid of the sleep monitor bug, further, you will need it in order to run the new beta app which uses cuda4.2 If you wish to keep 295-296, all you have to do is change the power settings to allow your computer to never put your monitor to sleep (also shouldn't hibernate). If you turn your monitor off manually, than after changing these settings, everything works fine. | |
ID: 24494 | Rating: 0 | rate: / Reply Quote | |
Did this effect the Linux version of the nvidia driver or just the windows version???? | |
ID: 24584 | Rating: 0 | rate: / Reply Quote | |
Hi everyone. | |
ID: 24823 | Rating: 0 | rate: / Reply Quote | |
One, see your using 296 driver, I'm assuming you have your machine set to Never sleep. Two, I see you card is OC, drop the clock to stock, as well as your memory. If your crunching on all cores, leave one available for GPUgrid. | |
ID: 24830 | Rating: 0 | rate: / Reply Quote | |
Hi everyone. I'll make one recommendation: If you are using the BOINC screen saver, turn it off. I have noted that when using the BOINC screen saver, the number of GPUGrid WUs that fail with computation error is high. Having turned off the screen saver completely, I get almost no WUs that fail due to computation errors. BTW - I'm running a mix of Win 7 and Win XP machines, and I do not run GPUGrid on the one Linux machine I have. Lastly, I recommend not using a screen saver at all. LCDs have no burn-in problems like CRTs did; given that, screen savers are eye candy and nothing more. For me, I power off my monitor and let the computer run when running any BOINC project for extended periods of time, and GPUGrid WUs failing due to computation errors is now very rare for me. ____________ | |
ID: 24874 | Rating: 0 | rate: / Reply Quote | |
Message boards : Graphics cards (GPUs) : Computation Error and ACEMD Crashes