Message boards : News : Beta testing starting soon
Author | Message |
---|---|
We will start some beta testing today or tomorrow with the new app for cuda4.2. | |
ID: 28114 | Rating: 0 | rate: / Reply Quote | |
New app is on in beta. Nate will be submitting some workunits soon. | |
ID: 28119 | Rating: 0 | rate: / Reply Quote | |
There are 50 new simulations in the beta queue. These are very simple test workunits, but if everything goes well I will submit a whole batch to the long queue tomorrow. This will help test the new app. | |
ID: 28120 | Rating: 0 | rate: / Reply Quote | |
Outcome Computation error | |
ID: 28124 | Rating: 0 | rate: / Reply Quote | |
I already had about 26 beta work units with similar results from one computer. From what I saw of the received beta work units that resulted in errors, others had similar results. | |
ID: 28125 | Rating: 0 | rate: / Reply Quote | |
The same here: I18R1-NATHAN_tstdhfr-0-1-RND4913_3. W7 64bit GTX560Ti CC 2.1. | |
ID: 28126 | Rating: 0 | rate: / Reply Quote | |
???? 22.01.2013 22:23:23 | GPUGRID | Requesting new tasks for NVIDIA ____________ Member of Boinc Italy. | |
ID: 28129 | Rating: 0 | rate: / Reply Quote | |
Select Normal length tasks too - there are a few being generated, possibly resends. | |
ID: 28130 | Rating: 0 | rate: / Reply Quote | |
I have now selected ALL... ACEMD standard, beta and long but nothing is coming in :-( | |
ID: 28131 | Rating: 0 | rate: / Reply Quote | |
as soon as we have finished testing beta. | |
ID: 28133 | Rating: 0 | rate: / Reply Quote | |
Yes, we are a little "dry" right now (slight shortage of tasks). Bear with us for the next 24 or 48 hours. We are trying to change to an updated application, which we are testing in the beta queue. As soon as the new app is installed, we will have plenty of new WUs to send. | |
ID: 28134 | Rating: 0 | rate: / Reply Quote | |
Still getting many failures on two systems, but did get one that finished: | |
ID: 28137 | Rating: 0 | rate: / Reply Quote | |
The only Beta task I completed was called I1R1-NATHAN_tstdhfr3-0-10-RND6104 | |
ID: 28139 | Rating: 0 | rate: / Reply Quote | |
15x error, 1x success, out of daily quota. | |
ID: 28144 | Rating: 0 | rate: / Reply Quote | |
I just destroyed something like 20 of the new beta tasks by getting a computer error on them after only 2 seconds of starting them. I don't overclock my cards, and they are at as low as I can get them in terms of temperature, which is 70 C or less, so I don't know why they failed. | |
ID: 28147 | Rating: 0 | rate: / Reply Quote | |
We're aware of the continued problems. We believe it is an issue with the application, but are checking the simulations as well. We'll update as soon as we know something. | |
ID: 28150 | Rating: 0 | rate: / Reply Quote | |
The NATHAN_tstdhfr4 batch seems to be working fine on all of my hosts. | |
ID: 28159 | Rating: 0 | rate: / Reply Quote | |
However, there's still no detailed info in the stderr output file about the GPU used to process the workunit.... (which would be very useful for figuring out the source of some errors) | |
ID: 28160 | Rating: 0 | rate: / Reply Quote | |
The NATHAN_tstdhfr4 batch seems to be working fine on all of my hosts.The same here. | |
ID: 28163 | Rating: 0 | rate: / Reply Quote | |
BTW I've received some Ann***_r*-TONI_AGGd8 workunits from the long queue, so there is some new batch also. | |
ID: 28167 | Rating: 0 | rate: / Reply Quote | |
There was an issue with the simulations that has been corrected, and most seem to be finishing successfully now. We are having some other issues with the application that need to be addressed before we can deploy the app on Long. Sometime in the next few days, hopefully. | |
ID: 28168 | Rating: 0 | rate: / Reply Quote | |
Recently received two Beta's. One ran successfully, the other did not. | |
ID: 28169 | Rating: 0 | rate: / Reply Quote | |
Project has no tasks available...sinece yestarday...sob.. ____________ Member of Boinc Italy. | |
ID: 28206 | Rating: 0 | rate: / Reply Quote | |
and i still wonder why everybody in Austria except me get work units and make up to half a million points per day O.o | |
ID: 28209 | Rating: 0 | rate: / Reply Quote | |
Are the new workunits now with smaller upload files? Because the appversion didnt change so i wonder ;) | |
ID: 28232 | Rating: 0 | rate: / Reply Quote | |
I'm running another Beta now and the GPU utilization is only 40% (W7x64). | |
ID: 28277 | Rating: 0 | rate: / Reply Quote | |
trypsin_lig_1_-NOELIA_RL_equ-0-1-RND1972_0 finished at 2202s @ W7 64bit GTX560Ti CC2.1/872MHz, driver 310.90. Low GPU load (begun at 40%, finished at 30%), low VRAM load (119 MB), high CPU load (0,7 CPU core of HT 3770K) => low credit. | |
ID: 28279 | Rating: 0 | rate: / Reply Quote | |
My GPU utilization also ended up around 36% for several tasks. | |
ID: 28281 | Rating: 0 | rate: / Reply Quote | |
getting around 36-39% usage on GTX 470 in win7 | |
ID: 28282 | Rating: 0 | rate: / Reply Quote | |
can not force project to send me beta wus, excluded everthing but beta , but nothings been sended. | |
ID: 28283 | Rating: 0 | rate: / Reply Quote | |
must select "Run test applications?" | |
ID: 28284 | Rating: 0 | rate: / Reply Quote | |
must select "Run test applications?" i have selected this, but nothing happens, still LongRuns only on the run. I stop to try, too many stopped wus. | |
ID: 28285 | Rating: 0 | rate: / Reply Quote | |
also, currently seeing 16-19% usage... not a good sign :( lol read this: http://milkyway.cs.rpi.edu/milkyway/forum_thread.php?id=3118 | |
ID: 28286 | Rating: 0 | rate: / Reply Quote | |
I am running a "acemdbeta version 648 (cuda42)" task called "trypsin_lig_1205_run3-NOELIA_RL2_equ-0-1-RND5309_0" | |
ID: 28288 | Rating: 0 | rate: / Reply Quote | |
right, u could use 3 wus at the same time with an "app_config.xml" file. And boinc version 7.0.44. Specified the app name = "acemdbeta". | |
ID: 28289 | Rating: 0 | rate: / Reply Quote | |
... or is there a bug with the app that leads to poor GPU load? | |
ID: 28290 | Rating: 0 | rate: / Reply Quote | |
counldn´t run any beta app right now. | |
ID: 28291 | Rating: 0 | rate: / Reply Quote | |
Rantanplan, please try to post these kinds of issues in "Number Crunching" or "Server and website". Make sure you don't have specific location preferences set for your computer under "GPUGRID preferences" It is explained more here: http://boinc.berkeley.edu/wiki/Preferences#Location-specific_preferences | |
ID: 28310 | Rating: 0 | rate: / Reply Quote | |
Thanks for the confirmation about low GPU usage not being a bug, Nate. | |
ID: 28311 | Rating: 0 | rate: / Reply Quote | |
Pulling out of BETA testing as several Noelia ones on different machines are stalling. This one got to 20 hours before I could abort it! | |
ID: 28316 | Rating: 0 | rate: / Reply Quote | |
6425256 4112331 30 Jan 2013 | 2:46:46 UTC 30 Jan 2013 | 4:18:27 UTC Completed and validated 3,695.83 2,613.81 1,050.00 ACEMD beta version v6.48 (cuda42) | |
ID: 28320 | Rating: 0 | rate: / Reply Quote | |
These Noelia beta units are entirely too dependent on the CPU. I had to free up another CPU to get them to run faster. They also have a habit of starting out fast and then slow down toward the end. I observed GPU usage can be as high as 77% at the beginning of the task, and drop to 13% in the end. | |
ID: 28328 | Rating: 0 | rate: / Reply Quote | |
beta units were working ok, albeit very low gpu usage, and then yesterday they started locking up my system. Had to do a hard reset. Now system locks up on boot when BOINC starts. Host id 125227, win7 x64 driver 310.70. Will try to update driver. | |
ID: 28329 | Rating: 0 | rate: / Reply Quote | |
Seeing LOTS of errors on the Beta tasks right now. | |
ID: 28333 | Rating: 0 | rate: / Reply Quote | |
Note that different types of tasks have to be tested on the new Beta app. | |
ID: 28334 | Rating: 0 | rate: / Reply Quote | |
Yesterday, I opted out of the beta testing because of the problems it was causing on my computers. Yet, I still kept getting beta WU's. I had to abort them, then set GPUGRID to "No new tasks". Forcing beta WU's on us is NOT COOL! You are jeopardizing the integrity of BOINC. Please let us know when it's SAFE to VOLUNTEER on this project again. | |
ID: 28335 | Rating: 0 | rate: / Reply Quote | |
To opt out of these Beta tests you need to go to GPUGRID preferences,
Separate preferences for home Separate preferences for work
| |
ID: 28336 | Rating: 0 | rate: / Reply Quote | |
When I attempted to opt out of the GPUGRID Beta, I did: | |
ID: 28338 | Rating: 0 | rate: / Reply Quote | |
Hi Rick, | |
ID: 28340 | Rating: 0 | rate: / Reply Quote | |
I have Short & Long projects selected for the default, and all 3 other profiles. Only the beta is unchected. This was the situation when I continued to get beta wu's and had to click "No new tasks" in the BOINC Client in order to keep my computers from freezing up and rebooting. I would love to resume GPUGRID work, but I'm concerned I'll still get more beta wu's, like I did yesterday. Is there a long lag between unchecking beta in the profile, and the scheduler getting the message? Rick | |
ID: 28342 | Rating: 0 | rate: / Reply Quote | |
Unless there is some server side work being performed (which there might have been) it shouldn't be long at all. If you suspend the project, make the online changes and then click update that should be enough to get the new profiles settings into Boinc. Then when you resume the project you should be using your new profile. I couldn't replicate your problem, but I guess the scheduler might ask for new tasks before checking your profiles settings. Most people just forget to deselect the "Run test applications" options, but the scheduler's logic has taken some criticism of late. | |
ID: 28343 | Rating: 0 | rate: / Reply Quote | |
Regarding Sponholz's inability to turn the betas off.. I haven't tried it myself so I can't verify that it works or does not work but I know this feature fails at 2 other projects and they claim it's a known bug in the server code. I have no idea if they are correct in saying it's a known bug, whether they have the feature misconfigured, whether they're using the same server code as GPUgrid or whatever. skgiven wrote: All my Beta's fail except those ending in 1: I believe the 1 is referring to the iteration count rather than a particular type of task, if that's what you were thinking. Many but not all of my beta NOELIA are failing too.I don't see any correlation between failure and iteration count. The failed ones all have "run2" in the name for example trypsin_lig_965_run2-NOELIA_RL2_equ-0-1-RND9201 all "run1" NOELIAs seem to have crunched error free. One "run2" completed successfully All of my failed ones say: Stderr output I don't seem to be getting any more "run2" NOELIA so I assume the failure has been noted and the tap turned off. ____________ BOINC <<--- credit whores, pedants, alien hunters | |
ID: 28344 | Rating: 0 | rate: / Reply Quote | |
I know its just an iteration count, but thought there might have been something in it's name; for example, the app tried to write to a file thats name was generated presuming the iteration _1. Anyway, it was just coincidence that every other task was failing. When you think about it you are most likely to have successful tasks ending in _0, then _1 and then _2... If a task has already failed, and the more it's failed then the less likely it will succeed. So if you get a task ending in _5 or _6 the chances of a successful run are relatively poor. | |
ID: 28345 | Rating: 0 | rate: / Reply Quote | |
Thanks for the 'undebugable' server code bug tip. Might explain some of the scheduler quirks, or not. hehe, it's debugable, but only by someone who knows for certain whether there are betas in the queue or not. ____________ BOINC <<--- credit whores, pedants, alien hunters | |
ID: 28347 | Rating: 0 | rate: / Reply Quote | |
Yesterday I got 3 Beta's and 2 errored out very quickly. By the third the system froze after a few seconds running the Beta and had to reboot to get any response. After the reboot I got the message that the graphic driver was restored after failing. | |
ID: 28363 | Rating: 0 | rate: / Reply Quote | |
Yesterday I got 3 Beta's and 2 errored out very quickly. By the third the system froze after a few seconds running the Beta and had to reboot to get any response. After the reboot I got the message that the graphic driver was restored after failing. Same thing for me :/ | |
ID: 28366 | Rating: 0 | rate: / Reply Quote | |
Looks like the GPUGRID server is hung up, I've had this message in my Acctivity Log; | |
ID: 28373 | Rating: 0 | rate: / Reply Quote | |
Beta 6.48: 4 of 4 successful. | |
ID: 28383 | Rating: 0 | rate: / Reply Quote | |
I was under the impression that fan speed shall be kept at max during crunching, and for me, everything I do. | |
ID: 28385 | Rating: 0 | rate: / Reply Quote | |
Not if GPU temperature is only 44°C.. and I value my ears ;) | |
ID: 28391 | Rating: 0 | rate: / Reply Quote | |
My observation, on running the NOELIA_RC3 beta units, is they running faster and are less CPU dependent the NOELIA_RL2 units, but they still need work. On Windows XP, the gpu usage is about 60% to 80%, and by freeing another cpu, this increased by a few points. On Windows 7, the gpu usage is about 30% to 50%, with no noticeable increase is gpu usage when I free up a cpu. Also on Windows 7, gpu usage decreases as the unit crunches, though on XP, I didn't notice the drop. | |
ID: 28419 | Rating: 0 | rate: / Reply Quote | |
My observation, on running the NOELIA_RC3 beta units, is they running faster and are less CPU dependent the NOELIA_RL2 units, but they still need work. On Windows XP, the gpu usage is about 60% to 80%, and by freeing another cpu, this increased by a few points. On Windows 7, the gpu usage is about 30% to 50%, with no noticeable increase is gpu usage when I free up a cpu. Also on Windows 7, gpu usage decreases as the unit crunches, though on XP, I didn't notice the drop. I have a couple more things to point out. WU I45R1-NATHAN_tstdhfr6-0-1-RND9717_0 finished crunching on a Windows 7 machine. The gpu usage was 88%. There was no decrease in gpu usage from beginning to end. http://www.gpugrid.net/result.php?resultid=6463595 The next thing is when the computers run these NOELIA_RC3 beta units, one after the other over several hours, the gpu usage drops for the later units compared to the previous ones, and the computers (both XP and 7) need to be rebooted, quite frequently (every few hours) to get the gpu usage level back up. | |
ID: 28421 | Rating: 0 | rate: / Reply Quote | |
The next thing is when the computers run these NOELIA_RC3 beta units......The same here (W7 64bit, GTX560Ti driver 310.90, core 6.12.34, i7-3770K one thread free for GPU, CPU process tamed to high). My system doesn't need to be rebooted, suspending and next enabling GPU computing via GUI is enough. I have noticed spontaneous/autonomic(not sure about the right expression) restarts of the acemd.2764.cuda42.exe process; suspend/enable GPU is necessary just after process restarting. I can see restarts by Balloon Message system of Process Tamer. | |
ID: 28422 | Rating: 0 | rate: / Reply Quote | |
6 days after changing all my profiles to NOT accept beta WU, they are still being forced on me. This is disgusting behavior, causing my machines to lock up and requiring a reboot. I've had to place a "No new tasks" embargo on GPUGRID until you get your act together. Moderator, please notify us when it's SAFE to resume volenteering for GPUGRID. Dissappointed, Rick | |
ID: 28423 | Rating: 0 | rate: / Reply Quote | |
Do you have set yes in the "run test apps" check-box? If you do, un-check it. | |
ID: 28424 | Rating: 0 | rate: / Reply Quote | |
Basically I was seeing the same issues on my systems: | |
ID: 28425 | Rating: 0 | rate: / Reply Quote | |
I've had the "ACEMD Beta" UNCHECKED for 6 days. I'm NOT using ANY manager. The only project I'm having unwanted beta WU's is from GPUGRID. However, I would really appreciate you letting me (us) know when the beta testing has stopped, so I can resume accepting ANY GPUGRID tasks. Thanks in advance, Rick | |
ID: 28426 | Rating: 0 | rate: / Reply Quote | |
I've had the "ACEMD Beta" UNCHECKED for 6 days. If skgiven says he can't replicate your problem then that's a pretty good indication you've got the settings wrong. On the other hand, there is a chance he turned off beta tasks just when there were no beta tasks in the queue and mistakenly figured he received no betas because the settings work as intended. Or he didn't even bother trying to replicate your problem. Here's what I'm gonna do... I'll turn off betas and then watch some other hosts' task lists. If I do not get betas while they do then that means there is no bug in the server code in use here and you screwed up. Just to make this interesting and to attempt to force you to double-check your settings.... if it turns out you've screwed up then you have to attach your fastest host to my account via my weak account key and crunch 500,000 credits for me. I'll PM you and skgiven my password, you can verify my settings for yourselves. If it turns out there is a bug then skgiven's gonna attach his fastest host to your account via the weak account key and crunch 500,000 credits for you. (Actually this is just a proposal, he hasn't agreed to this so far.) So who's willing to put their money... errmmm credits.... where their mouth is? Do we have a deal, gentlemen? btw, you're right, I bear no risk and stand only to gain, because I'm the one proposing a way to break the deadlock and I should be duly rewarded for my magnanimous effort ____________ BOINC <<--- credit whores, pedants, alien hunters | |
ID: 28428 | Rating: 0 | rate: / Reply Quote | |
I've had the "ACEMD Beta" UNCHECKED for 6 days. Do you have set yes in the "run test apps" check-box? If you do, un-check it. These two are separate settings. Unchecking "ACEMD Beta" won't stop the server sending you beta workunits. You have to uncheck the "Run test apps" checkbox also. | |
ID: 28429 | Rating: 0 | rate: / Reply Quote | |
Retvari, you are my hero! I also feel real dumb, because I did not see the run test apps box above the other application choices. Thanks all of you for helping me get it right. I'll begin accepting WU's again, and contribute to the cause. Regards,(and embarassed) Rick | |
ID: 28430 | Rating: 0 | rate: / Reply Quote | |
Dagorath & Rick, I have nothing to lose or gain either, other than fun so it's fine by me and I'm quite prepared to 'up the ante', significantly! | |
ID: 28431 | Rating: 0 | rate: / Reply Quote | |
Dagorath & Rick, I have nothing to lose or gain either, other than fun so it's fine by me and I'm quite prepared to 'up the ante', significantly! Lol! I had a hunch you had the confidence to up the ante :) I apologize for doubting everybody's word and now that Rick has solved the problem I hope y'all can understand why I am a doubter. In one of Rick's earlier posts (message 28338) in this thread he assures everybody he unchecked the "Run test apps?" setting. Now it turns out he did not. Not saying I'm any better, I've done much the same or even dumber on many occasions. First the knees go, then the eyes, before you know it you have to carry a map with your home marked with a big red X so you can find your way home. ____________ BOINC <<--- credit whores, pedants, alien hunters | |
ID: 28432 | Rating: 0 | rate: / Reply Quote | |
Sometimes you need to double, triple, quadruple, 'and the next-one' check :) | |
ID: 28433 | Rating: 0 | rate: / Reply Quote | |
It does go both ways, I wanted to run some Beta Apps on one of my machines, and it turns out for nearly two weeks I forgot to Check the "Send Test Applications " box. Hopefully I start getting some soon, but its my slowest machine, and since the only tasks have been the Huge Long Runs, it only searches for work every two days or so. | |
ID: 28434 | Rating: 0 | rate: / Reply Quote | |
Besides testing a new application, are we doing any other scientific work with these betas? | |
ID: 28435 | Rating: 0 | rate: / Reply Quote | |
Actually yes, WUs in betaqueue right now are the first step of the simulations which will be sent to short queue. So this is already the real thing. | |
ID: 28436 | Rating: 0 | rate: / Reply Quote | |
I suspended a queued GPUGrid Beta this morning, allowed a running GPUGrid task to finished and then Resumed the GPUGrid Beta, trypsin_lig_127_1-NOELIA_RC3_equ-0-1-RND4222_0 (still running). | |
ID: 28437 | Rating: 0 | rate: / Reply Quote | |
I have had a beta WU that completed without error: | |
ID: 28438 | Rating: 0 | rate: / Reply Quote | |
I suspended a queued GPUGrid Beta this morning, allowed a running GPUGrid task to finished and then Resumed the GPUGrid Beta, trypsin_lig_127_1-NOELIA_RC3_equ-0-1-RND4222_0 (still running). Is this much detail useful for the developers? If it is then I would be willing to create a script that would poll the card for this info periodically, save it to disk and ftp the file to some address. It would have the task name of course and each entry would include % completion, fan speed, load, clocks, whatever info would be helpful. I have a library of Python functions that implement the GUI RPC calls and together with the nvidia-settings and nvcontrol apps anything is possible. Would probably run on Windows too. ____________ BOINC <<--- credit whores, pedants, alien hunters | |
ID: 28439 | Rating: 0 | rate: / Reply Quote | |
Hi, | |
ID: 28450 | Rating: 0 | rate: / Reply Quote | |
Hi, Well I will help to clear the beta queue, but I don't get them often. I have checked "run test applications" and " beta". ____________ Greetings from TJ | |
ID: 28454 | Rating: 0 | rate: / Reply Quote | |
I had particularly bad experience with a beta unit. Most bad WU simply give you computation error message when they crash, and you go on to the next WU, without any reboot or computer crash. This unit ran for a few seconds, froze up the computer, then blue screen, and the computer reboots. It did this few times, before I aborted the unit. It also cause another perfectly good WU to crash as well. | |
ID: 28466 | Rating: 0 | rate: / Reply Quote | |
This task appeared to do something similar; cause a system reboot somehow. | |
ID: 28467 | Rating: 0 | rate: / Reply Quote | |
I think this one belongs in the list too: | |
ID: 28476 | Rating: 0 | rate: / Reply Quote | |
Thanks for bringing it to our attention. There was an issue building a small number of the simulations, which our checks didn't catch before they were sent out. We have cancelled the work units that were crashing machines, but it is possible that there are others so let us know if it happens again. Crashing your machines is obviously the last thing we want to do. In the future we can avoid this with additional checks we'll be doing for this type of work unit. | |
ID: 28482 | Rating: 0 | rate: / Reply Quote | |
This WU: trypsin_lig_904_3-NOELIA_RC3_equ-0-1-RND0962, resultied in the nVidia driver to stop. However it recovered automatically without booting the system. | |
ID: 28483 | Rating: 0 | rate: / Reply Quote | |
Been running a few betas now. GPU load varies between WUs in the range of 2x - 3x% (GTX660Ti). Accordingly, Power consumption, temperature, fan speed and memeory controller load are really low. Runtimes for 1500 credit-WUs vary between 1700 and 4000s. | |
ID: 28486 | Rating: 0 | rate: / Reply Quote | |
Had a repeat of my BSOD, though this time it seemed to be another project which triggered it. | |
ID: 28494 | Rating: 0 | rate: / Reply Quote | |
Here is a beta unit that ran rather slowly. | |
ID: 28502 | Rating: 0 | rate: / Reply Quote | |
I have set my system to accept only beta WU's to help clear the queue. However today I got 9 WU's that error out quickly. All are Noelia's run 2 and run 3 and only one run 4. All the run4-Noelia from yesterday and this morning (11 and 4) finished correctly. | |
ID: 28517 | Rating: 0 | rate: / Reply Quote | |
This one: trypsin_lig_1259_run2-NOELIA_RL3_equ-0-1-RND7950 and 2 more (1 run1) where resulting in an unresponsive system. Mouse pointer was moveable not click-able. All windows freeze for a few minutes then screen blank, and back with a notification that display driver had recovered, but again all windows freeze immediately. I had to abort these step by step in the seconds the system was responsive. | |
ID: 28531 | Rating: 0 | rate: / Reply Quote | |
Hi, OK, we seem to be done. The server status page says there are no tasks in the Beta queue, and my log just got these messages: 17/02/2013 15:07:56 | GPUGRID | Reporting 1 completed tasks So, fastening my seat belt and holding on tight for the next twist in the roller-coaster ride that is beta testing... :-) | |
ID: 28564 | Rating: 0 | rate: / Reply Quote | |
ACEMD beta version 0 248 0.57 (0.17 - 1.26) 111 | |
ID: 28565 | Rating: 0 | rate: / Reply Quote | |
Well, if a task crashes, a replacement is generated. I just got | |
ID: 28569 | Rating: 0 | rate: / Reply Quote | |
Probably this one, trypsin_lig_904_4-NOELIA_RC3_equ-0-1-RND8427_4 | |
ID: 28570 | Rating: 0 | rate: / Reply Quote | |
Message boards : News : Beta testing starting soon