Advanced search

Message boards : Graphics cards (GPUs) : No work available? Please install 6.4.2

Author Message
Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1957
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 4472 - Posted: 18 Dec 2008 | 13:30:34 UTC

Links available from
gpugrid.net/join.php

Please also do a project reset. The client moved from cpu time to elapsed time estimates based on flops, so we had to correct it on the server too. This create confusion between old and new clints.

BOINC will only try to debug problems on 6.4.x. So, after the update, if you still don't get WUs, write here your host id, so that I can scan it in the log.

many thanks,
gdf

localizer
Send message
Joined: 17 Apr 08
Posts: 113
Credit: 1,656,514,857
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4478 - Posted: 18 Dec 2008 | 13:52:31 UTC - in response to Message 4472.

OK ..... took host 17415 40 6.4.2. Reset project after version change - checked DCF manually - it was at 1.000000.
Suspended all other project work - manually updated GPUGrid - got 2 x 1 WU estimate to completion 238 hours.
Third attempt to get more WUs (this host has 2 x 260s) received the response below.

18/12/2008 13:47:13|GPUGRID|Sending scheduler request: To fetch work. Requesting 4497293 seconds of work, reporting 0 completed tasks
18/12/2008 13:47:18|GPUGRID|Scheduler request completed: got 0 new tasks
18/12/2008 13:47:18|GPUGRID|Message from server: No work sent
18/12/2008 13:47:18|GPUGRID|Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
18/12/2008 13:47:18|GPUGRID|Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.

Not sure where to go from here.

P.

Matthew Lei
Avatar
Send message
Joined: 4 Dec 08
Posts: 7
Credit: 2,718,779
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwat
Message 4479 - Posted: 18 Dec 2008 | 14:04:43 UTC

That method doesn't work for me, I've already tried to downgrade to 6.4.2 and also reseting the project. My host ID is 18432.

Profile Nightlord
Avatar
Send message
Joined: 22 Jul 08
Posts: 61
Credit: 5,461,041
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwat
Message 4483 - Posted: 18 Dec 2008 | 15:34:40 UTC
Last modified: 18 Dec 2008 | 15:52:00 UTC

Reverted to 6.4.2 and still getting no work available messages for host ID 8882

Messages:


Thu 18 Dec 2008 15:28:20 GMT||Starting BOINC client version 6.4.2 for x86_64-pc-linux-gnu
Thu 18 Dec 2008 15:28:20 GMT||log flags: task, file_xfer, sched_ops
Thu 18 Dec 2008 15:28:20 GMT||Libraries: libcurl/7.18.0 OpenSSL/0.9.8g zlib/1.2.3.3 c-ares/1.5.1
Thu 18 Dec 2008 15:28:20 GMT||Data directory: /home/richard/Desktop/Boinc/BOINC
Thu 18 Dec 2008 15:28:20 GMT||Processor: 2 GenuineIntel Intel(R) Core(TM)2 CPU 6700 @ 2.66GHz [Family 6 Model 15 Stepping 6]
Thu 18 Dec 2008 15:28:20 GMT||Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx lm constant_tsc arch_perfmon pebs bts rep_good pni monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr lahf_lm
Thu 18 Dec 2008 15:28:20 GMT||OS: Linux: 2.6.24-19-generic
Thu 18 Dec 2008 15:28:20 GMT||Memory: 1004.55 MB physical, 2.44 GB virtual
Thu 18 Dec 2008 15:28:20 GMT||Disk: 55.84 GB total, 49.99 GB free
Thu 18 Dec 2008 15:28:20 GMT||Local time is UTC +0 hours
Thu 18 Dec 2008 15:28:20 GMT||Not using a proxy
Thu 18 Dec 2008 15:28:20 GMT||CUDA devices found
Thu 18 Dec 2008 15:28:20 GMT||Coprocessor: GeForce 8800 GT (1)
Thu 18 Dec 2008 15:28:21 GMT||Version change (6.4.5 -> 6.4.2)
Thu 18 Dec 2008 15:28:21 GMT|GPUGRID|URL: http://www.gpugrid.net/; Computer ID: 8882; location: (none); project prefs: default
Thu 18 Dec 2008 15:28:21 GMT||General prefs: from GPUGRID (last modified 20-Aug-2008 22:12:44)
Thu 18 Dec 2008 15:28:21 GMT||Host location: none
Thu 18 Dec 2008 15:28:21 GMT||General prefs: using your defaults
Thu 18 Dec 2008 15:28:21 GMT||Preferences limit memory usage when active to 753.42MB
Thu 18 Dec 2008 15:28:21 GMT||Preferences limit memory usage when idle to 904.10MB
Thu 18 Dec 2008 15:28:21 GMT||Preferences limit disk usage to 27.92GB
Thu 18 Dec 2008 15:28:21 GMT||Running CPU benchmarks
Thu 18 Dec 2008 15:28:21 GMT||Suspending computation - running CPU benchmarks
Thu 18 Dec 2008 15:28:52 GMT||Benchmark results:
Thu 18 Dec 2008 15:28:52 GMT|| Number of CPUs: 2
Thu 18 Dec 2008 15:28:52 GMT|| 2784 floating point MIPS (Whetstone) per CPU
Thu 18 Dec 2008 15:28:52 GMT|| 7879 integer MIPS (Dhrystone) per CPU
Thu 18 Dec 2008 15:29:00 GMT|GPUGRID|Resetting project
Thu 18 Dec 2008 15:29:06 GMT|GPUGRID|Sending scheduler request: Requested by user. Requesting 172802 seconds of work, reporting 0 completed tasks
Thu 18 Dec 2008 15:29:11 GMT|GPUGRID|Scheduler request completed: got 0 new tasks
Thu 18 Dec 2008 15:29:11 GMT|GPUGRID|Message from server: No work sent
Thu 18 Dec 2008 15:29:11 GMT|GPUGRID|Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
Thu 18 Dec 2008 15:29:11 GMT|GPUGRID|Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.
Thu 18 Dec 2008 15:29:13 GMT|GPUGRID|Started download of logops3grid.png
Thu 18 Dec 2008 15:29:13 GMT|GPUGRID|Started download of project_1.png
Thu 18 Dec 2008 15:29:15 GMT|GPUGRID|Finished download of logops3grid.png
Thu 18 Dec 2008 15:29:15 GMT|GPUGRID|Finished download of project_1.png
Thu 18 Dec 2008 15:29:15 GMT|GPUGRID|Started download of project_2.png
Thu 18 Dec 2008 15:29:15 GMT|GPUGRID|Started download of project_3.png
Thu 18 Dec 2008 15:29:17 GMT|GPUGRID|Finished download of project_2.png
Thu 18 Dec 2008 15:29:17 GMT|GPUGRID|Finished download of project_3.png



edit #1 ....and the same for host ID 5889 :-(

/edit #2 ....detach and then re-attach host ID 5889 still receives the same no work messages
____________

Alain Maes
Send message
Joined: 8 Sep 08
Posts: 63
Credit: 1,651,074,002
RAC: 2,160,119
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4485 - Posted: 18 Dec 2008 | 16:07:44 UTC - in response to Message 4483.

Idem for hostID 16551

Kind regards

Alain

Phoneman1
Send message
Joined: 25 Nov 08
Posts: 51
Credit: 980,186
RAC: 0
Level
Gly
Scientific publications
watwat
Message 4486 - Posted: 18 Dec 2008 | 16:09:33 UTC

50% succes rate here!

The success:
Host 17875 got a WU at the first attempt and is running with 4 other project's tasks. All 5 tasks are running in high priority mode although nothing is remotely in deadline trouble.

The failure:
Host 117880 CANNOT get a wu now, although with 6.3.21 it never had any trouble.

Both updates were done by suspending all task activity; running the boinc installer; waiting for benchmarks; resetetting GPUgrid; checking that dcf was reset to 1.000000 (it was) before restarting task activity. I've done several manual updates I can't get anything for 17880. It doesn't matter if I suspend the other project either.

I've also tried 6.4.5 on this machine again this afternoon with the same result. First attempt below with other project running; the second with it suspended.

18/12/2008 15:49:14|GPUGRID|Sending scheduler request: Requested by user. Requesting 175 seconds of work, reporting 0 completed tasks
18/12/2008 15:49:19|GPUGRID|Scheduler request completed: got 0 new tasks
18/12/2008 15:49:19|GPUGRID|Message from server: No work sent
18/12/2008 15:49:19|GPUGRID|Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.
18/12/2008 15:49:19|GPUGRID|Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
18/12/2008 15:56:25|GPUGRID|Sending scheduler request: Requested by user. Requesting 349 seconds of work, reporting 0 completed tasks
18/12/2008 15:56:30|GPUGRID|Scheduler request completed: got 0 new tasks
18/12/2008 15:56:30|GPUGRID|Message from server: No work sent
18/12/2008 15:56:30|GPUGRID|Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.
18/12/2008 15:56:30|GPUGRID|Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.

To keep this machine in work I'm having to go back to 6.3.21. It is looking as if the effect of the latest changes is to make high-end boards unable to get work.

By the way I note the recommended Nvidia drivers have gone back a version or two, is that a mistake or must we also revert those to make any progress with 6.4.2??


Phoneman1

localizer
Send message
Joined: 17 Apr 08
Posts: 113
Credit: 1,656,514,857
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4488 - Posted: 18 Dec 2008 | 16:23:37 UTC - in response to Message 4486.

Phoneman1 - is 6.3.21 actually working for you?

Phoneman1
Send message
Joined: 25 Nov 08
Posts: 51
Credit: 980,186
RAC: 0
Level
Gly
Scientific publications
watwat
Message 4489 - Posted: 18 Dec 2008 | 16:42:18 UTC - in response to Message 4488.

Burdett - it was. It is not now!

I can't get anything for the GTX260 machine this afternoon on 6.4.2; 6.4.5; or 6.3.21. It doesn't matter if I suspend the other project or not. Always get the messages I posted earlier in this thread.

8600GTS machine is chugging away in 1 plus 4 mode on 6.4.2 with no real problem at all save for all 5 tasks in high priority for no reason I can see. There is no GPU tasks queued up there just the one that is running.

Phoneman1

localizer
Send message
Joined: 17 Apr 08
Posts: 113
Credit: 1,656,514,857
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4490 - Posted: 18 Dec 2008 | 16:53:13 UTC - in response to Message 4489.

..... frustrating isn't it.
It has got to be server side - you can see the 'results in progress' total dropping as machines start to run dry/low cache.

Wolfram1
Send message
Joined: 24 Aug 08
Posts: 45
Credit: 3,431,862
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwatwat
Message 4491 - Posted: 18 Dec 2008 | 16:54:16 UTC - in response to Message 4489.
Last modified: 18 Dec 2008 | 16:55:25 UTC

My Host-Id is 7521.
I made a reset of the projekt an have now DCF 1.00000
I have BOINC 6.4.5
Now I get the messages:

18.12.2008 17:47:39|GPUGRID|Sending scheduler request: Requested by user. Requesting 168179 seconds of work, reporting 0 completed tasks
18.12.2008 17:47:44|GPUGRID|Scheduler request completed: got 0 new tasks
18.12.2008 17:47:44|GPUGRID|Message from server: No work sent
18.12.2008 17:47:44|GPUGRID|Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.
18.12.2008 17:47:44|GPUGRID|Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.

Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1957
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 4492 - Posted: 18 Dec 2008 | 16:57:28 UTC - in response to Message 4491.
Last modified: 18 Dec 2008 | 17:20:13 UTC

I have reduced the number of flops requested by a factor 10 on type of workunits.
To see if things improve.

gdf

localizer
Send message
Joined: 17 Apr 08
Posts: 113
Credit: 1,656,514,857
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4493 - Posted: 18 Dec 2008 | 17:03:51 UTC - in response to Message 4492.

GDF - can you explain why the amount of work requested &/or a DCF issue could trigger a 'not available for your type of computer' message?

Thanks,
P.

Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1957
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 4494 - Posted: 18 Dec 2008 | 17:19:44 UTC - in response to Message 4493.

I think that is because if the estimate is longer than the deadline, the WU is not sent.

gdf

Phoneman1
Send message
Joined: 25 Nov 08
Posts: 51
Credit: 980,186
RAC: 0
Level
Gly
Scientific publications
watwat
Message 4496 - Posted: 18 Dec 2008 | 17:25:59 UTC - in response to Message 4492.

I'm now back on 6.4.2. after seeing GDF's post at 16:54.At first no luck again. Same sort of messages. Then I remembered a trick from my days cruncing cpdn...

I suspended all processing. Used notepad to edit client_state.xml and found the dcf tag in the block for GPU. Between the GPU URL and the DCF there is nrpc_failures. That was 6, changed that to 0; saved the file and exited notepad. Resumed boinc processing and updated GPU manually. Bingo! I now have GPU plus 4 other tasks running in high priority (for no good reason) on the GTX260 machine. I also have 2 GPU tasks ready to go when this finishes in 11 hours.

Note. Usual provisio with editing client_state.xml: make sure BOINC can't update it and preserve the < > </> signs or it will take some time on your part to correct it! You shouldn't need to do this edit unless you can't get work and were getting work previously.

Phoneman1

localizer
Send message
Joined: 17 Apr 08
Posts: 113
Credit: 1,656,514,857
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4497 - Posted: 18 Dec 2008 | 17:31:53 UTC - in response to Message 4496.

........ great tip. Got 4 WUs - but coming down at 240+hours, even though my DCF is 1.000000.

Phoneman1
Send message
Joined: 25 Nov 08
Posts: 51
Credit: 980,186
RAC: 0
Level
Gly
Scientific publications
watwat
Message 4498 - Posted: 18 Dec 2008 | 17:50:39 UTC - in response to Message 4497.

Got 4 WUs - but coming down at 240+hours, even though my DCF is 1.000000.


Phew! Yes just gone onto BOINCView for the first time in several hours and seen similar on my machines. I guess that is why the GPU tasks are running high priority. I am still puzzled why the other project's tasks are running in high priority. There is at most two days of work there for a 10 day deadline project. When I couldn't get a GPU task to run they were running in normal mode - very odd.

Phoneman1

Profile Nightlord
Avatar
Send message
Joined: 22 Jul 08
Posts: 61
Credit: 5,461,041
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwat
Message 4499 - Posted: 18 Dec 2008 | 18:05:38 UTC

Yippee! For me, that tip works host ID 5889

Boooo! It does not work on host ID 8882


____________

j2satx
Send message
Joined: 16 Dec 08
Posts: 20
Credit: 36,912,780
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4506 - Posted: 18 Dec 2008 | 19:54:09 UTC

Just got this from Beta mail list.

Hi Rom and David,

I just downloaded 6.4.5 to my home XP64 machine. Because I use
firefox it took me to the wrong download page and gave me the Win32
version.

The Win32 version has been fixed to report Win64 when run on a Win64
box. Which is fine for projects that have Win64 applications, but
apparently it doesn't report Win32 as an alternate platform, so I
don't get any work on projects that only support Win32. That's a
substantial problem. Haven't been monitoring the lists to see if that
problem has been reported before.

Eric

Could this be the issue here on GPUGrid..?


Phoneman1
Send message
Joined: 25 Nov 08
Posts: 51
Credit: 980,186
RAC: 0
Level
Gly
Scientific publications
watwat
Message 4507 - Posted: 18 Dec 2008 | 20:21:56 UTC - in response to Message 4499.
Last modified: 18 Dec 2008 | 20:23:11 UTC

It does not work on host ID 8882


Nightlord if it is not the wrong version of Boinc as just suggested, I'd suspend Boinc processing take another look at the client_state.xml file on that machine.

Between the:
<project>
<master_url>http://www.gpugrid.net/</master_url>
<project_name>GPUGRID</project_name>

and the

<duration_correction_factor>1.000000</duration_correction_factor>

we have been looking at you should find these two lines amongst the other stuff:

<nrpc_failures>0</nrpc_failures>
<master_fetch_failures>0</master_fetch_failures>

both need to be 0 - Burdett and I only needed to change the top one. It is just possible your second machine has a number greater than 0 for master fetch failures.

Phoneman1

MMx
Send message
Joined: 18 Dec 08
Posts: 1
Credit: 0
RAC: 0
Level

Scientific publications
wat
Message 4508 - Posted: 18 Dec 2008 | 20:35:48 UTC - in response to Message 4472.

Using version 6.4.2, did project reset and still getting no work units. Host ID is 19685. Also I was not getting the "Full-atom molecular dynamics is not available for your type of computer." message until I did a couple of version changes based on recommendations from this forum. Everything else was pretty much the same though. Btw I am a new user, so this may not be related to the current series of troubles.

18/12/2008 21:22:48||Libraries: libcurl/7.19.0 OpenSSL/0.9.8i zlib/1.2.3
18/12/2008 21:22:48||Data directory: D:\aplw\boinc-data
18/12/2008 21:22:48||Running under account Administrator
18/12/2008 21:22:48||Processor: 2 GenuineIntel Intel(R) Core(TM)2 Duo CPU T7500 @ 2.20GHz [x86 Family 6 Model 15 Stepping 10]
18/12/2008 21:22:48||Processor features: fpu tsc sse sse2 mmx
18/12/2008 21:22:48||OS: Microsoft Windows XP: Professional x86 Editon, Service Pack 2, (05.01.2600.00)
18/12/2008 21:22:48||Memory: 2.00 GB physical, 3.85 GB virtual
18/12/2008 21:22:48||Disk: 141.04 GB total, 16.96 GB free
18/12/2008 21:22:48||Local time is UTC +1 hours
18/12/2008 21:22:48||Not using a proxy
18/12/2008 21:22:48||CUDA devices found
18/12/2008 21:22:48||Coprocessor: GeForce 8600M GT (1)
18/12/2008 21:22:48||Version change (6.3.21 -> 6.4.2)
18/12/2008 21:22:48|GPUGRID|URL: http://www.gpugrid.net/; Computer ID: 19685; location: (none); project prefs: default
18/12/2008 21:22:48||No general preferences found - using BOINC defaults
18/12/2008 21:22:48||Reading preferences override file
18/12/2008 21:22:48||Preferences limit memory usage when active to 1023.18MB
18/12/2008 21:22:48||Preferences limit memory usage when idle to 1841.72MB
18/12/2008 21:22:48||Preferences limit disk usage to 5.00GB
18/12/2008 21:22:48||Running CPU benchmarks
18/12/2008 21:22:48||Suspending computation - running CPU benchmarks
18/12/2008 21:23:19||Benchmark results:
18/12/2008 21:23:19|| Number of CPUs: 2
18/12/2008 21:23:19|| 2152 floating point MIPS (Whetstone) per CPU
18/12/2008 21:23:19|| 5050 integer MIPS (Dhrystone) per CPU
18/12/2008 21:23:53|GPUGRID|Resetting project
18/12/2008 21:23:58|GPUGRID|Sending scheduler request: Requested by user. Requesting 60480 seconds of work, reporting 0 completed tasks
18/12/2008 21:24:03|GPUGRID|Scheduler request completed: got 0 new tasks
18/12/2008 21:24:03|GPUGRID|Message from server: No work sent
18/12/2008 21:24:03|GPUGRID|Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
18/12/2008 21:24:03|GPUGRID|Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.
18/12/2008 21:24:03|GPUGRID|Message from server: Full-atom molecular dynamics is not available for your type of computer.

Profile Nightlord
Avatar
Send message
Joined: 22 Jul 08
Posts: 61
Credit: 5,461,041
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwat
Message 4511 - Posted: 18 Dec 2008 | 21:20:34 UTC - in response to Message 4507.
Last modified: 18 Dec 2008 | 21:33:56 UTC

It does not work on host ID 8882


Nightlord if it is not the wrong version of Boinc as just suggested, I'd suspend Boinc processing take another look at the client_state.xml file on that machine.

Between the:
<project>
<master_url>http://www.gpugrid.net/</master_url>
<project_name>GPUGRID</project_name>

and the

<duration_correction_factor>1.000000</duration_correction_factor>

we have been looking at you should find these two lines amongst the other stuff:

<nrpc_failures>0</nrpc_failures>
<master_fetch_failures>0</master_fetch_failures>

both need to be 0 - Burdett and I only needed to change the top one. It is just possible your second machine has a number greater than 0 for master fetch failures.

Phoneman1



Thanks Phoneman1, it's a Linux64 box, as is the other host that worked.

I just verified the client_state file, the nrpc_failures was at 3 and master_fetch at 0. I've played around with a few changes, but to no benefit. At present this host is dead in the water.

/edit: I detached and re-attached with no effect. I then backed up the Boinc installation and made a fresh one, connected to GPUGrid and it gives the same error. This still fails, now with a clean client_state file.

Time for a new Host ID.....

localizer
Send message
Joined: 17 Apr 08
Posts: 113
Credit: 1,656,514,857
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4513 - Posted: 18 Dec 2008 | 21:33:26 UTC - in response to Message 4511.

....... after initial success, I'm back to having problems getting WUs.
Ho Hum.

P.

Profile Nightlord
Avatar
Send message
Joined: 22 Jul 08
Posts: 61
Credit: 5,461,041
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwat
Message 4515 - Posted: 18 Dec 2008 | 22:04:24 UTC

OK, this is not funny any more :-(

With a new host ID 19709 and a clean installation of 6.4.2, it still gives me the same error messages.

I'm at a loss now.

Profile Kokomiko
Avatar
Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 4517 - Posted: 18 Dec 2008 | 22:46:50 UTC

My boxes are running since yesterday on 6.4.2 and I have set the DCF to 1. The two GTX260² and the GTX280 are running now without any problem, also with a estimated time of over 66 hours instead 7:28h. Only my 8800GT has WUs with a estimated time of over 252 hours, but I have 3 WUs. To get more WUs I use a trick: I manually update, then I get the 24 hours delay, and make a second manually update immediately. Then I get another error message (last call nn seconds) and the 24 hours delay is away. When now the rest time of the running GPU-WU is lower then the workcache (I have 1.5 days) I get new work automatically.
____________

Temujin
Send message
Joined: 12 Jul 07
Posts: 100
Credit: 21,848,502
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwat
Message 4519 - Posted: 18 Dec 2008 | 23:43:53 UTC - in response to Message 4517.

I have 5 machines running GPUGrid, 3x linux64 (fedora9) core2 quads and 2x winxp32 core2 duals, each has 1 GPU.
I don't think I've had a machine run out of WUs (yet!!).
All 3 linux machines usually manage to maintain at least 3 WUs whereas the 2 windows machines usually get to about 80% of their last WUs before downloading more WUs.
They all run 6.3.19 apart from 1 of the windows machines which runs 6.3.21 and they all run 3+1 (or 1+1 for the duals) and run Seti & RCN in addition to GPUGrid.

Their estimated completion times range wildly from a normal 2 hours upto +70 hours but this doesn't seem to affect their ability to get new work.

I don't know why they get work when a lot of you don't get any. I haven't edited any config files since installing 6.3.19 which removed my requirement to set ncpu-1. On the 2 machines here, the DCFs are 0.062 and 0.137 and have never been much higher that that.

I know a lot of you are running 6.4.x versions of boinc, so is it worth downgrading to 6.3.19 (or 6.3.21) to see if that helps?

j2satx
Send message
Joined: 16 Dec 08
Posts: 20
Credit: 36,912,780
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4520 - Posted: 19 Dec 2008 | 0:20:36 UTC
Last modified: 19 Dec 2008 | 0:21:25 UTC

I just finished my first WU. As soon as it reported, it got another.

Running WXP and 6.45 on a single core and a slow 8600 GT.

Profile JStateson
Avatar
Send message
Joined: 31 Oct 08
Posts: 186
Credit: 3,384,540,878
RAC: 1,348,532
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4521 - Posted: 19 Dec 2008 | 0:48:22 UTC
Last modified: 19 Dec 2008 | 0:57:06 UTC

Hmm - I am running 6.4.5 with nVidia 180.60 and have two WU's on a 9800gtx+. Sorry about your problems. I did get that cell processor out of work for your system message this morning, but a reboot cleared it.

Profile Jack Shaftoe
Send message
Joined: 26 Nov 08
Posts: 27
Credit: 1,813,606
RAC: 0
Level
Ala
Scientific publications
watwatwatwat
Message 4523 - Posted: 19 Dec 2008 | 2:34:06 UTC - in response to Message 4521.
Last modified: 19 Dec 2008 | 2:42:34 UTC

6.4.2 per today's recommendation. Downloaded 2 wu's by resetting the project and suspending everything else this morning. After finishing those 2 tonight, tried the same thing without success.

GTX 260, idle again. Server-side scheduler appears to be fubar.

12/18/2008 9:32:11 PM|GPUGRID|Sending scheduler request: Requested by user. Requesting 1468800 seconds of work, reporting 0 completed tasks
12/18/2008 9:32:16 PM|GPUGRID|Scheduler request completed: got 0 new tasks
12/18/2008 9:32:16 PM|GPUGRID|Message from server: No work sent
12/18/2008 9:32:16 PM|GPUGRID|Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
12/18/2008 9:32:16 PM|GPUGRID|Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.

http://www.gpugrid.net/show_host_detail.php?hostid=17947

6.3.x will at least allow me to DL work by resetting/suspending everything. 6.4.x seems to make it impossible to maintain work.

Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1957
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 4531 - Posted: 19 Dec 2008 | 9:28:17 UTC - in response to Message 4515.
Last modified: 19 Dec 2008 | 9:37:51 UTC

Nightlord, please send your client_state.xml at ps3grid@gmail.com.


Thanks.

gdf

From the server logs you are requesting 1 sec of work.

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 4536 - Posted: 19 Dec 2008 | 10:59:17 UTC

If version 6.4.2 is the preferred version, where can we get it. THe BOINC site only has the 6.2.19 and 6.4.5 available for download (as far as I have been able to push the system).

Profile Kokomiko
Avatar
Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 4537 - Posted: 19 Dec 2008 | 11:05:31 UTC - in response to Message 4536.
Last modified: 19 Dec 2008 | 11:06:14 UTC

If version 6.4.2 is the preferred version, where can we get it. THe BOINC site only has the 6.2.19 and 6.4.5 available for download (as far as I have been able to push the system).



Here you can get all versions:

all versions of the BOINC-Manager
____________

Profile Stefan Ledwina
Avatar
Send message
Joined: 16 Jul 07
Posts: 464
Credit: 221,007,857
RAC: 4,333,521
Level
Leu
Scientific publications
watwatwatwatwatwatwatwat
Message 4538 - Posted: 19 Dec 2008 | 11:07:03 UTC - in response to Message 4536.

If version 6.4.2 is the preferred version, where can we get it. THe BOINC site only has the 6.2.19 and 6.4.5 available for download (as far as I have been able to push the system).


Hi Paul,

you can get it from http://www.gpugrid.net/join.php?sys=gpu or http://boinc.berkeley.edu/dl/.
____________

pixelicious.at - my little photoblog

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 4541 - Posted: 19 Dec 2008 | 11:21:23 UTC - in response to Message 4538.
Last modified: 19 Dec 2008 | 11:23:01 UTC

If version 6.4.2 is the preferred version, where can we get it. THe BOINC site only has the 6.2.19 and 6.4.5 available for download (as far as I have been able to push the system).


Hi Paul,

you can get it from http://www.gpugrid.net/join.php?sys=gpu or http://boinc.berkeley.edu/dl/.

Thanks ...

6.5.0 did not work, so let us see about 6.4.2 ...

Nope, no work for your type computer.

So, there is something else going on with the server regardless of the version of BOINC I run ...

{edit}I did run off 11 tasks for SaH last night and at least 3 of them have validated against a wingman... so, slight progress ... but GPU Grid is more like the kind of work I would want to do rather than just rip off SaH nonsense.{/edit}

Phoneman1
Send message
Joined: 25 Nov 08
Posts: 51
Credit: 980,186
RAC: 0
Level
Gly
Scientific publications
watwat
Message 4554 - Posted: 19 Dec 2008 | 15:20:07 UTC - in response to Message 4541.


6.5.0 did not work, so let us see about 6.4.2 ...

Nope, no work for your type computer.


When I saw an earlier post about 6.5.0 working well I decided to upgrade both my machines. As I wanted to try and determine what caused all the problems yesterday when it became necessary to reset nrpc_failures in the client_state.xml I thought I'd upgrade using differnet methods on my two machines. On the machine with the faster graphics card I upgraded BOINC the way I used to do it:

Open BOINC Manager
select task tab and check nothing is about to up load
select message tab and make sure no schedule requests outstanding etc.
Select activity / suspend
Select File / exit
Open task manager and check that all boinc; boincmgr and boinctray processes are not in the list.
Run the Boinc Installer

On the machine with the slower graphics card I cut the procedure down to:
Run the Boinc Installer

It looked as if this upgrade had worked but could I get a download of GPU on this machine with the slower graphics card - no.

I had to edit the client_state.xml file on this machine and change the nrpc_failures value back to 0. In fact I had to do this twice because I had not noticed Boinc had a client_state_next.xml file to replace the file I had just edited as soon as I restarted Boinc.

Once I had Boinc stopped and there was no "next" file in the directory (Boinc only creates one in certain instances and deletes it when it is not needed itself). I was able to edit the client_state.xml and restart Boinc.

At that point in the folder view I right-clicked on client_state.xml and selected copy. Then I right-clicked outside the file list and selected paste. Looking at the copy I had taken I could see that nrpc_failures was still 0 and sure enough going into Boinc and selecting GPU in the projects tab and pressing the update button brought me another wu.

It is just possible this "next" file business could be the cause of your problem today Paul and also Nightlord's problem from yesterday.

So the long winded way of upgrading worked for me today but the short-cut turned out to be anything but.

Yes I have 1 plus 4 tasks running in normal mode on both machines using 6.5.0 with estimates that look a lot better than anything else I've tried in the last three weeks with this project.

Phoneman1

Profile [BOINC@Poland]AiDec
Send message
Joined: 2 Sep 08
Posts: 53
Credit: 9,213,937
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwat
Message 4560 - Posted: 19 Dec 2008 | 16:35:09 UTC - in response to Message 4507.


Between the:
<project>
<master_url>http://www.gpugrid.net/</master_url>
<project_name>GPUGRID</project_name>

and the

<duration_correction_factor>1.000000</duration_correction_factor>

we have been looking at you should find these two lines amongst the other stuff:

<nrpc_failures>0</nrpc_failures>
<master_fetch_failures>0</master_fetch_failures>

both need to be 0 - Burdett and I only needed to change the top one. It is just possible your second machine has a number greater than 0 for master fetch failures.

Phoneman1


@GDF @Guys:

You have to know that - it really works. Phoneman u r great :). Both have to be 0!

I just installed new OS. After many big troubles with getting new WU on my maschines (with really different soft- and hard- ware configuration) i wanted to try new possibilities.

The new possibility was:

XP32 SP2
178.24 (old but checked and stable for sure)
6.5.0 - a fresh breeze ;)

But it doesn`t work. I mean again `no molecular` for me. Sure it was on fresh OS, fresh BM (it means DCF 1 etc.). I tried few times during some hours. Nothing. Then I`ve changed nrpc to 0. Manual update and voila :). But that`s not all. It was fantastic request (as always should be, but wasn`t never/ever in last weeks) which gave me 2 WU`s. After a moment BM requested again - next 2 WU`s arrived :). 1h23min - 1h58min remaining/WU.

Then I made experiment. After some of BM requests (after some minutes) I`ve changed nrpc again to 0 and with 4 GPUG WU`s (max.) and with a lot of Prime WU`s (not stopped! - Prime Resource share - 50%, GPUGrid Resource share - 100%) I made Manual update. Yes, there was correct request :).

Then next experiment. Little risky ;) :P. I`ve aborted one WU (some of you could get killed for one WU and I`m aborting it...!!! ;D ). And I made Manual update. Request correct, but got no new (the 4th one) WU, cause (...)molecular no available bla bla bla(...). Tried twice - same effect. Then I`ve changed nrpc again to 0 and manual update. Request correct, but... (...) molecular not available etc.(...) :O. Restarted comp, checked client_state, changed Resource share to 1000, changed DCF to 0.000001... Request 0... Request 0... Whatever I did, always request 0. Changed DCF back to 1.000000. Request correct, but (...)molecular not available(...). I can reach correct request, but there`s now way for me to receive any more WU`s.

Mby there`s something on server side, some note in database, that this maschine has received an error with one of WU`s and it means that this maschine should not receive new jobs. Or something similar.

Anyway I hope it will help some guys to get some WU`s and mby help you GDF to wonder WTF is going on now :). Remember, there was everything PERFECT, until first error. And you know that because of memory leaks probably every maschine will be at the same place as me now - mby go this way...



____________

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 4563 - Posted: 19 Dec 2008 | 17:02:04 UTC

Well, I just tried setting nrpc failures to 0 (it was 3) but, not available for your computer.

I mean, I guess, I am fine if the project does not want my help ...

But, it sure seems that this is a server side problem (at least to me) and there is a curious reluctance to look there ...

I have done my part and have done what they have been asking.

GPU processing is intriguing, but, I guess I will have to wait for EaH to get it right. Or SaH to get Astropulse converted.

Profile [BOINC@Poland]AiDec
Send message
Joined: 2 Sep 08
Posts: 53
Credit: 9,213,937
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwat
Message 4566 - Posted: 19 Dec 2008 | 17:46:47 UTC
Last modified: 19 Dec 2008 | 17:47:06 UTC

Trying and trying again. Every values correct now (DCF, nrpc etc.). Stopped other projects. Manual update. Got new WU`s :). - Guys, use it!
____________

Profile Nightlord
Avatar
Send message
Joined: 22 Jul 08
Posts: 61
Credit: 5,461,041
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwat
Message 4572 - Posted: 19 Dec 2008 | 18:13:22 UTC - in response to Message 4531.

Sorry I was at work all day :-(

Client_state.xml just sent.

lllvette
Avatar
Send message
Joined: 1 Nov 08
Posts: 6
Credit: 16,022,037
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 4573 - Posted: 19 Dec 2008 | 18:42:49 UTC

When I try to run 6.4.2 From this site I get
'Error in setup initialization file " on two systems... dont Know why. Is 6.4.2 the still the preferred version?... 6.4.5 is installed on my sys. now.
____________

Profile Kokomiko
Avatar
Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 4574 - Posted: 19 Dec 2008 | 19:00:00 UTC - in response to Message 4573.
Last modified: 19 Dec 2008 | 19:00:31 UTC

When I try to run 6.4.2 From this site I get
'Error in setup initialization file " on two systems... dont Know why. Is 6.4.2 the still the preferred version?... 6.4.5 is installed on my sys. now.


6.4.5 is the version with the biggest problems here on GPUGrid. Try the new developer version 6.5.0 or reload the 6.4.2. Be sure to clear the cache browser first, so that you can be sure to get a new download and not the broken file from cache.
____________

lllvette
Avatar
Send message
Joined: 1 Nov 08
Posts: 6
Credit: 16,022,037
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 4577 - Posted: 19 Dec 2008 | 21:49:52 UTC - in response to Message 4574.

Where do I find this Cache file to delete it?

Profile Nightlord
Avatar
Send message
Joined: 22 Jul 08
Posts: 61
Credit: 5,461,041
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwat
Message 4582 - Posted: 19 Dec 2008 | 22:04:38 UTC

Now 3 boxes dead in the water, without work, and unable to get any.

Downgraded to 6.4.2, reset project, checked nrpc_fails was at zero, checked master_request fails was at zero, ensured other projects were suspended, manual update...etc...etc... Nothing wants to kick these hosts into life.

Hosts: 15596, 8515 and 8882

If I recall correctly, this problem started for me on a host with 6.3.21 that had been clean for weeks. This seems to be server side, unrelated to Boinc version.

I thank our lucky stars there isn't a weekend coming up followed by a long festive break.......

Profile Kokomiko
Avatar
Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 4583 - Posted: 19 Dec 2008 | 22:14:30 UTC - in response to Message 4577.

Where do I find this Cache file to delete it?


Use the keywords

delete cache <your browser name> on GOOGLE or any other search machine of your choice

e.g.
delete cache internet explorer
or
delete cache firefox

so you will find a solution. Therefor I don't know your browser type and your native language can't give you a better explanation. My German menu will not fit your interests .. :)



____________

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 4585 - Posted: 19 Dec 2008 | 23:14:07 UTC

Ok, I installed 6.5.0 ...

set errors to 0, DCF was 1.0, upped my resource share to 100, increased my buffer size to 0.4 and now I have gotten 5 tasks from GPU Grid with one running as we speak.

Even better it looks like I have 9 tasks running with one on the GPU with 0.03 CPU and 8 tasks on the other virtual CPUs (4 core i7 with HT giving the appearance of 8 CPUs).

*SO* it seems that the error message returned is the same regardless of what the error is on the server side. In other words, even if the reason is that the deadline is too low, or some other reason we get the same misleading error message.

The other possiblility is that the project needs to start the feeder as Carl Christensen said on the BOINC Dev list "...they need to add the "--allapps" flag for the feeder..." and we are making changes and randomly the scheduler all of a sudden gives us a reward and we think we have bothered the problem.

My dog Pixie, for example, knows that if she scratches at the dog door that it will open. She scratches and it opens ... of course the scratching has nothing to do with the timer, but you cannot convince her of that ...

Profile Nightlord
Avatar
Send message
Joined: 22 Jul 08
Posts: 61
Credit: 5,461,041
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwat
Message 4586 - Posted: 19 Dec 2008 | 23:41:19 UTC
Last modified: 19 Dec 2008 | 23:54:40 UTC

Hallelujah!

Upgrading to 6.5.0 from 6.4.2 (at least on the Vista box), project reset and manual update finally kicked one host into action! Now to find out if it works on the XP and Linux hosts....


....but it didn't work on the XP box and there is no 6.5.0 available for Linux.

Profile K1atOdessa
Send message
Joined: 25 Feb 08
Posts: 249
Credit: 389,728,788
RAC: 1,399,945
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4591 - Posted: 20 Dec 2008 | 0:58:10 UTC - in response to Message 4586.

Hallelujah!

Upgrading to 6.5.0 from 6.4.2 (at least on the Vista box), project reset and manual update finally kicked one host into action! Now to find out if it works on the XP and Linux hosts....


....but it didn't work on the XP box and there is no 6.5.0 available for Linux.


It seemed to work for me, and I didn't reset the project or do a manual update.

BOINC 6.5.0 working for me

We'll see if it screws up again, though. There has been a lot of stuff that works for one client and not for another.

Profile Nognlite
Send message
Joined: 9 Nov 08
Posts: 69
Credit: 25,106,923
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwat
Message 4596 - Posted: 20 Dec 2008 | 3:19:40 UTC

6.5.0 was working fine for me until this:

12/19/2008 8:02:39 PM|GPUGRID|Sending scheduler request: To fetch work. Requesting 1067038 seconds of work, reporting 2 completed tasks
12/19/2008 8:02:44 PM|GPUGRID|Scheduler request completed: got 0 new tasks
12/19/2008 8:02:44 PM|GPUGRID|Message from server: Project is temporarily shut down for maintenance

Tried to download two hours later and the dreaded this:

12/19/2008 10:12:14 PM|GPUGRID|Sending scheduler request: Requested by user. Requesting 1068275 seconds of work, reporting 2 completed tasks
12/19/2008 10:12:19 PM|GPUGRID|Scheduler request completed: got 0 new tasks
12/19/2008 10:12:19 PM|GPUGRID|Message from server: No work sent
12/19/2008 10:12:19 PM|GPUGRID|Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
12/19/2008 10:12:19 PM|GPUGRID|Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.

Now back to 24hrs before next call. Again it looks like once the two WU I have are done my computer will be idle for the night.

Pat

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 4597 - Posted: 20 Dec 2008 | 5:47:09 UTC - in response to Message 4596.

Now back to 24hrs before next call. Again it looks like once the two WU I have are done my computer will be idle for the night.

You can always force updates in the project pane.

Again, I am getting more suspicious that the single error message is over-riding all other "real" error messages. I have always run a low queue size as is recommended and it is possible that this is the problem. When I did NNW on all projects but GPU Grid upped the size to 0.4 days I got 5 tasks.

So, until I drain my queue again I will not know for sure if this is a "cure" or if it was just a random joke of the demons ....

Profile [BOINC@Poland]AiDec
Send message
Joined: 2 Sep 08
Posts: 53
Credit: 9,213,937
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwat
Message 4600 - Posted: 20 Dec 2008 | 7:49:07 UTC - in response to Message 4583.
Last modified: 20 Dec 2008 | 7:49:30 UTC

Where do I find this Cache file to delete it?


Use the keywords

delete cache <your browser name> on GOOGLE or any other search machine of your choice

e.g.
delete cache internet explorer
or
delete cache firefox



It`s enough to press CTRL+F5 :)
____________

Profile [BOINC@Poland]AiDec
Send message
Joined: 2 Sep 08
Posts: 53
Credit: 9,213,937
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwat
Message 4601 - Posted: 20 Dec 2008 | 8:27:56 UTC - in response to Message 4566.
Last modified: 20 Dec 2008 | 8:29:47 UTC


Between the:
<project>
<master_url>http://www.gpugrid.net/</master_url>
<project_name>GPUGRID</project_name>

and the

<duration_correction_factor>1.000000</duration_correction_factor>

we have been looking at you should find these two lines amongst the other stuff:

<nrpc_failures>0</nrpc_failures>
<master_fetch_failures>0</master_fetch_failures>

both need to be 0 - Burdett and I only needed to change the top one. It is just possible your second machine has a number greater than 0 for master fetch failures.

Phoneman1


@GDF @Guys:

You have to know that - it really works. Phoneman u r great :). Both have to be 0!

I just installed new OS. After many big troubles with getting new WU on my maschines (with really different soft- and hard- ware configuration) i wanted to try new possibilities.

The new possibility was:

XP32 SP2
178.24 (old but checked and stable for sure)
6.5.0 - a fresh breeze ;)

But it doesn`t work. I mean again `no molecular` for me. Sure it was on fresh OS, fresh BM (it means DCF 1 etc.). I tried few times during some hours. Nothing. Then I`ve changed nrpc to 0. Manual update and voila :). But that`s not all. It was fantastic request (as always should be, but wasn`t never/ever in last weeks) which gave me 2 WU`s. After a moment BM requested again - next 2 WU`s arrived :). 1h23min - 1h58min remaining/WU.

Then I made experiment. After some of BM requests (after some minutes) I`ve changed nrpc again to 0 and with 4 GPUG WU`s (max.) and with a lot of Prime WU`s (not stopped! - Prime Resource share - 50%, GPUGrid Resource share - 100%) I made Manual update. Yes, there was correct request :).

Then next experiment. Little risky ;) :P. I`ve aborted one WU (some of you could get killed for one WU and I`m aborting it...!!! ;D ). And I made Manual update. Request correct, but got no new (the 4th one) WU, cause (...)molecular no available bla bla bla(...). Tried twice - same effect. Then I`ve changed nrpc again to 0 and manual update. Request correct, but... (...) molecular not available etc.(...) :O. Restarted comp, checked client_state, changed Resource share to 1000, changed DCF to 0.000001... Request 0... Request 0... Whatever I did, always request 0. Changed DCF back to 1.000000. Request correct, but (...)molecular not available(...). I can reach correct request, but there`s now way for me to receive any more WU`s.

Mby there`s something on server side, some note in database, that this maschine has received an error with one of WU`s and it means that this maschine should not receive new jobs. Or something similar.

Anyway I hope it will help some guys to get some WU`s and mby help you GDF to wonder WTF is going on now :). Remember, there was everything PERFECT, until first error. And you know that because of memory leaks probably every maschine will be at the same place as me now - mby go this way...






Trying and trying again. Every values correct now (DCF, nrpc etc.). Stopped other projects. Manual update. Got new WU`s :). - Guys, use it!




Guys, it really, really works. I did it for next maschine (quite different - OS, driver etc.) and got new tasks again. But sure it works just manually. No auto downloading new tasks.
____________

Profile Nightlord
Avatar
Send message
Joined: 22 Jul 08
Posts: 61
Credit: 5,461,041
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwat
Message 4602 - Posted: 20 Dec 2008 | 9:53:01 UTC - in response to Message 4601.
Last modified: 20 Dec 2008 | 9:55:45 UTC

Trying and trying again. Every values correct now (DCF, nrpc etc.). Stopped other projects. Manual update. Got new WU`s :). - Guys, use it!




Guys, it really, really works. I did it for next maschine (quite different - OS, driver etc.) and got new tasks again. But sure it works just manually. No auto downloading new tasks.



Sorry, but it doesn't work for everyone. Maybe in your case it does - if so, please send me your WU's!

Other people who tried the same have inconsistent results. Notice Burdett who it worked for once, but then not after: notice myself who it worked for on one host but not another.

Setting the nrpcfails to zero is I think just another strange interaction with the Boinc server code. It's worth a try, but not a magic bullet.

TomaszPawel
Send message
Joined: 18 Aug 08
Posts: 121
Credit: 59,836,411
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4603 - Posted: 20 Dec 2008 | 10:14:41 UTC - in response to Message 4602.
Last modified: 20 Dec 2008 | 10:16:13 UTC

2008-12-20 10:44:53|GPUGRID|Sending scheduler request: To fetch work. Requesting 581438 seconds of work, reporting 0 completed tasks
2008-12-20 10:44:58|GPUGRID|Scheduler request completed: got 0 new tasks
2008-12-20 10:44:58|GPUGRID|Message from server: No work sent
2008-12-20 10:44:58|GPUGRID|Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
2008-12-20 10:44:58|GPUGRID|Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.

Ja juz mam kurwa dosyć. Działało i przestało. Jam można tak spieprzyć na całej lini działający projekt.

6.3.21 Was working extreme well untill some genious make some mistake!

Phoneman1
Send message
Joined: 25 Nov 08
Posts: 51
Credit: 980,186
RAC: 0
Level
Gly
Scientific publications
watwat
Message 4609 - Posted: 20 Dec 2008 | 11:54:41 UTC - in response to Message 4602.


Setting the nrpcfails to zero is I think just another strange interaction with the Boinc server code. It's worth a try, but not a magic bullet.


Agreed, there could be a lot of other issues preventing downloading of wu's.

For instance, yesterday I had a client_state_next.xml file on my machine whilst I was editing client_state.xml to set nrpc_failures to 0. The "next" file kept replacing the file I had just edited as soon as I started Boinc. In the process making my edit a waste of time. Once I realised what was happening I got rid of the "next" file by updating another project. Once I suspended all Boic activity and exited Boinc again I was able to repeat the edit. This time the edited file didn't get overwritten when I re-started Boinc and I got my download.

Why are we having to change nrpc_failurea values? - My hunch is, it is how we are stopping Boinc. I suspect using the installer to stop GPU tasks in particular is causing the problem. Needs some more tedious testing for me to be 100% sure; but it is the only theory I've got at the moment why this variable ends up non-zero on so many machines when we change Boinc versions.

Phoneman1

Profile [BOINC@Poland]AiDec
Send message
Joined: 2 Sep 08
Posts: 53
Credit: 9,213,937
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwat
Message 4611 - Posted: 20 Dec 2008 | 12:37:41 UTC - in response to Message 4609.
Last modified: 20 Dec 2008 | 12:39:28 UTC


Setting the nrpcfails to zero is I think just another strange interaction with the Boinc server code. It's worth a try, but not a magic bullet.


Agreed, there could be a lot of other issues preventing downloading of wu's.


I agree, but I did it so many times with many maschines and it really works. Maybe it`s just some interaction which helps in receiving WU`s, but without this operation I had no any chances to download WU`s. It means that just for now, just for us (members) it is some way to go until everything will work fine.
____________

Profile Jack Shaftoe
Send message
Joined: 26 Nov 08
Posts: 27
Credit: 1,813,606
RAC: 0
Level
Ala
Scientific publications
watwatwatwat
Message 4618 - Posted: 20 Dec 2008 | 14:57:03 UTC - in response to Message 4582.
Last modified: 20 Dec 2008 | 14:57:20 UTC

This seems to be server side, unrelated to Boinc version.


Agreed. Hopefully the admins are looking at this now.

-Jack (reset the project again this morning after running out of work overnight)

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4622 - Posted: 20 Dec 2008 | 15:49:27 UTC - in response to Message 4618.

Hopefully the admins are looking at this now.


They are.

MrS
____________
Scanning for our furry friends since Jan 2002

rraplee
Send message
Joined: 18 Sep 07
Posts: 3
Credit: 598,897
RAC: 0
Level
Gly
Scientific publications
watwat
Message 4624 - Posted: 20 Dec 2008 | 16:11:48 UTC

Hey!
Having same problem: no work units available, etc.
Running boinc 6.4.5
Host id 18125
Have tried reset of project to no avail.
12/20/2008 9:48:00 AM|GPUGRID|Sending scheduler request: Requested by user. Requesting 226525 seconds of work, reporting 0 completed tasks
12/20/2008 9:48:05 AM|GPUGRID|Scheduler request completed: got 0 new tasks
12/20/2008 9:48:05 AM|GPUGRID|Message from server: No work sent
12/20/2008 9:48:05 AM|GPUGRID|Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
12/20/2008 9:48:05 AM|GPUGRID|Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.
12/20/2008 9:48:07 AM|GPUGRID|Started download of logops3grid.png
12/20/2008 9:48:07 AM|GPUGRID|Started download of project_1.png
12/20/2008 9:48:09 AM|GPUGRID|Finished download of logops3grid.png
12/20/2008 9:48:09 AM|GPUGRID|Started download of project_2.png
12/20/2008 9:48:10 AM|GPUGRID|Finished download of project_1.png
12/20/2008 9:48:10 AM|GPUGRID|Started download of project_3.png
12/20/2008 9:48:11 AM|GPUGRID|Finished download of project_2.png
12/20/2008 9:48:12 AM|GPUGRID|Finished download of project_3.png
12/20/2008 9:49:19 AM|Einstein@Home|Restarting task h1_0737.55_S5R4__327_S5R4a_2 using einstein_S5R4 version 604
12/20/2008 9:49:22 AM|Einstein@Home|Starting h1_0737.55_S5R4__315_S5R4a_1
12/20/2008 9:49:23 AM|Einstein@Home|Starting task h1_0737.55_S5R4__315_S5R4a_1 using einstein_S5R4 version 604
12/20/2008 9:49:31 AM|GPUGRID|Sending scheduler request: Requested by user. Requesting 711936 seconds of work, reporting 0 completed tasks
12/20/2008 9:49:36 AM|GPUGRID|Scheduler request completed: got 0 new tasks
12/20/2008 9:49:36 AM|GPUGRID|Message from server: No work sent
12/20/2008 9:49:36 AM|GPUGRID|Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
12/20/2008 9:49:36 AM|GPUGRID|Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.

Desti
Send message
Joined: 10 Jul 07
Posts: 19
Credit: 1,272,950
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwatwatwat
Message 4626 - Posted: 20 Dec 2008 | 16:30:10 UTC

No work here too :-(

http://www.gpugrid.net/show_host_detail.php?hostid=19934
____________
Linux Users Everywhere @ BOINC

Profile Bigred
Send message
Joined: 24 Nov 08
Posts: 10
Credit: 25,447,456
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwat
Message 4638 - Posted: 20 Dec 2008 | 21:07:14 UTC - in response to Message 4626.

I have neither the time or the patience to do the constant manual updates or file changes. Until the problems are sorted out, it looks like Seti will get most of my GPU support.

Wolfram1
Send message
Joined: 24 Aug 08
Posts: 45
Credit: 3,431,862
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwatwat
Message 4640 - Posted: 20 Dec 2008 | 21:37:38 UTC - in response to Message 4638.

I have neither the time or the patience to do the constant manual updates or file changes. Until the problems are sorted out, it looks like Seti will get most of my GPU support.



How many credits did you get by seti per day? ;-)

Profile Beyond
Avatar
Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4743 - Posted: 22 Dec 2008 | 18:08:25 UTC - in response to Message 4472.

I'm using BOINC v6.4.1. Did a project reset after my last WU finished this AM. BOINC then downloaded 2 new WUs. Two pieces of good news:

1) The estimated times are much more accurate (only about 20% high, which is fine with me).
2) CPU time has fallen from 13-18% CUDA usage on my quad down to 0%. 5 processes running, 4 at 25% each, CUDA at 0%.

This is very good news as the CPU usage problem in Windows (32bit XP SP3) seems to have been fixed somehow.
The WU seems to be progressing normally percentage-wise.

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4746 - Posted: 22 Dec 2008 | 18:54:18 UTC

The changes you're seeing are caused by the new windows app.

MrS
____________
Scanning for our furry friends since Jan 2002

fractal
Send message
Joined: 16 Aug 08
Posts: 87
Credit: 1,248,879,715
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4748 - Posted: 22 Dec 2008 | 19:03:38 UTC - in response to Message 4488.

Phoneman1 - is 6.3.21 actually working for you?

6.3.21 is working for me. Never gave me any grief after it was worked out that I needed to set GPUGRID to 1000 share and not run too many CPU projects. I got in a little trouble when I put three CPU projects on the machine, but GPUGRID @ 1000 + one CPU @ 100 with 3+ days queue and I never run out of work.

Everyone I know who tried 6.4.anything has regretted it, sorry to say.

Profile Beyond
Avatar
Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4751 - Posted: 22 Dec 2008 | 19:27:17 UTC - in response to Message 4746.

The changes you're seeing are caused by the new windows app.

Saw that after my post. Thanks for the heads up though.

fractal, I was doing fine with v6.4.1 until I saw the recommendation on the front page to upgrade to v6.4.5. After going to v6.4.5 nothing worked right.
Reinstalling v6.4.1 fixed some of the problems and resetting the project fixed the rest.

BTW, the project front page is still saying to go to v6.4.5. Shouldn't that be changed?

Profile mike047
Send message
Joined: 21 Dec 08
Posts: 47
Credit: 7,330,049
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwat
Message 4903 - Posted: 26 Dec 2008 | 23:48:24 UTC
Last modified: 26 Dec 2008 | 23:49:54 UTC

is it 6.4.2 or 6.4.5?

I am on 6.4.5 and working a 6.58 job. It now refuses to get new work. The new box that I started tonight will not get work either.

Linux os/9600GSO.

No memory leak:D

mike

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4905 - Posted: 26 Dec 2008 | 23:55:47 UTC - in response to Message 4903.

Try to suspend the other projects to get work.

MrS
____________
Scanning for our furry friends since Jan 2002

Profile mike047
Send message
Joined: 21 Dec 08
Posts: 47
Credit: 7,330,049
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwat
Message 4906 - Posted: 26 Dec 2008 | 23:59:00 UTC - in response to Message 4905.

I did that and had also tried when GPU was the only project. Tried a reset also with no luck. I'll let it run and see if it will pick some thing up.

mike

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4908 - Posted: 27 Dec 2008 | 0:01:15 UTC - in response to Message 4903.

It now refuses to get new work.


do you mean BOINC is requesting 0 seconds of new work or the dreaded "molecular dynamics for cell processor not available for your system" message?

MrS
____________
Scanning for our furry friends since Jan 2002

Profile mike047
Send message
Joined: 21 Dec 08
Posts: 47
Credit: 7,330,049
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwat
Message 4909 - Posted: 27 Dec 2008 | 0:05:11 UTC - in response to Message 4908.

It now refuses to get new work.


do you mean BOINC is requesting 0 seconds of new work or the dreaded "molecular dynamics for cell processor not available for your system" message?

MrS


It is asking for a bunch but getting the red letters:(

mike

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 4914 - Posted: 27 Dec 2008 | 0:51:29 UTC

I got that too ...

Try increasing your queue size too (if it is small like mine was at 0.1 days the default) ...

Profile mike047
Send message
Joined: 21 Dec 08
Posts: 47
Credit: 7,330,049
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwat
Message 4925 - Posted: 27 Dec 2008 | 8:46:51 UTC

I checked when I got up, it had not received any new work. I did a manual update and got 2 wu's for each box,

I have 2 happy gpu's now:D

mike

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4938 - Posted: 27 Dec 2008 | 13:44:16 UTC

I had 6.4.2 running with minor problems, i.e. I had to suspend all CPU projects to make it request new WUs.. but I got them.

Today I got the message "no work sent, cell not available for you" again, whithout any changes on my side. Project reset didn't help, the server has work, setting nrpc=0 didn't help and sending a double request (like Kokomiko suggested) didn't help. Too bad I'm out of house now until tomorrow.

Maybe the problem solved itself until then. Otherwise: any suggestions I should try?

MrS
____________
Scanning for our furry friends since Jan 2002

Profile Beyond
Avatar
Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4944 - Posted: 27 Dec 2008 | 14:51:08 UTC - in response to Message 4938.
Last modified: 27 Dec 2008 | 14:51:37 UTC

I had 6.4.2 running with minor problems, i.e. I had to suspend all CPU projects to make it request new WUs.. but I got them.

Today I got the message "no work sent, cell not available for you" again, whithout any changes on my side. Project reset didn't help, the server has work, setting nrpc=0 didn't help and sending a double request (like Kokomiko suggested) didn't help. Too bad I'm out of house now until tomorrow.

Maybe the problem solved itself until then. Otherwise: any suggestions I should try?

MrS

It sounds like someone changed something on the server or maybe the server process is stuck.
Work doesn't seem to be going out this AM at least to NVidia cards. See this thread:

http://www.gpugrid.net/forum_thread.php?id=594

Now we're up to 978 WUs ready to send on the server status page.

Jayargh
Send message
Joined: 21 Dec 07
Posts: 47
Credit: 5,252,135
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwat
Message 4946 - Posted: 27 Dec 2008 | 15:24:11 UTC - in response to Message 4944.

I had 6.4.2 running with minor problems, i.e. I had to suspend all CPU projects to make it request new WUs.. but I got them.

Today I got the message "no work sent, cell not available for you" again, whithout any changes on my side. Project reset didn't help, the server has work, setting nrpc=0 didn't help and sending a double request (like Kokomiko suggested) didn't help. Too bad I'm out of house now until tomorrow.

Maybe the problem solved itself until then. Otherwise: any suggestions I should try?

MrS

It sounds like someone changed something on the server or maybe the server process is stuck.
Work doesn't seem to be going out this AM at least to NVidia cards. See this thread:

http://www.gpugrid.net/forum_thread.php?id=594

Now we're up to 978 WUs ready to send on the server status page.


You can never tell anything on the server status due to lumping GPU & PS3 work together.

This project has turned into a nightmare to run the last few weeks....one problem after another ARGHHHHHH!

abuze
Send message
Joined: 3 May 08
Posts: 5
Credit: 4,074,151
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwatwatwat
Message 4948 - Posted: 27 Dec 2008 | 15:41:48 UTC

This is strange. I had the same problem described above (Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.) I tried a project reset, suspending all other projects, setting a larger queue, reinstalling BOINC (i have 6.4.5), no luck.

I was just browsing the forum and thought to try again, and lo and behold, i got a new workunit. Are you sure the server queue isn't full of PS3 units?

Donnie
Send message
Joined: 13 Nov 08
Posts: 11
Credit: 11,185,470
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwat
Message 4949 - Posted: 27 Dec 2008 | 15:59:27 UTC - in response to Message 4948.

I also just received WUs!

Profile Kokomiko
Avatar
Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 4952 - Posted: 27 Dec 2008 | 17:33:09 UTC
Last modified: 27 Dec 2008 | 17:33:21 UTC

I have now also 2 boxes without work. They got no work in the last 24 hours and so this boxes are now idle. Is there really no new work available? So far I got work by manually update, but in the last 24 hours both boxes got every time the same message ... no work for your type of computer ... and the boxes can't call automatically for work, then after this error they go in the 24 hours loop. I will shut down now this 2 boxes and try tomorrow again, no need to run this PCs without a chance for work ...

This boxes are effected:
Linux 65 bit GTX260² 15924
Vista 64 bit GTX260² 17745
____________

Profile Kokomiko
Avatar
Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 4956 - Posted: 27 Dec 2008 | 18:11:00 UTC

After trying for an hour I finally got WUs on this both boxes. Looks like really no work on the server was available.
____________

Profile [BOINC@Poland]AiDec
Send message
Joined: 2 Sep 08
Posts: 53
Credit: 9,213,937
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwat
Message 4962 - Posted: 27 Dec 2008 | 19:22:02 UTC
Last modified: 27 Dec 2008 | 19:38:10 UTC

- "No work sent, molecular dynamics not available for you" again
- 2 different hosts
- Correct BM, correct driver, correct temperature at home, even weather outside is correct (if some1 would like to ask :P) etc.
- Nothing helps (nrpc, DCF, suspending other projects - but request is fine...)
- 1 box got some tasks in queue, 2nd box got nothing in queue (I`m going to try reset project after finishing these 2 WU`s which I`m crunching now, but I don`t believe it will help).

And probably I`m one of not so many who are taking this situation absolutely easy :). I understand this situation. I could understand this situation even for whole next year if... If something what I asked earlier could become soon.

Please. Please change `per-CPU limit` from 1 to 3 and give us possibility to download more than 1-2 WU`s `at once` (5-6 will be OK). After these changes I`ll do not care even if server will go down for 24h. I want to have enough WU`s in stock (queue) to let me crunch at least 24h if some problems will occur. I want to have possibility to DL enough WU`s even if possibility to DL new tasks can occur once per day. And pls change this nonsense that BM is `deferring communication` for next 24h if can`t reach work. That`s really biggest `bunkum` :).

And then I`ll can sleep well.

Mby more of you, crunchers, could sign this supplication? :)
Or mby you disagree?



PS. I`m curious how GPUGrid is working `under` VM. I think I`ll install 3-4 OS`s on each host, just to be sure that I always have enough WU`s. And I`ll switch between them if necessary. Or I`ll install Ubuntu inside Win (by using this option with `filepartition`).
____________

Phoneman1
Send message
Joined: 25 Nov 08
Posts: 51
Credit: 980,186
RAC: 0
Level
Gly
Scientific publications
watwat
Message 4965 - Posted: 27 Dec 2008 | 20:18:48 UTC - in response to Message 4962.

Agreed none of the usual "tricks" seem to force a download of work at the moment - but I am lucky enough to have a queue of 2 waiting to go on my machine with the GTX260 card.

On the machine with the 8600GTS card I have more work units than I can process before their deadline. With this machine trebling the number of wu available to download would just mean I would have to abort more wus every few days, unless the deadline was revised accordingly.

Ideally, the number of wus downloaded should be limited to capability of the card in relation to the deadline; now where have I heard that before...:-)

Looks like some people are getting work though, I guess because other people are aborting wus or wus are passing their deadline and are being re-issued.

Phoneman1

Profile Beyond
Avatar
Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4966 - Posted: 27 Dec 2008 | 20:48:04 UTC - in response to Message 4965.

On the machine with the 8600GTS card I have more work units than I can process before their deadline. With this machine trebling the number of wu available to download would just mean I would have to abort more wus every few days, unless the deadline was revised accordingly.

Looks like some people are getting work though, I guess because other people are aborting wus or wus are passing their deadline and are being re-issued.

Phoneman1

I think you hit it on the head. The few WUs that people have been able to get are probably ones that are reissued. It seems the new ones are no good.
Could you abort your excess WUs so some of the rest of us can get work? (Same for anyone else that has too many WUs.) Thanks.

Profile [BOINC@Poland]AiDec
Send message
Joined: 2 Sep 08
Posts: 53
Credit: 9,213,937
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwat
Message 4967 - Posted: 27 Dec 2008 | 20:49:36 UTC - in response to Message 4965.
Last modified: 27 Dec 2008 | 20:50:29 UTC

I have more work units than I can process before their deadline.(...)I would have to abort more wus every few days, unless the deadline was revised accordingly.


I prefer to have to much WU`s and abort some of them every day instead of fighting ;) every day over hours...
____________

Profile [XTBA>XTC] D.I.Y.Calculat...
Send message
Joined: 2 Sep 08
Posts: 4
Credit: 15,705,436
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwat
Message 4968 - Posted: 27 Dec 2008 | 21:08:44 UTC - in response to Message 4962.

hi,

>>(ID user 5824) my situation is bad, i'm in stress :
3 computers, 3 nvidia 260, 3 problem's like this :
..........
27/12/2008 21:02:45|GPUGRID|Message from server: No work sent
27/12/2008 21:02:45|GPUGRID|Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
27/12/2008 21:02:45|GPUGRID|Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.
..........
>>
system : 3 X xp 32
drivers : 3 X forcewave 178.24 / xp
version boinc :
- 6.3.17 : problem's
- 6.4.5 : problem's
- 6.5.0 : problem's

Waiting (i and my GPU's are) some good news !!!


____________

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4977 - Posted: 28 Dec 2008 | 1:45:53 UTC

Got 2 new WUs :)

MrS
____________
Scanning for our furry friends since Jan 2002

Profile [XTBA>XTC] D.I.Y.Calculat...
Send message
Joined: 2 Sep 08
Posts: 4
Credit: 15,705,436
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwat
Message 4981 - Posted: 28 Dec 2008 | 3:11:54 UTC - in response to Message 4977.

Yes , me to :

version boinc :
- 6.3.17 : 4 WU's
- 6.4.5 : 4 WU's
- 6.5.0 : 4 WU's

Well done !
____________

Profile X1900AIW
Send message
Joined: 12 Sep 08
Posts: 74
Credit: 23,566,124
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5108 - Posted: 31 Dec 2008 | 7:57:20 UTC - in response to Message 4472.

Please also do a project reset. ...
... if you still don't get WUs, write here your host id, so that I can scan it in the log.

http://www.gpugrid.net/show_host_detail.php?hostid=21210

Two systems running Vista/64 (6.5.0) I got workunits without any trouble at all, see here
http://www.gpugrid.net/results.php?hostid=20725
http://www.gpugrid.net/results.php?hostid=18146

Both GTX260 switched to a new (old chipset) motherboard, running Vista got WUs, but there I got the first "compute error", never seen before:
http://www.gpugrid.net/results.php?hostid=21081

Same board with Linux/64bit (openSUSE 11.1, nVidia 177.82) I tried these versions:
- 6.4.5 (first)
- 6.5.0
- 6.4.2
- again 6.5.0, 6.4.2 and so on ...

Now I stay with 6.5.0 and will wait some days, whether I will see a solution or not.

With 6.5.0 I got on first start no more
"Full-atom molecular dynamics on Cell processor is not available for your type of computer."-message,
but afterwards I did, every time.

Do I really have to edit this values manually ? Don't think, because they are ignored,
is it right ?
- Mi 31 Dez 2008 09:25:55 CET||[error] bad value -1.000000 of time stats connected_frac; ignoring
- Mi 31 Dez 2008 09:25:55 CET||[error] bad value -1.000000 of time stats active_frac; ignoring


With suspending another BOINC-project GPUgrid tried to download, if I got two (!) error messages,
would the board crunch two WUs, if it can get some ? Otherwise I switch back to single slot configuration.

Mi 31 Dez 2008 09:50:18 CET|GPUGRID|Sending scheduler request: To fetch work. Requesting 345602 seconds of work, reporting 0 completed tasks
Mi 31 Dez 2008 09:50:23 CET|GPUGRID|Scheduler request completed: got 0 new tasks
Mi 31 Dez 2008 09:50:23 CET|GPUGRID|Message from server: No work sent
Mi 31 Dez 2008 09:50:23 CET|GPUGRID|Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
Mi 31 Dez 2008 09:50:23 CET|GPUGRID|Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.

Profile rebirther
Avatar
Send message
Joined: 7 Jul 07
Posts: 53
Credit: 3,048,781
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwat
Message 5109 - Posted: 31 Dec 2008 | 8:03:57 UTC

Problem came back :(

Profile Megacruncher TSBT
Send message
Joined: 7 Aug 08
Posts: 8
Credit: 5,685,236,983
RAC: 9,948,272
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5110 - Posted: 31 Dec 2008 | 12:50:48 UTC - in response to Message 5109.

Problem back for me too. Regardless of Boinc Manager version, Windows version or card type. Soon all my GPUs are going to be idle.
____________
The Scottish Boinc Team

Profile X1900AIW
Send message
Joined: 12 Sep 08
Posts: 74
Credit: 23,566,124
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5124 - Posted: 31 Dec 2008 | 16:06:11 UTC - in response to Message 5108.

O.k. with the new information given from GDF (thanks for that)
http://www.gpugrid.net/forum_thread.php?id=606&nowrap=true#5111

I will switch back temporarily to other clients. If everything is going well with the servers I'll come back.

Happy new year !

Profile X1900AIW
Send message
Joined: 12 Sep 08
Posts: 74
Credit: 23,566,124
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5164 - Posted: 2 Jan 2009 | 4:05:21 UTC - in response to Message 5124.

This morning I got two new WUs again with the Vista / 6.5.0, wonderful.
http://www.gpugrid.net/results.php?hostid=21081

01/02/09 04:41:12|GPUGRID|Sending scheduler request: Requested by user. Requesting 17282 seconds of work, reporting 0 completed tasks
01/02/09 04:41:17|GPUGRID|Scheduler request completed: got 2 new tasks
01/02/09 04:41:19|GPUGRID|Started download of acemd_6.55_windows_intelx86__cuda.exe

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5165 - Posted: 2 Jan 2009 | 5:49:50 UTC

Ha!

I got three ... :)

Profile K1atOdessa
Send message
Joined: 25 Feb 08
Posts: 249
Credit: 389,728,788
RAC: 1,399,945
Level
Asp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5168 - Posted: 2 Jan 2009 | 12:35:20 UTC

I've been getting new work automatically / regularly now. Only infrequently getting the "not available for your type of processor" message, but it still gets work fairly quickly after several manual tries.

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5170 - Posted: 2 Jan 2009 | 14:31:53 UTC

During the last few days I've seen the message more often than before and had to manually retry several times.

MrS
____________
Scanning for our furry friends since Jan 2002

STE\/E
Send message
Joined: 18 Sep 08
Posts: 368
Credit: 3,258,747,722
RAC: 50,558,045
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 5171 - Posted: 2 Jan 2009 | 14:44:35 UTC

So far since early this morning I've got more work every time I called for it on both my Windoze & Linux Box's & my Caches are full at the moment ... :)

Dukzilla
Send message
Joined: 3 Jan 09
Posts: 6
Credit: 73,310,743
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5198 - Posted: 3 Jan 2009 | 15:47:43 UTC

Hey Everyone,

I just signed up for this project hoping to have better luck than I did with SETI@HOME. So far, not so good. At first, I got the same error messages that everyone else started posting.
1/3/2009 10:06:59 AM|GPUGRID|Message from server: No work sent
1/3/2009 10:06:59 AM|GPUGRID|Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
1/3/2009 10:06:59 AM|GPUGRID|Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.

I read the forum posts, reset the project and lo and behold I got the same message but I finally received a work unit. WAHOO!! Wait a sec ... not so fast ... by the time I switched over to the tasks tab it failed with a "Computation error" message. Is there an error log I can look at somewhere?

Here is my computer ID: http://www.gpugrid.net/show_host_detail.php?hostid=21627

I'm hoping that someone can help fix this quickly. Thanks in advance for any and all advice.

Greg

Profile Stefan Ledwina
Avatar
Send message
Joined: 16 Jul 07
Posts: 464
Credit: 221,007,857
RAC: 4,333,521
Level
Leu
Scientific publications
watwatwatwatwatwatwatwat
Message 5199 - Posted: 3 Jan 2009 | 15:57:02 UTC - in response to Message 5198.

Hi Greg, and welcome to GPUGRID!

Your account doesn't show a failed task. Could you do a manual update of GPUGRID to report the task? Only then we can have a look at the stderr.out to see what might be wrong...
____________

pixelicious.at - my little photoblog

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5203 - Posted: 3 Jan 2009 | 16:32:25 UTC - in response to Message 5198.

Hey Everyone,

I just signed up for this project hoping to have better luck than I did with SETI@HOME. So far, not so good. At first, I got the same error messages that everyone else started posting.
1/3/2009 10:06:59 AM|GPUGRID|Message from server: No work sent
1/3/2009 10:06:59 AM|GPUGRID|Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
1/3/2009 10:06:59 AM|GPUGRID|Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.

I read the forum posts, reset the project and lo and behold I got the same message but I finally received a work unit. WAHOO!! Wait a sec ... not so fast ... by the time I switched over to the tasks tab it failed with a "Computation error" message. Is there an error log I can look at somewhere?

Here is my computer ID: http://www.gpugrid.net/show_host_detail.php?hostid=21627

I'm hoping that someone can help fix this quickly. Thanks in advance for any and all advice.

Greg


If you look here you can see in the STDERR Out that there is a lack of memory ...

Not sure why that occurred, the computer description says you have 511 M memory ...

Did you play a game while the task was running?

I am out of my depth ... someone else likely will have to jump in ...

Profile DoctorNow
Avatar
Send message
Joined: 18 Aug 07
Posts: 83
Credit: 122,995,082
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5206 - Posted: 3 Jan 2009 | 16:40:07 UTC - in response to Message 5203.

If you look here you can see in the STDERR Out that there is a lack of memory ...

Not sure why that occurred, the computer description says you have 511 M memory ...

Did you play a game while the task was running?

I am out of my depth ... someone else likely will have to jump in ...

It's running on a 64-Bit Windows Version.
As far as I know the 64-Bit app-version has a memory leak.
But it shouldn't happen on the very first WU...
____________
Member of BOINC@Heidelberg and ATA!

Profile Stefan Ledwina
Avatar
Send message
Joined: 16 Jul 07
Posts: 464
Credit: 221,007,857
RAC: 4,333,521
Level
Leu
Scientific publications
watwatwatwatwatwatwatwat
Message 5208 - Posted: 3 Jan 2009 | 17:24:58 UTC - in response to Message 5206.
Last modified: 3 Jan 2009 | 17:26:54 UTC

I thought the memory leak had something to do with the drivers and only on XP 64? On Linux and Vista 64 I never saw something about memeory leaks...

Dukzilla, since you use XP 64bit, could you please install the 180.84 Nvidia drivers and try again?
____________

pixelicious.at - my little photoblog

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5222 - Posted: 3 Jan 2009 | 20:49:35 UTC

For testing purposes I installed 6.4.2. It had the problem that it never requested any GPU work voluntarily, not before I suspended the CPU projects. I accidently had a massive stockpile of CPU work, though.

Now I had a strange situation: I had 14 Milkyway WUs at 48 minutes each.. that's <3h of processing for my 4 cores. No other WUs were running except one GPU-Grid with an estimated runtime of 2h left. Cache setting is 1.25 days and ressource share 50% Milkyway and 37,5% GPU-Grid. Why the hell didn't BOINC request new work? I'm a little worried I'll still have to babysit it..

Is there any client which is more well-behaved? I did not really hear good stuff about the later ones.

MrS
____________
Scanning for our furry friends since Jan 2002

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5223 - Posted: 3 Jan 2009 | 21:18:22 UTC - in response to Message 5222.

For testing purposes I installed 6.4.2. It had the problem that it never requested any GPU work voluntarily, not before I suspended the CPU projects. I accidently had a massive stockpile of CPU work, though.

Now I had a strange situation: I had 14 Milkyway WUs at 48 minutes each.. that's <3h of processing for my 4 cores. No other WUs were running except one GPU-Grid with an estimated runtime of 2h left. Cache setting is 1.25 days and ressource share 50% Milkyway and 37,5% GPU-Grid. Why the hell didn't BOINC request new work? I'm a little worried I'll still have to babysit it..

Is there any client which is more well-behaved? I did not really hear good stuff about the later ones.

MrS


I does not request new work because it make no distinction between LTD accumulated by the GPU vs the CPU ...

Which is why I tried to make a point about that in this latest fix cycle. We need to be able to properly establish the correct resource shares, THEN make accurate measurements of capabilities, THEN we can accumulate LTD correctly for each resource ... THEN we can fetch work properly ...

THe short answer ... set the R Share to 500 and try again ... I seem to be doing OK with RS of 100 on the i7 for the moment ... of course, I also don't mind going in and resetting LTD to 0 as I have to on that machine to make all things equal again ...

Oh, the other part of the short answer ... yes, you will need to babysit if for no other reason than the site does not issue work correctly ... but that is just my experience ... for you to get work it seems you have to be out or close to out of work before the scheduler issues work ... then you can get, assuming it is sending any at all, two or three tasks one after the other ... then ... tomorrow you get to bite your nails again ...

Phoneman1
Send message
Joined: 25 Nov 08
Posts: 51
Credit: 980,186
RAC: 0
Level
Gly
Scientific publications
watwat
Message 5231 - Posted: 3 Jan 2009 | 22:01:40 UTC - in response to Message 5222.

I've been using 6.5.0 for about two weeks. It seems to keep GPU tasks and CPU tasks in step (providing there is GPU work available).

Only downside I'm seeing (which may or may not be the Boinc client's fault) is the DCF drifts up to around 13.5 after a few units have been completed and stays around there. The estimated time in BoincView is currently 15 days+ ... but it still gets work when it is available. For the last 10 days or so I've stopped resetting the DCF.

I'm running resource shares of 100 / 100 / GPU / Cosmology with 0.5 days cache. I usually have 1 GPU task running and either 2 or three queued on the machine with the GTX260 card. The machine with the 8600GTS card normally has 1 or 2 GPU tasks queued and one running. Both machines are Vista HP 32 bit.

Boinc won't over-commit itself and at the moment treats GPU tasks like any other for work fetch. So your current work buffer must be able to be completed within the individual project deadlines before it will get any more. So, for example, if you had a 1 day cache and ran 4 other projects each of which had a four day deadline it wouldn't request a GPU task until there was time available and the ratio of LTD had shifted in GPU's favour. Paul's suggestion slants things more in GPU's favour but until time becomes available it won't request work. That is why suspending other projects and manually updating GPU brings a request for > 0 seconds of work. But even with a >0 seconds request you may not get any tasks....

A lot of the present work fetch problems though is because GPU does not seem to have work available. The web site doesn't help, it doesn't separate PS3 jobs ready to send from GPU jobs ready to send so people see the number ready to send and get very frustrated!

Phoneman1

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5241 - Posted: 3 Jan 2009 | 23:44:51 UTC

Somehow my 6.4.2 is totally confused. It's got 24 Milkyways now, which amounts to 4.8h of work on my CPU and 9h when factoring in the 50% resource share. There's also a GPU-WU at estimated 10h cpu time. Adding the 10h of GPU-Grid I get a total time of 7.3h. Nevermind the one WU at 10h.. this tells me that all Milkyways will be done in less than 7.3h. They have a 3 days deadline (just got them today) .. but still BOINC runs them in high priority mode. WTF?! My on fraction etc. looks alright to me.

Guess I should try 6.5.0. But then BOINC will only have 1 GPU-Grid WU at a time due to the 15 days estimate, wouldn't it?

MrS
____________
Scanning for our furry friends since Jan 2002

Scott Brown
Send message
Joined: 21 Oct 08
Posts: 144
Credit: 2,973,555
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwatwat
Message 5248 - Posted: 4 Jan 2009 | 3:03:49 UTC - in response to Message 5241.


6.5.0 gave me two when I added a new (albeit slow - 9500GT) card, so at least initially you should get multiple workunits.

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5253 - Posted: 4 Jan 2009 | 5:48:28 UTC - in response to Message 5241.

Somehow my 6.4.2 is totally confused. It's got 24 Milkyways now, which amounts to 4.8h of work on my CPU and 9h when factoring in the 50% resource share. There's also a GPU-WU at estimated 10h cpu time. Adding the 10h of GPU-Grid I get a total time of 7.3h. Nevermind the one WU at 10h.. this tells me that all Milkyways will be done in less than 7.3h. They have a 3 days deadline (just got them today) .. but still BOINC runs them in high priority mode. WTF?! My on fraction etc. looks alright to me.

Guess I should try 6.5.0. But then BOINC will only have 1 GPU-Grid WU at a time due to the 15 days estimate, wouldn't it?

MrS



I have not figured out the rules used ... my 9800 when it is allowed to pull work usually gets two ... at 0.5 buffer and only GPU Grid active the 280 pulls 3, at 1.0 it pulled 6 (giving me 5 spare) ...

But, I have also pulled those same numbers with all tasks in the buffer adding up to a couple days worth of work too ...

RIght now the i7 has buffer of 1.0 and is running SIMAP for which it has a least a days worth of work. Also AI, and Cosmology have a full days work ... along with WCG's days work ... giving me 5 days? I don't know ... heck I just want to keep it busy ...

Dukzilla
Send message
Joined: 3 Jan 09
Posts: 6
Credit: 73,310,743
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5254 - Posted: 4 Jan 2009 | 5:48:39 UTC - in response to Message 5208.

I thought the memory leak had something to do with the drivers and only on XP 64? On Linux and Vista 64 I never saw something about memory leaks...

Dukzilla, since you use XP 64bit, could you please install the 180.84 Nvidia drivers and try again?


I don't know what I did right or wrong but its working now. I woke up this morning to find my 2nd monitor wouldn't accept a setting above 1024x768. The order was also backwards. I rebooted (for the 3rd time in 24 hours). The 2nd monitor came up normally and in the right position. I decided to do a manual update for the project. I watched the message log. It gave me the same initial error messages BUT it downloaded 4 work units. Lo and Behold, its running. I don't know what the heck is going on. I don't care. Its working. Mostly. Shouldn't I see a total of 5 processes running. I have an AMD 9850 quad core with 4GB RAM and the GPU with 512MB.

Is there a setting I need to adjust to run all 5? I've checked all my projects and they are set to 8 CPU. Also, should I see the message (0.05 CPUs, 1CUDA) at the end of the line for the one running GPUGRID work unit?

I'll be really happy when I actually complete a work unit.

Please bear with the newbie questions. I want to get the most out of this project. It will help both of us.

Thanks again for all of your help.

Greg

BTW - I went through the automatic update process and it said I already have the latest drivers. I'm going to do a manual update and see if that helps.

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5259 - Posted: 4 Jan 2009 | 9:44:31 UTC

Depending on the version of BOINC you are running you will see just the same number as your cores or one more ... There are ways to get the earlier version to use the cores plus one or you could try 6.5.0 which will run plus one out of the box.

There are still server issues and the BOINC Version from my observation makes no difference as to that ...

The message (0.05...) is what you should see for the one task running on the GPU...

There are reports on SaH where the death of the task can cause the GPU to go nuts ... Yours is the first report of that for GPU Grid that I have seen ... so, something unusual is going on ...

The best version I have reports on and that *I* personally are using for the Nvidia drivers is 180.48 you may want to snag those and install them. Just like with games, the latest video drivers got real important with BOINC.

I too will be happy when you complete a task ...

Profile Stefan Ledwina
Avatar
Send message
Joined: 16 Jul 07
Posts: 464
Credit: 221,007,857
RAC: 4,333,521
Level
Leu
Scientific publications
watwatwatwatwatwatwatwat
Message 5260 - Posted: 4 Jan 2009 | 10:31:25 UTC - in response to Message 5259.
Last modified: 4 Jan 2009 | 10:33:54 UTC



The best version I have reports on and that *I* personally are using for the Nvidia drivers is 180.48 you may want to snag those and install them...


Yea, it's good that this driver is working fine for you, but with XP 64bit he should really use 180.84. See this forum thread...
____________

pixelicious.at - my little photoblog

Dukzilla
Send message
Joined: 3 Jan 09
Posts: 6
Credit: 73,310,743
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5273 - Posted: 4 Jan 2009 | 18:36:53 UTC - in response to Message 5259.

Depending on the version of BOINC you are running you will see just the same number as your cores or one more ... There are ways to get the earlier version to use the cores plus one or you could try 6.5.0 which will run plus one out of the box.

There are still server issues and the BOINC Version from my observation makes no difference as to that ...

The message (0.05...) is what you should see for the one task running on the GPU...

There are reports on SaH where the death of the task can cause the GPU to go nuts ... Yours is the first report of that for GPU Grid that I have seen ... so, something unusual is going on ...

The best version I have reports on and that *I* personally are using for the Nvidia drivers is 180.48 you may want to snag those and install them. Just like with games, the latest video drivers got real important with BOINC.

I too will be happy when you complete a task ...


I finally completed a task and manually updated so it would post. The remaining tasks all say ~20 hours to complete. This is up from the 2 hours is was waying yesterday. I updated the drivers to 180.48 when I was going through this process with SETI. I did it again last night just to make sure.

My next update will be to 6.5.0. Maybe that will make a difference.

Greg

Garfield
Send message
Joined: 3 Jan 09
Posts: 2
Credit: 1,866,345
RAC: 0
Level
Ala
Scientific publications
watwatwatwat
Message 5319 - Posted: 5 Jan 2009 | 22:40:58 UTC

Hi there,
everything went fine for me, until i tried to OC my 9800GTX+ 1GB...
From that time, not a single ping. I try all mentioned and go to defauls... :-(
I have one more 9800GTX 512 in my bag now, If there will be more troubles than poins in this projec, I will sell it and buy some second hand QUAD instead of it...
____________

Garfield
Send message
Joined: 3 Jan 09
Posts: 2
Credit: 1,866,345
RAC: 0
Level
Ala
Scientific publications
watwatwatwat
Message 5336 - Posted: 6 Jan 2009 | 11:08:48 UTC - in response to Message 5319.

I went to default clocks on my card, stopped other projects, restarted computer, then it loaded from GPUGRID some tests and now i have 4 new WU... :-D
Great. Let us see the future...
(running 6.4.5)

6.1.2009 11:48:26||Starting BOINC client version 6.4.5 for windows_intelx86
6.1.2009 11:48:26||log flags: task, file_xfer, sched_ops
6.1.2009 11:48:26||Libraries: libcurl/7.19.0 OpenSSL/0.9.8i zlib/1.2.3
6.1.2009 11:48:26||Data directory: C:\Documents and Settings\All Users\Data aplikací\BOINC
6.1.2009 11:48:26||Running under account Garfield
6.1.2009 11:48:27||Processor: 4 GenuineIntel Intel(R) Core(TM)2 Quad CPU Q9550 @ 2.83GHz [x86 Family 6 Model 23 Stepping 7]
6.1.2009 11:48:27||Processor features: fpu tsc pae nx sse sse2 mmx
6.1.2009 11:48:27||OS: Microsoft Windows XP: Home x86 Editon, Service Pack 3, (05.01.2600.00)
6.1.2009 11:48:27||Memory: 3.25 GB physical, 3.09 GB virtual
6.1.2009 11:48:27||Disk: 3.72 GB total, 2.02 GB free
6.1.2009 11:48:27||Local time is UTC +1 hours
6.1.2009 11:48:27||Not using a proxy
6.1.2009 11:48:28||CUDA devices found
6.1.2009 11:48:28||Coprocessor: GeForce 9800 GTX/9800 GTX+ (1)
6.1.2009 11:48:28|Milkyway@home|URL: http://milkyway.cs.rpi.edu/milkyway/; Computer ID: 42149; location: (none); project prefs: default
6.1.2009 11:48:28|QMC@HOME|URL: http://qah.uni-muenster.de/; Computer ID: 131092; location: (none); project prefs: default
6.1.2009 11:48:28|SETI@home|URL: http://setiathome.berkeley.edu/; Computer ID: 4738967; location: (none); project prefs: default
6.1.2009 11:48:28|GPUGRID|URL: http://www.gpugrid.net/; Computer ID: 21614; location: (none); project prefs: default
6.1.2009 11:48:28||No general preferences found - using BOINC defaults
6.1.2009 11:48:28||Reading preferences override file
6.1.2009 11:48:28||Preferences limit memory usage when active to 2994.28MB
6.1.2009 11:48:28||Preferences limit memory usage when idle to 3160.62MB
6.1.2009 11:48:28||Preferences limit disk usage to 1.86GB
6.1.2009 11:48:29|GPUGRID|Sending scheduler request: To fetch work. Requesting 207360 seconds of work, reporting 0 completed tasks
6.1.2009 11:48:32||Project communication failed: attempting access to reference site
6.1.2009 11:48:52|GPUGRID|Scheduler request failed: Couldn't resolve host name
6.1.2009 11:48:53||Internet access OK - project servers may be temporarily down.
6.1.2009 11:49:52|GPUGRID|Sending scheduler request: To fetch work. Requesting 207360 seconds of work, reporting 0 completed tasks
6.1.2009 11:49:57|GPUGRID|Scheduler request completed: got 2 new tasks
6.1.2009 11:50:00|GPUGRID|Started download of VsX9714-SH2_US-11-40-SH2_US240000-LICENSE
6.1.2009 11:50:00|GPUGRID|Started download of VsX9714-SH2_US-11-40-SH2_US240000-COPYRIGHT
6.1.2009 11:50:02|GPUGRID|Finished download of VsX9714-SH2_US-11-40-SH2_US240000-LICENSE
6.1.2009 11:50:02|GPUGRID|Finished download of VsX9714-SH2_US-11-40-SH2_US240000-COPYRIGHT
6.1.2009 11:50:02|GPUGRID|Started download of VsX9714-SH2_US-11-40-SH2_US240000-VsX9714-SH2_US-10-40-SH2_US240000_1
6.1.2009 11:50:02|GPUGRID|Started download of VsX9714-SH2_US-11-40-SH2_US240000-VsX9714-SH2_US-10-40-SH2_US240000_2
6.1.2009 11:50:09|GPUGRID|Finished download of VsX9714-SH2_US-11-40-SH2_US240000-VsX9714-SH2_US-10-40-SH2_US240000_1
6.1.2009 11:50:09|GPUGRID|Finished download of VsX9714-SH2_US-11-40-SH2_US240000-VsX9714-SH2_US-10-40-SH2_US240000_2
6.1.2009 11:50:09|GPUGRID|Started download of VsX9714-SH2_US-11-40-SH2_US240000-VsX9714-SH2_US-10-40-SH2_US240000_3
6.1.2009 11:50:09|GPUGRID|Started download of VsX9714-SH2_US-11-40-SH2_US240000-complex_full.sol.ionized.pdb
6.1.2009 11:50:13|GPUGRID|Finished download of VsX9714-SH2_US-11-40-SH2_US240000-VsX9714-SH2_US-10-40-SH2_US240000_3
6.1.2009 11:50:13|GPUGRID|Started download of VsX9714-SH2_US-11-40-SH2_US240000-complex_full.sol.ionized.psf
6.1.2009 11:50:31|GPUGRID|Finished download of VsX9714-SH2_US-11-40-SH2_US240000-complex_full.sol.ionized.pdb
6.1.2009 11:50:31|GPUGRID|Started download of VsX9714-SH2_US-11-40-SH2_US240000-parameters
6.1.2009 11:50:31|GPUGRID|Sending scheduler request: To fetch work. Requesting 205510 seconds of work, reporting 0 completed tasks
6.1.2009 11:50:42|GPUGRID|Finished download of VsX9714-SH2_US-11-40-SH2_US240000-parameters
6.1.2009 11:50:42|GPUGRID|Started download of VsX9714-SH2_US-11-40-SH2_US240000-SH2_US240000
6.1.2009 11:50:42|GPUGRID|Scheduler request completed: got 2 new tasks

____________

Dukzilla
Send message
Joined: 3 Jan 09
Posts: 6
Credit: 73,310,743
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5344 - Posted: 6 Jan 2009 | 13:58:26 UTC - in response to Message 5273.


My next update will be to 6.5.0. Maybe that will make a difference.
Greg


The update (and several reboots) seems to have resolved most of the issues with S@H and GPUGRID. I can now process work units for both. Sometimes. I'm still getting periods where all the remaining CUDA work units fail with Computation Error. A reboot seems to fix this. I have to reboot not just restart the BOINC client. I'll keep monitoring the boards for both projects and see if a solution presents itself. Thanks for all of your help so far.

Greg

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5349 - Posted: 6 Jan 2009 | 17:48:55 UTC - in response to Message 5344.


My next update will be to 6.5.0. Maybe that will make a difference.
Greg


The update (and several reboots) seems to have resolved most of the issues with S@H and GPUGRID. I can now process work units for both. Sometimes. I'm still getting periods where all the remaining CUDA work units fail with Computation Error. A reboot seems to fix this. I have to reboot not just restart the BOINC client. I'll keep monitoring the boards for both projects and see if a solution presents itself. Thanks for all of your help so far.

Greg


I recall notes on SaH that this is a common problem with their application, that once it crashes in some particular way, which may not bring down the system, all remaining work will be CE ...

Just a suggestion, try running only one project at a time. I have had a couple tasks here that died when I first started and zero since ... not conclusive, but nearly two solid weeks I think it has been since my last crash ... so, at the moment, I think the weakness is still on the SaH end ... YMMV

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5415 - Posted: 9 Jan 2009 | 8:38:17 UTC

Geez, 6.4.2 is so nuts! I'll try 6.5.0 now..

Twice this week BOINC decided it would be a good idea not to fetch any CPU work and just run dry until I return from work and discover the disaster.

MrS
____________
Scanning for our furry friends since Jan 2002

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5420 - Posted: 9 Jan 2009 | 12:49:36 UTC - in response to Message 5415.
Last modified: 9 Jan 2009 | 12:50:48 UTC

Geez, 6.4.2 is so nuts! I'll try 6.5.0 now..

Twice this week BOINC decided it would be a good idea not to fetch any CPU work and just run dry until I return from work and discover the disaster.

MrS


6.5.0 *MAY* work better, or the same ...

The problem you are likely running into is the work fetch policy not making a distinction between work done on the CPU vs work done on the GPU ...

I configured for resource share of 200 (high, but not highest) and 0.4 esxtra buffer ... and I still watch it like a hawk ... at times I have gone in and reset LTDs to zero to force more work fetch ...

Not sure when we may expect the next version of BOINC Manager that begins to address this ... rumor has mid next month which likely means later than that ...

{edit}Of course I am at home all the time and I have the luxury of being able to hover over my systems ... and not being able to sleep means that I do it at all hours of the day and night ... :) {/edit}

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5444 - Posted: 10 Jan 2009 | 13:25:43 UTC
Last modified: 10 Jan 2009 | 13:26:53 UTC

I'm happy with the change: 6.5.0 behaves just as expected for me!

I don't have to suspend all cpu tasks any more to make it request new work and it actually tries to establish a cash of the size I request, not only a few hours like 6.4.2 did.

Edit:

The problem you are likely running into is the work fetch policy not making a distinction between work done on the CPU vs work done on the GPU


Yes, that was definitely a reason, coupled with buggy code.

MrS
____________
Scanning for our furry friends since Jan 2002

uBronan
Avatar
Send message
Joined: 1 Feb 09
Posts: 139
Credit: 575,023
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 6734 - Posted: 17 Feb 2009 | 22:23:10 UTC - in response to Message 5420.

after i had the same issues as many here i found myself with help of search engine copy of boinc 6.5.0 and yes all works fine again
version 6.4.5 gave so many problems with other applications that i even tried the newest but those did not really run nicely as well including 6.6.7
i am amazed to see this version running so smooth

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 6740 - Posted: 18 Feb 2009 | 6:19:29 UTC - in response to Message 6734.

after i had the same issues as many here i found myself with help of search engine copy of boinc 6.5.0 and yes all works fine again
version 6.4.5 gave so many problems with other applications that i even tried the newest but those did not really run nicely as well including 6.6.7
i am amazed to see this version running so smooth


Here is a BOINC "tip", download a version of BOINC and try it ... if it works, KEEP A COPY OF THE INSTALLER ...

Keep at least 2 versions back ... 6.5.0 works well, so does 5.10.45 and so I keep copies of both ...

When I find another stable copy and it has run for several months I will think about deleting my old copies of 5.10.45 ...

But, I don't expect a stable version of BOINC for awhile ... and it is more than likely to NOT be 6.7.0 ...

uBronan
Avatar
Send message
Joined: 1 Feb 09
Posts: 139
Credit: 575,023
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 6772 - Posted: 19 Feb 2009 | 9:49:10 UTC - in response to Message 6740.
Last modified: 19 Feb 2009 | 9:56:00 UTC

I was thinking that 6.4.5 was a stable version untill i tried to install the sse optimized clients.
All optimized apps are being totally ignored by this version, when i placed the files in the maps where they must be they did not work.
I was thinking it was the applications who where faulty but after i downloaded other optimized applications found that the issue was with them also.
So i took my old 4.2.19 and installed it and direct after i was done all the optimized applications started :)
Ofcourse i reinstalled the 6.4.5 version to check and poef gone optimized applications and back to originals.
So yes i agree with you i am gonna keep a bit more older versions since newer ones and also the recommended ones not work like they should.
For now i think 6.5.0 runs the best :D

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 6784 - Posted: 19 Feb 2009 | 18:44:41 UTC - in response to Message 6772.

Wow, a 4-series client?! Milkyway optimized certainly works with 6.5.0, so 6.4.5 looks even worse if it doesn't support the anonymous platform properly.

MrS
____________
Scanning for our furry friends since Jan 2002

Post to thread

Message boards : Graphics cards (GPUs) : No work available? Please install 6.4.2

//