Author |
Message |
OtterSend message
Joined: 6 Jan 09 Posts: 11 Credit: 6,376,844 RAC: 0 Level
Scientific publications
|
So now I've got machines that are working on the new WUs, but they are getting messages that they can't have work because they won't be able to finish it..
edit - I spoke too soon, I still have PS3s failing to download as well.
4/21/2009 5:15:15 PM|GPUGRID|Message from server: No work sent
4/21/2009 5:15:15 PM|GPUGRID|Message from server: Full-atom molecular dynamics is not available for your type of computer.
4/21/2009 5:15:15 PM|GPUGRID|Message from server: (won't finish in time) BOINC runs 97.3% of time, computation enabled 99.8% of that
4/21/2009 5:15:15 PM|GPUGRID|Deferring communication for 31 sec
4/21/2009 5:15:15 PM|GPUGRID|Reason: requested by project
4/21/2009 5:15:15 PM|GPUGRID|Deferring communication for 1 min 0 sec
4/21/2009 5:15:15 PM|GPUGRID|Reason: no work from project
4/21/2009 5:50:16 PM|GPUGRID|[file_xfer] Started download of file cellmd2_5.03_powerpc64-ps3-linux-gnu
4/21/2009 5:50:16 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-LICENSE
4/21/2009 5:50:17 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-LICENSE: file not found
4/21/2009 5:50:17 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-LICENSE
4/21/2009 5:50:17 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-COPYRIGHT
4/21/2009 5:50:18 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-COPYRIGHT: file not found
4/21/2009 5:50:18 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-COPYRIGHT
4/21/2009 5:50:18 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_1
4/21/2009 5:50:19 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_1: file not found
4/21/2009 5:50:19 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_1
4/21/2009 5:50:19 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_2
4/21/2009 5:50:21 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_2: file not found
4/21/2009 5:50:21 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_2
4/21/2009 5:50:21 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_3
4/21/2009 5:50:22 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_3: file not found
4/21/2009 5:50:22 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_3
4/21/2009 5:50:22 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-grama.ionized.pdb
4/21/2009 5:50:23 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-grama.ionized.pdb: file not found
4/21/2009 5:50:23 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-grama.ionized.pdb
4/21/2009 5:50:23 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-grama.ionized.psf
4/21/2009 5:50:24 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-grama.ionized.psf: file not found
4/21/2009 5:50:24 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-grama.ionized.psf
4/21/2009 5:50:24 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-parameters
4/21/2009 5:50:25 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-parameters: file not found
4/21/2009 5:50:25 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-parameters
4/21/2009 5:50:25 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-GAPREV800000
4/21/2009 5:50:26 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-GAPREV800000: file not found
4/21/2009 5:50:26 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-GAPREV800000
4/21/2009 5:50:26 PM|GPUGRID|[file_xfer] Started download of file logops3grid.png
4/21/2009 5:50:27 PM|GPUGRID|[file_xfer] Finished download of file logops3grid.png
4/21/2009 5:50:27 PM|GPUGRID|[file_xfer] Throughput 14378 bytes/sec
4/21/2009 5:50:27 PM|GPUGRID|[file_xfer] Started download of file project_1.png
4/21/2009 5:50:29 PM|GPUGRID|[file_xfer] Finished download of file cellmd2_5.03_powerpc64-ps3-linux-gnu
4/21/2009 5:50:29 PM|GPUGRID|[file_xfer] Throughput 320169 bytes/sec
4/21/2009 5:50:29 PM|GPUGRID|[file_xfer] Finished download of file project_1.png
4/21/2009 5:50:29 PM|GPUGRID|[file_xfer] Throughput 33860 bytes/sec
4/21/2009 5:50:29 PM|GPUGRID|[file_xfer] Started download of file project_2.png
4/21/2009 5:50:29 PM|GPUGRID|[file_xfer] Started download of file project_3.png
4/21/2009 5:50:31 PM|GPUGRID|[file_xfer] Finished download of file project_2.png
4/21/2009 5:50:31 PM|GPUGRID|[file_xfer] Throughput 34960 bytes/sec
4/21/2009 5:50:31 PM|GPUGRID|[file_xfer] Finished download of file project_3.png
4/21/2009 5:50:31 PM|GPUGRID|[file_xfer] Throughput 39677 bytes/sec
|
|
|
OtterSend message
Joined: 6 Jan 09 Posts: 11 Credit: 6,376,844 RAC: 0 Level
Scientific publications
|
Hate to bump myself, but this is still an issue. Multiple PS3s failing to download WUs getting file not found errors |
|
|
OtterSend message
Joined: 6 Jan 09 Posts: 11 Credit: 6,376,844 RAC: 0 Level
Scientific publications
|
Tried reseting the specific machines (6 different ones), and restarting BOINC all to no avail.
4/22/2009 8:46:36 PM||Starting BOINC client version 5.10.6 for powerpc64-ps3-linux-gnu
4/22/2009 8:46:36 PM||log flags: task, file_xfer, sched_ops
4/22/2009 8:46:36 PM||Libraries: libcurl/7.16.2 OpenSSL/0.9.8b zlib/1.2.3 libidn/0.6.5
4/22/2009 8:46:36 PM||Executing as a daemon
4/22/2009 8:46:36 PM||Data directory: /opt/boinc
4/22/2009 8:46:36 PM||Processor: 2 PS3PF Cell Broadband Engine
4/22/2009 8:46:36 PM||Processor features: altivec
4/22/2009 8:46:36 PM||Memory: 197.47 MB physical, 512.11 MB virtual
4/22/2009 8:46:36 PM||Disk: 15.21 GB total, 3.44 GB free
4/22/2009 8:46:36 PM|GPUGRID|URL: http://www.gpugrid.net/; Computer ID: 31099; location: (none); project prefs: default
4/22/2009 8:46:36 PM||General prefs: from http://boinc.iaik.tugraz.at/sha1_coll_search/ (last modified 2009-01-28 11:23:21)
4/22/2009 8:46:36 PM||Host location: none
4/22/2009 8:46:36 PM||General prefs: using your defaults
4/22/2009 8:46:36 PM||Preferences limit memory usage when active to 187.60MB
4/22/2009 8:46:36 PM||Preferences limit memory usage when idle to 187.60MB
4/22/2009 8:46:36 PM||Preferences limit disk usage to 0.64GB
4/22/2009 8:46:36 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_3
4/22/2009 8:46:36 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-grama.ionized.pdb
4/22/2009 8:46:38 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_3: file not found
4/22/2009 8:46:38 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_3
4/22/2009 8:46:38 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-grama.ionized.pdb: file not found
4/22/2009 8:46:38 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-grama.ionized.pdb
4/22/2009 8:46:38 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-grama.ionized.psf
4/22/2009 8:46:38 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-parameters
4/22/2009 8:46:41 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-grama.ionized.psf: file not found
4/22/2009 8:46:41 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-grama.ionized.psf
4/22/2009 8:46:41 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-parameters: file not found
4/22/2009 8:46:41 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-parameters
4/22/2009 8:46:41 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-GAPREV800000
4/22/2009 8:46:43 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-GAPREV800000: file not found
4/22/2009 8:46:43 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-GAPREV800000
4/22/2009 8:47:28 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-LICENSE
4/22/2009 8:47:28 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-COPYRIGHT
4/22/2009 8:47:29 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-LICENSE: file not found
4/22/2009 8:47:29 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-LICENSE
4/22/2009 8:47:29 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-COPYRIGHT: file not found
4/22/2009 8:47:29 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-COPYRIGHT
4/22/2009 8:47:31 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_1
4/22/2009 8:47:31 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_2
4/22/2009 8:47:32 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_1: file not found
4/22/2009 8:47:32 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_1
4/22/2009 8:47:32 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_2: file not found
4/22/2009 8:47:32 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_2
4/22/2009 8:47:38 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_3
4/22/2009 8:47:38 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-grama.ionized.pdb
4/22/2009 8:47:39 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_3: file not found
4/22/2009 8:47:39 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-BE18971-GAPREV-36-50-GAPREV800000_3
4/22/2009 8:47:39 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-grama.ionized.pdb: file not found
4/22/2009 8:47:39 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-grama.ionized.pdb
4/22/2009 8:47:41 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-grama.ionized.psf
4/22/2009 8:47:41 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-parameters
4/22/2009 8:47:42 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-grama.ionized.psf: file not found
4/22/2009 8:47:42 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-grama.ionized.psf
4/22/2009 8:47:42 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-parameters: file not found
4/22/2009 8:47:42 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-parameters
4/22/2009 8:47:43 PM|GPUGRID|[file_xfer] Started download of file BE18971-GAPREV-37-50-GAPREV800000-GAPREV800000
4/22/2009 8:47:44 PM|GPUGRID|[file_xfer] Temporarily failed download of BE18971-GAPREV-37-50-GAPREV800000-GAPREV800000: file not found
4/22/2009 8:47:44 PM|GPUGRID|Backing off 1 min 0 sec on download of file BE18971-GAPREV-37-50-GAPREV800000-GAPREV800000
|
|
|
OtterSend message
Joined: 6 Jan 09 Posts: 11 Credit: 6,376,844 RAC: 0 Level
Scientific publications
|
Heading on a week of failed downloads (and general trouble with PS3s) - and no response.
Do you plan to keep supporting them or not? If not just let us know and we can move on. |
|
|
VagnSend message
Joined: 5 Sep 08 Posts: 7 Credit: 1,611,207 RAC: 0 Level
Scientific publications
|
Problem is still there.
I just cancelled the download of the next WU that the client had been trying to download for some 4 hours without success.
Here some of the 'messages' from the cancelled wu :
25 apr 2009 09:37:07 CEST|GPUGRID|[file_xfer] Started download of file nr19435-GAPREV-37-50-GAPREV1450000-LICENSE
25 apr 2009 09:37:07 CEST|GPUGRID|[file_xfer] Started download of file nr19435-GAPREV-37-50-GAPREV1450000-COPYRIGHT
25 apr 2009 09:37:08 CEST|GPUGRID|[file_xfer] Temporarily failed download of nr19435-GAPREV-37-50-GAPREV1450000-LICENSE: file not found
That however didn't improve the situation. Instead I'm getting these messages :
25 apr 2009 13:29:54 CEST|GPUGRID|Message from server: Full-atom molecular dynamics is not available for your type of computer.
25 apr 2009 13:29:54 CEST|GPUGRID|Message from server: (won't finish in time) BOINC runs 99.7% of time, computation enabled 100.0% of that
25 apr 2009 13:29:54 CEST|GPUGRID|Deferring communication for 31 sec
Does anyone know what to do ? The present '128000-IBUCH_GRAUS-0-100-RND7684_0' wu is nearly finished.
Or when we / I can expect these issues to be solved ?
regards Vagn
|
|
|
ignasiSend message
Joined: 10 Apr 08 Posts: 254 Credit: 16,836,000 RAC: 0 Level
Scientific publications
|
Have all these errors happened when trying to download these *GAPREV* WUs?
I'm pretty sure these have to do with the server fall two weeks ago. After recovery, loads of data was coming back in and the disks got quickly filled up. We had to manually babysit it and remove data by batches of Gb. Series of WU as these *GAPREV* had been interrupted.
Recently though, we refilled with new job such as these *IBUCH_GRAUS*.
I hope this helps in clarifying the situation.
Thanks for your patience,
ignasi |
|
|
ToniVolunteer moderator Project administrator Project developer Project tester Project scientist Send message
Joined: 9 Dec 08 Posts: 1006 Credit: 5,068,599 RAC: 0 Level
Scientific publications
|
Hi all,
apologizes for the late response. We are investigating, but in the meantime feel free to abort WUs which cause "download error" - there should be a fair number of other types now. Thanks for your patience!
Edit: Ignasi anticipated me :-) |
|
|
OtterSend message
Joined: 6 Jan 09 Posts: 11 Credit: 6,376,844 RAC: 0 Level
Scientific publications
|
Yes they have all been GAPREV-*
My guess is that you have old WUs floating around that are trapped in the system (fail on one machine -> timeout -> resend to new machine -> repeat)
The SHORT WUs all work, but I keep getting the bad ones (probably because I run 17 PS3s, so I have more chances to get bad stuff) |
|
|
ToniVolunteer moderator Project administrator Project developer Project tester Project scientist Send message
Joined: 9 Dec 08 Posts: 1006 Credit: 5,068,599 RAC: 0 Level
Scientific publications
|
Thanks Otter, you helped us spot a tricky problem with the generation of WUs! ... :-)
(We posted an update in the PS3 and GPU forums.) |
|
|