Advanced search

Message boards : Graphics cards (GPUs) : High amount of errors

Author Message
silence911
Send message
Joined: 1 Oct 16
Posts: 4
Credit: 80,984,514
RAC: 0
Level
Thr
Scientific publications
wat
Message 54679 - Posted: 13 May 2020 | 18:45:32 UTC
Last modified: 13 May 2020 | 18:46:46 UTC

Can someone help me with why I have so many errors? I am currently running a 2060 KO, no overclocking or undervolting at all. Is it normal to have such a high amount of error units?



Here is the pastebin of one of the units that ended in error

https://pastebin.com/4MjKFUHV

Thank you in advance,
Son

Keith Myers
Send message
Joined: 13 Dec 17
Posts: 1340
Credit: 7,653,573,724
RAC: 13,216,170
Level
Tyr
Scientific publications
watwatwatwatwat
Message 54680 - Posted: 13 May 2020 | 19:00:19 UTC - in response to Message 54679.

I don't see anything wrong with your valid task. Look at any of your errored task wingmen and you will see everyone failed the task.

We had a big batch of incorrectly formatted work lately that all bombs out on every host.

Don't worry about it.

Pop Piasa
Avatar
Send message
Joined: 8 Aug 19
Posts: 252
Credit: 458,054,251
RAC: 0
Level
Gln
Scientific publications
watwat
Message 54681 - Posted: 13 May 2020 | 22:16:39 UTC

The EXCEPTIONAL CONDITION error...

(unknown error) - exit code 195 (0xc3)</message>
<stderr_txt>
20:40:17 (9532): wrapper (7.9.26016): starting
20:40:18 (9532): wrapper: running acemd3.exe (--boinc input --device 0)
EXCEPTIONAL CONDITION: src\mdio\bincoord.c, line 193: "nelems != 1"
20:40:20 (9532): acemd3.exe exited; CPU time 0.000000
20:40:20 (9532): app exit status: 0xc0000409
20:40:20 (9532): called boinc_finish(195)


That's the error that I've been seeing too. Bad units, all. Amazing that you received so any in a row. I only saw 3 in a row.

🤔I'm curious whether they will result in another batch of patched WUs or if this was an anticipated error factor and these are proof of theoretical boundaries.

silence911
Send message
Joined: 1 Oct 16
Posts: 4
Credit: 80,984,514
RAC: 0
Level
Thr
Scientific publications
wat
Message 54682 - Posted: 14 May 2020 | 1:41:05 UTC - in response to Message 54681.

Here's another error that I see also. This usually happens when the tasks are near finished.

https://pastebin.com/rJgJdKJn

Profile robertmiles
Send message
Joined: 16 Apr 09
Posts: 503
Credit: 755,370,933
RAC: 212,472
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 54683 - Posted: 14 May 2020 | 4:01:03 UTC - in response to Message 54682.
Last modified: 14 May 2020 | 4:04:50 UTC

Here's another error that I see also. This usually happens when the tasks are near finished.

https://pastebin.com/rJgJdKJn

Looks like the important part of the messages is the stderr section down at the bottom:

</stderr_txt>
<message>
upload failure: <file_xfer_error>
<file_name>3nkuA00_320_2-TONI_MDADpr4sn-9-10-RND1933_0_0</file_name>
<error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
</message>
]]>

The stat() function attempts to find information about a file. I suspect that it could not find the file, and therefore could not upload it.

I did not spot anything about why there is a problem with that file.

rod4x4
Send message
Joined: 4 Aug 14
Posts: 266
Credit: 2,219,935,054
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 54684 - Posted: 14 May 2020 | 6:59:52 UTC - in response to Message 54683.
Last modified: 14 May 2020 | 7:17:45 UTC

Here's another error that I see also. This usually happens when the tasks are near finished.

https://pastebin.com/rJgJdKJn

Looks like the important part of the messages is the stderr section down at the bottom:

</stderr_txt>
<message>
upload failure: <file_xfer_error>
<file_name>3nkuA00_320_2-TONI_MDADpr4sn-9-10-RND1933_0_0</file_name>
<error_code>-240 (stat() failed)</error_code>
</file_xfer_error>
</message>
]]>

The stat() function attempts to find information about a file. I suspect that it could not find the file, and therefore could not upload it.

I did not spot anything about why there is a problem with that file.


The task is completing successfully, but not uploadng as it is missing when the stat() function tries to manipulate the file on the disk.
May need to whitelist your \Program Data\BOINC folder in your Anti Virus solution. Your AV may be quarantining the file before it can be uploaded.

silence911
Send message
Joined: 1 Oct 16
Posts: 4
Credit: 80,984,514
RAC: 0
Level
Thr
Scientific publications
wat
Message 54685 - Posted: 14 May 2020 | 7:18:49 UTC - in response to Message 54684.

I only use Windows Defender. Currently I don't see any option to whitelist a specific folder. How do it do it?

rod4x4
Send message
Joined: 4 Aug 14
Posts: 266
Credit: 2,219,935,054
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 54687 - Posted: 14 May 2020 | 13:10:42 UTC - in response to Message 54685.

I only use Windows Defender. Currently I don't see any option to whitelist a specific folder. How do it do it?


Not a power user of windows defender.
This may be a good start
https://support.microsoft.com/en-au/help/4028485/windows-10-add-an-exclusion-to-windows-security

silence911
Send message
Joined: 1 Oct 16
Posts: 4
Credit: 80,984,514
RAC: 0
Level
Thr
Scientific publications
wat
Message 54694 - Posted: 14 May 2020 | 20:26:01 UTC - in response to Message 54687.

After following the advice to whitelist Boinc. I noticed this error in my log

5/14/2020 1:01:51 PM | GPUGRID | [error] Can't rename output file slots/3/progress.log to projects/www.gpugrid.net/4h65A02_379_4-TONI_MDADpr4sh-9-10-RND9478_1_0: Error 32

Keith Myers
Send message
Joined: 13 Dec 17
Posts: 1340
Credit: 7,653,573,724
RAC: 13,216,170
Level
Tyr
Scientific publications
watwatwatwatwat
Message 54697 - Posted: 15 May 2020 | 1:03:49 UTC - in response to Message 54694.

You must not have selected sub-folders of the main BOINC folder for whitelisting too.

Post to thread

Message boards : Graphics cards (GPUs) : High amount of errors

//