Message boards : Graphics cards (GPUs) : New version tasks failing on Windows hosts
Author | Message |
---|---|
I'm getting a bunch of new work for my Windows hosts but they are all failing with Error while computing. Here is a sample stderr log. | |
ID: 57318 | Rating: 0 | rate: / Reply Quote | |
You're getting error code 0xc0000135, which officially means 'status_dll_not_found'. | |
ID: 57319 | Rating: 0 | rate: / Reply Quote | |
My first successful WU from this particular batch: | |
ID: 57320 | Rating: 0 | rate: / Reply Quote | |
My first successful WU from this particular batch: Watching at Hosts Ranking page, many other windows hosts are starting to succeed their tasks, and are quickly ascending in rank. Nice to see that Windows hosts are recovered to contribute to the Project along with Linux ones! Output file 09 is very large, over 472 megs. Just to remember that if some file happened to exceed a 512 MB size, it would become stalled when uploading to server. Ian&Steve C. mentioned it and was discussed at can't upload results. file size too big being blocked? thread. I experienced this same problem in one of my hosts, Message #57194. | |
ID: 57321 | Rating: 0 | rate: / Reply Quote | |
The file size limit is set on the server, but implemented by the client on your own machine. It's possible to inspect the current value in client_state.xml, and even change it if you catch it soon enough. | |
ID: 57322 | Rating: 0 | rate: / Reply Quote | |
I'm getting a bunch of new work for my Windows hosts but they are all failing with Error while computing. Here is a sample stderr log. Sometimes, resetting Gpugrid Project at BOINC Manager may help, given that all app related files will be reloaded. | |
ID: 57323 | Rating: 0 | rate: / Reply Quote | |
I'm getting a bunch of new work for my Windows hosts but they are all failing with Error while computing. Here is a sample stderr log. Perhaps worth a try, but I think it's unlikely to help with this one. Another tried-and-tested solution to some rare problems is to perform a full system restart. | |
ID: 57324 | Rating: 0 | rate: / Reply Quote | |
Another WU is done: | |
ID: 57325 | Rating: 0 | rate: / Reply Quote | |
I have had three tasks complete successfully. They all ran on Quadro RTX 4000 GPU's. Is it possible that only RTX series cards will work? | |
ID: 57326 | Rating: 0 | rate: / Reply Quote | |
I have completed two tasks so far, one on a RTX 2080 and another on a GTX 1080 Ti. | |
ID: 57327 | Rating: 0 | rate: / Reply Quote | |
Can anyone tell whether these new tasks run on Ampere cards, too? | |
ID: 57328 | Rating: 0 | rate: / Reply Quote | |
Can anyone tell whether these new tasks run on Ampere cards, too? Yes, they run fine on Ampere cards also. | |
ID: 57330 | Rating: 0 | rate: / Reply Quote | |
Can anyone tell whether these new tasks run on Ampere cards, too? Thanks, Keith, for the valuable information. Just to make sure: you talk about Windows, too, or just Linux? | |
ID: 57332 | Rating: 0 | rate: / Reply Quote | |
Can anyone tell whether these new tasks run on Ampere cards, too? In both Windows and Linux systems with updated drivers, at last. | |
ID: 57333 | Rating: 0 | rate: / Reply Quote | |
I found one task that failed and the acemd3 application had crashed. | |
ID: 57334 | Rating: 0 | rate: / Reply Quote | |
I had this happen with Folding@Home, several weeks ago. Maybe you are talking about the same project anyway :-) | |
ID: 57339 | Rating: 0 | rate: / Reply Quote | |
I've now got a Windows v218, flavour cuda101, running and saved for posterity. | |
ID: 57345 | Rating: 0 | rate: / Reply Quote | |
I think the problem is due to older Windows OS versions that don't have the newer Microsoft Visual C++ 2015-2019 Redistributable. The new ACEMD app seems to require this to work. | |
ID: 57353 | Rating: 0 | rate: / Reply Quote | |
Well, as you can see from my screenshot yesterday (which was taken on a machine running Windows 7, BTW), acemd3 has loaded vcruntime140 from the BOINC slot directory - i.e., from an image circulated by the project via its various downloads. That file is actually v14.28.29325.2, dated Fri Sep 25 2020. | |
ID: 57356 | Rating: 0 | rate: / Reply Quote | |
Well, welcome to DLL Hell (yes, that's a thing). | |
ID: 57357 | Rating: 0 | rate: / Reply Quote | |
Good news. The machine which failed yesterday is now running an acemd3 cuda101 task. Here is the full set of C runtime library files, and where they're being run from: | |
ID: 57359 | Rating: 0 | rate: / Reply Quote | |
I'll drop a line to Gianni, to pass on to the development / deployment team. any reaction from their side so far ? | |
ID: 57374 | Rating: 0 | rate: / Reply Quote | |
Sadly, none at all. I might try Toni, after our conversation in the news thread yesterday. | |
ID: 57375 | Rating: 0 | rate: / Reply Quote | |
This is already fixed in acemdbeta and will be fixed as soon as we deploy the new app. | |
ID: 57413 | Rating: 0 | rate: / Reply Quote | |
Sadly, I'm not completely convinced by this. | |
ID: 57415 | Rating: 0 | rate: / Reply Quote | |
I didn't manually intervene with anything and got 3 beta tasks that ran successfully on my Win10 machine. They all finished successfully within a few minutes. Does that help? | |
ID: 57416 | Rating: 0 | rate: / Reply Quote | |
That's reassuring - thanks. But I'll keep an eye on my Windows 7 machines when the project next ramps up. | |
ID: 57417 | Rating: 0 | rate: / Reply Quote | |
This is already fixed in acemdbeta and will be fixed as soon as we deploy the new app. No further explanation ? How can we fix it ? What is the problem ? Not only the C++ (I have all dll and C++ manually installed) Till june all was working. Now, not more (without any change in my config. It seems the problem comes from the WU | |
ID: 57542 | Rating: 0 | rate: / Reply Quote | |
Message boards : Graphics cards (GPUs) : New version tasks failing on Windows hosts