Message boards : News : Probable access problems on 9th Dec
Author | Message |
---|---|
On 9th Dec we are moving the IP of gpugrid to another network. This means changing the DNS. While the dns update is propagating around the world you might experience that the server is unreachable. | |
ID: 42274 | Rating: 0 | rate: / Reply Quote | |
Thanks for the notice! | |
ID: 42276 | Rating: 0 | rate: / Reply Quote | |
Thanks for the notice! +1 :) Thanks Gianni ! ____________ [CSF] Thomas H.V. Dupont Founder of the team CRUNCHERS SANS FRONTIERES 2.0 www.crunchersansfrontieres | |
ID: 42279 | Rating: 0 | rate: / Reply Quote | |
the transition is over and it should be all working like before. | |
ID: 42347 | Rating: 0 | rate: / Reply Quote | |
Hi, | |
ID: 42349 | Rating: 0 | rate: / Reply Quote | |
même chose ici 10/12/2015 06:45:22 | | Project communication failed: attempting access to reference site 10/12/2015 06:45:23 | | Internet access OK - project servers may be temporarily down. ____________ | |
ID: 42353 | Rating: 0 | rate: / Reply Quote | |
même chose ici All is working now .... but zero WU ! K. ____________ Dreams do not always come true. But not because they are too big or impossible. Why did we stop believing. (Martin Luther King) | |
ID: 42354 | Rating: 0 | rate: / Reply Quote | |
Yup, we need more WU. Short runs are totally forgotten... | |
ID: 42355 | Rating: 0 | rate: / Reply Quote | |
An alternative to GPUGRID, Poem, which also offers Bio GPU units. | |
ID: 42356 | Rating: 0 | rate: / Reply Quote | |
FYI: I'm still having network access problems: | |
ID: 42363 | Rating: 0 | rate: / Reply Quote | |
Zoltan, perhaps your host has not refreshed its DNS cache. A reboot would help in that case. | |
ID: 42364 | Rating: 0 | rate: / Reply Quote | |
Zoltan, perhaps your host has not refreshed its DNS cache. A reboot would help in that case.I've checked it by ipconfig /displaydns and it was ok. The address in the cache was 84.89.134.145, to make sure the cache refresh itself I did anipconfig /flushdns to clear the cache.However, it seems that the problem is gone in the meantime. There's nothing I can do about the extra workunits assigned to my hosts, they will be reassigned to another host after their deadline (5 days). | |
ID: 42373 | Rating: 0 | rate: / Reply Quote | |
Cc je suis en France je parle juste français | |
ID: 42380 | Rating: 0 | rate: / Reply Quote | |
Apprenez donc | |
ID: 42381 | Rating: 0 | rate: / Reply Quote | |
FYI: I'm still having network access problems: I am have been having the same problems since the change to the new network. But for me, it happens occasionally during downloads only (definitely more often than before), and it involves only one file in the WU downloading. To continue with the download, I can either press "renter now " when the status is "Download reentry in xx:xx:xx" or exit boinc and then run it again. Sometimes, I have to do this more than once. But most of the time, the downloads go smoothly. I haven't lost a WU yet, due to this problem. Knock wood! | |
ID: 42384 | Rating: 0 | rate: / Reply Quote | |
I am have been having the same problems since the change to the new network. But for me, it happens occasionally during downloads only (definitely more often than before), and it involves only one file in the WU downloading. To continue with the download, I can either press "renter now " when the status is "Download reentry in xx:xx:xx" or exit boinc and then run it again. Same behavior here as well! | |
ID: 42387 | Rating: 0 | rate: / Reply Quote | |
même chose ici Still receiving this error and I have flushed DNS/rebooted. Other projects running normally | |
ID: 42390 | Rating: 0 | rate: / Reply Quote | |
which domain name are you attaching to? | |
ID: 42400 | Rating: 0 | rate: / Reply Quote | |
which domain name are you attaching to? | |
ID: 42401 | Rating: 0 | rate: / Reply Quote | |
FYI: I'm still having network access problems: I just want to add that this is happening on my windows xp computer, only. The windows 10 machine is downloading WUs with no problems, so far. And sometimes, more than one file gets stuck. | |
ID: 42414 | Rating: 0 | rate: / Reply Quote | |
I just want to add that this is happening on my windows xp computer, only. The windows 10 machine is downloading WUs with no problems, so far. I have another "ghost" task on one of my hosts. | |
ID: 42488 | Rating: 0 | rate: / Reply Quote | |
I too just noticed I had a task that timed out without a response on me. I went over the BOINC log and couldn't find its name. I did notice the following in the log for December 29th however (when the task was assigned to me): 29-Dec-2015 13:30:40 [GPUGRID] Sending scheduler request: To fetch work. 29-Dec-2015 13:30:40 [GPUGRID] Requesting new tasks for NVIDIA GPU 29-Dec-2015 13:35:47 [GPUGRID] Scheduler request failed: Timeout was reached 29-Dec-2015 13:35:47 [GPUGRID] Sending scheduler request: To fetch work. 29-Dec-2015 13:35:47 [GPUGRID] Requesting new tasks for NVIDIA GPU 29-Dec-2015 13:35:49 [GPUGRID] Scheduler request completed: got 0 new tasks 29-Dec-2015 13:35:49 [GPUGRID] No tasks sent 29-Dec-2015 13:35:49 [GPUGRID] No tasks are available for Long runs (8-12 hours on fastest card) 29-Dec-2015 13:35:49 [GPUGRID] Project has no tasks available 29-Dec-2015 13:35:51 [---] Project communication failed: attempting access to reference site 29-Dec-2015 13:35:52 [---] Internet access OK - project servers may be temporarily down. So, it seems to me the request for new tasks did go through to the scheduler, but its response never reached my machine. I am also having the download / upload problems mentioned in this thread. Files eventually do get down / up, but with several retries. This is definitely a network problem on the GPUGRID side of the network - maybe a router close to the project servers has not had its DNS and / or routing tables refreshed? I am wondering how this issue with phantom WU assignments is affecting WU availability and the overall computation progress, especially in this WU season of drought. Just imagine hosts requesting tasks, getting them without knowing it, and after some minutes requesting again. This issue does not need to happen many times to many users to make many tasks disappear... ____________ | |
ID: 42558 | Rating: 0 | rate: / Reply Quote | |
Still getting these errors on most of my downloads. Eventually they come through. Thu 07 Jan 2016 02:42:14 PM CST | GPUGRID | Temporarily failed download of e20s36_e16s2p1f382-GERARD_CXCL12_DIMPROTO1-0-pdb_file: transient HTTP error | |
ID: 42571 | Rating: 0 | rate: / Reply Quote | |
Now that you mention it, I am too. There are several earlier entries, this is just the most recent. I never paid any attention to it before. Whether it is a big problem or not I have no idea. | |
ID: 42572 | Rating: 0 | rate: / Reply Quote | |
Still getting these errors on most of my downloads. Eventually they come through. Same here. After the initial download times out I hit "retry" from BoincTasks transfers tab and the download resumes and finishes. Been doing this for a couple of weeks. | |
ID: 42578 | Rating: 0 | rate: / Reply Quote | |
I have lost 2 WUs while downloading on my windows xp machine: | |
ID: 42603 | Rating: 0 | rate: / Reply Quote | |
I would like to report, that I have occasionally download problems until this date (individual files get stuck). This was not a concern, when there have not been many WUs around, but now when the pipeline is full, it is quite boring. | |
ID: 42776 | Rating: 0 | rate: / Reply Quote | |
I would like to report, that I have occasionally download problems until this date (individual files get stuck). This was not a concern, when there have not been many WUs around, but now when the pipeline is full, it is quite boring. Same here. I've asked about it several times but never got a reply. Hours wasted that could be used crunching. | |
ID: 42781 | Rating: 0 | rate: / Reply Quote | |
Yes me too, stuck file usually downloads after a few hours before the task running is finished however 2 times recently it has been stuck for over 4 hours and this left GPU idle for a few hours. I hate that. Crunching computer is running using electricity, belching fire into our skies and no work is being done. | |
ID: 42793 | Rating: 0 | rate: / Reply Quote | |
recently i saw one article For All Portable issue problems.But after one week the content got changed to some game content...may be be that is because of my browser issue ..pls try read this article that gives exact solutions...also inform me about the issue i am facing....the link is http://bit.do/solveportableissues | |
ID: 42803 | Rating: 0 | rate: / Reply Quote | |
Still having the same download issue. Recently brought my XP machine back to crunch here. It has the same issue. Downloads get stuck for hours. This is the only project of 7 that I'm currently running that does this and since it's On 2 different machines/OSs the problem is not on my end. PLEASE FIX THIS! | |
ID: 42808 | Rating: 0 | rate: / Reply Quote | |
I can confirm this; it is not on my end! The problem has aroused when the project changed the network. It seems to me that the new network cannot cope with the size of data transferred from the server to the user and vice versa. | |
ID: 42811 | Rating: 0 | rate: / Reply Quote | |
I routinely see delays of 10 to 20 minutes or so on downloads and a few uploads. I see it on both wired and wireless connections. It is annoying when I am running my GTX 750 Tis and am trying to make the 24 hour limit. | |
ID: 42822 | Rating: 0 | rate: / Reply Quote | |
I've forwarded your complaints to our IT service. Indeed delays in download/upload could be caused by the new network. I'll keep you updated! | |
ID: 42825 | Rating: 0 | rate: / Reply Quote | |
I've forwarded your complaints to our IT service. Indeed delays in download/upload could be caused by the new network. I'll keep you updated! Thanks. | |
ID: 42827 | Rating: 0 | rate: / Reply Quote | |
FYI: I'm still having network access problems: WU file downloading problem is now happening on both my windows xp and 10 computers, occasionally. See log: 2/21/2016 6:15:03 PM | GPUGRID | Finished download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-par_file 2/21/2016 6:15:03 PM | GPUGRID | Started download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-conf_file_enc 2/21/2016 6:15:04 PM | GPUGRID | Finished download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-conf_file_enc 2/21/2016 6:15:04 PM | GPUGRID | Started download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-metainp_file 2/21/2016 6:15:05 PM | GPUGRID | Finished download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-metainp_file 2/21/2016 6:15:05 PM | GPUGRID | Started download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-hills_file 2/21/2016 6:15:06 PM | GPUGRID | Finished download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-hills_file 2/21/2016 6:15:06 PM | GPUGRID | Started download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-xsc_file 2/21/2016 6:15:07 PM | GPUGRID | Finished download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-xsc_file 2/21/2016 6:15:07 PM | GPUGRID | Started download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-prmtop_file 2/21/2016 6:15:08 PM | GPUGRID | Finished download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-prmtop_file 2/21/2016 6:20:02 PM | GPUGRID | Temporarily failed download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-psf_file: transient HTTP error 2/21/2016 6:20:02 PM | GPUGRID | Backing off 00:02:40 on download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-psf_file 2/21/2016 6:20:03 PM | | Project communication failed: attempting access to reference site 2/21/2016 6:20:04 PM | | Internet access OK - project servers may be temporarily down. 2/21/2016 6:22:42 PM | GPUGRID | Started download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-psf_file 2/21/2016 6:27:55 PM | | Project communication failed: attempting access to reference site 2/21/2016 6:27:55 PM | GPUGRID | Temporarily failed download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-psf_file: transient HTTP error 2/21/2016 6:27:55 PM | GPUGRID | Backing off 00:04:30 on download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-psf_file 2/21/2016 6:27:56 PM | | Internet access OK - project servers may be temporarily down. 2/21/2016 6:28:51 PM | GPUGRID | Started download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-psf_file 2/21/2016 6:29:06 PM | GPUGRID | Temporarily failed download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-psf_file: transient HTTP error 2/21/2016 6:29:06 PM | GPUGRID | Backing off 00:13:42 on download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-psf_file 2/21/2016 6:29:07 PM | | Project communication failed: attempting access to reference site 2/21/2016 6:29:08 PM | | BOINC can't access Internet - check network connection or proxy configuration. 2/21/2016 6:29:18 PM | GPUGRID | Started download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-psf_file 2/21/2016 6:29:35 PM | GPUGRID | Finished download of e18s17_e11s8p1f343-GERARD_CXCL12_CHLCPUBCHEM_chalcone121-0-psf_file Another trick to get the download to restart is to disconnect and reconnect the network internet connection, and then in the boinc manager under the transfer tab press the renter now button with the stalled file highlighted. Of course, you can wait for it to restart on its own, this merely speeds up the download. This was not happening in such frequency before the network upgrade. | |
ID: 42829 | Rating: 0 | rate: / Reply Quote | |
7 tries and 5 1/2 hours wasted trying to download 1 file because every time the download fails the wait period for the next try gets longer. This is ridiculous. I thinks it's time to move somewhere else until this issue is resolved. 2+ months is long enough for me. | |
ID: 42831 | Rating: 0 | rate: / Reply Quote | |
7 tries and 5 1/2 hours wasted trying to download 1 file because every time the download fails the wait period for the next try gets longer. This is ridiculous. I thinks it's time to move somewhere else until this issue is resolved. 2+ months is long enough for me.Our complaints were forwarded to the IT service a week ago, however this problem exists since the changes in the network. I guess it's a misconfigured routing table (or more of them), which is quite hard to spot, especially when not all traffic is affected by it. A spare project could help to reduce the idle GPU time, so when the network issues will be fixed at GPUGrid's campus, your host will automatically stop downloading from the other (spare, 0 resource share) project. | |
ID: 42832 | Rating: 0 | rate: / Reply Quote | |
7 tries and 5 1/2 hours wasted trying to download 1 file because every time the download fails the wait period for the next try gets longer. This is ridiculous. I thinks it's time to move somewhere else until this issue is resolved. 2+ months is long enough for me.Our complaints were forwarded to the IT service a week ago, however this problem exists since the changes in the network. I guess it's a misconfigured routing table (or more of them), which is quite hard to spot, especially when not all traffic is affected by it. A spare project could help to reduce the idle GPU time, so when the network issues will be fixed at GPUGrid's campus, your host will automatically stop downloading from the other (spare, 0 resource share) project. Something else blew up last night. I awoke this morning to find 4 tasks ready to report and no new tasks running. That had to be at least 12+ dead hours of no crunching. The projects tab showed the next update would not be for 12 more hours. After doing a manual update the 4 tasks reported and new tasks were requested. Below is a partial copy of the messages: 672710 GPUGRID 2/24/2016 7:24:35 AM update requested by user 672711 GPUGRID 2/24/2016 7:24:40 AM Fetching scheduler list 672712 GPUGRID 2/24/2016 7:24:43 AM Master file download succeeded 672713 GPUGRID 2/24/2016 7:24:48 AM Sending scheduler request: Requested by user. 672714 GPUGRID 2/24/2016 7:24:48 AM Reporting 4 completed tasks 672715 GPUGRID 2/24/2016 7:24:48 AM Requesting new tasks for NVIDIA GPU 672716 GPUGRID 2/24/2016 7:24:50 AM Scheduler request completed: got 1 new tasks What would cause the master file to be needed again? I'm assuming that was a/the reason for the 12 hour delay. New tasks were received but there are 7 files stuck again. *bangs head on desk* Also I don't think using a 0 share standby will work because if I remember correctly BOINC will not allow new tasks from another project to download if it detects stuck downloads from the higher priority project. FWIW if the current IT service can't get this resolved after 2 months maybe GPUGrid might consider switching to another service provider. | |
ID: 42834 | Rating: 0 | rate: / Reply Quote | |
What would cause the master file to be needed again? Ten consecutive failures to contact the scheduler. Note that's the request work/report work contact attempt, not the file download attempts this thread has mainly been about. Check the full log in stdoutdae.txt - see when the problem started/ended. Unless you've suppressed it, BOINC will try to contact a 'neutral' web host (google.com) after each failure: if google is OK but gpugrid fails, then the project server is the suspect. But if google fails as well, then your own network connection and ISP may be at fault. To test a little theory of mine - what OS is having these problems? Linux, Windows, OS X? Or all three? I'm Windows, and I see the downloads stalling sometimes - but the work is usually fully downloaded by the time I need it. [Edit - OK, we don't support OS X here. Forget that one.] | |
ID: 42835 | Rating: 0 | rate: / Reply Quote | |
To test a little theory of mine - what OS is having these problems? Linux, Windows, OS X? Or all three? I'm Windows, and I see the downloads stalling sometimes - but the work is usually fully downloaded by the time I need it. I am on Win7 64-bit. It is not an actual operational problem for me at the moment. For the past four days, even with one or two backoffs, I get the downloads in less than 20 minutes, and usually about 10 minutes. With a little overlap (buffer setting of 0.01 + 0.01 days), it is working OK for my GTX 960, though the problem to some degree is still there. | |
ID: 42836 | Rating: 0 | rate: / Reply Quote | |
Both my XP and Win7 boxes are having the issue. Win7 is the most problematic because I'm running dual cards with 2 tasks each. Because of the 2 task per GPU limit I have no buffer to cover the stuck downloads. A 3 tasks per card limit would probably eliminate the issue but I doubt that will happen. | |
ID: 42837 | Rating: 0 | rate: / Reply Quote | |
To test a little theory of mine - what OS is having these problems? Windows 7 Pro here. It's a minor problem, usually, however I am not crunching at the scale some are. ____________ Team USA forum | Team USA page Join us and #crunchforcures. We are now also folding:join team ID 236370! | |
ID: 42838 | Rating: 0 | rate: / Reply Quote | |
Hi nanoprobe and the gpugrid community, | |
ID: 42856 | Rating: 0 | rate: / Reply Quote | |
I just noticed another phantom task that was assigned to me, but my BOINC client never got the scheduler's response: 02-Mar-2016 14:35:22 [GPUGRID] Sending scheduler request: Requested by project. 02-Mar-2016 14:35:22 [GPUGRID] Requesting new tasks for NVIDIA GPU 02-Mar-2016 14:40:27 [GPUGRID] Scheduler request failed: Timeout was reached 02-Mar-2016 14:40:27 [GPUGRID] Sending scheduler request: Requested by project. 02-Mar-2016 14:40:27 [GPUGRID] Requesting new tasks for NVIDIA GPU 02-Mar-2016 14:40:29 [GPUGRID] Scheduler request completed: got 1 new tasks After the timeout, my BOINC client merrily requested once more for new tasks... I wish there was a way to cancel tasks using the project's site, e.g. a Cancel button on the task list. ____________ | |
ID: 42876 | Rating: 0 | rate: / Reply Quote | |
Intrigued by this post by Bjarke I decided to do some trace-routing for GPUGRID and my other projects (WCG and POEM). Here's the output from tracert, having appended the geographic location of each hop using http://www.ipligence.com/geolocation: C:\Users\vagelis>tracert www.gpugrid.org Tracing route to www.gpugrid.org [84.89.134.145] over a maximum of 30 hops: ## Skipping trace of my own ISP ## 8 77 ms 75 ms 76 ms nl-sar.nordu.net [80.249.209.203] -- NETHERLANDS 9 80 ms 80 ms 98 ms uk-hex.nordu.net [109.105.102.97] -- SWEDEN 10 96 ms 105 ms 95 ms ndn-gw.mx1.lon.uk.geant.net [109.105.102.98] -- SWEDEN 11 85 ms 98 ms 86 ms ae0.mx1.par.fr.geant.net [62.40.98.77] -- UK 12 81 ms 81 ms 81 ms 83.97.88.129 -- UK 13 104 ms * 105 ms 83.97.88.130 -- UK 14 139 ms 139 ms 120 ms TELMAD.AE4.uv.rt1.val.red.rediris.es [130.206.245.89] -- SPAIN - MADRID 15 118 ms 120 ms 121 ms anella-val1-router.red.rediris.es [130.206.211.70] -- SPAIN - MADRID 16 * * * Request timed out. 17 126 ms 126 ms 126 ms grosso.upf.edu [84.89.134.145] -- SPAIN - BARCELONA 18 216 ms 126 ms 126 ms grosso.upf.edu [84.89.134.145] 19 118 ms 117 ms 120 ms grosso.upf.edu [84.89.134.145] Trace complete. Note that I took traces from two locations using different ISPs to determine the entry point to GPUGRID's ISP network. The trace above is the common part. Comparing GPUGRID's route trace to my other projects, it is evident that there's a lot of hopping around across Europe: Netherlands to Sweden to the UK to finally reach Spain. In contrast, WCG's trace shows a hop in the UK and then it goes to the USA. POEM's again has a hop in the UK and then goes to Germany. Now, I'm not saying that hopping across Europe is a bad thing, even for an IP packet :D, but more hops does mean more points that can cause network problems. It would be interesting to have a route trace from before the GPUGRID ISP switch to compare... ____________ | |
ID: 42883 | Rating: 0 | rate: / Reply Quote | |
A tracert from eastern Pennsylvania seems simple enough. My guess is that it is a local problem near UPF.
| |
ID: 42885 | Rating: 0 | rate: / Reply Quote | |
I wonder if it is possible for BOINC, or some ancillary program, to do a tracert on a file as it is being downloaded? That would be more useful in finding the sticking points than doing a tracert after the fact, when conditions have changed. | |
ID: 42890 | Rating: 0 | rate: / Reply Quote | |
Did I miss something? | |
ID: 43002 | Rating: 0 | rate: / Reply Quote | |
Dear all, | |
ID: 43060 | Rating: 0 | rate: / Reply Quote | |
Your 32 bit linux is a problem. There are no 32 bit linux apps listed on https://www.gpugrid.net/apps.php | |
ID: 43067 | Rating: 0 | rate: / Reply Quote | |
fractal spotted it; 3.19.0-32-generic Supported OS
Windows 32/64-bit
| |
ID: 43068 | Rating: 0 | rate: / Reply Quote | |
Message boards : News : Probable access problems on 9th Dec