Advanced search

Message boards : News : Project servers up again

Author Message
Profile MJH
Project administrator
Project developer
Project scientist
Send message
Joined: 12 Nov 07
Posts: 696
Credit: 27,266,655
RAC: 0
Level
Val
Scientific publications
watwat
Message 32212 - Posted: 24 Aug 2013 | 11:45:14 UTC

Problem with the database. Should be back online soon!

MJH

localizer
Send message
Joined: 17 Apr 08
Posts: 113
Credit: 1,656,514,857
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 32213 - Posted: 24 Aug 2013 | 11:49:15 UTC - in response to Message 32212.
Last modified: 24 Aug 2013 | 11:49:32 UTC

Shame - all 4 tasks aborted by project - 2 were 70%+ complete.

Oh well - need to get the DB issues sorted out.

Profile nate
Send message
Joined: 6 Jun 11
Posts: 124
Credit: 2,928,865
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwat
Message 32214 - Posted: 24 Aug 2013 | 12:04:21 UTC

Yea, we're sorry guys, especially to people who had WUs cancelled. We're working on it.

Profile MJH
Project administrator
Project developer
Project scientist
Send message
Joined: 12 Nov 07
Posts: 696
Credit: 27,266,655
RAC: 0
Level
Val
Scientific publications
watwat
Message 32215 - Posted: 24 Aug 2013 | 12:20:23 UTC - in response to Message 32214.
Last modified: 24 Aug 2013 | 12:20:38 UTC

Server is back up now. Sincere apologies to anyone who gets a long-running WU cancelled as a result.

MJH

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 6,169
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 32217 - Posted: 24 Aug 2013 | 13:22:25 UTC - in response to Message 32215.

Server is back up now. Sincere apologies to anyone who gets a long-running WU cancelled as a result.

MJH

There was another short server outage.
The GPUGrid_file_deleter is still stopped.

Profile MJH
Project administrator
Project developer
Project scientist
Send message
Joined: 12 Nov 07
Posts: 696
Credit: 27,266,655
RAC: 0
Level
Val
Scientific publications
watwat
Message 32218 - Posted: 24 Aug 2013 | 13:28:34 UTC - in response to Message 32217.

The GPUGrid_file_deleter is still stopped.


Yes, that's deliberate.

MJH

Profile dskagcommunity
Avatar
Send message
Joined: 28 Apr 11
Posts: 456
Credit: 817,865,789
RAC: 0
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 32229 - Posted: 24 Aug 2013 | 17:34:30 UTC - in response to Message 32213.

Shame - all 4 tasks aborted by project - 2 were 70%+ complete.

Oh well - need to get the DB issues sorted out.


I feel with ya, mine two that hurting where 80%+ too :/
____________
DSKAG Austria Research Team: http://www.research.dskag.at



Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1957
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 32233 - Posted: 24 Aug 2013 | 19:08:21 UTC - in response to Message 32229.

see discussion on server forum.

Sorry about that.

gdf

FoldingNator
Send message
Joined: 1 Dec 12
Posts: 24
Credit: 60,122,950
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwat
Message 32268 - Posted: 25 Aug 2013 | 15:44:14 UTC - in response to Message 32233.
Last modified: 25 Aug 2013 | 15:45:01 UTC

Wtf is happening over here?

By all my done WU it says canceled/aborted. I've seen this topic before and at that time I guessed only 2 WU's were aborted. But now I mean, its not 1 WU, all of the 38 done WU's long runs. -_-

Is this a misunderstanding from the server or what's happening? Do I need it all over again? I hope not. -_-'

Profile Retvari Zoltan
Avatar
Send message
Joined: 20 Jan 09
Posts: 2343
Credit: 16,201,255,749
RAC: 6,169
Level
Trp
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 32270 - Posted: 25 Aug 2013 | 16:04:28 UTC - in response to Message 32268.
Last modified: 25 Aug 2013 | 16:04:48 UTC

Wtf is happening over here?

By all my done WU it says canceled/aborted. I've seen this topic before and at that time I guessed only 2 WU's were aborted. But now I mean, its not 1 WU, all of the 38 done WU's long runs. -_-

Is this a misunderstanding from the server or what's happening? Do I need it all over again? I hope not. -_-'

I can see only one aborted WU on your host's tasklist.

FoldingNator
Send message
Joined: 1 Dec 12
Posts: 24
Credit: 60,122,950
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwat
Message 32283 - Posted: 26 Aug 2013 | 0:29:22 UTC - in response to Message 32270.
Last modified: 26 Aug 2013 | 0:31:38 UTC

Yes indeed, that's only @ the taskslist. But when you open a task you can see this sort of text "WU aborted", in Dutch "WU afgebroken":
http://prntscr.com/1nfi3j
http://prntscr.com/1nfif4
http://prntscr.com/1nfigd
http://prntscr.com/1nfiif
http://prntscr.com/1nfil4
http://prntscr.com/1nfine
http://prntscr.com/1nfip3
and more of that...

My host is 156899. The tasks are completed and I've get the credits of it, but it says the tasks are aborted. :?

Profile dskagcommunity
Avatar
Send message
Joined: 28 Apr 11
Posts: 456
Credit: 817,865,789
RAC: 0
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 32284 - Posted: 26 Aug 2013 | 6:40:50 UTC - in response to Message 32218.

The GPUGrid_file_deleter is still stopped.


Yes, that's deliberate.

MJH


I think it is deliberTe too to dry up the queues and sort then things out and start the deleter then again?
____________
DSKAG Austria Research Team: http://www.research.dskag.at



Profile skgiven
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 32285 - Posted: 26 Aug 2013 | 8:47:32 UTC - in response to Message 32283.
Last modified: 26 Aug 2013 | 8:47:41 UTC

Aborting a WU after it's completed doesn't sound too smart - if that's what happened. Anyway, such things are unplanned mistakes.
I only had one beta in progress (and only 32sec in) when it was Project Aborted,
- Exit status 202 (0xca) EXIT_ABORTED_BY_PROJECT
____________
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help

FoldingNator
Send message
Joined: 1 Dec 12
Posts: 24
Credit: 60,122,950
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwat
Message 32302 - Posted: 26 Aug 2013 | 18:32:47 UTC - in response to Message 32285.
Last modified: 26 Aug 2013 | 18:46:21 UTC

Haha but I dont have nothing aborted manually! I'm not that stupid... haha lol. :P ;)

I saw only a status change of all my done and 100% completed tasks around the start of this thread. Often I'm watching my done tasks after a week or so, so last week I saw this strange status, what's very odd in my opinion.

It's only about the WU's who I've completed before wednesday 11.45UTC. So after the servers are up and running again all my WU's are okay, before 11.45UTC they say that weird status.

As an example:
Tasks from computer 116314 from Retvari Zoltan has also the same weird status: http://prntscr.com/1nka0a (I get just a task before 24 augustus 11.45UTC)

TJ
Send message
Joined: 26 Jun 09
Posts: 815
Credit: 1,470,385,294
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 32309 - Posted: 26 Aug 2013 | 22:44:00 UTC - in response to Message 32302.

There is nothing weird about it, but you have to read the forum first before to understand way these WU's where cancelled.
MJH cancelled a lot of WU's as they where no longer of use to him for optimizing the app, which he did great by the way.
And GDF did apologize for the lose of many WU's in progress. As a result we will get an app with a very low error rate. They also worked over the weekend so a more friendly tone to the admins would be appropriate.
____________
Greetings from TJ

FoldingNator
Send message
Joined: 1 Dec 12
Posts: 24
Credit: 60,122,950
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwat
Message 32318 - Posted: 27 Aug 2013 | 12:58:34 UTC
Last modified: 27 Aug 2013 | 13:14:31 UTC

What exactly wasn't friendly at my message? I asked only a question in combination with a conclusion what I saw in my tasklist. I don't know what's wrong with that. I never meant it disrespectful, if someone has interpreted it that way. ;-)

Anyway, the weird status isn't only at the MJHARVEY runs, also at the SDOERR, SANTI and NATHAN runs. Just at all short, long and beta runs before 24 august 11.45 UTC. I'm understanding the cancelling status of MJHARVEY (ACEMD beta) runs, that's out of question, but what about the status for NATHAN, SANTI and SDOERR?

I think you (TJ) did understand my message wrong. These WU where I'm talking about aren't cancelled at progress-time (that's what GDF says), not manually by myself, but after we've (all of us I think) upload it.

It isn't like what GDF says over here (only in my opinion, but my English is bad... Google translate couldn't help me either out... :P ): http://www.gpugrid.net/forum_thread.php?id=3448&nowrap=true#32232

Most of the WU's where I'm talking about aren't previously canceled by someone else, for what I can see... or does he meant something else? Help me out, you may also send me a DM/PM if that's easier. :)

TJ
Send message
Joined: 26 Jun 09
Posts: 815
Credit: 1,470,385,294
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 32319 - Posted: 27 Aug 2013 | 14:04:56 UTC - in response to Message 32318.

This is what GDF state in his post:

We are only canceling results unsent but all results associated to a cancelled WU get cancelled

And this means all associated WU´s to that particular job got cancelled (but was not meant to be so). How these are associated is something they only know.

Well you started your post with this:
Wtf is happening over here?
That is not friendly, especially as they are working for about the last ten days to improve the app. Until late at night and in the weekend.

Anyhow the "weird status" is past, so happy crunching ;-)

____________
Greetings from TJ

FoldingNator
Send message
Joined: 1 Dec 12
Posts: 24
Credit: 60,122,950
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwat
Message 32331 - Posted: 27 Aug 2013 | 19:27:52 UTC
Last modified: 27 Aug 2013 | 19:31:57 UTC


Well you started your post with this:
Wtf is happening over here?
That is not friendly, especially as they are working for about the last ten days to improve the app. Until late at night and in the weekend.

That was only a reaction of my own, about the thing I saw. Nothing specific for the crew. Don't you say ever "WTF" when you are shocked about something what's happened at the moment? lol :P :?

Do I have say the next time "What a shame" or something like what Localizer said? Haha exact the same kind of reaction in my opninion, only he has more credit and maybe more respect from you(?). I can't smell or think from the Netherlands what's happening in Spain, what the crew is doing or not, or they're working in the weekend. I don't search every topic in detail and I found only this one with relatively little information. So when I'm scared out and in generally I'm yelling "wtf".. I guess it isn't a very unkown word nowadays.

But enough of this story. Like I said in my reaction above: it wasn't disrespectful meant anyway. That's not how I am.


Anyhow the "weird status" is past, so happy crunching ;-)

Hm okay, thank you for your explanation. :)

Profile skgiven
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 23 Apr 09
Posts: 3968
Credit: 1,995,359,260
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 32332 - Posted: 27 Aug 2013 | 20:01:25 UTC - in response to Message 32331.

Enough of that!
____________
FAQ's

HOW TO:
- Opt out of Beta Tests
- Ask for Help

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 32404 - Posted: 28 Aug 2013 | 19:01:42 UTC - in response to Message 32331.

But enough of this story. Like I said in my reaction above: it wasn't disrespectful meant anyway. That's not how I am.

I don't think any offense was taken, let's leave it at that!

MrS
____________
Scanning for our furry friends since Jan 2002

Message boards : News : Project servers up again

//