Join PrimeGrid
Returning Participants
Community
Leader Boards
Results
Other
drummers-lowrise
|
Message boards :
Number crunching :
Server sent unit to 3 hosts in a short time
Author |
Message |
mackerel Volunteer tester
 Send message
Joined: 2 Oct 08 Posts: 2652 ID: 29980 Credit: 570,442,335 RAC: 5,621
                              
|
http://www.primegrid.com/workunit.php?wuid=701428386
Only noticed because it was a prime. In under 2 minutes the server sent tasks out to 3 hosts. I thought 2 was the norm, so just curious why the 3rd? Due to the short amount of time, and they all completed normally, this is not related to resend. I assume it is nothing to do with being a prime as it wouldn't have known at the time.
I looked at a small number of other PPSE units and they were all sent twice as expected. | |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14043 ID: 53948 Credit: 481,266,047 RAC: 509,006
                               
|
http://www.primegrid.com/workunit.php?wuid=701428386
Only noticed because it was a prime. In under 2 minutes the server sent tasks out to 3 hosts. I thought 2 was the norm, so just curious why the 3rd? Due to the short amount of time, and they all completed normally, this is not related to resend. I assume it is nothing to do with being a prime as it wouldn't have known at the time.
I looked at a small number of other PPSE units and they were all sent twice as expected.
How many times has this question been asked now? It seems like Jim was answering this at least once a month while he was here...
It's one of those cases where a task had an error, but one of our "special sauce" processes reanimated the dead task when it saw a valid upload file. Once the task errored out, a replacement was sent. Then the dead task was fixed. That's how you get extra tasks.
scheduler.log:2021-02-03 14:29:03.9306 [PID=14567] [CRITICAL] [HOST#1054795] [RESULT#1179978526] [WU#701428386] changed CPID: marking in-progress result llrPPSE_351375590_0 as client error!
unabandoned.log:2021-02-03 15:23:07 Workunit 701428386, result llrPPSE_351375590_0, AVN 810 was abandoned but had an upload. Now fixed.
____________
My lucky number is 75898524288+1 | |
|
mackerel Volunteer tester
 Send message
Joined: 2 Oct 08 Posts: 2652 ID: 29980 Credit: 570,442,335 RAC: 5,621
                              
|
Thanks for the response. I guess that's an indication of its rarity, and not a surprise I didn't see it before in the time I've been here. I would suggest some kind of FAQ but that probably wont get read either. | |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14043 ID: 53948 Credit: 481,266,047 RAC: 509,006
                               
|
Thanks for the response. I guess that's an indication of its rarity, and not a surprise I didn't see it before in the time I've been here. I would suggest some kind of FAQ but that probably wont get read either.
Wiki.
____________
My lucky number is 75898524288+1 | |
|
|
Congrats on the find!
I noticed this workunit on my DC list and was curious about the 3rd task as well.
I originally felt bad taking the DC away from the other user (at least in the discord #prime-discoveries channel), but after more investigation, it seems we were both given credit on our profiles and TdP double checker leaderboard.
To help future users searching for this information, the task output was:
<primegrid_recovery>
This result was abandoned and automatically recovered on server.
</primegrid_recovery>
<stderr_txt>
</stderr_txt>
| |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14043 ID: 53948 Credit: 481,266,047 RAC: 509,006
                               
|
Congrats on the find!
I noticed this workunit on my DC list and was curious about the 3rd task as well.
I originally felt bad taking the DC away from the other user (at least in the discord #prime-discoveries channel), but after more investigation, it seems we were both given credit on our profiles and TdP double checker leaderboard.
To help future users searching for this information, the task output was:
<primegrid_recovery>
This result was abandoned and automatically recovered on server.
</primegrid_recovery>
<stderr_txt>
</stderr_txt>
Additionally, the run time and cpu times are set to 44,444 seconds as an in-your-face indicator that something is different about that task.
____________
My lucky number is 75898524288+1 | |
|
WezHSend message
Joined: 9 Jun 11 Posts: 129 ID: 101605 Credit: 921,182,348 RAC: 2,997,466
                           
|
I originally felt bad taking the DC away from the other user (at least in the discord #prime-discoveries channel), but after more investigation, it seems we were both given credit on our profiles and TdP double checker leaderboard.
I feel bad for myself, I did lose prime due this error....
http://www.primegrid.com/workunit.php?wuid=701428386 | |
|
|
How many times has this question been asked now? It seems like Jim was answering this at least once a month while he was here...
I asked about it on Discord in December. It was Linode instances for me causing this, as well.
____________
Proud member of Team Aggie the Pew
"Wir müssen wissen. Wir werden wissen."
"We must know, we shall know."
- David Hilbert, 1930 | |
|
Message boards :
Number crunching :
Server sent unit to 3 hosts in a short time |