PrimeGrid
Please visit donation page to help the project cover running costs for this month

Toggle Menu

Join PrimeGrid

Returning Participants

Community

Leader Boards

Results

Other

drummers-lowrise

Advanced search

Message boards : Project Staging Area : Mystery Destination

Author Message
ZydorProject donor
Avatar
Send message
Joined: 27 Nov 10
Posts: 226
ID: 74718
Credit: 25,180,844
RAC: 0
PPS LLR Gold: Earned 500,000 credits (683,896)SR5 LLR Bronze: Earned 10,000 credits (65,374)SGS LLR Silver: Earned 100,000 credits (198,823)PPS Sieve Jade: Earned 10,000,000 credits (12,095,504)PSA Jade: Earned 10,000,000 credits (12,131,472)
Message 35111 - Posted: 10 Apr 2011 | 16:21:38 UTC

I am running in the 5oB check. Get a wierd one every now and then, sort of twice a day.

Usually amongst all the echos in the dos box there is:
""Hi welcome to PrimeGrid prpserver 5oB DC server"

If I see that, fine all's well.

A few times I have not seen it, yet the server at port 13000 is alledgedly giving me work - certainly the WU name indictes its a 5oB WU. When that occurs (no welcome message) despite all else being the same as a "good" running WU - it will run full term, but then fail when sent back to the server.

As the WUs get longer, it becomes a real pain -eg crunch for 4-8hrs, then fall over :)

At present when I spot a cmd box without the welcome message (or I see a WU missing from the running list) I go to the relevant directory at my end and delete the data files for it, then restart using "prpclient". If I dont delete the data files, it just resumes - wherever its resuming from ...)

The latter gets me going, and thats fine. However I am concious of deleting data files, and somewhere in PG Land a server has a zapped WU on its hands, almost "phantom" WU as I've hit the data files. Not much else I can do if I am to keep the WU running at 5oB, but somewhere a spurious WU is spinning round as a result (presumably).

Sorry a bit rambling, usual story wordy wordy to explain what is simple visually.

No idea why it occurs, no idea what I am zapping (its the later that concerns me). In the great scheme of things its been no big deal, but increasingly it is, because if not spotted (usually by the fact that the running WU count in the stats table drops one) then up to 8 hours chrunching is wasted.

Regards
Zy

gomeyerProject donor
Send message
Joined: 26 Oct 08
Posts: 80
ID: 30918
Credit: 358,409,613
RAC: 0
321 LLR Ruby: Earned 2,000,000 credits (2,006,649)Cullen LLR Amethyst: Earned 1,000,000 credits (1,049,607)ESP LLR Bronze: Earned 10,000 credits (75,053)PPS LLR Ruby: Earned 2,000,000 credits (2,027,453)PSP LLR Amethyst: Earned 1,000,000 credits (1,035,190)SoB LLR Amethyst: Earned 1,000,000 credits (1,043,750)SGS LLR Amethyst: Earned 1,000,000 credits (1,012,055)TRP LLR Amethyst: Earned 1,000,000 credits (1,049,083)Woodall LLR Amethyst: Earned 1,000,000 credits (1,003,614)321 Sieve Amethyst: Earned 1,000,000 credits (1,007,951)Cullen/Woodall Sieve (suspended) Ruby: Earned 2,000,000 credits (4,254,568)PPS Sieve Double Silver: Earned 200,000,000 credits (325,765,324)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Turquoise: Earned 5,000,000 credits (5,624,626)TRP Sieve (suspended) Turquoise: Earned 5,000,000 credits (9,449,779)PSA Ruby: Earned 2,000,000 credits (2,004,884)
Message 35117 - Posted: 10 Apr 2011 | 18:54:33 UTC
Last modified: 10 Apr 2011 | 18:58:22 UTC

I can confirm this same behaviour.

I noticed the following dialog (in red) this morning on one of my hosts. Even though the server had returned "No available candidates are left . . " the core was indeed running a workunit. Upon checking http://pgllr.mine.nu:13000/all.html it was apparent that this machine was crunching a task that the server did not know it had downloaded.

Since the task was up to 85% I allowed it to complete which generated the message in blue.

The following is from the prpclient.log in that folder.

[2011-04-09 01:56:27 EDT] 5OB: 1*2^2935767+41693 is not prime. Residue 2BCBACC82E6A97C0.
[2011-04-09 01:56:27 EDT] Total Time: 97:41:37 Total Tests: 33 Total PRPs Found: 0
[2011-04-09 01:56:33 EDT] 5OB: Returning work to server pgllr.mine.nu at port 13000
[2011-04-09 01:56:36 EDT] 5OB: INFO: Test for 1*2^2935767+41693 was accepted
[2011-04-09 01:56:38 EDT] 296: nothing was received on socket after 2 seconds
[2011-04-09 01:56:41 EDT] 5OB: Getting work from server pgllr.mine.nu at port 13000
[2011-04-09 01:56:51 EDT] 5OB: INFO: No available candidates are left on this server.
[2011-04-09 01:56:51 EDT] 296: nothing was received on socket after 60 seconds
[2011-04-09 01:56:51 EDT] 5OB: PRPNet server is version 4.3.0
[2011-04-09 01:56:51 EDT] 296: nothing was received on socket after 1 seconds

[2011-04-09 08:12:33 EDT] 5OB: 1*2^2955160+40291 is not prime. Residue 312079B92D34E9F1.
[2011-04-09 08:12:33 EDT] Total Time:103:57:43 Total Tests: 34 Total PRPs Found: 0
[2011-04-09 08:12:34 EDT] 5OB: Returning work to server pgllr.mine.nu at port 13000
[2011-04-09 08:12:36 EDT] 5OB: INFO: Test for 1*2^2955160+40291 was ignored. Candidate and/or test was not found
[2011-04-09 08:12:36 EDT] 5OB: INFO: 0 of 1 test results were accepted

[2011-04-09 08:12:37 EDT] 5OB: Getting work from server pgllr.mine.nu at port 13000
[2011-04-09 08:12:39 EDT] 5OB: PRPNet server is version 4.3.0
etc. . .

I am running the prpclient-4.3.0beta-client-windows client.
All times are US EDT (UTC-4).

In the future if I see this type of thing soon enough I will select stopoption=2 and do a CTL-C. This will abandon the wu properly. Then reset stopoption=3 and restart that core. However in this case, since the server does not know that machine even has the wu I don't know if it really matters.

ZydorProject donor
Avatar
Send message
Joined: 27 Nov 10
Posts: 226
ID: 74718
Credit: 25,180,844
RAC: 0
PPS LLR Gold: Earned 500,000 credits (683,896)SR5 LLR Bronze: Earned 10,000 credits (65,374)SGS LLR Silver: Earned 100,000 credits (198,823)PPS Sieve Jade: Earned 10,000,000 credits (12,095,504)PSA Jade: Earned 10,000,000 credits (12,131,472)
Message 35120 - Posted: 10 Apr 2011 | 20:26:40 UTC

Its now happening to every WU I have on 5oB - as it finishes a legit one, the above behaviour kicks in. Only now there is no resetting, and its impossible to start a legit WU.

Looking like 5oB port 13000 will come to a halt shortly if others are getting this.

Regards
Zy

ZydorProject donor
Avatar
Send message
Joined: 27 Nov 10
Posts: 226
ID: 74718
Credit: 25,180,844
RAC: 0
PPS LLR Gold: Earned 500,000 credits (683,896)SR5 LLR Bronze: Earned 10,000 credits (65,374)SGS LLR Silver: Earned 100,000 credits (198,823)PPS Sieve Jade: Earned 10,000,000 credits (12,095,504)PSA Jade: Earned 10,000,000 credits (12,131,472)
Message 35125 - Posted: 10 Apr 2011 | 20:55:14 UTC
Last modified: 10 Apr 2011 | 20:55:58 UTC

So far 5 out of 12 Cores affected, the rest will follow over the next 3-5 hours.

No startup of a legitimate 5oB WU is possible. UnlessI post otherwise assume the rest are falling over after they have completed current WU and try to get another.

Regards
Zy

ZydorProject donor
Avatar
Send message
Joined: 27 Nov 10
Posts: 226
ID: 74718
Credit: 25,180,844
RAC: 0
PPS LLR Gold: Earned 500,000 credits (683,896)SR5 LLR Bronze: Earned 10,000 credits (65,374)SGS LLR Silver: Earned 100,000 credits (198,823)PPS Sieve Jade: Earned 10,000,000 credits (12,095,504)PSA Jade: Earned 10,000,000 credits (12,131,472)
Message 35128 - Posted: 10 Apr 2011 | 22:13:47 UTC - in response to Message 35125.

Same behaviour, but now additional lines of text appear:

gethostbyname: No such file or directory
gethostbyname(pgllr.mine.nu> generated error 11004

As others look like they are being able to log in new WUs at the Project, according to the PRPNet stats page for 5oB Pendings, I can only assume its only me for some reason, particularly as no one else has reported this or acknowledged there is a server issue.

I've spent over two hours crawling through everything I can find this end. It looks like the server has decided I'm "persona non grata" :)

One machine is off totally now, the two others will follow shortly, probably over the next 2 hours. Oh well, time to find something else to crunch - hope you make 5 May :)

Regards
Zy

enderakProject donor
Send message
Joined: 13 Dec 08
Posts: 45
ID: 32842
Credit: 8,789,369
RAC: 0
SoB LLR Amethyst: Earned 1,000,000 credits (1,510,496)TRP LLR Silver: Earned 100,000 credits (113,075)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Gold: Earned 500,000 credits (560,005)TRP Sieve (suspended) Silver: Earned 100,000 credits (215,668)PSA Turquoise: Earned 5,000,000 credits (6,389,782)
Message 35129 - Posted: 10 Apr 2011 | 22:24:38 UTC

It's not just you, I am having the same issue as well.

The stats show that a few tests are still being handed out, but it seems to be fewer than normal.

ZydorProject donor
Avatar
Send message
Joined: 27 Nov 10
Posts: 226
ID: 74718
Credit: 25,180,844
RAC: 0
PPS LLR Gold: Earned 500,000 credits (683,896)SR5 LLR Bronze: Earned 10,000 credits (65,374)SGS LLR Silver: Earned 100,000 credits (198,823)PPS Sieve Jade: Earned 10,000,000 credits (12,095,504)PSA Jade: Earned 10,000,000 credits (12,131,472)
Message 35130 - Posted: 10 Apr 2011 | 22:29:26 UTC - in response to Message 35129.

Thanks for that.

It was getting to the stage where I was thinking I had been canned for some reason as there had been total silence, and it looks as though my hostname directory is now either gone or inaccessible .

Hope they are able to fix it soon else John can kiss goodbye to his target date of 5 May.

Regards
Zy

gomeyerProject donor
Send message
Joined: 26 Oct 08
Posts: 80
ID: 30918
Credit: 358,409,613
RAC: 0
321 LLR Ruby: Earned 2,000,000 credits (2,006,649)Cullen LLR Amethyst: Earned 1,000,000 credits (1,049,607)ESP LLR Bronze: Earned 10,000 credits (75,053)PPS LLR Ruby: Earned 2,000,000 credits (2,027,453)PSP LLR Amethyst: Earned 1,000,000 credits (1,035,190)SoB LLR Amethyst: Earned 1,000,000 credits (1,043,750)SGS LLR Amethyst: Earned 1,000,000 credits (1,012,055)TRP LLR Amethyst: Earned 1,000,000 credits (1,049,083)Woodall LLR Amethyst: Earned 1,000,000 credits (1,003,614)321 Sieve Amethyst: Earned 1,000,000 credits (1,007,951)Cullen/Woodall Sieve (suspended) Ruby: Earned 2,000,000 credits (4,254,568)PPS Sieve Double Silver: Earned 200,000,000 credits (325,765,324)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Turquoise: Earned 5,000,000 credits (5,624,626)TRP Sieve (suspended) Turquoise: Earned 5,000,000 credits (9,449,779)PSA Ruby: Earned 2,000,000 credits (2,004,884)
Message 35131 - Posted: 10 Apr 2011 | 22:34:06 UTC

Yup. Now have 5 of 14 cores with same problem. Will let the others finish and try again in a day or two if this can be fixed.

rogue
Volunteer developer
Avatar
Send message
Joined: 8 Sep 07
Posts: 1196
ID: 12001
Credit: 18,565,548
RAC: 0
PPS LLR Bronze: Earned 10,000 credits (31,229)PSA Jade: Earned 10,000,000 credits (18,533,435)
Message 35132 - Posted: 10 Apr 2011 | 22:47:35 UTC

There seems to be something going on with the server. I will need logs from Lennart to diagnose.

ZydorProject donor
Avatar
Send message
Joined: 27 Nov 10
Posts: 226
ID: 74718
Credit: 25,180,844
RAC: 0
PPS LLR Gold: Earned 500,000 credits (683,896)SR5 LLR Bronze: Earned 10,000 credits (65,374)SGS LLR Silver: Earned 100,000 credits (198,823)PPS Sieve Jade: Earned 10,000,000 credits (12,095,504)PSA Jade: Earned 10,000,000 credits (12,131,472)
Message 35133 - Posted: 10 Apr 2011 | 22:52:09 UTC - in response to Message 35132.
Last modified: 10 Apr 2011 | 23:01:51 UTC

Just for the record for those fixing this, in case its a relevant symptom that will help diagnosis, the stats page now shows me starting a WU:

1*2^3092727+41693 Sun Apr 10 22:35:35 2011

Not me - I've checked all cmd boxes I have left on the remaining two machines - I dont have a WU running this end by that designation (I have just 5 left now out of 12)

EDIT:
Just thought ...... I hope the few that seem to log in are genuine ones, and not phantom ones like this one ... could be a bit of a nightmare.

Regards
Zy

ZydorProject donor
Avatar
Send message
Joined: 27 Nov 10
Posts: 226
ID: 74718
Credit: 25,180,844
RAC: 0
PPS LLR Gold: Earned 500,000 credits (683,896)SR5 LLR Bronze: Earned 10,000 credits (65,374)SGS LLR Silver: Earned 100,000 credits (198,823)PPS Sieve Jade: Earned 10,000,000 credits (12,095,504)PSA Jade: Earned 10,000,000 credits (12,131,472)
Message 35134 - Posted: 10 Apr 2011 | 23:14:41 UTC

OooH - looks like I'm in - two wenthrough over last few minutes :)

If there is an anonimous benefactor out there - my thanks :)

Will post if the others dont restart

Regards
Zy

ZydorProject donor
Avatar
Send message
Joined: 27 Nov 10
Posts: 226
ID: 74718
Credit: 25,180,844
RAC: 0
PPS LLR Gold: Earned 500,000 credits (683,896)SR5 LLR Bronze: Earned 10,000 credits (65,374)SGS LLR Silver: Earned 100,000 credits (198,823)PPS Sieve Jade: Earned 10,000,000 credits (12,095,504)PSA Jade: Earned 10,000,000 credits (12,131,472)
Message 35135 - Posted: 10 Apr 2011 | 23:20:41 UTC
Last modified: 10 Apr 2011 | 23:26:07 UTC

Nope - back to the old problem.

Edit:
It now shows error socket connect, error 10061. It fell back to SGS, and crunching that. That behaviour (fall back) not happened for a while.

Regards
Zy

Profile Lennart SM5YMTProject donor
Honorary cruncher
Avatar
Send message
Joined: 7 May 07
Posts: 1125
ID: 7989
Credit: 694,692,344
RAC: 0
Discovered the World's First base 13 Generalized Woodall prime!!!Eliminated 22 conjecture "k"s2009 Tour de Primes highest prime count2009 Tour de Primes most Mountain Stage primes2010 Tour de Primes highest prime count2010 Tour de Primes highest prime score321 LLR Turquoise: Earned 5,000,000 credits (5,097,586)Cullen LLR Amethyst: Earned 1,000,000 credits (1,101,661)PPS LLR Emerald: Earned 50,000,000 credits (80,169,933)PSP LLR Jade: Earned 10,000,000 credits (18,921,475)SoB LLR Bronze: Earned 10,000 credits (55,453)SR5 LLR Ruby: Earned 2,000,000 credits (4,407,637)SGS LLR Ruby: Earned 2,000,000 credits (4,595,742)TPS LLR (retired) Silver: Earned 100,000 credits (360,998)TRP LLR Turquoise: Earned 5,000,000 credits (5,800,792)Woodall LLR Ruby: Earned 2,000,000 credits (4,885,194)321 Sieve Amethyst: Earned 1,000,000 credits (1,345,944)Cullen/Woodall Sieve (suspended) Sapphire: Earned 20,000,000 credits (27,566,122)PPS Sieve Double Silver: Earned 200,000,000 credits (220,787,724)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Ruby: Earned 2,000,000 credits (2,694,194)TRP Sieve (suspended) Turquoise: Earned 5,000,000 credits (8,986,371)AP 26/27 Amethyst: Earned 1,000,000 credits (1,780,026)GFN Jade: Earned 10,000,000 credits (18,585,003)PSA Double Silver: Earned 200,000,000 credits (287,482,568)
Message 35136 - Posted: 10 Apr 2011 | 23:27:56 UTC

Sorry I was not at home last hr's.

I have restarted the server and I think it shall work now.


To Mark. I am not sure but it could be prpamdin, I had to kill it to release the connection to the server.

Lennart

ZydorProject donor
Avatar
Send message
Joined: 27 Nov 10
Posts: 226
ID: 74718
Credit: 25,180,844
RAC: 0
PPS LLR Gold: Earned 500,000 credits (683,896)SR5 LLR Bronze: Earned 10,000 credits (65,374)SGS LLR Silver: Earned 100,000 credits (198,823)PPS Sieve Jade: Earned 10,000,000 credits (12,095,504)PSA Jade: Earned 10,000,000 credits (12,131,472)
Message 35137 - Posted: 10 Apr 2011 | 23:38:53 UTC
Last modified: 10 Apr 2011 | 23:39:32 UTC

All cores now working normaly - one resumed an old fallback, which is fine.

I've also had a running WU just finish and transit through to starting a new one with no problem.

Connections are fast - and all appears to be back to normal

Back Murphy .... Get Back I Say .... :)

Many Thanks, looks like the server restart did the trick

Regards
Zy

ZydorProject donor
Avatar
Send message
Joined: 27 Nov 10
Posts: 226
ID: 74718
Credit: 25,180,844
RAC: 0
PPS LLR Gold: Earned 500,000 credits (683,896)SR5 LLR Bronze: Earned 10,000 credits (65,374)SGS LLR Silver: Earned 100,000 credits (198,823)PPS Sieve Jade: Earned 10,000,000 credits (12,095,504)PSA Jade: Earned 10,000,000 credits (12,131,472)
Message 35138 - Posted: 11 Apr 2011 | 0:09:27 UTC
Last modified: 11 Apr 2011 | 0:11:03 UTC

Murphy is deaf :)

The old problem is back. I had a WU finish, failed to contact server for new work, did not fallback to my default fallback of SGS. 5oB stats page appears down as well. (It accepted the completed work as valid, but did not manage to contact server for a new one)

Just an observation in case its not you beavering away.

Regards
Zy

ZydorProject donor
Avatar
Send message
Joined: 27 Nov 10
Posts: 226
ID: 74718
Credit: 25,180,844
RAC: 0
PPS LLR Gold: Earned 500,000 credits (683,896)SR5 LLR Bronze: Earned 10,000 credits (65,374)SGS LLR Silver: Earned 100,000 credits (198,823)PPS Sieve Jade: Earned 10,000,000 credits (12,095,504)PSA Jade: Earned 10,000,000 credits (12,131,472)
Message 35141 - Posted: 11 Apr 2011 | 0:57:05 UTC - in response to Message 35138.

As long as Murphy has taken a flying hike .... its looking ok now. Been solid for about 20 mins, new WUs coming back on board

Regards
Zy

gomeyerProject donor
Send message
Joined: 26 Oct 08
Posts: 80
ID: 30918
Credit: 358,409,613
RAC: 0
321 LLR Ruby: Earned 2,000,000 credits (2,006,649)Cullen LLR Amethyst: Earned 1,000,000 credits (1,049,607)ESP LLR Bronze: Earned 10,000 credits (75,053)PPS LLR Ruby: Earned 2,000,000 credits (2,027,453)PSP LLR Amethyst: Earned 1,000,000 credits (1,035,190)SoB LLR Amethyst: Earned 1,000,000 credits (1,043,750)SGS LLR Amethyst: Earned 1,000,000 credits (1,012,055)TRP LLR Amethyst: Earned 1,000,000 credits (1,049,083)Woodall LLR Amethyst: Earned 1,000,000 credits (1,003,614)321 Sieve Amethyst: Earned 1,000,000 credits (1,007,951)Cullen/Woodall Sieve (suspended) Ruby: Earned 2,000,000 credits (4,254,568)PPS Sieve Double Silver: Earned 200,000,000 credits (325,765,324)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Turquoise: Earned 5,000,000 credits (5,624,626)TRP Sieve (suspended) Turquoise: Earned 5,000,000 credits (9,449,779)PSA Ruby: Earned 2,000,000 credits (2,004,884)
Message 35142 - Posted: 11 Apr 2011 | 1:22:00 UTC

@Lennart,
First of all, thanks for getting right on that.
But second, I was able to get four going but the fifth failed as before.

rogue
Volunteer developer
Avatar
Send message
Joined: 8 Sep 07
Posts: 1196
ID: 12001
Credit: 18,565,548
RAC: 0
PPS LLR Bronze: Earned 10,000 credits (31,229)PSA Jade: Earned 10,000,000 credits (18,533,435)
Message 35143 - Posted: 11 Apr 2011 | 2:04:38 UTC

What is the maxworkunits set on the server? I suggest setting it to 1 if it isn't already.

Profile Lennart SM5YMTProject donor
Honorary cruncher
Avatar
Send message
Joined: 7 May 07
Posts: 1125
ID: 7989
Credit: 694,692,344
RAC: 0
Discovered the World's First base 13 Generalized Woodall prime!!!Eliminated 22 conjecture "k"s2009 Tour de Primes highest prime count2009 Tour de Primes most Mountain Stage primes2010 Tour de Primes highest prime count2010 Tour de Primes highest prime score321 LLR Turquoise: Earned 5,000,000 credits (5,097,586)Cullen LLR Amethyst: Earned 1,000,000 credits (1,101,661)PPS LLR Emerald: Earned 50,000,000 credits (80,169,933)PSP LLR Jade: Earned 10,000,000 credits (18,921,475)SoB LLR Bronze: Earned 10,000 credits (55,453)SR5 LLR Ruby: Earned 2,000,000 credits (4,407,637)SGS LLR Ruby: Earned 2,000,000 credits (4,595,742)TPS LLR (retired) Silver: Earned 100,000 credits (360,998)TRP LLR Turquoise: Earned 5,000,000 credits (5,800,792)Woodall LLR Ruby: Earned 2,000,000 credits (4,885,194)321 Sieve Amethyst: Earned 1,000,000 credits (1,345,944)Cullen/Woodall Sieve (suspended) Sapphire: Earned 20,000,000 credits (27,566,122)PPS Sieve Double Silver: Earned 200,000,000 credits (220,787,724)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Ruby: Earned 2,000,000 credits (2,694,194)TRP Sieve (suspended) Turquoise: Earned 5,000,000 credits (8,986,371)AP 26/27 Amethyst: Earned 1,000,000 credits (1,780,026)GFN Jade: Earned 10,000,000 credits (18,585,003)PSA Double Silver: Earned 200,000,000 credits (287,482,568)
Message 35144 - Posted: 11 Apr 2011 | 2:52:27 UTC - in response to Message 35143.

What is the maxworkunits set on the server? I suggest setting it to 1 if it isn't already.


It was 10 It is now 1

Lennart

gomeyerProject donor
Send message
Joined: 26 Oct 08
Posts: 80
ID: 30918
Credit: 358,409,613
RAC: 0
321 LLR Ruby: Earned 2,000,000 credits (2,006,649)Cullen LLR Amethyst: Earned 1,000,000 credits (1,049,607)ESP LLR Bronze: Earned 10,000 credits (75,053)PPS LLR Ruby: Earned 2,000,000 credits (2,027,453)PSP LLR Amethyst: Earned 1,000,000 credits (1,035,190)SoB LLR Amethyst: Earned 1,000,000 credits (1,043,750)SGS LLR Amethyst: Earned 1,000,000 credits (1,012,055)TRP LLR Amethyst: Earned 1,000,000 credits (1,049,083)Woodall LLR Amethyst: Earned 1,000,000 credits (1,003,614)321 Sieve Amethyst: Earned 1,000,000 credits (1,007,951)Cullen/Woodall Sieve (suspended) Ruby: Earned 2,000,000 credits (4,254,568)PPS Sieve Double Silver: Earned 200,000,000 credits (325,765,324)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Turquoise: Earned 5,000,000 credits (5,624,626)TRP Sieve (suspended) Turquoise: Earned 5,000,000 credits (9,449,779)PSA Ruby: Earned 2,000,000 credits (2,004,884)
Message 35145 - Posted: 11 Apr 2011 | 3:28:40 UTC

Just retried that 5th core again and it dowloaded work normally, but I tried another core and it failed. Sorry guys, but from my side there is still a problem.

I'll leave these 5 cores running for the time being and see how it goes.

ZydorProject donor
Avatar
Send message
Joined: 27 Nov 10
Posts: 226
ID: 74718
Credit: 25,180,844
RAC: 0
PPS LLR Gold: Earned 500,000 credits (683,896)SR5 LLR Bronze: Earned 10,000 credits (65,374)SGS LLR Silver: Earned 100,000 credits (198,823)PPS Sieve Jade: Earned 10,000,000 credits (12,095,504)PSA Jade: Earned 10,000,000 credits (12,131,472)
Message 35158 - Posted: 11 Apr 2011 | 12:28:56 UTC
Last modified: 11 Apr 2011 | 12:31:36 UTC

I dont know if you are bothered about Orphan WUs - but in case you are, I have two inside the 5oB pendings as a result of the recent problems:

1*2^3075976+2131 Zydor 176262 UK_BOINC_Team Sun Apr 10 17:51:05 2011 18:24

1*2^3092727+41693 Zydor 176262 UK_BOINC_Team Sun Apr 10 22:35:35 2011 13:39

Definitely not running my end, triple checked all cmd boxes. Looks like there could be probably around a dozen or so orphans running at present.

Regards
Zy

Prime Al ScreenProject donor
Send message
Joined: 1 Dec 09
Posts: 240
ID: 50942
Credit: 38,208,946
RAC: 0
321 LLR Amethyst: Earned 1,000,000 credits (1,061,666)Cullen LLR Amethyst: Earned 1,000,000 credits (1,029,850)PPS LLR Ruby: Earned 2,000,000 credits (2,647,860)PSP LLR Ruby: Earned 2,000,000 credits (2,877,696)SoB LLR Amethyst: Earned 1,000,000 credits (1,376,336)SR5 LLR Silver: Earned 100,000 credits (131,116)SGS LLR Ruby: Earned 2,000,000 credits (2,025,343)TRP LLR Amethyst: Earned 1,000,000 credits (1,112,533)Woodall LLR Amethyst: Earned 1,000,000 credits (1,118,119)321 Sieve Amethyst: Earned 1,000,000 credits (1,014,806)Cullen/Woodall Sieve (suspended) Amethyst: Earned 1,000,000 credits (1,022,047)PPS Sieve Turquoise: Earned 5,000,000 credits (6,478,805)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Silver: Earned 100,000 credits (242,867)TRP Sieve (suspended) Amethyst: Earned 1,000,000 credits (1,050,478)AP 26/27 Gold: Earned 500,000 credits (679,594)GFN Ruby: Earned 2,000,000 credits (2,278,022)PSA Jade: Earned 10,000,000 credits (12,061,816)
Message 35173 - Posted: 11 Apr 2011 | 20:21:21 UTC - in response to Message 35158.

I dont know if you are bothered about Orphan WUs - but in case you are, I have two inside the 5oB pendings as a result of the recent problems:

1*2^3075976+2131 Zydor 176262 UK_BOINC_Team Sun Apr 10 17:51:05 2011 18:24

1*2^3092727+41693 Zydor 176262 UK_BOINC_Team Sun Apr 10 22:35:35 2011 13:39

Definitely not running my end, triple checked all cmd boxes. Looks like there could be probably around a dozen or so orphans running at present.

Regards
Zy

I also have one (1*2^3087928+40291) of those that I can not see in any of the logs for that host. So I agree that those WUs are probably orphans.

rogue
Volunteer developer
Avatar
Send message
Joined: 8 Sep 07
Posts: 1196
ID: 12001
Credit: 18,565,548
RAC: 0
PPS LLR Bronze: Earned 10,000 credits (31,229)PSA Jade: Earned 10,000,000 credits (18,533,435)
Message 35180 - Posted: 12 Apr 2011 | 0:13:38 UTC

If you did not do a test on the orphans, then you are okay. With the 4.3.0 client and 4.3.0 server there won't be orphans. If you are running the 4.3.0 client and are getting orphaned tasks on the server, then there must be a bug.

rogue
Volunteer developer
Avatar
Send message
Joined: 8 Sep 07
Posts: 1196
ID: 12001
Credit: 18,565,548
RAC: 0
PPS LLR Bronze: Earned 10,000 credits (31,229)PSA Jade: Earned 10,000,000 credits (18,533,435)
Message 35181 - Posted: 12 Apr 2011 | 0:14:05 UTC - in response to Message 35180.

If you did not do a test on the orphans, then you are okay. With the 4.3.0 client and 4.3.0 server there won't be orphans. If you are running the 4.3.0 client and are getting orphaned tasks on the server, then there must be a bug.


The positive side is that your client probably never tested them, thus haven't lost credit.

ZydorProject donor
Avatar
Send message
Joined: 27 Nov 10
Posts: 226
ID: 74718
Credit: 25,180,844
RAC: 0
PPS LLR Gold: Earned 500,000 credits (683,896)SR5 LLR Bronze: Earned 10,000 credits (65,374)SGS LLR Silver: Earned 100,000 credits (198,823)PPS Sieve Jade: Earned 10,000,000 credits (12,095,504)PSA Jade: Earned 10,000,000 credits (12,131,472)
Message 35182 - Posted: 12 Apr 2011 | 0:30:16 UTC - in response to Message 35181.
Last modified: 12 Apr 2011 | 0:39:16 UTC

Not fussed re credit - just reporting in case it helps re server housekeeping, if it doesnt, no worries ...

They were created last night as a result of the server hassles and some bad WUs created (see top of this Mystery Destination thread). The only way I could get going again was zap the temp WU data files in the directory, the standard ini options did not cover the deletion because as far as the software was conerned it was a valid WU and was running, problem was, it was going no where, previous affected ones had been declared invalid at the server.

Hence the sledgehammer to crack a nut by deleting the data files. So having caused the Orphan's, I wanted to try and make sure they didnt cause hassles. If its no hassle, great, I'll leave any furture orphans be.

EDIT
Just in case we got crossed wires here - be clear that the server had issued a test and it was running on the Client, some I noticed immediately, some not until 2 0or 3 hours or so into crunching that it would be a failed WU if run full term. Therefore there is a record somewhere that number combination had been issued for testing, because the WU was running - or I am missing something here on how the server controls them?

Regards
Zy

rogue
Volunteer developer
Avatar
Send message
Joined: 8 Sep 07
Posts: 1196
ID: 12001
Credit: 18,565,548
RAC: 0
PPS LLR Bronze: Earned 10,000 credits (31,229)PSA Jade: Earned 10,000,000 credits (18,533,435)
Message 35220 - Posted: 13 Apr 2011 | 0:27:30 UTC - in response to Message 35182.

Upon further review these problems appear to be a case of running two instances of the client from the same folder. Can any of you confirm that is happening?

This has been an issue before and is really the best explanation for the behavior. When running two instances of the client from the same folder, they can trample eachother's files. Barring a coding issue with the client, gomeyer's log is a perfect example of that. The client is running single-threaded, so it is not possible to see information for two different work suffixes intermingled as shown in his log.

I need to find a way to prevent users from doing that.

gomeyerProject donor
Send message
Joined: 26 Oct 08
Posts: 80
ID: 30918
Credit: 358,409,613
RAC: 0
321 LLR Ruby: Earned 2,000,000 credits (2,006,649)Cullen LLR Amethyst: Earned 1,000,000 credits (1,049,607)ESP LLR Bronze: Earned 10,000 credits (75,053)PPS LLR Ruby: Earned 2,000,000 credits (2,027,453)PSP LLR Amethyst: Earned 1,000,000 credits (1,035,190)SoB LLR Amethyst: Earned 1,000,000 credits (1,043,750)SGS LLR Amethyst: Earned 1,000,000 credits (1,012,055)TRP LLR Amethyst: Earned 1,000,000 credits (1,049,083)Woodall LLR Amethyst: Earned 1,000,000 credits (1,003,614)321 Sieve Amethyst: Earned 1,000,000 credits (1,007,951)Cullen/Woodall Sieve (suspended) Ruby: Earned 2,000,000 credits (4,254,568)PPS Sieve Double Silver: Earned 200,000,000 credits (325,765,324)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Turquoise: Earned 5,000,000 credits (5,624,626)TRP Sieve (suspended) Turquoise: Earned 5,000,000 credits (9,449,779)PSA Ruby: Earned 2,000,000 credits (2,004,884)
Message 35222 - Posted: 13 Apr 2011 | 1:45:40 UTC - in response to Message 35220.
Last modified: 13 Apr 2011 | 2:09:28 UTC

Upon further review these problems appear to be a case of running two instances of the client from the same folder. Can any of you confirm that is happening?

Hi rogue,

I am absolutely certain that is not the case for me. I have named clientid= in the prpclient.ini file for each folder as -1, -2, etc so it is evident on the page http://pgllr.mine.nu:13000/all.html which task/folder the server is unaware of.

I'm fairly sure it is strictly communications being dropped between the client and the server based on the error message "nothing was received on socket . . ." which is always present at time of failure. Please see my first post in this thread.

I'm still running 8 cores but I have to check them frequently since this is still happening.

Best regards,
Gus

[edit] My first post in this thread has time stamps and error messages. Are there any log files on the server that could be checked for this event?[/edit]

Scott BrownProject donor
Volunteer moderator
Project administrator
Volunteer tester
Project scientist
Avatar
Send message
Joined: 17 Oct 05
Posts: 2125
ID: 1178
Credit: 8,408,131,484
RAC: 6,774,597
Discovered the World's First base 116 Generalized Cullen prime!!!Discovered 26 mega primesEliminated 7 conjecture "k"sDiscovered 1 Sophie Germain pairDiscovered 2 Fermat divisors2012 Tour de Primes highest prime count2012 Tour de Primes most Mountain Stage primes2015 Tour de Primes highest prime count2016 Tour de Primes highest prime countFound 23 primes in the 2018 Tour de PrimesFound 1 mega prime in the 2018 Tour de PrimesFound 2 primes in the 2018 Tour de Primes Mountain Stage2019 Tour de Primes highest prime countFound 22 primes in the 2019 Tour de Primes2020 Tour de Primes highest prime scoreFound 21 primes in the 2020 Tour de PrimesFound 4 mega primes in the 2020 Tour de Primes321 LLR Double Bronze: Earned 100,000,000 credits (184,732,070)Cullen LLR Double Bronze: Earned 100,000,000 credits (103,870,990)ESP LLR Double Silver: Earned 200,000,000 credits (203,249,784)Generalized Cullen/Woodall LLR Double Bronze: Earned 100,000,000 credits (109,580,172)PPS LLR Double Gold: Earned 500,000,000 credits (639,557,324)PSP LLR Double Bronze: Earned 100,000,000 credits (126,982,721)SoB LLR Double Bronze: Earned 100,000,000 credits (135,747,083)SR5 LLR Double Silver: Earned 200,000,000 credits (214,194,272)SGS LLR Double Silver: Earned 200,000,000 credits (200,485,349)TPS LLR (retired) Silver: Earned 100,000 credits (235,439)TRP LLR Double Silver: Earned 200,000,000 credits (201,215,056)Woodall LLR Double Bronze: Earned 100,000,000 credits (101,447,725)321 Sieve Double Silver: Earned 200,000,000 credits (235,451,253)Cullen/Woodall Sieve (suspended) Emerald: Earned 50,000,000 credits (83,794,448)Generalized Cullen/Woodall Sieve (suspended) Double Silver: Earned 200,000,000 credits (285,139,652)PPS Sieve Double Ruby: Earned 2,000,000,000 credits (2,664,685,363)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Double Silver: Earned 200,000,000 credits (203,523,358)TRP Sieve (suspended) Double Silver: Earned 200,000,000 credits (201,489,157)AP 26/27 Double Silver: Earned 200,000,000 credits (374,537,969)GFN Double Amethyst: Earned 1,000,000,000 credits (1,879,253,511)PSA Double Silver: Earned 200,000,000 credits (259,058,048)
Message 35223 - Posted: 13 Apr 2011 | 2:15:04 UTC - in response to Message 35220.
Last modified: 13 Apr 2011 | 2:22:18 UTC

Upon further review these problems appear to be a case of running two instances of the client from the same folder. Can any of you confirm that is happening?


I am getting this some as well, and all my clients are definitely installed in separate folders as well.


EDIT: Don't know if this is related, but I seem to get a trickle of PPSE10K or PPSE11K somehow (I guess when this is occurring) even though I have all non-5oB servers commented out and those two have a zero resource share even if the comments are not in there properly?
____________
141941*2^4299438-1 is prime!


rogue
Volunteer developer
Avatar
Send message
Joined: 8 Sep 07
Posts: 1196
ID: 12001
Credit: 18,565,548
RAC: 0
PPS LLR Bronze: Earned 10,000 credits (31,229)PSA Jade: Earned 10,000,000 credits (18,533,435)
Message 35224 - Posted: 13 Apr 2011 | 2:22:41 UTC

I made a mistake. In the log messages, it sometimes gives the work suffix and sometimes the socket ID. I'll have to fix that so that the logging is correct.

John mentioned that there was a performance issue with the code on the server. Unfortunately I have no idea what that might have been. Hopefully Lennart has some logs.

JohnProject donor
Honorary cruncher
Avatar
Send message
Joined: 21 Feb 06
Posts: 2875
ID: 2449
Credit: 2,681,934
RAC: 0
321 LLR Bronze: Earned 10,000 credits (11,773)Cullen LLR Bronze: Earned 10,000 credits (14,945)ESP LLR Bronze: Earned 10,000 credits (26,855)PPS LLR Bronze: Earned 10,000 credits (84,876)PSP LLR Bronze: Earned 10,000 credits (15,311)SoB LLR Bronze: Earned 10,000 credits (21,440)SR5 LLR Bronze: Earned 10,000 credits (29,270)SGS LLR Bronze: Earned 10,000 credits (26,616)TPS LLR (retired) Bronze: Earned 10,000 credits (36,288)TRP LLR Bronze: Earned 10,000 credits (41,655)Woodall LLR Bronze: Earned 10,000 credits (15,807)321 Sieve Bronze: Earned 10,000 credits (20,014)Cullen/Woodall Sieve (suspended) Bronze: Earned 10,000 credits (23,405)PPS Sieve Bronze: Earned 10,000 credits (36,192)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Bronze: Earned 10,000 credits (20,306)TRP Sieve (suspended) Bronze: Earned 10,000 credits (21,738)GFN Bronze: Earned 10,000 credits (86,217)PSA Ruby: Earned 2,000,000 credits (2,143,756)
Message 35225 - Posted: 13 Apr 2011 | 2:49:40 UTC - in response to Message 35223.
Last modified: 13 Apr 2011 | 3:05:24 UTC

EDIT: Don't know if this is related, but I seem to get a trickle of PPSE10K or PPSE11K somehow (I guess when this is occurring) even though I have all non-5oB servers commented out and those two have a zero resource share even if the comments are not in there properly?

This is working properly. Even at 0% share, you'll get work. If you don't want any work but 5oB, then you must comment out all ports.

However, I recommend leaving them (or another port of choosing) so that your cores won't go idle. 10K/11K are the shortest WU's so the client will return to 5oB faster. This only occurs when the clients are not able to get work from 5oB.

EDIT: 0:0 shouldn't get any work regardless if port is commented out or not. I suspect you probably have 0:#.
____________

ZydorProject donor
Avatar
Send message
Joined: 27 Nov 10
Posts: 226
ID: 74718
Credit: 25,180,844
RAC: 0
PPS LLR Gold: Earned 500,000 credits (683,896)SR5 LLR Bronze: Earned 10,000 credits (65,374)SGS LLR Silver: Earned 100,000 credits (198,823)PPS Sieve Jade: Earned 10,000,000 credits (12,095,504)PSA Jade: Earned 10,000,000 credits (12,131,472)
Message 35226 - Posted: 13 Apr 2011 | 3:27:37 UTC - in response to Message 35220.
Last modified: 13 Apr 2011 | 3:47:01 UTC

Upon further review these problems appear to be a case of running two instances of the client from the same folder. Can any of you confirm that is happening?

This has been an issue before and is really the best explanation for the behavior. When running two instances of the client from the same folder, they can trample eachother's files. Barring a coding issue with the client, gomeyer's log is a perfect example of that. The client is running single-threaded, so it is not possible to see information for two different work suffixes intermingled as shown in his log.

I need to find a way to prevent users from doing that.


Definitely not run two instances on the same folder

EDIT
For the record, I've only had one instance of the problem since your sterling efforts on the server that night, and that one was pretty soon after the fix efforts were in place. So not had one for 24hrs or more. The absolute unshakable common factor that guaranteed it would fall over, strange as it may sound, is when the "welcome to 5oB DC server" line was missing from the echo'd info on screen, and/or the WU was not shown on the 5oB Pendings list (always missing from that list when this happens). As soon as I saw either was the case - which was sometimes straightaway, sometimes few hours later, depending on whether I was actively looking for it - that was the trigger to zap it and restart.

Regards
Zy

Prime Al ScreenProject donor
Send message
Joined: 1 Dec 09
Posts: 240
ID: 50942
Credit: 38,208,946
RAC: 0
321 LLR Amethyst: Earned 1,000,000 credits (1,061,666)Cullen LLR Amethyst: Earned 1,000,000 credits (1,029,850)PPS LLR Ruby: Earned 2,000,000 credits (2,647,860)PSP LLR Ruby: Earned 2,000,000 credits (2,877,696)SoB LLR Amethyst: Earned 1,000,000 credits (1,376,336)SR5 LLR Silver: Earned 100,000 credits (131,116)SGS LLR Ruby: Earned 2,000,000 credits (2,025,343)TRP LLR Amethyst: Earned 1,000,000 credits (1,112,533)Woodall LLR Amethyst: Earned 1,000,000 credits (1,118,119)321 Sieve Amethyst: Earned 1,000,000 credits (1,014,806)Cullen/Woodall Sieve (suspended) Amethyst: Earned 1,000,000 credits (1,022,047)PPS Sieve Turquoise: Earned 5,000,000 credits (6,478,805)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Silver: Earned 100,000 credits (242,867)TRP Sieve (suspended) Amethyst: Earned 1,000,000 credits (1,050,478)AP 26/27 Gold: Earned 500,000 credits (679,594)GFN Ruby: Earned 2,000,000 credits (2,278,022)PSA Jade: Earned 10,000,000 credits (12,061,816)
Message 35278 - Posted: 14 Apr 2011 | 14:56:21 UTC - in response to Message 35220.

Upon further review these problems appear to be a case of running two instances of the client from the same folder. Can any of you confirm that is happening?

This has been an issue before and is really the best explanation for the behavior. When running two instances of the client from the same folder, they can trample eachother's files. Barring a coding issue with the client, gomeyer's log is a perfect example of that. The client is running single-threaded, so it is not possible to see information for two different work suffixes intermingled as shown in his log.

I need to find a way to prevent users from doing that.

While I do understand the reasoning, it would be nice not having separate directories for each occurrence. I just run multiple tabs under konsole in each subdirectory such minimizes the need for multiple windows open.

Not really having any clue on this, this is just an observation. I am somewhat surprised that filenames like 'work_5OB.in' and 'work_5OB.out' are not unique. Surely naming these based on the candidate and app would be sufficient. Not sure what is in pfgw.ini and llr.ini that really need to be unique files (I do see what is in them but wonder why these are not command line options). There appears to be some temporary file that presumably the llr creates uniquely.

The llr app has an -wDIR Run from a different working directory option. Presumably the prpclient app can create and destroy working directories based on the candidate (type and number). So the llr app can be invoked with that directory.

rogue
Volunteer developer
Avatar
Send message
Joined: 8 Sep 07
Posts: 1196
ID: 12001
Credit: 18,565,548
RAC: 0
PPS LLR Bronze: Earned 10,000 credits (31,229)PSA Jade: Earned 10,000,000 credits (18,533,435)
Message 35281 - Posted: 14 Apr 2011 | 15:21:32 UTC - in response to Message 35278.

While I do understand the reasoning, it would be nice not having separate directories for each occurrence. I just run multiple tabs under konsole in each subdirectory such minimizes the need for multiple windows open.

Not really having any clue on this, this is just an observation. I am somewhat surprised that filenames like 'work_5OB.in' and 'work_5OB.out' are not unique. Surely naming these based on the candidate and app would be sufficient. Not sure what is in pfgw.ini and llr.ini that really need to be unique files (I do see what is in them but wonder why these are not command line options). There appears to be some temporary file that presumably the llr creates uniquely.

The llr app has an -wDIR Run from a different working directory option. Presumably the prpclient app can create and destroy working directories based on the candidate (type and number). So the llr app can be invoked with that directory.


The problem is that the PRPNet client has no control over some of the filenames used by the helper programs. Examples of this are the the ini files and checkpoint files.

Profile VatoProject donor
Volunteer tester
Avatar
Send message
Joined: 2 Feb 08
Posts: 760
ID: 18447
Credit: 214,742,438
RAC: 288,420
Found 1 prime in the 2020 Tour de Primes321 LLR Ruby: Earned 2,000,000 credits (2,419,655)Cullen LLR Ruby: Earned 2,000,000 credits (2,309,957)ESP LLR Ruby: Earned 2,000,000 credits (2,617,477)Generalized Cullen/Woodall LLR Ruby: Earned 2,000,000 credits (2,076,479)PPS LLR Turquoise: Earned 5,000,000 credits (8,739,711)PSP LLR Ruby: Earned 2,000,000 credits (3,564,856)SoB LLR Ruby: Earned 2,000,000 credits (3,254,352)SR5 LLR Ruby: Earned 2,000,000 credits (2,982,194)SGS LLR Ruby: Earned 2,000,000 credits (2,945,539)TPS LLR (retired) Silver: Earned 100,000 credits (103,523)TRP LLR Ruby: Earned 2,000,000 credits (3,450,469)Woodall LLR Ruby: Earned 2,000,000 credits (2,147,072)321 Sieve Sapphire: Earned 20,000,000 credits (26,923,188)Cullen/Woodall Sieve (suspended) Ruby: Earned 2,000,000 credits (4,119,699)Generalized Cullen/Woodall Sieve (suspended) Jade: Earned 10,000,000 credits (10,278,995)PPS Sieve Sapphire: Earned 20,000,000 credits (33,792,522)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Ruby: Earned 2,000,000 credits (4,080,177)TRP Sieve (suspended) Turquoise: Earned 5,000,000 credits (5,221,054)AP 26/27 Sapphire: Earned 20,000,000 credits (20,020,744)GFN Sapphire: Earned 20,000,000 credits (39,516,868)PSA Sapphire: Earned 20,000,000 credits (34,211,039)
Message 35304 - Posted: 15 Apr 2011 | 0:23:42 UTC - in response to Message 35281.

This could be worked around by prpclient creating a directory to perform the work in, however that's more effort and state to keep between restarts, and I think the current approach is fine as-is.
____________

rogue
Volunteer developer
Avatar
Send message
Joined: 8 Sep 07
Posts: 1196
ID: 12001
Credit: 18,565,548
RAC: 0
PPS LLR Bronze: Earned 10,000 credits (31,229)PSA Jade: Earned 10,000,000 credits (18,533,435)
Message 35306 - Posted: 15 Apr 2011 | 1:07:40 UTC - in response to Message 35304.

Ultimately it would be nice to have a single U/I that could control running multiple instances of the client. I'm not referring to the PRPeditor, but something much different.

Such a tool would be time consuming to write. Were I retired (or without kids) I could probably find the time, but I don't have that luxury.

Message boards : Project Staging Area : Mystery Destination

[Return to PrimeGrid main page]
DNS Powered by DNSEXIT.COM
Copyright © 2005 - 2020 Rytis Slatkevičius (contact) and PrimeGrid community. Server load 5.11, 4.60, 3.55
Generated 31 Oct 2020 | 2:25:20 UTC