Author |
Message |
|
Ive been hitting alot of client errors lately and i was wondering if there was any possible way of limiting this, optimising against it and generally WTF is causing it?
Theres nothing worse than losing hours of compute due to unknown error |
|
|
RytisVolunteer moderator Project administrator
 Send message
Joined: 22 Jun 05 Posts: 2653 ID: 1 Credit: 95,297,257 RAC: 120,076
                     
|
This is weird, your PC is crashing on random workunits, on multiple apps. One of the cause for that might be that your computer is overheating (PrimeGrid applications are sensitive to even slightest errors in CPU calculations, and heat is the main cause for such errors). You can try cleaning up your CPU heatsink (make sure that by opening the case you don't void the warranty!), and try running subprojects that have shorter tasks, like TPS or sieves. Sieves also have higher tolerance to CPU output. You can change the projects in your account page.
____________
|
|
|
John Honorary cruncher
 Send message
Joined: 21 Feb 06 Posts: 2875 ID: 2449 Credit: 2,681,934 RAC: 0
                 
|
Also of interest would be if you are overclocking your machine. This, too, creates sensitivity issues with your CPU, RAM, and power supply. The primality program LLR does not tolerate "even the slightest of errors."
Many people conduct a "Stress Test" on their systems to rule out any hardware issues that may be causing problems. Prime95 has an option to do such that.
For more information on the test, please see the Wikipedia entry here: http://en.wikipedia.org/wiki/Prime95
You can download p95v* (Windows 32), p64v* (Windows 64) or mprime* (Linux) here: http://www.mersenne.org/freesoft.htm
The "Blend" option will stress both CPU and RAM. You decide how long you wish it to run...as it will run indefinitely if no errors are found. The Wikipedia entry mentions a good "rule of thumb" to follow.
This post is provided for informational purposes only. Please read all materials before running. If there are hardware issues, this test has a very high probability of finding them. Therefore, run at your own risk. :)"
____________
|
|
|
|
Um...in that case it could well be my power supply, i believe im substantially below the rating i truelly require....
cheers |
|
|
|
Hello all..Must appologize, my AMD laptop can not crunch anything any more, even though I had a huge cpu heat sink sitting over the cpu and a smaller cpu heat sink over the hdd and a high prf chill pad. Any way had to abort 60 plus wu's. The lap top can do anything else, run Photo shop, Starry night..ect. Prime95 just gives me cpu errors and nothing about the ram? Might change out ram anyway?
____________
My first Prime
20208428036625*2^666666-1 (SGS)
PROUD MEMBER OF TEAM CARL SAGAN |
|
|
|
Power supply or hard disk errors! May I suggest you run a hard disk diagnostic to determine if this is so? Also your O/S; if you have Windows, has a test for RAM errors, though long in duration. Also generally see if everything is plugged in well inside the computer. Don't forget to unplug the power supply first.. A loose Memory chip is nor uncommon.
Hard Drive Diagnostic Tests: http://www.tacktech.com/display.cfm?ttid=287
Peace
____________
Saving the world one spoonful at a time. |
|
|
|
I'll share this, because sometimes failures could be pretty tricky to find.
Recently I retired a mobo in one of my crunchers that started spewing out errors and occasionally completely locked up. First I thought I have a case of failing RAM. After couple of Memtest runs, errors were found, but every time they showed in different locations, which eliminated my 1st suspect. So what else could cause errors? It was by pure luck that I found the cause. While I ran Memtest, sometimes, and mostly when there were errors found, I heard a faint high-pitched sizzling sound coming from the motherboard. When I finally located its source, it turned out, that one of the capacitors near CPU socket failed and it wasn't all that bloated as they can get, which made it a bit more difficult to notice.
Well, I got a new mobo, and it's crunching on happily.
____________
|
|
|
|
Can someone tell me of a good stress test program that has AVX support, or at least link to a version of Prime95 that will stress test with AVX, as far as I can notice version 26.6 does not support it and I can't find anywhere to download newer versions. I would like to start overclocking my 3930K once the Cinco De Mayo challenge is over.
____________
|
|
|
rroonnaalldd Volunteer developer Volunteer tester
 Send message
Joined: 3 Jul 09 Posts: 1213 ID: 42893 Credit: 34,634,263 RAC: 0
                 
|
There should be a newer Prime95 version based on 27.5 and don't use any version between 26.6 and at least 27.5. These versions cause calculation errors...
[Edit]
Latest version seems to be Prime95 27.6.
____________
Best wishes. Knowledge is power. by jjwhalen
|
|
|
|
Can someone tell me of a good stress test program that has AVX support, or at least link to a version of Prime95 that will stress test with AVX, as far as I can notice version 26.6 does not support it and I can't find anywhere to download newer versions. I would like to start overclocking my 3930K once the Cinco De Mayo challenge is over.
LinX AVX Linpack. I´m not sure if I should link it here, so I don't. But it wont be hard to find through a search engine. |
|
|
|
There should be a newer Prime95 version based on 27.5 and don't use any version between 26.6 and at least 27.5. These versions cause calculation errors...
[Edit]
Latest version seems to be Prime95 27.6.
Nope, 27.7 |
|
|
|
I may be missing a simple statement somewhere. But I would like to know if PrimeGrid supports the Ubuntu operating system. Everything that down loads to the two computers running this o/s error out.
____________
|
|
|
|
I may be missing a simple statement somewhere. But I would like to know if PrimeGrid supports the Ubuntu operating system. Everything that down loads to the two computers running this o/s error out.
Hi,
the short answer is yes. I have two Ubuntu machines (12.04) crunching away without problems
The long answer however is that the boinc client (version 7.0.24) that comes standard with Ubuntu 12.04 is buggy. Your best bet is to upgrade to a more recent version of the boinc client (>7.0.24).
You are not the only one who experienced this issue on Ubuntu (and the problem with version 7.0.24 is not limited to PrimeGrid). Check these links below as it has been discussed before.
http://www.primegrid.com/forum_thread.php?id=4299&nowrap=true#53483
http://www.primegrid.com/forum_thread.php?id=4430&nowrap=true#55898
I hope this helps and feel free to report back if you have questions/issues upgrading to a more recent version of boinc.
____________
|
|
|