Since the double length WUs hit I've been having errors on 2 of my GPUs. Both run the older shorter WUs with no errors. The double length WUs are a problem. Both machines have 4GB ram and huge free disk space.
The first is a GTX 260:
http://www.primegrid.com/show_host_detail.php?hostid=134752
It runs the shorter WUs with no problems but many of the longer ones fail at 4056 seconds with this message:
<core_client_version>6.12.4</core_client_version>
<![CDATA[
<message>
Maximum elapsed time exceeded
</message>
<stderr_txt>
Sieve started: 19949512000000000 <= p < 19949518000000000
Thread 0 starting
Detected GPU 0: GeForce GTX 260
Detected compute capability: 1.3
Detected 27 multiprocessors.
Unhandled Exception Detected...
- Unhandled Exception Record -
Reason: Breakpoint Encountered (0x80000003) at address 0x75A622A1
The second problem GPU is on a dual GPU machine:
http://www.primegrid.com/show_host_detail.php?hostid=89885
The GTX 260 runs the longer WUs fine but every long WU on the GT 240 fails at 3876 seconds with a similar message as above:
<core_client_version>6.12.11</core_client_version>
<![CDATA[
<message>
Maximum elapsed time exceeded
</message>
<stderr_txt>
Sieve started: 19469632000000000 <= p < 19469638000000000
Thread 0 starting
Detected GPU 1: GeForce GT 240
Detected compute capability: 1.2
Detected 12 multiprocessors.
Unhandled Exception Detected...
- Unhandled Exception Record -
Reason: Breakpoint Encountered (0x80000003) at address 0x7D61002D
The key seems to be "Maximum elapsed time exceeded"
For now I've had to switch the GTX 260 box to Collatz as it's a remote machine. I've kept the dual GTX 260 / GT 240 box limping along by aborting all the longer WUs. As I stated above they all run the shorter WUs perfectly. Are these double length WUs a permanent change or are they the WUs under 1P? Is it a problem with v1.38? I can't imagine that many other machines aren't also having this problem. Any well thought out ideas would be appreciated |