Join PrimeGrid
Returning Participants
Community
Leader Boards
Results
Other
drummers-lowrise
|
Message boards :
Generalized Fermat Prime Search :
Genefer-19 tasks invalid
Author |
Message |
|
All,
All of a sudden I am getting invalid results in Genefer-19. This is after many valid results. How do I research this?
____________
Thanks,
Jim
| |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14011 ID: 53948 Credit: 435,681,308 RAC: 871,866
                               
|
If you look in the stderr log on the respective task pages, you'll see that both invalid tasks had a "maxerr exceeded" error, followed by trying to recover by reverting to the last checkpoint. Furthermore, both tasks were running at the same point in time.
I suspect that your computer experienced some sort of fault or glitch which altered either the CPU state, cache, or main memory, affecting and corrupting both calculations. If this continues to happen, then you need to start diagnosing the computer hardware. If it doesn't happen again, I wouldn't worry about it too much, but I'd keep an eye out for future occurances.
____________
My lucky number is 75898524288+1 | |
|
|
Thanks...
Jim | |
|
|
I have another "invalid", this time on GFN16. This is the STDERR data:
<core_client_version>7.16.11</core_client_version>
<![CDATA[
<stderr_txt>
geneferocl 3.3.3-2 (Windows/OpenCL/32-bit)
Copyright 2001-2018, Yves Gallot
Copyright 2009, Mark Rodenkirch, David Underbakke
Copyright 2010-2012, Shoichiro Yamada, Ken Brazier
Copyright 2011-2014, Michael Goetz, Ronald Schneider
Copyright 2011-2018, Iain Bethune
Genefer is free source code, under the MIT license.
Running on platform 'NVIDIA CUDA', device 'NVIDIA GeForce GTX 1070 Ti', vendor 'NVIDIA Corporation', version 'OpenCL 3.0 CUDA' and driver '466.77'.
19 computeUnits @ 1683MHz, memSize=8192MB, cacheSize=912kB, cacheLineSize=128B, localMemSize=48kB, maxWorkGroupSize=1024.
Supported transform implementations: ocl ocl2 ocl3 ocl4 ocl5
Command line: projects/www.primegrid.com/geneferocl_windows_3.3.3-2.exe -boinc -q 132841232^65536+1
Normal priority change succeeded.
Checking available transform implementations...
OCL transform is past its b limit.
OCL3 transform is past its b limit.
OCL4 transform is past its b limit.
OCL5 transform is past its b limit.
Using OCL2 transform
Starting initialization...
Initialization complete (0.103 seconds).
Testing 132841232^65536+1...
Estimated time for 132841232^65536+1 is 0:03:23
132841232^65536+1 is complete. (532371 digits) (err = 0.0000) (time = 0:03:22) 21:06:15
21:06:15 (13816): called boinc_finish(0)
</stderr_txt>
]]>
This particular iMac does not have Nvidia H/W, at least I don't think it does.
What is it telling me?
____________
Thanks,
Jim
| |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14011 ID: 53948 Credit: 435,681,308 RAC: 871,866
                               
|
geneferocl 3.3.3-2 (Windows/OpenCL/32-bit)
...
Running on platform 'NVIDIA CUDA', device 'NVIDIA GeForce GTX 1070 Ti',
...
This particular iMac does not have Nvidia H/W, at least I don't think it does.
What is it telling me?
It's telling you that you're either looking at the wrong task or the wrong computer, because that task ran on a Windows computer with an Nvidia 1070 Ti GPU. :)
____________
My lucky number is 75898524288+1 | |
|
|
Oops! Sorry. Let's try again.
<core_client_version>7.16.19</core_client_version>
<![CDATA[
<stderr_txt>
geneferocl 3.3.3-2 (Apple-x86/OpenCL/64-bit)
Copyright 2001-2018, Yves Gallot
Copyright 2009, Mark Rodenkirch, David Underbakke
Copyright 2010-2012, Shoichiro Yamada, Ken Brazier
Copyright 2011-2014, Michael Goetz, Ronald Schneider
Copyright 2011-2018, Iain Bethune
Genefer is free source code, under the MIT license.
Command line: geneferocl_macintel_3.3.3-2 -boinc -q 132841232^65536+1 --device 0
Normal priority change failed (needs superuser privileges.
Checking available transform implementations...
OCL transform is past its b limit.
OCL3 transform is past its b limit.
OCL4 transform is past its b limit.
OCL5 transform is past its b limit.
Using OCL2 transform
Running on platform 'Apple', device 'AMD Radeon Pro 5300 Compute Engine', vendor 'AMD', version 'OpenCL 1.2 ' and driver '1.2 (Aug 30 2021 06:56:17)'.
20 computeUnits @ 1650MHz, memSize=4080MB, cacheSize=0kB, cacheLineSize=0B, localMemSize=64kB, maxWorkGroupSize=256.
Starting initialization...
Initialization complete (0.067 seconds).
Testing 132841232^65536+1...
Estimated time for 132841232^65536+1 is 0:05:17
132841232^65536+1 is complete. (532371 digits) (err = 0.0000) (time = 0:05:14) 21:34:27
21:34:27 (27524): called boinc_finish
</stderr_txt>
]]>
____________
Thanks,
Jim
| |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14011 ID: 53948 Credit: 435,681,308 RAC: 871,866
                               
|
Oops! Sorry. Let's try again.
<core_client_version>7.16.19</core_client_version>
<![CDATA[
<stderr_txt>
geneferocl 3.3.3-2 (Apple-x86/OpenCL/64-bit)
Copyright 2001-2018, Yves Gallot
Copyright 2009, Mark Rodenkirch, David Underbakke
Copyright 2010-2012, Shoichiro Yamada, Ken Brazier
Copyright 2011-2014, Michael Goetz, Ronald Schneider
Copyright 2011-2018, Iain Bethune
Genefer is free source code, under the MIT license.
Command line: geneferocl_macintel_3.3.3-2 -boinc -q 132841232^65536+1 --device 0
Normal priority change failed (needs superuser privileges.
Checking available transform implementations...
OCL transform is past its b limit.
OCL3 transform is past its b limit.
OCL4 transform is past its b limit.
OCL5 transform is past its b limit.
Using OCL2 transform
Running on platform 'Apple', device 'AMD Radeon Pro 5300 Compute Engine', vendor 'AMD', version 'OpenCL 1.2 ' and driver '1.2 (Aug 30 2021 06:56:17)'.
20 computeUnits @ 1650MHz, memSize=4080MB, cacheSize=0kB, cacheLineSize=0B, localMemSize=64kB, maxWorkGroupSize=256.
Starting initialization...
Initialization complete (0.067 seconds).
Testing 132841232^65536+1...
Estimated time for 132841232^65536+1 is 0:05:17
132841232^65536+1 is complete. (532371 digits) (err = 0.0000) (time = 0:05:14) 21:34:27
21:34:27 (27524): called boinc_finish
</stderr_txt>
]]>
The software didn't pick up any errors, so that means a calculation error occured, eventually producing the wrong result. It's a hardware error, most likely the GPU.
____________
My lucky number is 75898524288+1 | |
|
|
Well, that's annoying! I very much appreciate the diagnosis. Guess I should go figure out how to diagnose a GPU problem. Since I have never seen this on my CPUs, I have to agree. This has happened twice on GFN-19 also. I wonder if the failure rate is proportional to the size of the GFN.
____________
Thanks,
Jim
| |
|
|
Well, that's annoying! I very much appreciate the diagnosis. Guess I should go figure out how to diagnose a GPU problem. Since I have never seen this on my CPUs, I have to agree. This has happened twice on GFN-19 also. I wonder if the failure rate is proportional to the size of the GFN.
Check that your GPU is not overheating (can be caused by a build up of dust, pet hair, etc inside the PC); the GPU is not over clocked/in Turbo mode; GPU fan is not running. These all can cause the GPU to overheat and start causing errors.
____________
| |
|
Message boards :
Generalized Fermat Prime Search :
Genefer-19 tasks invalid |