Join PrimeGrid
Returning Participants
Community
Leader Boards
Results
Other
drummers-lowrise
|
Message boards :
Problems and Help :
cmdline to force OCL instead of OCL4
Author |
Message |
|
I have a 280x Tahiti and it normally runs OCL for both Genefer (genefer 21) and Genefer_WR (genefer 22) but is running OCL4 after a fan died and overheated some (TThrottle to the rescue again) but now runs OCL4 even though it tries OCL fine via this message:
Using OCL transform
Resuming 194778^2097152+1 from a checkpoint (23560318 iterations left)
Estimated time remaining for 194778^2097152+1 is 20:30:00
Successful computation progress with OCL; switching back to OCL4.
Using OCL4 transform
It usually take 15-17 hrs to run genefer 21 but it's at 35 hrs with 20 to go on OCL4. What's the cmdline to force OCL instead of OCL4 to add to app_config?
____________
Largest Primes to Date:
As Double Checker: SR5 109208*5^1816285+1 Dgts-1,269,534
As Initial Finder: SR5 243944*5^1258576-1 Dgts-879,713
| |
|
|
It's -x OCL if I'm not mistaken. | |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14045 ID: 53948 Credit: 485,621,119 RAC: 673,701
                               
|
I have a 280x Tahiti and it normally runs OCL for both Genefer (genefer 21) and Genefer_WR (genefer 22) but is running OCL4 after a fan died and overheated some (TThrottle to the rescue again) but now runs OCL4 even though it tries OCL fine via this message:
Using OCL transform
Resuming 194778^2097152+1 from a checkpoint (23560318 iterations left)
Estimated time remaining for 194778^2097152+1 is 20:30:00
Successful computation progress with OCL; switching back to OCL4.
Using OCL4 transform
The parameter is "-x OCL", but that's not going to help you much.
Had you linked to the actual result, or had you included the entire stderr_out contents, I wouldn't have had to go searching for the tasks in question. I found two:
http://www.primegrid.com/result.php?resultid=862088229 and http://www.primegrid.com/result.php?resultid=861492379.
Your problem isn't that Genefer is running the wrong transform.
The problem is that the GPU is overheating and the calculations are failing. TThrottle may protect the GPU from damage, but the GPU isn't working correctly.
A larger, more informative clip from the output:
Using OCL transform
Starting initialization...
Initialization complete (108.643 seconds).
Testing 108588^4194304+1...
Estimated time for 108588^4194304+1 is 72:40:00
maxErr exceeded for 108588^4194304+1, 0.5000 > 0.4500
maxErr exceeded while using OCL; switching to OCL4.
maxErr exceeded may be caused by overclocking, overheated GPUs and other transient errors.
Using OCL4 transform
Resuming 108588^4194304+1 from a checkpoint (59507913 iterations left)
Estimated time remaining for 108588^4194304+1 is 92:40:00
maxErr exceeded for 108588^4194304+1, 1.0000 > 0.4500
maxErr exceeded while using OCL4; switching to OCL5.
maxErr exceeded may be caused by overclocking, overheated GPUs and other transient errors.
Using OCL5 transform
Resuming 108588^4194304+1 from a checkpoint (59507913 iterations left)
Estimated time remaining for 108588^4194304+1 is 110:00:00
Successful computation progress with OCL5; switching back to OCL.
Using OCL transform
Resuming 108588^4194304+1 from a checkpoint (59498842 iterations left)
Estimated time remaining for 108588^4194304+1 is 62:00:00
It starts using OCL, but it fails. It shouldn't be failing. As part of its error handling process, it tries other transforms. They work, and it goes back to OCL. But the calculation is already damaged.
I don't recall if the -x parameter will prevent it from switching in the case of an error. It's intended to bypass the benchmarks and tell Genefer which transform to start with. But if it doesn't downshift to another transform when an error occurs, its only other choice is to abort the task.
Your real problem is that the calculation is failing. "-x OCL" isn't going to solve that, however "-x OCL4" might. Forcing it to use the slower transform will probably use less power, produce less heat, and might be enough to prevent these errors. It will certainly run slower.
____________
My lucky number is 75898524288+1 | |
|
|
Wrong WU. The one you are referring to is a genefer 22 WU. This one is genefer 21. If I can't get this one running on OCL I might just abort it and start fresh but I'll give this one another try first.
This is the complete & correct stderr to the right one:
geneferocl 3.3.2-7 (Windows/OpenCL/32-bit)
Copyright 2001-2017, Yves Gallot
Copyright 2009, Mark Rodenkirch, David Underbakke
Copyright 2010-2012, Shoichiro Yamada, Ken Brazier
Copyright 2011-2014, Michael Goetz, Ronald Schneider
Copyright 2011-2017, Iain Bethune
Genefer is free source code, under the MIT license.
Running on platform 'AMD Accelerated Parallel Processing', device 'Tahiti', vendor 'Advanced Micro Devices, Inc.', version 'OpenCL 1.2 AMD-APP (2527.7)' and driver '2527.7'.
32 computeUnits @ 955MHz, memSize=3072MB, cacheSize=16kB, cacheLineSize=64B, localMemSize=32kB, maxWorkGroupSize=256.
Supported transform implementations: ocl ocl2 ocl3 ocl4 ocl5
Command line: projects/www.primegrid.com/primegrid_genefer_3_3_2_3.18_windows_intelx86__openclatiGFN21.exe -boinc -q 194778^2097152+1
Low priority change succeeded.
Checking available transform implementations...
A benchmark is needed to determine best transform, testing available transform implementations...
Testing OCL transform...
Testing OCL2 transform...
Testing OCL3 transform...
Testing OCL4 transform...
Testing OCL5 transform...
Benchmarks completed (95.641 seconds).
Using OCL4 transform
Starting initialization...
Initialization complete (60.915 seconds).
Testing 194778^2097152+1...
Estimated time for 194778^2097152+1 is 53:30:00
maxErr exceeded for 194778^2097152+1, 1.0000 > 0.4500
maxErr exceeded while using OCL4; switching to OCL.
maxErr exceeded may be caused by overclocking, overheated GPUs and other transient errors.
Using OCL transform
Resuming 194778^2097152+1 from a checkpoint (23560318 iterations left)
Estimated time remaining for 194778^2097152+1 is 20:30:00
Successful computation progress with OCL; switching back to OCL4.
Using OCL4 transform
Resuming 194778^2097152+1 from a checkpoint (23498583 iterations left)
Estimated time remaining for 194778^2097152+1 is 24:00:00
geneferocl 3.3.2-7 (Windows/OpenCL/32-bit)
Copyright 2001-2017, Yves Gallot
Copyright 2009, Mark Rodenkirch, David Underbakke
Copyright 2010-2012, Shoichiro Yamada, Ken Brazier
Copyright 2011-2014, Michael Goetz, Ronald Schneider
Copyright 2011-2017, Iain Bethune
Genefer is free source code, under the MIT license.
Running on platform 'AMD Accelerated Parallel Processing', device 'Tahiti', vendor 'Advanced Micro Devices, Inc.', version 'OpenCL 1.2 AMD-APP (2527.7)' and driver '2527.7'.
32 computeUnits @ 955MHz, memSize=3072MB, cacheSize=16kB, cacheLineSize=64B, localMemSize=32kB, maxWorkGroupSize=256.
Supported transform implementations: ocl ocl2 ocl3 ocl4 ocl5
Command line: projects/www.primegrid.com/primegrid_genefer_3_3_2_3.18_windows_intelx86__openclatiGFN21.exe -boinc -q 194778^2097152+1
Low priority change succeeded.
Using OCL4 transform
Resuming 194778^2097152+1 from a checkpoint (17732637 iterations left)
Estimated time remaining for 194778^2097152+1 is 15:50:00
geneferocl 3.3.2-7 (Windows/OpenCL/32-bit)
Copyright 2001-2017, Yves Gallot
Copyright 2009, Mark Rodenkirch, David Underbakke
Copyright 2010-2012, Shoichiro Yamada, Ken Brazier
Copyright 2011-2014, Michael Goetz, Ronald Schneider
Copyright 2011-2017, Iain Bethune
Genefer is free source code, under the MIT license.
Running on platform 'AMD Accelerated Parallel Processing', device 'Tahiti', vendor 'Advanced Micro Devices, Inc.', version 'OpenCL 1.2 AMD-APP (2527.7)' and driver '2527.7'.
32 computeUnits @ 955MHz, memSize=3072MB, cacheSize=16kB, cacheLineSize=64B, localMemSize=32kB, maxWorkGroupSize=256.
Supported transform implementations: ocl ocl2 ocl3 ocl4 ocl5
Command line: projects/www.primegrid.com/primegrid_genefer_3_3_2_3.18_windows_intelx86__openclatiGFN21.exe -boinc -q 194778^2097152+1
Low priority change succeeded.
Using OCL4 transform
Resuming 194778^2097152+1 from a checkpoint (17650385 iterations left)
Estimated time remaining for 194778^2097152+1 is 16:20:00
Terminating because BOINC client requested that we should quit.
geneferocl 3.3.2-7 (Windows/OpenCL/32-bit)
Copyright 2001-2017, Yves Gallot
Copyright 2009, Mark Rodenkirch, David Underbakke
Copyright 2010-2012, Shoichiro Yamada, Ken Brazier
Copyright 2011-2014, Michael Goetz, Ronald Schneider
Copyright 2011-2017, Iain Bethune
Genefer is free source code, under the MIT license.
Running on platform 'AMD Accelerated Parallel Processing', device 'Tahiti', vendor 'Advanced Micro Devices, Inc.', version 'OpenCL 1.2 AMD-APP (2527.7)' and driver '2527.7'.
32 computeUnits @ 955MHz, memSize=3072MB, cacheSize=16kB, cacheLineSize=64B, localMemSize=32kB, maxWorkGroupSize=256.
Supported transform implementations: ocl ocl2 ocl3 ocl4 ocl5
Command line: projects/www.primegrid.com/primegrid_genefer_3_3_2_3.18_windows_intelx86__openclatiGFN21.exe -boinc -q 194778^2097152+1
Low priority change succeeded.
Using OCL4 transform
Resuming 194778^2097152+1 from a checkpoint (17475424 iterations left)
Estimated time remaining for 194778^2097152+1 is 14:40:00
Terminating because BOINC client requested that we should quit.
geneferocl 3.3.2-7 (Windows/OpenCL/32-bit)
Copyright 2001-2017, Yves Gallot
Copyright 2009, Mark Rodenkirch, David Underbakke
Copyright 2010-2012, Shoichiro Yamada, Ken Brazier
Copyright 2011-2014, Michael Goetz, Ronald Schneider
Copyright 2011-2017, Iain Bethune
Genefer is free source code, under the MIT license.
Running on platform 'AMD Accelerated Parallel Processing', device 'Tahiti', vendor 'Advanced Micro Devices, Inc.', version 'OpenCL 1.2 AMD-APP (2527.7)' and driver '2527.7'.
32 computeUnits @ 955MHz, memSize=3072MB, cacheSize=16kB, cacheLineSize=64B, localMemSize=32kB, maxWorkGroupSize=256.
Supported transform implementations: ocl ocl2 ocl3 ocl4 ocl5
Command line: projects/www.primegrid.com/primegrid_genefer_3_3_2_3.18_windows_intelx86__openclatiGFN21.exe -boinc -q 194778^2097152+1
Low priority change succeeded.
Using OCL4 transform
Resuming 194778^2097152+1 from a checkpoint (16892736 iterations left)
Estimated time remaining for 194778^2097152+1 is 15:30:00
As you see it starts in OCL4, fails then switches to OCL then switches back to OCL4 then starts and stays there from then on. TThrottle is set to 82C now. I've run numerous genefer 21s as high as 88C successfully so heat shouldn't be the problem now. I tried -x OCL and nothing. What should the <app> to <app> contain in app_config? or other ways to force OCL?
EDIT: Even tried the following and still nothing: <cmdline>primegrid_genefer_3_3_2_3.18_windows_intelx86__openclatiGFN21.exe -boinc -q 194778^2097152+1 -x OCL</cmdline>
____________
Largest Primes to Date:
As Double Checker: SR5 109208*5^1816285+1 Dgts-1,269,534
As Initial Finder: SR5 243944*5^1258576-1 Dgts-879,713
| |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14045 ID: 53948 Credit: 485,621,119 RAC: 673,701
                               
|
<cmdline>-x OCL</cmdline>
____________
My lucky number is 75898524288+1 | |
|
|
<cmdline>-x OCL</cmdline>
It doesn't work. That was the exact command line I tried before starting this thread. I guess I'll let it finish using OCL4, but would like to know all info for app_config for the future.
<app>
................
................
................
................
................
</app>
Please fill in relevant info to make it work.
____________
Largest Primes to Date:
As Double Checker: SR5 109208*5^1816285+1 Dgts-1,269,534
As Initial Finder: SR5 243944*5^1258576-1 Dgts-879,713
| |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14045 ID: 53948 Credit: 485,621,119 RAC: 673,701
                               
|
Entire file for GFN21 on an Nvidia GPU:
<app_config>
<app>
<name>genefer</name>
<fraction_done_exact/>
<report_results_immediately/>
<max_concurrent>1</max_concurrent>
<gpu_versions>
<gpu_usage>1</gpu_usage>
<cpu_usage>.01</cpu_usage>
</gpu_versions>
</app>
<app_version>
<app_name><genefer/app_name>
<plan_class>OCLcudaGFN</plan_class>
<avg_ncpus>.01</avg_ncpus>
<ngpus>1</ngpus>
<cmdline>-x OCL</cmdline>
</app_version>
</app_config>
That file is a bit overzealous and includes several lines that may not be needed, but shouldn't hurt. The most likely reason it didn't work before is that you didn't include the correct plan_class tag.
A minimal file, that should also work, looks like this:
<app_config>
<app>
<name>genefer</name>
</app>
<app_version>
<app_name><genefer/app_name>
<plan_class>OCLcudaGFN</plan_class>
<cmdline>-x OCL</cmdline>
</app_version>
</app_config>
I haven't tested either personally, but both should work. Good luck both with the XML, and with fixing the problem with your GPU.
____________
My lucky number is 75898524288+1 | |
|
|
Thanks Mike.
____________
Largest Primes to Date:
As Double Checker: SR5 109208*5^1816285+1 Dgts-1,269,534
As Initial Finder: SR5 243944*5^1258576-1 Dgts-879,713
| |
|
Message boards :
Problems and Help :
cmdline to force OCL instead of OCL4 |