Join PrimeGrid
Returning Participants
Community
Leader Boards
Results
Other
drummers-lowrise
|
Message boards :
Project Staging Area :
Call for wwwwcl beta testers (OpenCL)
Author |
Message |
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
I have been working on an OpenCL version of wwww for a few months (on and off). It's been slow due to a change in architecture and a lack of free time to work on it.
The original version started with the architecture of wwww. I quickly realized that the sieving code in wwww couldn't feed the GPU fast enough to keep it busy. I then turned to ppsieve. Although ppsieve has a faster sieve, I found that it had the same problem. I then found primesieve (at code.google.com). It is the fastest implementation of a sieve I found.
The end product is a framework called GPUSieve. It is a framework for problems of this nature. By 'problem' I mean a program that sieve for primes, then for each prime, do something with it on the GPU. It is written in C++ and is multi-threaded, something which the other programs cannot do.
The new code is screaming fast compared to the current CPU only code. I estimate that my ATI Radeon 5770 could complete a Wieferich range for 3.5e16 in a year, that is more than twice the total amount of work done on the search since it started in the 80's. The WallSunSun code should be able to do at least 4e15 in a year, but I haven't done enough time testing of that code.
I hope to release the source code tomorrow along with Windows and Mac builds. I only have the ATI Radeon 5770 to test on (and a really old ATI Radeon Mobility), thus I am looking for some people to help beta test on their hardware. Once I post the code, I will provide details on how I want you to test it. | |
|
Scott Brown Volunteer moderator Project administrator Volunteer tester Project scientist
 Send message
Joined: 17 Oct 05 Posts: 2392 ID: 1178 Credit: 18,658,470,305 RAC: 6,956,219
                                                
|
I can test with several different NVidia GPUs. These projects require a 64-bit OS for the current PRPnet CPU app. For the OpenCL app, is this still true, or will 32-bit be possible?
____________
141941*2^4299438-1 is prime!
| |
|
Crun-chi Volunteer tester
 Send message
Joined: 25 Nov 09 Posts: 3233 ID: 50683 Credit: 151,443,349 RAC: 73,965
                         
|
Nvidia GPU can also work on OPENCL, so if you have app I am ready to test it.
____________
92*10^1585996-1 NEAR-REPDIGIT PRIME :) :) :)
4 * 650^498101-1 CRUS PRIME
2022202116^131072+1 GENERALIZED FERMAT
Proud member of team Aggie The Pew. Go Aggie! | |
|
|
My two HD5850 are ready for the testing. Impatiently wating for the instructions. | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
I can test with several different NVidia GPUs. These projects require a 64-bit OS for the current PRPnet CPU app. For the OpenCL app, is this still true, or will 32-bit be possible?
A 32-bit build is possible as there are no dependencies on 64-bit asm instructrions. I haven't built/run a 32-bit version though.
| |
|
Scott Brown Volunteer moderator Project administrator Volunteer tester Project scientist
 Send message
Joined: 17 Oct 05 Posts: 2392 ID: 1178 Credit: 18,658,470,305 RAC: 6,956,219
                                                
|
I have a couple of machines with NVidia GPU in 32-bit Windows systems, so I can test it out if you want to build them.
____________
141941*2^4299438-1 is prime!
| |
|
|
Do I need DP? If not I might be test on a couple W7-64 machines I have one of each ATI Juniper and Redwood series. | |
|
BiBi Volunteer tester Send message
Joined: 6 Mar 10 Posts: 151 ID: 56425 Credit: 34,290,031 RAC: 0
                   
|
I have a 4650 in a w7 x64, can it be used for testing? It has no DP support. | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
Do I need DP? If not I might be test on a couple W7-64 machines I have one of each ATI Juniper and Redwood series.
No. There are no floating point operations in the code as the FPU would overflow with the numbers being tested. | |
|
Menipe Volunteer tester Send message
Joined: 2 Jan 08 Posts: 235 ID: 17041 Credit: 112,898,330 RAC: 813
                       
|
I have a HD 6470 and an HD 6310 available to test.
____________
| |
|
|
I would be interested in the code. HD5770 and GTX580 | |
|
|
I have a HD5780 I'll test with | |
|
|
Some HD 5450, one old Mobility HD4330 and a couple of different Nvidia cards available for testing.
Regards Odi
____________
| |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
I have posted the code (along with a Mac build) here: http://home.roadrunner.com/~mrodenkirch/wwwwcl.zip.
My estimate for 3.5e16 in a year was on a ATI Radeon HD 5870. I have a Windows box with an ATI Radeon 6450, but the program hangs on that. I suspect it is due to mixing a 32-bit exe with the 64-bit OpenCL library. I'll try to fix that in the coming days.
Since I can't provide a Windows build yet, there is a VS2008 solution included. If you have VS2008 (or newer) and want to build it, you just need to install the AMD APP OpenCL development toolkit.
I don't have access to any Linux systems, so I cannot build the code for Linux. I have also been lazy in that I didn't write a makefile. I built on the Mac with this command line:
gcc *.c *.cpp opencl/*.cpp wwww/*.cpp sieve/*.cpp -m64 -framework OpenCL -lstdc++ -O2 -o wwwwcl
I assume that on Linux you just need to replace the "-framework OpenCL" with the appropriate -I and -l options to include the OpenCL headers and libraries on your system.
Execution is fairly simple:
./wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t5 -b2000
where -t tells the program how many threads to execute and -b is the number of blocks per thread. Here is what to expect for output:
wwwwcl v2.0
Compiled Apr 29 2012 with GCC 4.2.1 (Based on Apple Inc. build 5658) (LLVM build 2336.1.00)
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456892569421833, 9.560M p/sec, 1.07 CPU cores, 92.6% done. ETA 29 Apr 18:52
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 265.35 sec. (1.85 init + 263.37 sieve) at 9557118 p/sec.
Processor time: 282.66 sec. (1.86 init + 280.80 sieve).
Seconds spent in CPU and GPU: 1066.99 (cpu), 124.43 (gpu)
Percent of time spent in CPU vs. GPU: 0.90 (cpu), 0.10 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.02 (devices)
You will see more detail with the -v switch. I recommend starting with a smaller number of threads and blocks as you will use those to find the optimal settings for you GPU.
As part of the testing you should run ranges that have known Wieferich/WallSunSun special instances and ranges without them and thus verify the output.
I think it very likely that this program will either not run or will crash for some of you and might hang the system of a few others. (I experienced a few hangs during my initial development, so you aren't alone.) In all seriousness I strongly suggest starting with small ranges, 1e8 or so, and small numbers of threads and blocks until you get more comfortable running the program and feel that it will run safely on your hardware.
What I'm looking for is problems compiling and linking, problems at runtime, i.e. OpenCL errors output by the program, and any other serious issues.
This code will NOT work with PRPNet, at least not yet. I have to make some changes to both the client and server to support this program and I really don't want to support it until the major kinks are worked out. | |
|
|
I downloaded the file from the given link and unpacked it. But I didn't find and executable files. Is it a windows build?
Thank you. | |
|
|
here is the output from my iMac using an ATI Radeon HD 5750:
iMac-i7:wwwwcl philipp$ ./wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t5 -b2000
wwwwcl v2.0
Compiled Apr 29 2012 with GCC 4.2.1 (Based on Apple Inc. build 5658) (LLVM build 2336.1.00)
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456876215312659, 10.61M p/sec, 1.07 CPU cores, 76.2% done. ETA 01 May 00:23
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 241.49 sec. (2.27 init + 238.80 sieve) at 10501169 p/sec.
Processor time: 259.53 sec. (2.28 init + 257.25 sieve).
Seconds spent in CPU and GPU: 247.93 (cpu), 750.71 (gpu)
Percent of time spent in CPU vs. GPU: 0.25 (cpu), 0.75 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.61 (devices)
iMac-i7:wwwwcl philipp$
____________
| |
|
|
Download-32bit
Download-64bit
This is a win build, I dont know if its working because I have no ATI card. | |
|
|
I tested the compiled win versions. After an error because of missing MSVCP100.dll I installed the Microsoft Visual C++ 2010 Redistributable Package and the the application started.
Test machine was an 8core server with 3 HD5450, 6 cpu cores running but not other gpu work. I tested the same range linke rouge:
x86:
C:\prpnet\prpclient-gpu1>wwwwcl_x86.exe -p1334568e11 -P1334569e11 -TWieferich -t5 -b2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456803694770223, 1.542M p/sec, 2.34 CPU cores, 3.7% done. ETA 01 May 12:33
[...]
p=133456896430784287, 1.544M p/sec, 2.35 CPU cores, 96.4% done. ETA 01 May 12:33
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 1653.19 sec. (7.24 init + 1645.72 sieve) at 1533977 p/sec.
Processor time: 3835.47 sec. (10.75 init + 3824.72 sieve).
Seconds spent in CPU and GPU: 1663.88 (cpu), 6831.04 (gpu)
Percent of time spent in CPU vs. GPU: 0.20 (cpu), 0.80 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.82 (devices)
x64:
C:\prpnet\prpclient-gpu1>wwwwcl_x64.exe -p1334568e11 -P1334569e11 -TWieferich -t5 -b2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456803664459411, 1.537M p/sec, 2.29 CPU cores, 3.7% done. ETA 01 May 13:02
[...]
p=133456899903374813, 1.541M p/sec, 2.34 CPU cores, 99.9% done. ETA 01 May 13:02
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 1654.47 sec. (6.33 init + 1647.78 sieve) at 1532791 p/sec.
Processor time: 3811.70 sec. (10.03 init + 3801.67 sieve).
Seconds spent in CPU and GPU: 1664.52 (cpu), 6872.51 (gpu)
Percent of time spent in CPU vs. GPU: 0.19 (cpu), 0.81 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.83 (devices)
Regards Odi
____________
| |
|
|
When I try to run the 64 bit Win build with
wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t5 -b2000
I get an error message saying something like
The application could not be started (0xc000007b). Click "OK" to close the application.
Tried to run under Windows using a ATI Mobility Radeon HD 5470. | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
Download-32bit
Download-64bit
This is a win build, I dont know if its working because I have no ATI card.
Thanks.
You should also be able to run the code on an NVIVDIA card, although I don't know if you need the NVIDIA OpenCL toolkit for that. | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
If you use the -l option, wwwwcl will list the available platforms and devices on your system. The default is to use platform 0, device 0. You can override with the -p and -d options. You will need this if you have multiple GPUs on your system.
I ask you to test other ranges, not just the example I gave. I suggest that you look at special instances on the PRPNet server to get some ranges to test. You will need to use the -s option to change the threshold for special instances. It defaults to 100.
You should also vary the values for -t and -b so that you can determine the optimal settings for your GPU.
On some systems, the GPU is so fast that the GPU is starved for work. You can run multiple instances of wwwwcl concurrently to keep the GPU fed. To reach the 3.5e16 yearly throughput, I could run 3 instances of wwwwcl.
I can speed up the GPU code more, but it was more important for me to get this beta out first. I don't know when I'll get around to further speed-ups. | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
When I try to run the 64 bit Win build with
wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t5 -b2000
I get an error message saying something like
The application could not be started (0xc000007b). Click "OK" to close the application.
Tried to run under Windows using a ATI Mobility Radeon HD 5470.
Do you have ZoneAlarm installed? I googled this message and the first hit implied that ZoneAlarm was the cause. | |
|
|
When I try to run the 64 bit Win build with
wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t5 -b2000
I get an error message saying something like
The application could not be started (0xc000007b). Click "OK" to close the application.
Tried to run under Windows using a ATI Mobility Radeon HD 5470.
Do you have ZoneAlarm installed? I googled this message and the first hit implied that ZoneAlarm was the cause.
No I haven't. I also googled a bit and found this. I will check whether that does something. | |
|
Scott Brown Volunteer moderator Project administrator Volunteer tester Project scientist
 Send message
Joined: 17 Oct 05 Posts: 2392 ID: 1178 Credit: 18,658,470,305 RAC: 6,956,219
                                                
|
Neither windows build will run for me. I get an immediate program "stopped working" message without any output or a log file. This happens when I run with just the -l option.
i7-860 (12GB RAM)
GPU #1 - GT 530
GPU #2 - 9600 GSO
Win7 Enterprise x64
| |
|
Menipe Volunteer tester Send message
Joined: 2 Jan 08 Posts: 235 ID: 17041 Credit: 112,898,330 RAC: 813
                       
|
I tested the compiled win versions.
Test machine was an i7-2720QM with HD6470M running Win 7 x64 Enterprise, I tested the same range as rogue:
x86:
D:\Download\Primegrid\wwwwcl_x86>wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t
5 -b2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456808338228321, 3.505M p/sec, 0.46 CPU cores, 8.3% done. ETA 01 May 10:4
[...]
p=133456893155059109, 3.525M p/sec, 0.48 CPU cores, 93.2% done. ETA 01 May 10:
49
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes
tested
Elapsed time: 722.24 sec. (2.62 init + 719.31 sieve) at 3511230 p/sec.
Processor time: 342.64 sec. (3.42 init + 339.22 sieve).
Seconds spent in CPU and GPU: 733.97 (cpu), 3141.34 (gpu)
Percent of time spent in CPU vs. GPU: 0.19 (cpu), 0.81 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.86 (devices)
x64:
D:\Download\Primegrid\wwwwclx64>wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t5 -b
2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456808459423041, 3.521M p/sec, 0.29 CPU cores, 8.5% done. ETA 01 May 10:2
[...]
p=133456892841961327, 3.495M p/sec, 0.44 CPU cores, 92.8% done. ETA 01 May 10:
27
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes
tested
Elapsed time: 725.67 sec. (1.75 init + 723.56 sieve) at 3494643 p/sec.
Processor time: 303.55 sec. (2.61 init + 300.94 sieve).
Seconds spent in CPU and GPU: 736.97 (cpu), 3192.85 (gpu)
Percent of time spent in CPU vs. GPU: 0.19 (cpu), 0.81 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.87 (devices)
Regards Menipe
____________
| |
|
|
Hmm, when checking wwwwcl.exe with Dependency Walker it says there is a file missing called sysntfy.dll | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
Hmm, when checking wwwwcl.exe with Dependency Walker it says there is a file missing called sysntfy.dll
A Google search revealed this description:
Windows Notifications Dynamic Link Library
I have no idea what would be referencing it. | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
Neither windows build will run for me. I get an immediate program "stopped working" message without any output or a log file. This happens when I run with just the -l option.
i7-860 (12GB RAM)
GPU #1 - GT 530
GPU #2 - 9600 GSO
Win7 Enterprise x64
That isn't very helpful. If any of the OpenCL calls fail, then there should be a message echoed to the window before the program stops. I probably need to add more diagnostic messages when using the -v option.
Does anyone here have Visual Studio installed AND an NVIDIA GPU? It would be a great help to me if you can investigate this.
| |
|
|
Hmm, when checking wwwwcl.exe with Dependency Walker it says there is a file missing called sysntfy.dll
A Google search revealed this description:
Windows Notifications Dynamic Link Library
I have no idea what would be referencing it.
http://msoos.wordpress.com/2010/09/11/0xc000007b-error-solution/ says the error 0xc000007b is caused by a 32bit application that tries to load a 64bit dll. | |
|
|
Neither windows build will run for me. I get an immediate program "stopped working" message without any output or a log file. This happens when I run with just the -l option.
i7-860 (12GB RAM)
GPU #1 - GT 530
GPU #2 - 9600 GSO
Win7 Enterprise x64
That isn't very helpful. If any of the OpenCL calls fail, then there should be a message echoed to the window before the program stops. I probably need to add more diagnostic messages when using the -v option.
Does anyone here have Visual Studio installed AND an NVIDIA GPU? It would be a great help to me if you can investigate this.
Iam trying to compile the nvidia app but got an error. Need more time to investigate what cause this issue. | |
|
samuel7 Volunteer tester
 Send message
Joined: 1 May 09 Posts: 89 ID: 39425 Credit: 257,425,010 RAC: 0
                    
|
Rebirther, your 64-bit app works fine on my NVIDIA card:
Win 7 Ultimate (64-bit)
i7 875K @2.93GHz (HT disabled, no other CPU tasks running)
9800 GT
Visual C++ 2010 Redistributable Package (x64) installed.
Examples:
d:\man\wwww>wwwwcl.exe -p5423711e9 -P5423715e9 -TWieferich -t2 -b3000 -s1000
wwwwcl v2.0
Sieve started: 5423711000000000 <= p < 5423715000000000
5423711968552697 is a special instance (+1 -488 p)
Sieve complete: 5423711000000001 <= p < 5423715000000000 110405720 primes tested
Elapsed time: 17.61 sec. (0.22 init + 17.00 sieve) at 6268625 p/sec.
Processor time: 24.60 sec. (0.22 init + 24.38 sieve).
Seconds spent in CPU and GPU: 17.60 (cpu), 25.24 (gpu)
Percent of time spent in CPU vs. GPU: 0.41 (cpu), 0.59 (gpu)
CPU/GPU utilization: 0.51 (cores), 0.73 (devices)
d:\man\wwww>wwwwcl.exe -p14490200e9 -P14490204e9 -TWieferich -t2 -b3000 -s1000
wwwwcl v2.0
Sieve started: 14490200000000000 <= p < 14490204000000000
Sieve complete: 14490200000000001 <= p < 14490204000000000 107488122 primes tested
Elapsed time: 17.51 sec. (0.36 init + 16.82 sieve) at 6139807 p/sec.
Processor time: 24.76 sec. (0.36 init + 24.40 sieve).
Seconds spent in CPU and GPU: 17.53 (cpu), 25.68 (gpu)
Percent of time spent in CPU vs. GPU: 0.41 (cpu), 0.59 (gpu)
CPU/GPU utilization: 0.51 (cores), 0.75 (devices)
No errors after installing the libraries from MS.
CPU use was 33 % (of 4 cores) for -t values 2 and above.
GPU-Z showed 96 to 98 % GPU usage so this old card can be fed by just one instance of the program. With just one thread, GPU usage was between 60 and 70 %.
Tried different -b values from 500 to 5000 and the differences were minimal but 3000 appeared to be best.
Will post results on a GTX 560 Ti later.
____________
| |
|
|
The win64 bit app does not work at all on my win7 64bit with an nvidia card (gtx 570)... 32bit did work and this is what the log says:
.\wwwwcl.exe -p1334568e11 -P1334569e11 -TWieferich -t5 -b2000 -v
wwwwcl v2.0
Defaulting to platform 0
Defaulting to device 0
workGroupSize = 960000 = 2000 * 32 * 15 (blocks * workGroupSizeMultiple * deviceComputeUnits)
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456879344454091, 16.76M p/sec, 1.41 CPU cores, 79.3% done. ETA 01 May 18:30
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 153.78 sec. (1.39 init + 152.37 sieve) at 16490501 p/sec.
Processor time: 216.31 sec. (1.39 init + 214.92 sieve).
Seconds spent in CPU and GPU: 620.32 (cpu), 58.36 (gpu)
Percent of time spent in CPU vs. GPU: 0.91 (cpu), 0.09 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.02 (devices)
So it works it seems. Top marks! Though I am slightly worried about the cpu usage. It almost used two full cores. Is that worth switching over to GPU for?
Edit: System: Win 7 64bit, i5 2500k, gtx 570, nothing else eating away at the cpu.
____________
PrimeGrid Challenge Overall standings --- Last update: From Pi to Paddy (2016)
| |
|
|
@all: The win app is only for ATI cards, you will get an error with a missing ati driver file and the app is crashing on nvidia cards.
Through reading here, some got it running on nvidia cards. I dont know how and why??? | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
So it works it seems. Top marks! Though I am slightly worried about the cpu usage. It almost used two full cores. Is that worth switching over to GPU for?
Edit: System: Win 7 64bit, i5 2500k, gtx 570, nothing else eating away at the cpu.
Note that the Wieferich GPU code on your box is probably 10x (or more) than the CPU version of the same code, so even if it takes more than 1 CPU, it still is better to use a little extra CPU to feed the GPU.
Try running with -TWallSunSun. That code relies more heavily on the GPU than the Wieferich code.
| |
|
samuel7 Volunteer tester
 Send message
Joined: 1 May 09 Posts: 89 ID: 39425 Credit: 257,425,010 RAC: 0
                    
|
@all: The win app is only for ATI cards, you will get an error with a missing ati driver file and the app is crashing on nvidia cards.
Through reading here, some got it running on nvidia cards. I dont know how and why???
I needed only to make sure the libraries in the system matched the app (32 vs 64-bit). | |
|
|
@all: The win app is only for ATI cards, you will get an error with a missing ati driver file and the app is crashing on nvidia cards.
Through reading here, some got it running on nvidia cards. I dont know how and why???
I needed only to make sure the libraries in the system matched the app (32 vs 64-bit).
Can you make a list of libs and your driver version pls? | |
|
Menipe Volunteer tester Send message
Joined: 2 Jan 08 Posts: 235 ID: 17041 Credit: 112,898,330 RAC: 813
                       
|
I tested the compiled win versions.
Test machine was an AMD E-350 with HD6310 running Win 7 x64 Ultimate, I tested the same range as rogue:
x86:
C:\PrimeGrid\wwwwcl_x86>wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t5 -b2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456803008172223, 1.256M p/sec, 0.39 CPU cores, 3.0% done. ETA 01 May 12:3
[...]
p=133456899464309017, 717.9K p/sec, 0.23 CPU cores, 99.5% done. ETA 01 May 12:
37
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes
tested
Elapsed time: 2291.52 sec. (15.80 init + 2275.06 sieve) at 1106670 p/sec.
Processor time: 814.23 sec. (23.07 init + 791.16 sieve).
Seconds spent in CPU and GPU: 2312.30 (cpu), 9633.53 (gpu)
Percent of time spent in CPU vs. GPU: 0.19 (cpu), 0.81 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.84 (devices)
x64:
C:\PrimeGrid\wwwwcl>wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t5 -b2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456802942585581, 1.239M p/sec, 0.38 CPU cores, 2.9% done. ETA 01 May 11:2
[...]
p=133456898591230813, 1.241M p/sec, 0.35 CPU cores, 98.6% done. ETA 01 May 11:
21
Sieve interrupted: 133456800000000001 <= p < 133456900000000000 2535955284 prim
es tested
Elapsed time: 2054.40 sec. (13.42 init + 2040.64 sieve) at 1234403 p/sec.
Processor time: 785.34 sec. (21.81 init + 763.53 sieve).
Seconds spent in CPU and GPU: 2074.24 (cpu), 8616.28 (gpu)
Percent of time spent in CPU vs. GPU: 0.19 (cpu), 0.81 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.84 (devices)
Regards Menipe
____________
| |
|
samuel7 Volunteer tester
 Send message
Joined: 1 May 09 Posts: 89 ID: 39425 Credit: 257,425,010 RAC: 0
                    
|
@all: The win app is only for ATI cards, you will get an error with a missing ati driver file and the app is crashing on nvidia cards.
Through reading here, some got it running on nvidia cards. I dont know how and why???
I needed only to make sure the libraries in the system matched the app (32 vs 64-bit).
Can you make a list of libs and your driver version pls?
Well, I installed the package from MS and it worked immediately. Links: x64 and x86
You may need to copy the files msvcp100.dll and msvcr100.dll from 'Windows\system32' to the folder of the executable. That Dependency Walker utility (available via the link Toshio posted) could be useful.
NVIDIA driver is 290.53 | |
|
|
@all: The win app is only for ATI cards, you will get an error with a missing ati driver file and the app is crashing on nvidia cards.
Through reading here, some got it running on nvidia cards. I dont know how and why???
I needed only to make sure the libraries in the system matched the app (32 vs 64-bit).
Can you make a list of libs and your driver version pls?
Well, I installed the package from MS and it worked immediately. Links: x64 and x86
You may need to copy the files msvcp100.dll and msvcr100.dll from 'Windows\system32' to the folder of the executable. That Dependency Walker utility (available via the link Toshio posted) could be useful.
NVIDIA driver is 290.53
I have already both files and its not working. How did you run it? I have a batch file + exe, thats all. | |
|
Scott Brown Volunteer moderator Project administrator Volunteer tester Project scientist
 Send message
Joined: 17 Oct 05 Posts: 2392 ID: 1178 Credit: 18,658,470,305 RAC: 6,956,219
                                                
|
@all: The win app is only for ATI cards, you will get an error with a missing ati driver file and the app is crashing on nvidia cards.
Through reading here, some got it running on nvidia cards. I dont know how and why???
I needed only to make sure the libraries in the system matched the app (32 vs 64-bit).
Can you make a list of libs and your driver version pls?
Well, I installed the package from MS and it worked immediately. Links: x64 and x86
You may need to copy the files msvcp100.dll and msvcr100.dll from 'Windows\system32' to the folder of the executable. That Dependency Walker utility (available via the link Toshio posted) could be useful.
NVIDIA driver is 290.53
I have already both files and its not working. How did you run it? I have a batch file + exe, thats all.
x64 version was already installed on mine with it not working (ran from command line).
____________
141941*2^4299438-1 is prime!
| |
|
|
x64 version was already installed on mine with it not working (ran from command line).
Do you got a popup with ati driver? | |
|
Scott Brown Volunteer moderator Project administrator Volunteer tester Project scientist
 Send message
Joined: 17 Oct 05 Posts: 2392 ID: 1178 Credit: 18,658,470,305 RAC: 6,956,219
                                                
|
Do you got a popup with ati driver?
Nope...nothing but the program stopped working warning from win7.
Heading home soon, so I'll try it on a different box and GPU.
| |
|
Crun-chi Volunteer tester
 Send message
Joined: 25 Nov 09 Posts: 3233 ID: 50683 Credit: 151,443,349 RAC: 73,965
                         
|
Intel i5-2500k
Nvidia 560Ti ( beta 301.xx drivers)
windows 7 64 bit
64 bit app wwwwcl 2.0
GPU usage is about 31 %
and uses over over one core ( 35% ) on quad processor.
wwwwcl -p1334568e11 -P1334569e11 -TW
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456886350927309, 12.00M p/sec, 1.39 CPU cores, 86.4%
Sieve complete: 133456800000000001 <= p < 133456900000000000
Elapsed time: 214.60 sec. (2.41 init + 212.16 sieve) at 1181
Processor time: 297.20 sec. (2.87 init + 294.33 sieve).
Seconds spent in CPU and GPU: 441.44 (cpu), 83.89 (gpu)
Percent of time spent in CPU vs. GPU: 0.84 (cpu), 0.16 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.04 (devices)
____________
92*10^1585996-1 NEAR-REPDIGIT PRIME :) :) :)
4 * 650^498101-1 CRUS PRIME
2022202116^131072+1 GENERALIZED FERMAT
Proud member of team Aggie The Pew. Go Aggie! | |
|
|
Do you got a popup with ati driver?
Nope...nothing but the program stopped working warning from win7.
Heading home soon, so I'll try it on a different box and GPU.
I found the problem. The AMD SDK kit was needed to uninstall. Now its running. | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
If your elapsed time is much greater than processor time, then you should reduce the number of threads. | |
|
|
x64
Win7 x64 q6600@2,4 ATI 5970 (725/500 core/mem)
GPU Usage ~40%
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456879950227677, 11.39M p/sec, 1.03 CPU cores, 80.0% done. ETA 01 May 21:11
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 283.68 sec. (55.58 init + 227.99 sieve) at 8939350 p/sec.
Processor time: 273.98 sec. (43.01 init + 230.98 sieve).
Seconds spent in CPU and GPU: 970.07 (cpu), 98.38 (gpu)
Percent of time spent in CPU vs. GPU: 0.91 (cpu), 0.09 (gpu)
CPU/GPU utilization: 0.21 (cores), 0.02 (devices) | |
|
|
Win7 X64
CPU: i7 920@4.0GHz + MSI 5870 @ Default
D:\>wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t5 -b2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456847243740217, 19.66M p/sec, 1.05 CPU cores, 47.2% done. ETA 01 May 21:
p=133456894588431563, 19.68M p/sec, 1.04 CPU cores, 94.6% done. ETA 01 May 21:
29
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes
tested
Elapsed time: 130.21 sec. (1.34 init + 128.79 sieve) at 19476269 p/sec.
Processor time: 136.80 sec. (1.98 init + 134.82 sieve).
Seconds spent in CPU and GPU: 524.04 (cpu), 70.41 (gpu)
Percent of time spent in CPU vs. GPU: 0.88 (cpu), 0.12 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.03 (devices)
GPU Usage ~53% | |
|
|
Win7 x64 ATI Mobility Radeon 5470
GPU Load ~98%
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456899383557213, 936.7K p/sec, 0.20 CPU cores, 99.4% done. ETA 01 May 22:33
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 2709.43 sec. (8.74 init + 2700.19 sieve) at 935975 p/sec.
Processor time: 560.51 sec. (14.35 init + 546.16 sieve).
Seconds spent in CPU and GPU: 2728.36 (cpu), 12439.80 (gpu)
Percent of time spent in CPU vs. GPU: 0.18 (cpu), 0.82 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.91 (devices) | |
|
|
Win7 x65 Radeon 7970@1080 mhz load 19%
wwwwcl.exe -p1334568e11 -P1334569e11 -TWieferich -t5 -b2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456826811639179, 11.21M p/sec, 1.04 CPU cores, 26.8% done. ETA 01 May 22:
p=133456853946666931, 11.23M p/sec, 1.04 CPU cores, 53.9% done. ETA 01 May 22:
p=133456880919407209, 11.24M p/sec, 1.04 CPU cores, 80.9% done. ETA 01 May 22:
36
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes
tested
Elapsed time: 228.27 sec. (2.64 init + 225.53 sieve) at 11109501 p/sec.
Processor time: 237.76 sec. (3.68 init + 234.08 sieve).
Seconds spent in CPU and GPU: 917.40 (cpu), 48.97 (gpu)
Percent of time spent in CPU vs. GPU: 0.95 (cpu), 0.05 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.01 (devices)
| |
|
|
Win 7 Pro 64bit / i7-2700K stock speed / GTX570 stock speed (more details at the end if you're interested)
E:\PrimeGRID\PRPNet>wwwwcl.exe -p1334568e11 -P1334569e11 -TWieferich -t5 -b2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456891421117711, 19.13M p/sec, 1.48 CPU cores, 91.4% done. ETA 01 May 22:35
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 133.96 sec. (1.21 init + 132.72 sieve) at 18931021 p/sec.
Processor time: 198.81 sec. (1.62 init + 197.19 sieve).
Seconds spent in CPU and GPU: 530.88 (cpu), 61.13 (gpu)
Percent of time spent in CPU vs. GPU: 0.90 (cpu), 0.10 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.02 (devices)
Good luck rogue,
Peter
NVIDIA System Information report created on: 05/01/2012 22:37:51
System name: FT03B
[Display]
Operating System: Windows 7 Professional, 64-bit (Service Pack 1)
DirectX version: 11.0
GPU processor: GeForce GTX 570
Driver version: 285.62
DirectX support: 11
CUDA Cores: 480
Core clock: 750 MHz
Shader clock: 1500 MHz
Memory clock: 1950 MHz (3900 MHz data rate)
Memory interface: 320-bit
Total available graphics memory: 4095 MB
Dedicated video memory: 1280 MB GDDR5
System video memory: 0 MB
Shared system memory: 2815 MB
Video BIOS version: 70.10.55.00.01
IRQ: 16
Bus: PCI Express x16 Gen2 | |
|
Scott Brown Volunteer moderator Project administrator Volunteer tester Project scientist
 Send message
Joined: 17 Oct 05 Posts: 2392 ID: 1178 Credit: 18,658,470,305 RAC: 6,956,219
                                                
|
GT440 (OEM version - 144 shaders)
i7920 @ stock (HT on)
Vista Home Premium x64
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456809592339903, 3.993M p/sec, 0.60 CPU cores, 9.6% done. ETA 01 May 18:4
p=133456818669893687, 3.774M p/sec, 0.58 CPU cores, 18.7% done. ETA 01 May 18:
p=133456827687293959, 3.748M p/sec, 0.57 CPU cores, 27.7% done. ETA 01 May 18:
p=133456835803974653, 3.375M p/sec, 0.51 CPU cores, 35.8% done. ETA 01 May 18:
p=133456844041252763, 3.424M p/sec, 0.52 CPU cores, 44.0% done. ETA 01 May 18:
p=133456851983485423, 3.303M p/sec, 0.50 CPU cores, 52.0% done. ETA 01 May 18:
p=133456860984828163, 3.720M p/sec, 0.57 CPU cores, 61.0% done. ETA 01 May 18:
p=133456867162916653, 2.583M p/sec, 0.39 CPU cores, 67.2% done. ETA 01 May 18:
p=133456874294725907, 2.964M p/sec, 0.45 CPU cores, 74.3% done. ETA 01 May 18:
p=133456881547960319, 3.017M p/sec, 0.46 CPU cores, 81.5% done. ETA 01 May 18:
p=133456888854346549, 3.037M p/sec, 0.46 CPU cores, 88.9% done. ETA 01 May 18:
p=133456896509054939, 3.183M p/sec, 0.48 CPU cores, 96.5% done. ETA 01 May 18:
50
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes
tested
Elapsed time: 792.87 sec. (33.07 init + 759.75 sieve) at 3198436 p/sec.
Processor time: 414.32 sec. (29.81 init + 384.51 sieve).
Seconds spent in CPU and GPU: 810.69 (cpu), 742.99 (gpu)
Percent of time spent in CPU vs. GPU: 0.52 (cpu), 0.48 (gpu)
CPU/GPU utilization: 0.21 (cores), 0.19 (devices)
Running PFGW64 doing SR5 tasks on all 8 threads while testing.
| |
|
|
I've made some code changes to increase gpu load. Changed single thread sieve to multi-thread and what I've got.
wwwwcl.exe -p1334568e11 -P1334569e11 -TWieferich -t6 -b2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456849813392709, 41.95M p/sec, 5.65 CPU cores, 49.8% done. ETA 02 May 01:
21
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes
tested
Elapsed time: 65.63 sec. (4.77 init + 60.76 sieve) at 38640587 p/sec.
Processor time: 364.20 sec. (23.12 init + 341.08 sieve).
Seconds spent in CPU and GPU: 270.71 (cpu), 49.33 (gpu)
Percent of time spent in CPU vs. GPU: 0.85 (cpu), 0.15 (gpu)
CPU/GPU utilization: 0.17 (cores), 0.03 (devices)
If someone want to test, binary is here (32bit win): http://dl.dropbox.com/u/1452459/wwwwcl.exe | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
I've made some code changes to increase gpu load. Changed single thread sieve to multi-thread and what I've got.
wwwwcl.exe -p1334568e11 -P1334569e11 -TWieferich -t6 -b2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456849813392709, 41.95M p/sec, 5.65 CPU cores, 49.8% done. ETA 02 May 01:
21
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes
tested
Elapsed time: 65.63 sec. (4.77 init + 60.76 sieve) at 38640587 p/sec.
Processor time: 364.20 sec. (23.12 init + 341.08 sieve).
Seconds spent in CPU and GPU: 270.71 (cpu), 49.33 (gpu)
Percent of time spent in CPU vs. GPU: 0.85 (cpu), 0.15 (gpu)
CPU/GPU utilization: 0.17 (cores), 0.03 (devices)
If someone want to test, binary is here (32bit win): http://dl.dropbox.com/u/1452459/wwwwcl.exe
Please do not post exes without code. I maintain the code base and the official builds (with help from rebirther). If you don't mind, could you send me the code change you made. I prefer e-mail rather than the forum or PM. My e-mail is rogue @ wi.rr.com. | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
I think I know what you changed. I had removed ParallelPrimeSieve from the sieve folder since I wasn't using it. I can re-add it then have App.cpp instantiate it instead of PrimeSieve. Nevertheless you can send me your code changes just in case you did something differently. The biggest PITB might be dependencies within ParallelPrimeSieve for other libraries. | |
|
Scott Brown Volunteer moderator Project administrator Volunteer tester Project scientist
 Send message
Joined: 17 Oct 05 Posts: 2392 ID: 1178 Credit: 18,658,470,305 RAC: 6,956,219
                                                
|
If your elapsed time is much greater than processor time, then you should reduce the number of threads.
This might not hold true for the Fermi NVidia cards. Using -t4 or -t2 slows things down slightly on my GT440 OEM card compared to -t5. Increasing to -t8 (and using -b3000), however, almost doubles performance (and -t16 crashes the system).
____________
141941*2^4299438-1 is prime!
| |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
If your elapsed time is much greater than processor time, then you should reduce the number of threads.
This might not hold true for the Fermi NVidia cards. Using -t4 or -t2 slows things down slightly on my GT440 OEM card compared to -t5. Increasing to -t8 (and using -b3000), however, almost doubles performance (and -t16 crashes the system).
I suspect that -t16 might be running out of GPU memory although there is code to check for that condition. | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
I've made some code changes to increase gpu load. Changed single thread sieve to multi-thread and what I've got.
Please do not post exes without code. I maintain the code base and the official builds (with help from rebirther). If you don't mind, could you send me the code change you made. I prefer e-mail rather than the forum or PM. My e-mail is rogue @ wi.rr.com.
BTW, I'm sorry if I came across a little harsh. It is very important to me (as an open source software developer) that all source code on projects I support is readily available. It is also important to me that I sanction builds because I have little control over the development environment that everyone is using. It isn't that big a deal for wwwwcl, but it is for programs like pfgw. pfgw can be messed up easily if someone is not careful as it has so many dependencies on third party libraries. When that happens I'm put on the hook for someone else's code (and mistakes) when I have no idea what they changed. | |
|
|
This Rebirther's 64bit app on 64bit Win7 ATI 5800 series.
C:\Users\Me\Downloads\wwwwcl>wwwwcl.exe -p1334568e11 -P1334569e11 -TWieferich -t
5 -b2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456823257959577, 9.814M p/sec, 1.04 CPU cores, 23.3% done. ETA 01 May 21:
p=133456846880334179, 9.794M p/sec, 1.04 CPU cores, 46.9% done. ETA 01 May 21:
p=133456872227676061, 10.56M p/sec, 1.04 CPU cores, 72.2% done. ETA 01 May 21:
p=133456898212588381, 10.78M p/sec, 1.04 CPU cores, 98.2% done. ETA 01 May 21:
03
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes
tested
Elapsed time: 250.35 sec. (3.03 init + 247.23 sieve) at 10129465 p/sec.
Processor time: 261.27 sec. (4.43 init + 256.84 sieve).
Seconds spent in CPU and GPU: 1006.04 (cpu), 93.97 (gpu)
Percent of time spent in CPU vs. GPU: 0.91 (cpu), 0.09 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.02 (devices)
| |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
Rebirther's 64bit app on 64bit Win7, GTX580 with driver version 266.58 and CUDAToolkit 3.2
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t4 -b2000
wwwwcl v2.0
OpenCL Error: Invalid value
in call to clGetKernelWorkGroupInfo
kernelName: wieferich_kernel
64-bit version creates empty wwww.log
32-bit version does nothing at all (no output, no errors).
Updating driver to 301.32 and 64-bit version is doing fine.
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t4 -b2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456879465595863, 16.55M p/sec, 1.28 CPU cores, 79.5% done. ETA 02 May 10:19
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 154.42 sec. (1.23 init + 153.16 sieve) at 16422507 p/sec.
Processor time: 198.17 sec. (1.73 init + 196.44 sieve).
Seconds spent in CPU and GPU: 464.30 (cpu), 42.15 (gpu)
Percent of time spent in CPU vs. GPU: 0.92 (cpu), 0.08 (gpu)
CPU/GPU utilization: 0.25 (cores), 0.02 (devices)
EDIT: Sesef's version doesn't work, perhaps 64-bit version would.
____________
My stats | |
|
Crun-chi Volunteer tester
 Send message
Joined: 25 Nov 09 Posts: 3233 ID: 50683 Credit: 151,443,349 RAC: 73,965
                         
|
Sesef version
wwwwcl.exe -p1334568e11 -P1334569e11 -TWieferich -t6 -b2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456892074820551, 13.19M p/sec, 1.41 CPU cores, 92.1% done. ETA 02 May 12:42
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 240.01 sec. (22.85 init + 217.09 sieve) at 10566198 p/sec.
Processor time: 311.66 sec. (24.01 init + 287.65 sieve).
Seconds spent in CPU and GPU: 384.62 (cpu), 658.20 (gpu)
Percent of time spent in CPU vs. GPU: 0.37 (cpu), 0.63 (gpu)
CPU/GPU utilization: 0.18 (cores), 0.30 (devices)
wwwwcl.exe -p1334568e11 -P1334569e11 -TWieferich -t8 -b3000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456882799494283, 31.45M p/sec, 2.83 CPU cores, 82.8% done. ETA 02 May 12:44
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 93.85 sec. (6.44 init + 87.38 sieve) at 27021435 p/sec.
Processor time: 246.70 sec. (8.99 init + 237.71 sieve).
Seconds spent in CPU and GPU: 257.82 (cpu), 321.73 (gpu)
Percent of time spent in CPU vs. GPU: 0.44 (cpu), 0.56 (gpu)
CPU/GPU utilization: 0.13 (cores), 0.16 (devices)
Rogue 64 bit
wwwwcl.exe -p1334568e11 -P1334569e11 -TWieferich -t6 -b2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456895638206683, 19.89M p/sec, 1.50 CPU cores, 95.6% done. ETA 02 May 12:54
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 128.09 sec. (0.59 init + 127.48 sieve) at 19797948 p/sec.
Processor time: 191.58 sec. (0.59 init + 190.99 sieve).
Seconds spent in CPU and GPU: 575.66 (cpu), 0.05 (gpu)
Percent of time spent in CPU vs. GPU: 1.00 (cpu), 0.00 (gpu)
CPU/GPU utilization: 0.17 (cores), 0.00 (devices)
wwwwcl.exe -p1334568e11 -P1334569e11 -TWieferich -t8 -b3000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456894820528821, 19.82M p/sec, 1.40 CPU cores, 94.8% done. ETA 02 May 12:56
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 128.47 sec. (0.58 init + 127.86 sieve) at 19740249 p/sec.
Processor time: 179.96 sec. (0.58 init + 179.39 sieve).
Seconds spent in CPU and GPU: 785.99 (cpu), 51.53 (gpu)
Percent of time spent in CPU vs. GPU: 0.94 (cpu), 0.06 (gpu)
CPU/GPU utilization: 0.13 (cores), 0.01 (devices)
____________
92*10^1585996-1 NEAR-REPDIGIT PRIME :) :) :)
4 * 650^498101-1 CRUS PRIME
2022202116^131072+1 GENERALIZED FERMAT
Proud member of team Aggie The Pew. Go Aggie! | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
If you too many threads I think that some of the additional GPU time is spent waiting to execute in the GPU. It would be interesting if you vary -t and -b more. For example, try -t3 -b10000.
The error with clGetKernelWorkGroupInfo is due to the requirement for OpenCL 1.1. The code in wwwwcl is not supported by OpenCL 1.0. Can anyone tell me the minimum driver needed to support OpenCL 1.1? I'll add that information to the readme. | |
|
Scott Brown Volunteer moderator Project administrator Volunteer tester Project scientist
 Send message
Joined: 17 Oct 05 Posts: 2392 ID: 1178 Credit: 18,658,470,305 RAC: 6,956,219
                                                
|
Can anyone tell me the minimum driver needed to support OpenCL 1.1? I'll add that information to the readme.
For NVidia, it looks like you need something above the 280.13 (Linux) or 280.19(Windows) drivers.
| |
|
Crun-chi Volunteer tester
 Send message
Joined: 25 Nov 09 Posts: 3233 ID: 50683 Credit: 151,443,349 RAC: 73,965
                         
|
As you requested
wwwwcl.exe -p1334568e11 -P1334569e11 -TWieferich -t3 -b10000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456895194058081, 19.96M p/sec, 1.58 CPU cores, 95.2% done. ETA 02 May 13:52
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 127.83 sec. (0.61 init + 127.14 sieve) at 19838761 p/sec.
Processor time: 201.29 sec. (0.61 init + 200.68 sieve).
Seconds spent in CPU and GPU: 260.22 (cpu), 69.95 (gpu)
Percent of time spent in CPU vs. GPU: 0.79 (cpu), 0.21 (gpu)
CPU/GPU utilization: 0.33 (cores), 0.09 (devices)
wwwwcl.exe -p1334568e11 -P1334569e11 -TWieferich -t2 -b15000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456895395957667, 19.96M p/sec, 1.56 CPU cores, 95.4% done. ETA 02 May 13:55
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 127.77 sec. (0.61 init + 127.04 sieve) at 19848544 p/sec.
Processor time: 198.70 sec. (0.61 init + 198.09 sieve).
Seconds spent in CPU and GPU: 132.52 (cpu), 66.87 (gpu)
Percent of time spent in CPU vs. GPU: 0.66 (cpu), 0.34 (gpu)
CPU/GPU utilization: 0.50 (cores), 0.25 (devices)
wwwwcl.exe -p1334568e11 -P1334569e11 -TWieferich -t1 -b25000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456889338901581, 12.41M p/sec, 0.98 CPU cores, 89.3% done. ETA 02 May 13:59
Sieve interrupted: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 205.08 sec. (0.61 init + 204.28 sieve) at 12365704 p/sec.
Processor time: 200.82 sec. (0.61 init + 200.21 sieve).
Seconds spent in CPU and GPU: 209.72 (cpu), 69.48 (gpu)
Percent of time spent in CPU vs. GPU: 0.75 (cpu), 0.25 (gpu)
CPU/GPU utilization: 1.00 (cores), 0.33 (devices)
____________
92*10^1585996-1 NEAR-REPDIGIT PRIME :) :) :)
4 * 650^498101-1 CRUS PRIME
2022202116^131072+1 GENERALIZED FERMAT
Proud member of team Aggie The Pew. Go Aggie! | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
For NVidia, it looks like you need something above the 280.13 (Linux) or 280.19(Windows) drivers.
Yes, I came across same information.
____________
My stats | |
|
|
The error with clGetKernelWorkGroupInfo is due to the requirement for OpenCL 1.1. The code in wwwwcl is not supported by OpenCL 1.0. Can anyone tell me the minimum driver needed to support OpenCL 1.1? I'll add that information to the readme.
According the nvidia developer forum since 280.13/280.19 beta and the official build was the 280.26 from 2011-08-09. Source: http://forums.nvidia.com/index.php?showtopic=206951
Edith says: too late ;)
Regards Odi
____________
| |
|
|
It would be interesting if you vary -t and -b more.
I tested the x86 win app also on a nvidia GT440 with different parameters:
C:\prpnet\prpclient-gpu>wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t5 -b2000
Elapsed time: 655.17 sec. (18.86 init + 636.19 sieve) at 3870698 p/sec.
Processor time: 616.72 sec. (19.22 init + 597.50 sieve).
Seconds spent in CPU and GPU: 664.37 (cpu), 2071.85 (gpu)
Percent of time spent in CPU vs. GPU: 0.24 (cpu), 0.76 (gpu)
CPU/GPU utilization: 0.21 (cores), 0.64 (devices)
C:\prpnet\prpclient-gpu>wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t5 -b4000
Elapsed time: 590.40 sec. (3.79 init + 586.40 sieve) at 4295290 p/sec.
Processor time: 612.65 sec. (3.65 init + 609.00 sieve).
Seconds spent in CPU and GPU: 600.78 (cpu), 2140.41 (gpu)
Percent of time spent in CPU vs. GPU: 0.22 (cpu), 0.78 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.72 (devices)
C:\prpnet\prpclient-gpu>wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t5 -b8000
Elapsed time: 623.09 sec. (3.08 init + 619.79 sieve) at 4069956 p/sec.
Processor time: 627.64 sec. (3.01 init + 624.63 sieve).
Seconds spent in CPU and GPU: 634.60 (cpu), 2315.38 (gpu)
Percent of time spent in CPU vs. GPU: 0.22 (cpu), 0.78 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.73 (devices)
C:\prpnet\prpclient-gpu>wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t5 -b16000
Elapsed time: 622.91 sec. (3.54 init + 618.97 sieve) at 4071112 p/sec.
Processor time: 648.48 sec. (3.39 init + 645.10 sieve).
Seconds spent in CPU and GPU: 632.87 (cpu), 2383.03 (gpu)
Percent of time spent in CPU vs. GPU: 0.21 (cpu), 0.79 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.76 (devices)
C:\prpnet\prpclient-gpu>wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t2 -b4000
Elapsed time: 877.02 sec. (2.77 init + 874.16 sieve) at 2891575 p/sec.
Processor time: 834.92 sec. (2.65 init + 832.27 sieve).
Seconds spent in CPU and GPU: 886.77 (cpu), 966.86 (gpu)
Percent of time spent in CPU vs. GPU: 0.48 (cpu), 0.52 (gpu)
CPU/GPU utilization: 0.50 (cores), 0.55 (devices)
C:\prpnet\prpclient-gpu>wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t8 -b4000
Elapsed time: 596.75 sec. (4.50 init + 591.96 sieve) at 4249589 p/sec.
Processor time: 607.87 sec. (4.23 init + 603.65 sieve).
Seconds spent in CPU and GPU: 607.30 (cpu), 3813.20 (gpu)
Percent of time spent in CPU vs. GPU: 0.14 (cpu), 0.86 (gpu)
CPU/GPU utilization: 0.13 (cores), 0.79 (devices)
C:\prpnet\prpclient-gpu>wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t10 -b4000
Elapsed time: 614.21 sec. (6.13 init + 607.78 sieve) at 4128787 p/sec.
Processor time: 619.71 sec. (5.76 init + 613.96 sieve).
Seconds spent in CPU and GPU: 624.95 (cpu), 4768.02 (gpu)
Percent of time spent in CPU vs. GPU: 0.12 (cpu), 0.88 (gpu)
CPU/GPU utilization: 0.10 (cores), 0.77 (devices)
@rogue: Is it possible to add a function which do some short tests with settings for -t and -b and give an output to determine the optimal setting/throughput?
Regards Odi
____________
| |
|
|
Rebirther's 64bit app on 64bit Win7, GTX580 with driver version 266.58 and CUDAToolkit 3.2
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t4 -b2000
wwwwcl v2.0
OpenCL Error: Invalid value
in call to clGetKernelWorkGroupInfo
kernelName: wieferich_kernel
64-bit version creates empty wwww.log
32-bit version does nothing at all (no output, no errors).
Updating driver to 301.32 and 64-bit version is doing fine.
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t4 -b2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456879465595863, 16.55M p/sec, 1.28 CPU cores, 79.5% done. ETA 02 May 10:19
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 154.42 sec. (1.23 init + 153.16 sieve) at 16422507 p/sec.
Processor time: 198.17 sec. (1.73 init + 196.44 sieve).
Seconds spent in CPU and GPU: 464.30 (cpu), 42.15 (gpu)
Percent of time spent in CPU vs. GPU: 0.92 (cpu), 0.08 (gpu)
CPU/GPU utilization: 0.25 (cores), 0.02 (devices)
EDIT: Sesef's version doesn't work, perhaps 64-bit version would.
I got the same error with toolkit 3.2 and 4.2 (compiled app). Driver version 285.62. Iam not sure why. The app should run with 3.2 but 4.2 need a new driver.
@Honza: Do you create the app and run or do you only installed toolkit? | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
@rogue: Is it possible to add a function which do some short tests with settings for -t and -b and give an output to determine the optimal setting/throughput?
Regards Odi
Probably. Let me think about it. I hope to release an updated build with sesef's changes later today.
BTW, since writing such a function will take time and is not high priority, I recommend that someone write a script to run wwwwcl on a range of 1e10 (which will take less time) and vary -t and -b. Users could then use the script to run wwwwcl to see which values work best for their hardware. I think that running wwwwcl on 20 to 50 distinct -t/-b values would be more than enough for a user to determine which settings are best for them and should run in less than an hour for most users. | |
|
Scott Brown Volunteer moderator Project administrator Volunteer tester Project scientist
 Send message
Joined: 17 Oct 05 Posts: 2392 ID: 1178 Credit: 18,658,470,305 RAC: 6,956,219
                                                
|
9600 GSO
AMD Phenom 9100e
Win7 enterprise x64
wwwwcl_x64 -p1334568e11 -P1334569e11 -TWieferich -t8 -b
2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456804239794551, 1.762M p/sec, 1.00 CPU cores, 4.2% done. ETA 02 May 10:0
p=133456808328132893, 1.700M p/sec, 1.01 CPU cores, 8.3% done. ETA 02 May 10:0
p=133456812143784247, 1.590M p/sec, 0.94 CPU cores, 12.1% done. ETA 02 May 10:
p=133456815837861233, 1.539M p/sec, 0.92 CPU cores, 15.8% done. ETA 02 May 10:
p=133456819472420039, 1.511M p/sec, 0.89 CPU cores, 19.5% done. ETA 02 May 10:
p=133456823257959577, 1.568M p/sec, 0.93 CPU cores, 23.3% done. ETA 02 May 10:
p=133456827013476193, 1.560M p/sec, 0.93 CPU cores, 27.0% done. ETA 02 May 10:
p=133456830890223829, 1.616M p/sec, 0.96 CPU cores, 30.9% done. ETA 02 May 10:
p=133456834857635983, 1.655M p/sec, 0.99 CPU cores, 34.9% done. ETA 02 May 10:
p=133456838855188133, 1.645M p/sec, 0.98 CPU cores, 38.9% done. ETA 02 May 10:
p=133456842701278469, 1.615M p/sec, 0.96 CPU cores, 42.7% done. ETA 02 May 10:
p=133456846335127751, 1.511M p/sec, 0.90 CPU cores, 46.3% done. ETA 02 May 10:
p=133456849848390161, 1.457M p/sec, 0.87 CPU cores, 49.8% done. ETA 02 May 10:
p=133456853785169027, 1.641M p/sec, 0.98 CPU cores, 53.8% done. ETA 02 May 10:
p=133456857510022477, 1.545M p/sec, 0.91 CPU cores, 57.5% done. ETA 02 May 10:
p=133456861204323599, 1.539M p/sec, 0.91 CPU cores, 61.2% done. ETA 02 May 10:
p=133456864959497221, 1.559M p/sec, 0.92 CPU cores, 65.0% done. ETA 02 May 10:
p=133456868896704299, 1.638M p/sec, 0.97 CPU cores, 68.9% done. ETA 02 May 10:
p=133456872591092447, 1.538M p/sec, 0.91 CPU cores, 72.6% done. ETA 02 May 10:
p=133456876467723569, 1.607M p/sec, 0.95 CPU cores, 76.5% done. ETA 02 May 10:
p=133456880525830633, 1.690M p/sec, 1.00 CPU cores, 80.5% done. ETA 02 May 10:
p=133456884281315341, 1.561M p/sec, 0.93 CPU cores, 84.3% done. ETA 02 May 10:
p=133456888067010931, 1.570M p/sec, 0.93 CPU cores, 88.1% done. ETA 02 May 10:
p=133456891973819041, 1.626M p/sec, 0.96 CPU cores, 92.0% done. ETA 02 May 10:
p=133456895941186891, 1.645M p/sec, 0.97 CPU cores, 95.9% done. ETA 02 May 10:
p=133456899333091903, 1.411M p/sec, 0.84 CPU cores, 99.3% done. ETA 02 May 10:
09
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes
tested
Elapsed time: 1612.46 sec. (16.24 init + 1595.81 sieve) at 1572726 p/sec.
Processor time: 1516.36 sec. (14.02 init + 1502.34 sieve).
Seconds spent in CPU and GPU: 9758.07 (cpu), 658.80 (gpu)
Percent of time spent in CPU vs. GPU: 0.94 (cpu), 0.06 (gpu)
CPU/GPU utilization: 0.13 (cores), 0.01 (devices)
SR5 running on all four cores while testing. Also note that, while a bit slower, changes in -t from 2 to 8 have only minor effect.
____________
141941*2^4299438-1 is prime!
| |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
I got the same error with toolkit 3.2 and 4.2 (compiled app). Driver version 285.62. Iam not sure why. The app should run with 3.2 but 4.2 need a new driver.
@Honza: Do you create the app and run or do you only installed toolkit?
Just took your app and updating driver to 301.32 and 64-bit version is doing fine.
Newer drives has OpenCL support, 260.x version was too old and no OpenCL there.
____________
My stats | |
|
|
I got the same error with toolkit 3.2 and 4.2 (compiled app). Driver version 285.62. Iam not sure why. The app should run with 3.2 but 4.2 need a new driver.
@Honza: Do you create the app and run or do you only installed toolkit?
Just took your app and updating driver to 301.32 and 64-bit version is doing fine.
Newer drives has OpenCL support, 260.x version was too old and no OpenCL there.
Do you have a GTX6xx? The v301.32 is only for 6xx serie. | |
|
Scott Brown Volunteer moderator Project administrator Volunteer tester Project scientist
 Send message
Joined: 17 Oct 05 Posts: 2392 ID: 1178 Credit: 18,658,470,305 RAC: 6,956,219
                                                
|
One of the slowest possible NVidia cards...
8400M GS
Core2 T8100
Vista Home Premium 32-bit
wwwwcl.exe -p1334568e11 -P1334569e11 -TWieferich -t5 -b2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456801019519399, 421.2K p/sec, 0.07 CPU cores, 1.0% done. ETA 02 May 18:5
[...]
p=133456899247284661, 427.4K p/sec, 0.08 CPU cores, 99.2% done. ETA 02 May 18:
54
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes
tested
Elapsed time: 6358.68 sec. (329.94 init + 6028.05 sieve) at 398818 p/sec.
Processor time: 771.83 sec. (304.76 init + 467.07 sieve).
Seconds spent in CPU and GPU: 6362.78 (cpu), 28962.63 (gpu)
Percent of time spent in CPU vs. GPU: 0.18 (cpu), 0.82 (gpu)
CPU/GPU utilization: 0.21 (cores), 0.96 (devices)
| |
|
Crun-chi Volunteer tester
 Send message
Joined: 25 Nov 09 Posts: 3233 ID: 50683 Credit: 151,443,349 RAC: 73,965
                         
|
I use 301.xx series on my 560Ti works perfect
____________
92*10^1585996-1 NEAR-REPDIGIT PRIME :) :) :)
4 * 650^498101-1 CRUS PRIME
2022202116^131072+1 GENERALIZED FERMAT
Proud member of team Aggie The Pew. Go Aggie! | |
|
|
@all: Pls test these versions (prefered NVIDIA cards) with driver versions 296.xx and 301.xx. The program is created with cudatoolkit 3.2/4.1/4.2
Download
I must know if the program is starting or give an error. | |
|
Crun-chi Volunteer tester
 Send message
Joined: 25 Nov 09 Posts: 3233 ID: 50683 Credit: 151,443,349 RAC: 73,965
                         
|
Nvidia 560TI beta 301.xx drivers Win 7 64 bit
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t3 -b10000
wwwwcl v2.0
OpenCL Error: Invalid value
in call to clGetKernelWorkGroupInfo
kernelName: wieferich_kernel
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t3 -b10000
wwwwcl v2.0
OpenCL Error: Invalid value
in call to clGetKernelWorkGroupInfo
kernelName: wieferich_kernel
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t3 -b10000
wwwwcl v2.0
OpenCL Error: Invalid value
in call to clGetKernelWorkGroupInfo
kernelName: wieferich_kernel
wwwwcl.exe -p1334568e11 -P1334569e11 -TWieferich -t3 -b10000
wwwwcl v2.0
OpenCL Error: Invalid value
in call to clGetKernelWorkGroupInfo
kernelName: wieferich_kernel
____________
92*10^1585996-1 NEAR-REPDIGIT PRIME :) :) :)
4 * 650^498101-1 CRUS PRIME
2022202116^131072+1 GENERALIZED FERMAT
Proud member of team Aggie The Pew. Go Aggie! | |
|
|
Thx Crun-chi, I think I need the dev driver for compiling. Will test it tomorrow. | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
Nvidia 560TI beta 301.xx drivers Win 7 64 bit
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t3 -b10000
wwwwcl v2.0
OpenCL Error: Invalid value
in call to clGetKernelWorkGroupInfo
kernelName: wieferich_kernel
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t3 -b10000
wwwwcl v2.0
OpenCL Error: Invalid value
in call to clGetKernelWorkGroupInfo
kernelName: wieferich_kernel
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t3 -b10000
wwwwcl v2.0
OpenCL Error: Invalid value
in call to clGetKernelWorkGroupInfo
kernelName: wieferich_kernel
wwwwcl.exe -p1334568e11 -P1334569e11 -TWieferich -t3 -b10000
wwwwcl v2.0
OpenCL Error: Invalid value
in call to clGetKernelWorkGroupInfo
kernelName: wieferich_kernel
This is most likely due to the driver not supporting OpenCL 1.1. I could change the code to remove this dependency, but it could hurt performance. | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
[quote]Nvidia 560TI beta 301.xx drivers Win 7 64 bit
This is most likely due to the driver not supporting OpenCL 1.1. I could change the code to remove this dependency, but it could hurt performance.
Previous Rebirther's x64 version was running fine on 301.32 driver.
Latest one x64 gives errors as Chru-chi, 32-bit has no output.
____________
My stats | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
I've updated the source (and Mac build) to include the changes provided by sesef. No other changes were made. I'll let rebirther build the Win32 and Win64 exes as I still can get them to build and run correctly.
Make sure that you test ranges with known special instances.
Also, please test some WallSunSun ranges (-TWallSunSun). They are a little slower, but the software can verify that the code on the GPU was executed correctly whereas it can't do that for Wieferich.
I hope to test the PRPNet changes this weekend so that we can start using wwwwcl in production next week. | |
|
|
Updated versions with latest code:
32bit
64bit
There are some issues with 64bit and sometimes blackscreen. Pls report anomalies.
@rogue: I have no luck with cudatoolkit, all drivers supporting OpenCL1.1 but getting always the same error. | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
Win 2008R2 SP2 x64, ATI 7950
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t4 -b2000
Give around 14,25Mp/sec with GPU usage ~70%.
32-bit version does 10,6Mp/sec with GPU usage ~50%.
I wasn't able to get more GPU usage using differrent -t and -b switches. According to task manager, process still uses only around 1 CPU core.
-TWallSunSun crashes the app.
____________
My stats | |
|
|
Win 7 Pro 64bit / i7-2700K stock speed / GTX570 stock speed
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t5 -b2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456850878083389, 21.26M p/sec, 1.52 CPU cores, 50.9% done. ETA 05 May 12:21
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 120.16 sec. (0.59 init + 119.55 sieve) at 21104316 p/sec.
Processor time: 182.63 sec. (0.61 init + 182.02 sieve).
Seconds spent in CPU and GPU: 485.87 (cpu), 58.17 (gpu)
Percent of time spent in CPU vs. GPU: 0.89 (cpu), 0.11 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.02 (devices)
____________
35 x 2^3587843+1 is prime! | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
Win 2008R2 SP2 x64, ATI 7950
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t4 -b2000
Give around 14,25Mp/sec with GPU usage ~70%.
32-bit version does 10,6Mp/sec with GPU usage ~50%.
I wasn't able to get more GPU usage using differrent -t and -b switches. According to task manager, process still uses only around 1 CPU core.
-TWallSunSun crashes the app.
That't not good. Rebirther, can you debug? | |
|
|
Win 2008R2 SP2 x64, ATI 7950
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t4 -b2000
Give around 14,25Mp/sec with GPU usage ~70%.
32-bit version does 10,6Mp/sec with GPU usage ~50%.
I wasn't able to get more GPU usage using differrent -t and -b switches. According to task manager, process still uses only around 1 CPU core.
-TWallSunSun crashes the app.
That't not good. Rebirther, can you debug?
What do you need? | |
|
|
I have tested with larger -b value, one error, smaller -b values no errors.
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWallSunSun -t4 -b10000 -v
wwwwcl v2.0
Defaulting to platform 0
Defaulting to device 0
workGroupSize = 2240000 = 10000 * 32 * 7 (blocks * workGroupSizeMultiple * devic
eComputeUnits)
Sieve started: 133456800000000000 <= p < 133456900000000000
OpenCL Error: Invalid command queue
in call to clEnqueueReadBuffer
argument: prime
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWallSunSun -t3 -b10000 -v
wwwwcl v2.0
Defaulting to platform 0
Defaulting to device 0
workGroupSize = 2240000 = 10000 * 32 * 7 (blocks * workGroupSizeMultiple * devic
eComputeUnits)
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456806271375781, 2.595M p/sec, 0.91 CPU cores, 6.3% done. ETA 05 May 18:0
p=133456812895713067, 2.735M p/sec, 0.79 CPU cores, 12.9% done. ETA 05 May 18:
p=133456819432047091, 2.694M p/sec, 0.76 CPU cores, 19.4% done. ETA 05 May 18:
p=133456826057022431, 2.712M p/sec, 0.81 CPU cores, 26.1% done. ETA 05 May 18:
p=133456832946931057, 2.822M p/sec, 0.84 CPU cores, 32.9% done. ETA 05 May 18:
p=133456839660126407, 2.722M p/sec, 0.81 CPU cores, 39.7% done. ETA 05 May 18:
p=133456846461369947, 2.828M p/sec, 0.86 CPU cores, 46.5% done. ETA 05 May 18:
p=133456853616091429, 2.967M p/sec, 1.13 CPU cores, 53.6% done. ETA 05 May 18:
p=133456860328717157, 2.795M p/sec, 0.82 CPU cores, 60.3% done. ETA 05 May 18:
p=133456867218467207, 2.870M p/sec, 0.99 CPU cores, 67.2% done. ETA 05 May 18:
p=133456872517883227, 2.190M p/sec, 0.67 CPU cores, 72.5% done. ETA 05 May 18:
p=133456879142626301, 2.719M p/sec, 0.67 CPU cores, 79.1% done. ETA 05 May 18:
p=133456886032975711, 2.844M p/sec, 0.94 CPU cores, 86.0% done. ETA 05 May 18:
p=133456893011162137, 2.900M p/sec, 1.09 CPU cores, 93.0% done. ETA 05 May 18:
p=133456899900858451, 2.694M p/sec, 0.99 CPU cores, 99.9% done. ETA 05 May 18:
05
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes
tested
Elapsed time: 927.48 sec. (0.58 init + 925.05 sieve) at 2734239 p/sec.
Processor time: 809.02 sec. (0.56 init + 808.46 sieve).
Seconds spent in CPU and GPU: 941.12 (cpu), 2464.17 (gpu)
Percent of time spent in CPU vs. GPU: 0.28 (cpu), 0.72 (gpu)
CPU/GPU utilization: 0.33 (cores), 0.87 (devices)
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t4 -b10000 -v
wwwwcl v2.0
Defaulting to platform 0
Defaulting to device 0
workGroupSize = 2240000 = 10000 * 32 * 7 (blocks * workGroupSizeMultiple * devic
eComputeUnits)
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456826763655327, 11.26M p/sec, 0.80 CPU cores, 26.8% done. ETA 05 May 18:
p=133456864126726373, 15.58M p/sec, 1.15 CPU cores, 64.1% done. ETA 05 May 18:
14
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes
tested
Elapsed time: 162.69 sec. (0.60 init + 161.94 sieve) at 15587631 p/sec.
Processor time: 192.27 sec. (0.59 init + 191.68 sieve).
Seconds spent in CPU and GPU: 266.82 (cpu), 208.85 (gpu)
Percent of time spent in CPU vs. GPU: 0.56 (cpu), 0.44 (gpu)
CPU/GPU utilization: 0.25 (cores), 0.20 (devices)
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t3 -b10000 -v
wwwwcl v2.0
Defaulting to platform 0
Defaulting to device 0
workGroupSize = 2240000 = 10000 * 32 * 7 (blocks * workGroupSizeMultiple * devic
eComputeUnits)
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456833388680577, 13.89M p/sec, 1.00 CPU cores, 33.4% done. ETA 05 May 18:
p=133456866688543933, 13.87M p/sec, 0.98 CPU cores, 66.7% done. ETA 05 May 18:
10
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes
tested
Elapsed time: 185.32 sec. (6.04 init + 179.13 sieve) at 13684177 p/sec.
Processor time: 187.03 sec. (5.93 init + 181.10 sieve).
Seconds spent in CPU and GPU: 189.58 (cpu), 237.01 (gpu)
Percent of time spent in CPU vs. GPU: 0.44 (cpu), 0.56 (gpu)
CPU/GPU utilization: 0.34 (cores), 0.43 (devices) | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
c:\temp>wwwwcl64.exe -p1334568e11 -P1334569e11 -TWallSunSun -t2 -b1000 -v
wwwwcl v2.0
Defaulting to platform 0
Defaulting to device 0
...and crash on ATI HD 7950.
____________
My stats | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
Win7 x64 on i5-2500, GTX 580 with 301.xx driver.
GPU usage ~35%.
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t4 -b2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456887865123811, 18.28M p/sec, 1.28 CPU cores, 87.9% done. ETA 05 May 19:16
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 139.46 sec. (0.68 init + 138.75 sieve) at 18184118 p/sec.
Processor time: 178.73 sec. (0.67 init + 178.06 sieve).
Seconds spent in CPU and GPU: 418.73 (cpu), 39.25 (gpu)
Percent of time spent in CPU vs. GPU: 0.91 (cpu), 0.09 (gpu)
CPU/GPU utilization: 0.25 (cores), 0.02 (devices)
Now that is interesting. GPU usage ~99% with -TWallSunSun.
c:\_PG\wwwwcl>wwwwcl64.exe -p1334568e11 -P1334569e11 -[b]TWallSunSun[/b] -t4 -b2000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456887582475801, 7.304M p/sec, 1.36 CPU cores, 87.6% done. ETA 05 May 19:23
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 349.03 sec. (1.20 init + 347.47 sieve) at 7265742 p/sec.
Processor time: 474.77 sec. (1.73 init + 473.04 sieve).
Seconds spent in CPU and GPU: 353.97 (cpu), 1173.71 (gpu)
Percent of time spent in CPU vs. GPU: 0.23 (cpu), 0.77 (gpu)
CPU/GPU utilization: 0.25 (cores), 0.83 (devices)
____________
My stats | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
I have tested with larger -b value, one error, smaller -b values no errors.
[code]wwwwcl64.exe -p1334568e11 -P1334569e11 -TWallSunSun -t4 -b10000 -v
wwwwcl v2.0
Defaulting to platform 0
Defaulting to device 0
workGroupSize = 2240000 = 10000 * 32 * 7 (blocks * workGroupSizeMultiple * deviceComputeUnits)
Sieve started: 133456800000000000 <= p < 133456900000000000
OpenCL Error: Invalid command queue
in call to clEnqueueReadBuffer
argument: prime
Interesting. The most likely possibility is that there isn't enough memory on the GPU to handle that many threads for that large a work group. WallSunSun does require GPU more memory than Wieferich, but not significantly so. A memory leak is possible, so I'll look into it.
WallSunSun will utilize the GPU much more than Wieferich, so I would not recommend using -t or -b values that are that large.
This doesn't explain Honza's issue though, although he seems to be having more issues than most other users. | |
|
|
I have tested with larger -b value, one error, smaller -b values no errors.
[code]wwwwcl64.exe -p1334568e11 -P1334569e11 -TWallSunSun -t4 -b10000 -v
wwwwcl v2.0
Defaulting to platform 0
Defaulting to device 0
workGroupSize = 2240000 = 10000 * 32 * 7 (blocks * workGroupSizeMultiple * deviceComputeUnits)
Sieve started: 133456800000000000 <= p < 133456900000000000
OpenCL Error: Invalid command queue
in call to clEnqueueReadBuffer
argument: prime
Interesting. The most likely possibility is that there isn't enough memory on the GPU to handle that many threads for that large a work group. WallSunSun does require GPU more memory than Wieferich, but not significantly so. A memory leak is possible, so I'll look into it.
WallSunSun will utilize the GPU much more than Wieferich, so I would not recommend using -t or -b values that are that large.
This doesn't explain Honza's issue though, although he seems to be having more issues than most other users.
Yes, I have seen around 3xxMB of GPU memory usage, but have around 600MB free. | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
Yes, I have seen around 3xxMB of GPU memory usage, but have around 600MB free.
I'm able to run 3 instances to get higher GPU usage on HD 7950.
____________
My stats | |
|
|
Yes, I have seen around 3xxMB of GPU memory usage, but have around 600MB free.
I'm able to run 3 instances to get higher GPU usage on HD 7950.
GPU usage of my GTX460:
TWallSunSun=99%
TWieferich=~50%
But always too much cpu usage. | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
This doesn't explain Honza's issue though, although he seems to be having more issues than most other users.
Updated to latest 12.4 driver (from 12.2), still no luck.
____________
My stats | |
|
|
This doesn't explain Honza's issue though, although he seems to be having more issues than most other users.
Updated to latest 12.4 driver (from 12.2), still no luck.
Do you have AMD SDK installed? | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
Do you have AMD SDK installed?
Yes, streamsdk_2-6-RC3_win764
____________
My stats | |
|
|
Do you have AMD SDK installed?
Yes, streamsdk_2-6-RC3_win764
This can cause a driver conflict. I have installed with a NVIDIA card, and the wwwwcl cause an error on start. After deinstallation its working. | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
This can cause a driver conflict. I have installed with a NVIDIA card, and the wwwwcl cause an error on start. After deinstallation its working.
Well, uninstalled all Catalyst stuff, restarted and...wwwwcl works as before. Means 32-bit version is still crashing and 64-bit version is still working and WallSunSun crashing.
Windows are still holding 12.2 drivers according to Device Manager.
____________
My stats | |
|
|
This can cause a driver conflict. I have installed with a NVIDIA card, and the wwwwcl cause an error on start. After deinstallation its working.
Well, uninstalled all Catalyst stuff, restarted and...wwwwcl works as before. Means 32-bit version is still crashing and 64-bit version is still working and WallSunSun crashing.
What dependencies are there?
None. | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
Well, I was now able to really install and use 12.4 driver.
32-bit version is now working.
It leaves only WallSunSun tests crashing (both 32 and 64 versions).
____________
My stats | |
|
|
Well, I was now able to really install and use 12.4 driver.
32-bit version is now working.
It leaves only WallSunSun tests crashing (both 32 and 64 versions).
I have rebuild both versions with statically link. Pls try this one:
stl-version | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
I have rebuild both versions with statically link. Pls try this one:
stl-version
There is no visible change in behaviour.
WallSunSun still crashing both 32 and 64-bit version.
(Wieferich is fine)
____________
My stats | |
|
|
I have rebuild both versions with statically link. Pls try this one:
stl-version
There is no visible change in behaviour.
WallSunSun still crashing both 32 and 64-bit version.
(Wieferich is fine)
Do you get an error output (-v)? | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
Do you get an error output (-v)?
This is also same as before:
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWallSunSun -v
wwwwcl v2.0
Defaulting to platform 0
Defaulting to device 0
...and immediate crash.
This is from EventsManager:
Faulting application name: wwwwcl64.exe, version: 0.0.0.0, time stamp: 0x4fa588d8
Faulting module name: aticaldd64.dll, version: 6.14.10.1720, time stamp: 0x4f7e470f
Exception code: 0xc0000005
Fault offset: 0x000000000035b937
Faulting process id: 0x8b0
Faulting application start time: 0x01cd2b8946359767
Faulting application path: c:\temp\wwwwcl\wwwwcl64.exe
Faulting module path: C:\Windows\system32\aticaldd64.dll
Report Id: 84a27781-977c-11e1-952e-00241dce8b9c
____________
My stats | |
|
|
Do you get an error output (-v)?
This is also same as before:
wwwwcl64.exe -p1334568e11 -P1334569e11 -TWallSunSun -v
wwwwcl v2.0
Defaulting to platform 0
Defaulting to device 0
...and immediate crash.
This is from EventsManager:
Faulting application name: wwwwcl64.exe, version: 0.0.0.0, time stamp: 0x4fa588d8
Faulting module name: aticaldd64.dll, version: 6.14.10.1720, time stamp: 0x4f7e470f
Exception code: 0xc0000005
Fault offset: 0x000000000035b937
Faulting process id: 0x8b0
Faulting application start time: 0x01cd2b8946359767
Faulting application path: c:\temp\wwwwcl\wwwwcl64.exe
Faulting module path: C:\Windows\system32\aticaldd64.dll
Report Id: 84a27781-977c-11e1-952e-00241dce8b9c
Ok, better for debugging. It seems to be a driver issue. The card is very new. Can you check this with another HD6xxx or HD5xxx?
@rogue: I have no ATI card to debug the program, I hope you can find out what cause this problem.
@all: Anyone else have the same problem with a HD7xxx card?
| |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
@rogue: I have no ATI card to debug the program, I hope you can find out what cause this problem.
I have an ATI card, but my Windows build freezes the computer. Could you e-mail me your zipped up solution folder (minus the .ncb and other temp files) and let me know which OpenCL SDK you are using? | |
|
|
@rogue: I have no ATI card to debug the program, I hope you can find out what cause this problem.
I have an ATI card, but my Windows build freezes the computer. Could you e-mail me your zipped up solution folder (minus the .ncb and other temp files) and let me know which OpenCL SDK you are using?
Done, streamsdk_2-6-RC3_win764 (32bit and 64bit included) | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
Ok, better for debugging. It seems to be a driver issue. The card is very new. Can you check this with another HD6xxx or HD5xxx?
Unfortunatelly not, at least not now.
I might be able to reach some low-end HD5xxx on Windows 7 x64 tommorow.
But this would need updating driver which I'm a bit hesitative...
____________
My stats | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
I still can't get a build to run for me on Windows. It just hangs on a call to copy data from the GPU back to the CPU.
I did find a problem, but it is minor. I don't know if it is causing Honza's issue or not. I've sent the updated source to Rebirther for him to build.
Honza, can you install Visual Studio 2010 Express Edition? If so, could you assist in debugging the crash (assuming the patch doesn't work)? | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
Honza, can you install Visual Studio 2010 Express Edition? If so, could you assist in debugging the crash (assuming the patch doesn't work)?
Yes, I can install VS 2010 and assist if needed.
____________
My stats | |
|
|
I still can't get a build to run for me on Windows. It just hangs on a call to copy data from the GPU back to the CPU.
I did find a problem, but it is minor. I don't know if it is causing Honza's issue or not. I've sent the updated source to Rebirther for him to build.
Honza, can you install Visual Studio 2010 Express Edition? If so, could you assist in debugging the crash (assuming the patch doesn't work)?
I have updated all versions.
@rogue: mail sent | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
I have updated all versions.
Still the same, crashes for WallSunSUn.
Some more information from the latest app.
>wwwwcl64.exe -l
wwwwcl v2.0
List of available platforms and devices
Platform 0 is a Advanced Micro Devices, Inc. AMD Accelerated Parallel Processing, version OpenCL 1.1 AMD-APP (844.5)
Device 0 is a Advanced Micro Devices, Inc. Tahiti
>wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -v -b2000
wwwwcl v2.0
Defaulting to platform 0
Defaulting to device 0
workGroupSize = 3584000 = 2000 * 64 * 28 (blocks * workGroupSizeMultiple * deviceComputeUnits)
Sieve started: 133456800000000000 <= p < 133456900000000000
btw, can we introduce version numbering for each new build? This may reduce some confusion later on in the process.
____________
My stats | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
Honza, can you install Visual Studio 2010 Express Edition? If so, could you assist in debugging the crash (assuming the patch doesn't work)?
Got VS2010Express1.iso downloaded.
What should I install and do? You can send instructions via PM or mail.
I found all MS Visual C++ redistributable 2005, 2008 and 2010 version , both x64 and x64 are already installed on my host.
____________
My stats | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
btw, can we introduce version numbering for each new build? This may reduce some confusion later on in the process.
Sorry about that. I've been lazy.
I don't think you have the right file. Go here: http://www.microsoft.com/visualstudio/en-us/products/2010-editions/visual-cpp-express and click on the download button. Make sure you choose VS2010, not VS2011 from the dialog it opens. It will download an exe. Double click the exe to install VS2010, which it will then download. Use the default install options. You will also need the AMD APP SDK.
A VS2008 solution is included in my zip file. You will have to upgrade it when you open it with VS2010, but that shouldn't be any issue. Use the "Build" option to build the solution. You can e-mail me with questions. Unless you are familiar with VS, you will need a little guidance with the debugging, but hopefully not too much. | |
|
|
btw, can we introduce version numbering for each new build? This may reduce some confusion later on in the process.
Sorry about that. I've been lazy.
I don't think you have the right file. Go here: http://www.microsoft.com/visualstudio/en-us/products/2010-editions/visual-cpp-express and click on the download button. Make sure you choose VS2010, not VS2011 from the dialog it opens. It will download an exe. Double click the exe to install VS2010, which it will then download. Use the default install options. You will also need the AMD APP SDK.
A VS2008 solution is included in my zip file. You will have to upgrade it when you open it with VS2010, but that shouldn't be any issue. Use the "Build" option to build the solution. You can e-mail me with questions. Unless you are familiar with VS, you will need a little guidance with the debugging, but hopefully not too much.
I have sent him a link for the pro trial version. The Express version doesnt have 64bit. | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
I have sent him a link for the pro trial version. The Express version doesnt have 64bit.
I see. There should be little speed difference between 32-bit and 64-bit builds. | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
VS C++ installed.
Now, what are you interested in?
Unhandled exception at 0x6935b937 in wwwwcl64.exe: 0xC0000005: Access violation reading location 0x0000000000000000.
Unhandled exception at 0x692d4477 in wwwwcl.exe: 0xC0000005: Access violation reading location 0x00000000.
____________
My stats | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
VS C++ installed.
Now, what are you interested in?
Unhandled exception at 0x6935b937 in wwwwcl64.exe: 0xC0000005: Access violation reading location 0x0000000000000000.
Unhandled exception at 0x692d4477 in wwwwcl.exe: 0xC0000005: Access violation reading location 0x00000000.
Build the Debug project (it is probably Release). It might squawk that it can't find inttypes.h. If so, you have to right click on the project and select "Properties". From C/C++ -> General is an option called "Additional Include Directories". It should have $(SolutionDir)\visualstudio in the list.
On the Properties dialog there is another selection called "Debugging". On that page is an option called "Command Arguments". Use that to specify the same arguments that you use on the command line.
Once built correctly, hit F5 to run. It will stop when it hits the bug causing the crash. I need the stack trace from that. | |
|
|
I'm not sure if I'm using the latest code since I downloaded the original a week or so ago, compiled it this weekend and only just got around to testing it, but here's the output of a couple of tests done on a Mac Pro 1,1 with a HD5770 card installed.
ME:wwwwcl ME$ ./wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t6 -b2000
wwwwcl v2.0
Compiled May 6 2012 with GCC 4.2.1 (Based on Apple Inc. build 5658) (LLVM build 2336.1.00)
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456896759005609, 10.09M p/sec, 1.07 CPU cores, 96.8% done. ETA 09 May 04:22
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 253.60 sec. (2.37 init + 251.13 sieve) at 9999877 p/sec.
Processor time: 272.21 sec. (2.39 init + 269.82 sieve).
Seconds spent in CPU and GPU: 1098.58 (cpu), 182.30 (gpu)
Percent of time spent in CPU vs. GPU: 0.86 (cpu), 0.14 (gpu)
CPU/GPU utilization: 0.17 (cores), 0.03 (devices)
ME:wwwwcl ME$ ./wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t2 -b2000
wwwwcl v2.0
Compiled May 6 2012 with GCC 4.2.1 (Based on Apple Inc. build 5658) (LLVM build 2336.1.00)
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456897213205119, 10.14M p/sec, 1.07 CPU cores, 97.2% done. ETA 09 May 04:29
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 253.26 sec. (2.38 init + 250.76 sieve) at 10013441 p/sec.
Processor time: 270.62 sec. (2.38 init + 268.24 sieve).
Seconds spent in CPU and GPU: 260.43 (cpu), 183.00 (gpu)
Percent of time spent in CPU vs. GPU: 0.59 (cpu), 0.41 (gpu)
CPU/GPU utilization: 0.50 (cores), 0.35 (devices)
ME:wwwwcl ME$ ./wwwwcl -p1334568e11 -P1334569e11 -TWallSunSun -t2 -b2000
wwwwcl v2.0
Compiled May 6 2012 with GCC 4.2.1 (Based on Apple Inc. build 5658) (LLVM build 2336.1.00)
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456898374142177, 1.949M p/sec, 0.25 CPU cores, 98.4% done. ETA 09 May 05:13
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 1307.93 sec. (3.87 init + 1303.52 sieve) at 1938902 p/sec.
Processor time: 322.26 sec. (3.88 init + 318.39 sieve).
Seconds spent in CPU and GPU: 1314.90 (cpu), 2219.65 (gpu)
Percent of time spent in CPU vs. GPU: 0.37 (cpu), 0.63 (gpu)
CPU/GPU utilization: 0.50 (cores), 0.85 (devices)
If there's something else I should test, values, ranges just let me know.
____________
147*2^1392930+1 was my first prime number found, others have followed :) | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
I'm sorry, but I haven't tested the next release of PRPNet yet. I have the changes and I had intended to test on Wednesday, but I didn't have my thumb drive with me.
I also have a few changes to wwwwcl, mainly a few stats and using of an ini file. No bug fixes. Honza's issue is still not resolved. | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
I'm sorry, but I haven't tested the next release of PRPNet yet. I have the changes and I had intended to test on Wednesday, but I didn't have my thumb drive with me.
I also have a few changes to wwwwcl, mainly a few stats and using of an ini file. No bug fixes. Honza's issue is still not resolved.
I've posted wwwwcl 2.1.0 at home.roadrunner.com/~mrodenkirch/wwwwcl_2.1.0.zip. The changes are:
Added more detail when using -v, such as the amount of GPU memory allocated
Added -q option to not print stats
Added wwww.ini file along with description of supported settings
Re-factored SharedMemoryItem
This will work with PRPNet 5.0.8, which I just released, but you won't be able to use it with either project until Lennart upgrades the server.
Windows users will need to a build from rebirther.
An OS X build is included. AFAIK, nobody with Linux has built and tested the code. I compile with the following:
gcc *.c *.cpp opencl/*.cpp wwww/*.cpp sieve/*.cpp -m64 -framework OpenCL -fopenmp -lstdc++ -O2 -o wwwwcl
on OS X. I expect a similar command for building on Linux. | |
|
|
I'm sorry, but I haven't tested the next release of PRPNet yet. I have the changes and I had intended to test on Wednesday, but I didn't have my thumb drive with me.
I also have a few changes to wwwwcl, mainly a few stats and using of an ini file. No bug fixes. Honza's issue is still not resolved.
I've posted wwwwcl 2.1.0 at home.roadrunner.com/~mrodenkirch/wwwwcl_2.1.0.zip. The changes are:
Added more detail when using -v, such as the amount of GPU memory allocated
Added -q option to not print stats
Added wwww.ini file along with description of supported settings
Re-factored SharedMemoryItem
This will work with PRPNet 5.0.8, which I just released, but you won't be able to use it with either project until Lennart upgrades the server.
Windows users will need to a build from rebirther.
An OS X build is included. AFAIK, nobody with Linux has built and tested the code. I compile with the following:
gcc *.c *.cpp opencl/*.cpp wwww/*.cpp sieve/*.cpp -m64 -framework OpenCL -fopenmp -lstdc++ -O2 -o wwwwcl
on OS X. I expect a similar command for building on Linux.
Still many compiling errors on windows. Need to wait a bit longer :/ | |
|
|
Still many compiling errors on windows. Need to wait a bit longer :/
Remember to set /openmp for Multi-Threaded build.
I've posted wwwwcl 2.1.0 at home.roadrunner.com/~mrodenkirch/wwwwcl_2.1.0.zip. The changes are:
Did you consider using a http://gitorious.org/, SVN or something else. I think it will be much better to share source than upload zip files. | |
|
|
At last, tried a linux build (ubuntu 11.04, gcc 4.5.2, nvidia driver 270.41.06). This was with your original release from several days ago. Finally got it to build after adding some #include and #define in main.h and App.h, and a couple of others... I can send along diff's after I pound on this some more. Still, with the following build command:
gcc *.c *.cpp opencl/*.cpp wwww/*.cpp sieve/*.cpp -O2 -I/usr/local/cuda/include -L/usr/lib/nvidia-current -lOpenCL -lpthread -lstdc++ -o wwwwcl -m64
the compilation still has lots of warnings like this:
App.cpp: In member function ‘void App::WriteCheckpoint()’:
App.cpp:516:50: warning: format ‘%llu’ expects type ‘long long unsigned int’, but argument 3 has type ‘uint64_t’
These were not present when I tried "-m32" instead of "-m64".
Still, I have a runnable, with all of the dynamic libraries resolved, so give it a whirl. Nope, no joy:
gary:~/Downloads/wwwwcl$ ./wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t4 -b2000
wwwwcl v2.0
Compiled May 12 2012 with GCC 4.5.2
OpenCL Error: Invalid value
in call to clGetKernelWorkGroupInfo
kernelName: wieferich_kernel
gary:~/Downloads/wwwwcl$
Not sure where to go from here, except to bed. Back at it later. I'll try your latest release (from a few hours ago).
--Gary | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
At last, tried a linux build (ubuntu 11.04, gcc 4.5.2, nvidia driver 270.41.06). This was with your original release from several days ago. Finally got it to build after adding some #include and #define in main.h and App.h, and a couple of others... I can send along diff's after I pound on this some more. Still, with the following build command:
gcc *.c *.cpp opencl/*.cpp wwww/*.cpp sieve/*.cpp -O2 -I/usr/local/cuda/include -L/usr/lib/nvidia-current -lOpenCL -lpthread -lstdc++ -o wwwwcl -m64
the compilation still has lots of warnings like this:
App.cpp: In member function ‘void App::WriteCheckpoint()’:
App.cpp:516:50: warning: format ‘%llu’ expects type ‘long long unsigned int’, but argument 3 has type ‘uint64_t’
These were not present when I tried "-m32" instead of "-m64".
Still, I have a runnable, with all of the dynamic libraries resolved, so give it a whirl. Nope, no joy:
gary:~/Downloads/wwwwcl$ ./wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t4 -b2000
wwwwcl v2.0
Compiled May 12 2012 with GCC 4.5.2
OpenCL Error: Invalid value
in call to clGetKernelWorkGroupInfo
kernelName: wieferich_kernel
gary:~/Downloads/wwwwcl$
Not sure where to go from here, except to bed. Back at it later. I'll try your latest release (from a few hours ago).
--Gary
I wouldn't worry about the compiler warnings right now, but please e-mail me your diffs.
Thanks. Is that OpenCL 1.0 or OpenCL 1.1? I will investigate a change to address that problem.
I'll try to fix the Windows project later today, but I have a really busy day, so I can't make any promises.
As for sesef's suggestion, I hadn't considered it. I won't make any decisions on it at this time. | |
|
|
Rogue,
Just shot you an email (your rr.com address posted earlier in this thread) with source diffs, compile command, compiler output, and a run attempt using your new 2.1.0 version. Same results as before.
Yes, it looks like an OpenCL 1.0 library. I'll scrounge up 1.1 somewhere, I'm sure, and give that a try.
--Gary
____________
"I am he as you are he as you are me and we are all together"
87*2^3496188+1 is prime! (1052460 digits)
4 is not prime! (1 digit) | |
|
|
Just tested Rebirther build version x64
D:\PRPnet\prpclient-5>wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t5 -b2
000
wwwwcl v2.0
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456816968635053, 7.065M p/sec, 1.68 CPU cores, 17.0% done. ETA 13 May 09:
p=133456833928760093, 7.053M p/sec, 1.75 CPU cores, 33.9% done. ETA 13 May 09:
p=133456850762002741, 6.998M p/sec, 1.84 CPU cores, 50.8% done. ETA 13 May 09:
p=133456867569279161, 6.988M p/sec, 1.85 CPU cores, 67.6% done. ETA 13 May 09:
p=133456884518593219, 7.046M p/sec, 1.87 CPU cores, 84.5% done. ETA 13 May 09:
18
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes
tested
Elapsed time: 361.62 sec. (0.99 init + 360.58 sieve) at 7012808 p/sec.
Processor time: 652.99 sec. (0.98 init + 652.01 sieve).
Seconds spent in CPU and GPU: 372.43 (cpu), 1105.09 (gpu)
Percent of time spent in CPU vs. GPU: 0.25 (cpu), 0.75 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.60 (devices)
Windows 7 x64 NVIDIA GTX 540M
wwww.log is empty
____________
| |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
I will need someone with VS 2010 Pro to build the production release for Windows as rebirther's version can't build with openmp. | |
|
|
I will need someone with VS 2010 Pro to build the production release for Windows as rebirther's version can't build with openmp.
Now its working as it should ;) | |
|
|
v2.1.2
32bit
64bit
Its possible that the app crashes with MSVCR100.dll. You need to install
Microsoft Visual C++ 2010 Service Pack 1 Redistributable Package
At this time my program crashed every time and I cannot find the problem. All packages are installed. | |
|
|
Tried it too, mine crashed as well until I tried without the -v switch, try giving it a go without -v.
____________
147*2^1392930+1 was my first prime number found, others have followed :) | |
|
|
D:\PRPnet\prpclient-5>wwwwcl64-21.exe -p1334568e11 -P1334569e11 -TWieferich -t5
-b2000
The application was unable to start correctly (0xc000007b). etc.
Windows 7 x64
____________
| |
|
|
For Rebirther's 64 bit (2.1.2 version) on trying to run it on a dual GPU I can't get it to recognise the second GPU properly, and I also can't get it to read the whole of the ini file correctly (except for the last 4 lines). I might be keying in the format of the values incorrectly into the ini file, anyone know the desired format?
It doesn't like the -v switch for me.
It crashed a few times when I had other cores busy doing other things, after a re-boot with nothing else running I've completed about 20 different tests without it crashing.
It does crash when I have additional cores busy.
Screen outputs available if required.
K
____________
147*2^1392930+1 was my first prime number found, others have followed :) | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
Does it crash immediately with -v or when the range is done?
For a dual GPU, use -l to list the platforms and devices, then use -p and -d to switch to the second GPU.
I can't speak for the ini file issue (yet). It's possible that I have a bug in the code. Make sure the lines don't have a '//' in the first two characters. | |
|
|
V2.1.2 on Win 7 Pro 64bit / i7-2700K stock speed / GTX570 stock speed
E:\PrimeGRID\PRPNet>wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t5 -b2000
wwwwcl v2.1.2, a GPU program to search for Wieferich and WallSunSun primes
Sieve started: 133456800000000000 <= p < 133456900000000000
First time:
Windows message saying app stopped responding, details:
Problem signature:
Problem Event Name: APPCRASH
Application Name: wwwwcl64.exe
Application Version: 0.0.0.0
Application Timestamp: 4fb07d8f
Fault Module Name: ntdll.dll
Fault Module Version: 6.1.7601.17725
Fault Module Timestamp: 4ec4aa8e
Exception Code: c0000264
Exception Offset: 00000000000cd7d8
OS Version: 6.1.7601.2.1.0.256.48
Locale ID: 2057
Additional Information 1: b87d
Additional Information 2: b87dcb7ec2077235b3b7aeeb1b688a50
Additional Information 3: 2e05
Additional Information 4: 2e0539e2e801d223d577c17e3118a6c7
Second time:
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes tested
Elapsed time: 69.08 sec. (0.72 init + 59.65 sieve) at 41980137 p/sec.
Processor time: 223.47 sec. (3.53 init + 219.95 sieve).
Seconds spent in CPU and GPU: 8.67 (cpu), 208.80 (gpu)
Percent of time spent in CPU vs. GPU: 0.04 (cpu), 0.96 (gpu)
CPU/GPU utilization: 0.22 (cores), 5.25 (devices)
Log file: Primes tested I64u. Checksum 0000000097279b54. Time 0.000000 seconds
| |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14011 ID: 53948 Credit: 433,230,186 RAC: 922,880
                               
|
C:\Temp>wwwwcl64.exe -h
wwwwcl v2.1.2, a GPU program to search for Wieferich and WallSunSun primes
-h --help Print this help
-v --verbose Verbose output (memory usage)
-q --quiet Quiet output (no banner or stats)
-p --pmin=P0 Sieve start: 3 <= P0 <= p (default P0=3)
-P --pmax=P1 Sieve end: p < P1 <= 2^62 (default P1=P0+10^9)
-t --nthreads=N Start N threads (default N=1)
-l --list List available plagorms and devices
-b --blocks=B Force B blocks per device
-f --platform=F Use platform F instead of 0
-d --device=D Use device D instead of 0
-T --type=T <Wieferich><WallSunSun>
-s --special=S Limit to use for determining special instances (default 100)
Rebirther's 64 bit win build, but the typo probably is in all the builds.
____________
My lucky number is 75898524288+1 | |
|
|
K Lewis is right, without the -v switch mine is running. The first time I got a bluescreen while I closed the window immediately.
The speed is around 5x higher than the old one but taking more cpu usage:
F:\x>wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t4 -b2000
wwwwcl v2.1.2, a GPU program to search for Wieferich and WallSunSun primes
Sieve started: 133456800000000000 <= p < 133456900000000000
p=133456876642618943, 25.57M p/sec, 1.67 CPU cores, 76.6% done. ETA 14 May 16:
30
Sieve complete: 133456800000000001 <= p < 133456900000000000 2535955284 primes
tested
Elapsed time: 121.22 sec. (0.58 init + 101.39 sieve) at 24862272 p/sec.
Processor time: 175.17 sec. (2.22 init + 172.96 sieve).
Seconds spent in CPU and GPU: 19.22 (cpu), 230.71 (gpu)
Percent of time spent in CPU vs. GPU: 0.08 (cpu), 0.92 (gpu)
CPU/GPU utilization: 0.26 (cores), 3.09 (devices)
32bit crashed with ntdll file in the middle of the calculation. | |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14011 ID: 53948 Credit: 433,230,186 RAC: 922,880
                               
|
-v also causes the program to crash on my machine (win 7 x64, 64 bit wwww build.)
____________
My lucky number is 75898524288+1 | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
I'll try to investigate the -v option tonight to see why it is crashing. Most likely it is something stupid on my part, such as not passing enough parameters on a printf().
With OpenMP, wwwwcl will use a lot more CPU for the sieving. Wieferich will need more threads than WallSunSun to maximize GPU throughput. | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
Just out of curiosity, I gave 2.1.2 a try.
HD7950 crashing on Wieferich, possibly on msvcr100.dll with -v option.
Without -v option, 32-bit it goes fine. Using up to 3 cores, ~20M p/sec with nice GPU usage of ~95-98%.
>wwwwcl.exe -p1334568e11 -P1334569e11 -TWieferich -b2000 -t3
With -b2000, CPU usage goes down in final stages of computation (and so does Mp/sec)
But -b200 or -b500 crashes on first progress output.
EDIT:
64-bit version is similar in speed but lower CPU usage.
-b1000 looks optimal in terms of speed, CPU usage and screen responsiveness.
____________
My stats | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
Run from very low p's, there should be somethings else of I64u
c:\temp\wwwwcl>wwwwcl64.exe -p1000 -P100e11 -TWieferich -b2000 -t4
wwwwcl v2.1.2, a GPU program to search for Wieferich and WallSunSun primes
Sieve started: 1000 <= p < 10000000000000
I64u is a special instance (+19226819 -1 p)
I64u is a special instance (+21840613 -1 p)
I64u is a special instance (+22084519 +1 p)
I64u is a special instance (+22278433 +1 p)
I64u is a special instance (+22730017 +1 p)
I64u is a special instance (+25762181 -1 p)
I64u is a special instance (+1013 -1 p)
I64u is a special instance (+1069 -1 p)
I64u is a Wieferich prime
I64u is a special instance (+1123 -1 p)
____________
My stats | |
|
|
I tested the new one on my GT440.
wwwwcl 2.0:
C:\prpnet\prpclient-gpu>wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t5 -b4000
Elapsed time: 590.40 sec. (3.79 init + 586.40 sieve) at 4295290 p/sec.
Processor time: 612.65 sec. (3.65 init + 609.00 sieve).
Seconds spent in CPU and GPU: 600.78 (cpu), 2140.41 (gpu)
Percent of time spent in CPU vs. GPU: 0.22 (cpu), 0.78 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.72 (devices)
wwwwcl 2.1.2:
C:\prpnet\prpclient-gpu>wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t5 -b4000
Elapsed time: 744.01 sec. (3.71 init + 728.88 sieve) at 3461158 p/sec.
Processor time: 350.16 sec. (2.67 init + 347.49 sieve).
Seconds spent in CPU and GPU: 11.32 (cpu), 1353.72 (gpu)
Percent of time spent in CPU vs. GPU: 0.01 (cpu), 0.99 (gpu)
CPU/GPU utilization: 0.30 (cores), 35.56 (devices)
The throughput falled rapidly but gpu is used better than before...
Regards Odi
____________
| |
|
|
I got it working too. 2.1.2 Windows 7 x64 gtx 540M
____________
| |
|
|
Sorry for the delay, I see the -v switch issue is being dealt with, it crashes just after prior to starting, after reporting running with x threads.
The mac version (2.1.0) worked fine with the -v switch.
I've got the dual GPU switches working now thanks, but it still crashes if too many cores are busy on other task for some reason.
____________
147*2^1392930+1 was my first prime number found, others have followed :) | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
I tested the new one on my GT440.
wwwwcl 2.0:
C:\prpnet\prpclient-gpu>wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t5 -b4000
Elapsed time: 590.40 sec. (3.79 init + 586.40 sieve) at 4295290 p/sec.
Processor time: 612.65 sec. (3.65 init + 609.00 sieve).
Seconds spent in CPU and GPU: 600.78 (cpu), 2140.41 (gpu)
Percent of time spent in CPU vs. GPU: 0.22 (cpu), 0.78 (gpu)
CPU/GPU utilization: 0.20 (cores), 0.72 (devices)
wwwwcl 2.1.2:
C:\prpnet\prpclient-gpu>wwwwcl -p1334568e11 -P1334569e11 -TWieferich -t5 -b4000
Elapsed time: 744.01 sec. (3.71 init + 728.88 sieve) at 3461158 p/sec.
Processor time: 350.16 sec. (2.67 init + 347.49 sieve).
Seconds spent in CPU and GPU: 11.32 (cpu), 1353.72 (gpu)
Percent of time spent in CPU vs. GPU: 0.01 (cpu), 0.99 (gpu)
CPU/GPU utilization: 0.30 (cores), 35.56 (devices)
The throughput falled rapidly but gpu is used better than before...
Regards Odi
I suggest playing around with the -t and -b switches to find the optimal settings.
I found the problem with -v. It was due to a switch to use some macros which were used to address compiler warnings. I'll try to post it tonight. | |
|
|
Hi,
Long time reader, first time poster.. wanted to help test this new program out. Unfortunately not working for me.
Core i5 2500k @ 4.5ghz with 2x 4770HD in Crossfire. Using the latest 12.4 Catalyst drivers. This is the output from the 64bit version, I get exactly the same with the 32bit version.
MSVC redist installed.
I:\>wwwwcl64.exe -p1334568e11 -P1334569e11 -TWieferich -t5 -b2000
wwwwcl v2.1.2, a GPU program to search for Wieferich and WallSunSun primes
OpenCL Error: Program build failure
in call to clBuildProgram
"C:\Users\Phil\AppData\Local\Temp\OCLB5D6.tmp.cl", line 129: error: write
to <
32 bits via pointer not allowed unless cl_khr_byte_addressable_store
is enabled
rem[gid] = 0;
^
"C:\Users\Phil\AppData\Local\Temp\OCLB5D6.tmp.cl", line 132: error: write to <
32 bits via pointer not allowed unless cl_khr_byte_addressable_store
is enabled
rem[gid] = w_rem;
^
"C:\Users\Phil\AppData\Local\Temp\OCLB5D6.tmp.cl", line 133: error: write to <
32 bits via pointer not allowed unless cl_khr_byte_addressable_store
is enabled
quot[gid] = w_quot;
^
"C:\Users\Phil\AppData\Local\Temp\OCLB5D6.tmp.cl", line 141: error: write to <
32 bits via pointer not allowed unless cl_khr_byte_addressable_store
is enabled
rem[gid] = 1;
^
"C:\Users\Phil\AppData\Local\Temp\OCLB5D6.tmp.cl", line 142: error: write to <
32 bits via pointer not allowed unless cl_khr_byte_addressable_store
is enabled
quot[gid] = sp_quot;
^
"C:\Users\Phil\AppData\Local\Temp\OCLB5D6.tmp.cl", line 146: error: write to <
32 bits via pointer not allowed unless cl_khr_byte_addressable_store
is enabled
rem[gid] = -1;
^
"C:\Users\Phil\AppData\Local\Temp\OCLB5D6.tmp.cl", line 147: error: write to <
32 bits via pointer not allowed unless cl_khr_byte_addressable_store
is enabled
quot[gid] = sp_quot+1;
^
"C:\Users\Phil\AppData\Local\Temp\OCLB5D6.tmp.cl", line 155: error: write to <
32 bits via pointer not allowed unless cl_khr_byte_addressable_store
is enabled
rem[gid] = 1;
^
"C:\Users\Phil\AppData\Local\Temp\OCLB5D6.tmp.cl", line 156: error: write to <
32 bits via pointer not allowed unless cl_khr_byte_addressable_store
is enabled
quot[gid] = -sp_quot;
^
"C:\Users\Phil\AppData\Local\Temp\OCLB5D6.tmp.cl", line 160: error: write to <
32 bits via pointer not allowed unless cl_khr_byte_addressable_store
is enabled
rem[gid] = -1;
^
"C:\Users\Phil\AppData\Local\Temp\OCLB5D6.tmp.cl", line 161: error: write to <
32 bits via pointer not allowed unless cl_khr_byte_addressable_store
is enabled
quot[gid] = -(sp_quot-1);
^
"C:\Users\Phil\AppData\Local\Temp\OCLB5D6.tmp.cl", line 9: warning: variable
"cc" was declared but never referenced
ulong cc, magicNumber, magicShift;
^
11 errors detected in the compilation of "C:\Users\Phil\AppData\Local\Temp\OCLB5
D6.tmp.cl".
Internal error: clc compiler invocation failed.
This means nothing to me but I hope it's useful in helping test out !
Phil
| |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
According to this link:
http://groups.google.com/group/reconstructme/browse_thread/thread/66248fa2cd6b818d
it appears that your card is not supported.
BTW, I've posted 2.1.3 here: home.roadrunner.com/~mrodenkirch/wwwwcl_2.1.3.zip. Windows users will need to wait for rebirther to do a build.
The changes from 2.1.2 are to fix the -v crash. | |
|
Scott Brown Volunteer moderator Project administrator Volunteer tester Project scientist
 Send message
Joined: 17 Oct 05 Posts: 2392 ID: 1178 Credit: 18,658,470,305 RAC: 6,956,219
                                                
|
According to this link:
http://groups.google.com/group/reconstructme/browse_thread/thread/66248fa2cd6b818d
it appears that your card is not supported.
That link appears to refer to the Fire Pro 7740 series. gribzy appears to have two crossfired HD4770 cards. I think those cards are supported.
The crossfire setup is probably the culprit (as has been SLI on NVidia setups for other GPU apps). I would suggest testing with crossfire disabled.
| |
|
|
According to this link:
http://groups.google.com/group/reconstructme/browse_thread/thread/66248fa2cd6b818d
it appears that your card is not supported.
BTW, I've posted 2.1.3 here: home.roadrunner.com/~mrodenkirch/wwwwcl_2.1.3.zip. Windows users will need to wait for rebirther to do a build.
The changes from 2.1.2 are to fix the -v crash.
This link is not working:(
____________
| |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
I named the file incorrectly. The link should be fixed now. | |
|
|
v2.1.3
32bit
64bit
64bit is working
32bit crashed in the middle of the calculation with ntdll. | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
Can we have some output if GPU is not supported or driver is not supported?
Tested 2.1.3 with HD46xx and driver from 2011/05 and wasn't able to get any output at all (nor error). -?, -h, -v does nothing.
Same for GTX 580 with latest 300.xx drivers - no output at all nor error when trying to compute.
____________
My stats | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
Can we have some output if GPU is not supported or driver is not supported?
Tested 2.1.3 with HD46xx and driver from 2011/05 and wasn't able to get any output at all (nor error). -?, -h, -v does nothing.
Same for GTX 580 with latest 300.xx drivers - no output at all nor error when trying to compute.
I don't know how to make such a check in code. There is plenty of error checking for all of the OpenCL calls, so if one of those calls fails you should get a message. I'm open to suggestions.
-l will list the valid platforms/devices. | |
|
Honza Volunteer moderator Volunteer tester Project scientist Send message
Joined: 15 Aug 05 Posts: 1957 ID: 352 Credit: 6,140,716,592 RAC: 2,274,372
                                      
|
-l will list the valid platforms/devices.
Well, that's the point - it does nothing, no output to screen or log file at all.
I would expect something like "HDxxx not supported, get a proper card." or "Driver version not supported, get yourself updated to 12.2+"
____________
My stats | |
|
rogueVolunteer developer
 Send message
Joined: 8 Sep 07 Posts: 1256 ID: 12001 Credit: 18,565,548 RAC: 0
 
|
-l will list the valid platforms/devices.
Well, that's the point - it does nothing, no output to screen or log file at all.
I would expect something like "HDxxx not supported, get a proper card." or "Driver version not supported, get yourself updated to 12.2+"
That's very odd. When I run rebirther's 2.1.3 Windows build on a laptop that doesn't support OpenCL, I get this (when using -l):
List of available platforms and devices
Platform 0 is a Intel(R) Corporation Intel(R) OpenCL, version OpenCL 1.1
No devices
Platform 1 is a Advanced Micro Devices, Inc. AMD Accelerated Parallel Processing, version OpenCL 1.1 AMD-APP (851.4)
No devices
Fatal Error:
No devices were found that can run this code
If you use -l alone with no other parameters, do you still get nothing? I wonder if there i |
|