Join PrimeGrid
Returning Participants
Community
Leader Boards
Results
Other
drummers-lowrise
|
Message boards :
Generalized Fermat Prime Search :
how long does it take to compute for ____?
Author |
Message |
kiskaVolunteer tester Send message
Joined: 13 Apr 12 Posts: 47 ID: 138397 Credit: 242,087,376 RAC: 97,756
                     
|
How long to run short and long Genfer on C2D P8400 and Geforce 9300M?
I am already running 2 short for Intel so what is the time every one else took? | |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14036 ID: 53948 Credit: 475,998,733 RAC: 222,579
                               
|
How long to run short and long Genfer on C2D P8400 and Geforce 9300M?
I am already running 2 short for Intel so what is the time every one else took?
You can't run Genefer on a 9300M. The 9300M is an older, single-precision-only GPU and Generfer requires double precision. Therefore your only option is to run on the CPU.
Your P8400 is a little slower than my C2Q Q6600, which takes about 54 hours to run the short WUs. I would guess it would take about 60 hours on your computer.
The long WUs are not available for CPUs due to the extreme length of the calculation. (A "long" World Record WU would probably take 3 to 4 months to compute on your CPU. The deadline is only three weeks. On the fastest of GPUs -- a GTX 580 -- this requires about 5 days.)
____________
My lucky number is 75898524288+1 | |
|
kiskaVolunteer tester Send message
Joined: 13 Apr 12 Posts: 47 ID: 138397 Credit: 242,087,376 RAC: 97,756
                     
|
Ok I was wondering at the estimate it gave me. 3000 hrs :s ahhhhh. Or how about an i3 or i5 or i7. How does it compare with those? | |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14036 ID: 53948 Credit: 475,998,733 RAC: 222,579
                               
|
Ok I was wondering at the estimate it gave me. 3000 hrs :s ahhhhh. Or how about an i3 or i5 or i7. How does it compare with those?
The estimate BOINC gives you is usually worthless.
The estimate Genefer gives you is ridiculously accurate. You can see that estimate by looking in the file "stdout.txt" in the appropriate "slot" directory in your BOINC data directory.
Depending on the exact CPU, a newer i7, with hyperthreading turned off, might run twice as fast as my C2D.
The real way to run Genefer quickly is to put a GTX 580 (NOT a 680) into a desktop computer and use the GPU to do the computations. A GTX 580 is about 100 times faster than my CPU.
____________
My lucky number is 75898524288+1 | |
|
kiskaVolunteer tester Send message
Joined: 13 Apr 12 Posts: 47 ID: 138397 Credit: 242,087,376 RAC: 97,756
                     
|
So I got 57 hrs is that accurate? | |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14036 ID: 53948 Credit: 475,998,733 RAC: 222,579
                               
|
So I got 57 hrs is that accurate?
If that's the number inside stderr.txt, then yes, it's accurate, PROVIDED the following rather obvious conditions are true:
1) Your computer isn't very busy during the next 57 hours. Obviously, if it's doing something else and Genefer can't get a full core to run on, it will take longer than 57 hours.
2) Your computer is on. If it's off, or sleeping, Genefer can't run.
3) You have BOINC set to crunch 100% of the time. Any of the following can cause BOINC to not crunch at 100%:
- "While Computer is in Use" is not checked
- "While computer is on batteries" is not checked
- "While processor usage is less than $$ percent" is greater than 0
- "Every day between the hours of..." has different hours
- "Use at most ###% CPU time" is less than 100
Also note that if you're on a laptop and you're running on batteries the CPU will slow down, so Genefer will take longer to run. Of course, crunching on batteries is never a great idea anyway since it will run down the battery exceptionally quickly.
Essentially the estimate is accurate as long as Genefer is allowed to run at full speed.
____________
My lucky number is 75898524288+1
| |
|
|
I have an Nvidia GT640 and started a WR on the 13th, running 24x7. It is now the 18th and it's less than 20%. At this rate it will take 25+ days to complete. The units only have a 21 day deadline. I know that if it finished it will be OK and get credit, but why a newer video card is taking longer than the deadline? Has anyone else run on a GT640 and what are the results?
Thanks.
____________
My lucky numbers are 121*2^4553899-1 and 3756801695685*2^666669±1
My movie https://vimeo.com/manage/videos/502242 | |
|
|
The 6xx series are seriously crippled in the Double Precision computing performance, which is exactly what Genefer needs (unfortunately). As is pointed out here even a GTX 680 is a lot slower than the GTX 580. So unfortunately, newer is not better in this case.
____________
PrimeGrid Challenge Overall standings --- Last update: From Pi to Paddy (2016)
| |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14036 ID: 53948 Credit: 475,998,733 RAC: 222,579
                               
|
I have an Nvidia GT640 and started a WR on the 13th, running 24x7. It is now the 18th and it's less than 20%. At this rate it will take 25+ days to complete. The units only have a 21 day deadline. I know that if it finished it will be OK and get credit, but why a newer video card is taking longer than the deadline? Has anyone else run on a GT640 and what are the results?
Thanks.
If there isn't something wrong with your setup and if it really takes 25 days then there probably isn't time for anyone to have completed a WU, but I'll see if I can look through the database and try to find some results from another 640.
Assuming 25 days is reasonable, there's rwo probable reasons why it's slow:
1) The "4" in your model number designates the card as a moderately low-end video card in Nvidia's video card range. The GT 640 is, as of right now, the slowest of all the Kepler series video cards that I'm aware of. It's got fewer shaders than the other cards. It's a new, slow, video card. I'm not certain, but it's likely a 540 or 440 would also have trouble with the deadline.
2) Nvidia's 600 series video cards are poorly suited to GeneferCUDA and other apps that use double precision math because they reduced the number of double precision floating point processors. Although the 600 series cards are generally faster than the 500 series, there's only a third as many DP processors as compared with the 500 series cards. That greatly reduces performance on GeneferCUDA and LLR-CUDA. (Sieves don't use double precision and are not affected.) Nvidia's flagship (and very expensive) GTX680 GPU is therefore significantly SLOWER than the older and less expensive GTX580.
____________
My lucky number is 75898524288+1 | |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14036 ID: 53948 Credit: 475,998,733 RAC: 222,579
                               
|
... but I'll see if I can look through the database and try to find some results from another 640.
Ok, this is a GT 640M (a laptop GPU), which is likely much slower than than the desktop GT 640:
4386^4194304+1 is complete. (15275944 digits) (err = 0.0000) (time = 858:39:04) 18:14:32
That's almost 36 days. But that's a mobile 640, so it's not a good comparison.
I don't see any completed WR work units from a 640, but that's not surprising considering the card came out recently. Some WU's have failed, however, and therefore the results are in the database. Here's the time estimates from those:
Clock=901 MHz
Estimated total run time for 3396^4194304+1 is 664:18:15
Clock=901 MHz
Estimated total run time for 4824^4194304+1 is 612:23:40
Clock=901 MHz
Estimated total run time for 6396^4194304+1 is 730:07:38
Clock=901 MHz
Estimated total run time for 6214^4194304+1 is 706:28:40
The estimates there run from 25 to 30 days, so it looks like 25 days is a reasonable estimate for what it will take for your computer to do the WR tasks.
Yes, you're likely but not quaranteed to get credit when you return the WU shortly after the deadline. However, my recommendation is that if you have a computer that, even under ideal conditions, is unable to meet the deadline that you may want to consider running the short WUs instead. Or crunch the sieves. It's unlikely but still possible that you could crunch for 25 days and get no credit because the WU is purged before you return it.
I hate to say this (because I'm definitely biased toward the GFN tasks), but the 600 series GPUs are better suited for sieving.
EDIT: The mobile 640M that took 36 days to crunch that WU did receive credit for it.
____________
My lucky number is 75898524288+1 | |
|
|
According to WIKI a 640 runs at 1/5 the speed of a 680. A 580 takes 100 hours to run one of the current units. A 680 is around 20% slower than that so 120 hours. That means the 640 will take around 600 hours. | |
|
Scott Brown Volunteer moderator Project administrator Volunteer tester Project scientist
 Send message
Joined: 17 Oct 05 Posts: 2416 ID: 1178 Credit: 19,977,164,341 RAC: 19,277,716
                                                
|
I have two 640s running the GFN 262144 units on PRPnet. They run those units in about 4hrs 3mins (factory OC'ed one) to 4hrs 20mins (stock clocked card). Perhaps Mike can extrapolate from those numbers?
Where the 640s really shine are on the Wall-Sun-Sun prime search on PRPnet. On that project, they were as good or better than a GTS450, GTX550 Ti, and Quadro 4000.
EDIT (Remembered that I had replied to a PM about the 640s with some relevant info that I will copy to here):
Well, I have tried it out on the GFN prime searches, CW sieve, PPS sieve, and GFN sieve with mixed results compared to the GTX 550 Ti. On the GFN prime, it is much slower (on the 262144 work, the 550 is a bit more than twice as fact as the 640). On the GFN sieve, the 640 does better, but is still much slower than the 550 (20P/day vs 32P/day). Things are similar on the CW sieve (about 700 seconds on the 550 vs. 1100 seconds on the 640). Things are much better, however, on the PPS sieve. The 640 is just a bit faster than the 550 (sorry, I have those times at work in a spreadsheet and cannot remember them at the moment).
So why the mixed results? My guess is that the programs have not yet caught up to Kepler. Fermi was different than the previous architecture, but it built on the strengths of the previous GPUs. Kepler is fundamentally different (for example, it can do 3+3+3 faster than it can multiply 3x3...MUCH faster!). When the code catches up to this, the 640 will be an excellent card. Overall at PG, at least for now, the 550 is a bit better.
That said, the 550 does use about twice as much power, so the production per Watt is similar even on GFN prime searching, and much better for the 640 at PPS sieve.
____________
141941*2^4299438-1 is prime!
| |
|
|
Thanks everyone for the info. Since the unit is not going to be purged before I finish, I will try and finish it, and see if it at least makes the grade and gets credit.
I will move it to something else afterward.
____________
My lucky numbers are 121*2^4553899-1 and 3756801695685*2^666669±1
My movie https://vimeo.com/manage/videos/502242 | |
|
|
For Information my GT630M on a solar powered laptop (it thinks it is mains) with core = 475MHz, shaders = 950MHz, memory = 900MHZ with 1833MB, according to CPU-Z I am taking about 15hrs for the short tasks. It is off when it travels to work for half an hour every few days.
I tried a long task but it would have missed the deadline and gone into the last challenge so I aborted it.
On the GT6xx series, particularly with portables, I would advise the short tasks. The credit per hour is about the same.
____________
Member team AUSTRALIA
My lucky number is 9291*2^1085585+1 | |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14036 ID: 53948 Credit: 475,998,733 RAC: 222,579
                               
|
On the GT6xx series, particularly with portables, I would advise the short tasks. The credit per hour is about the same.
No matter what hardware you use, the credit per hour should be the same for both the short and long GFN tasks.
____________
My lucky number is 75898524288+1 | |
|
|
Hi Kevin, et al,
I will reply to your PM for everyone to see as I told you in my email.
CPU-Z is a hardware monitoring system in freeware available from
http://www.cpuid.com/softwares/cpu-z.html
I find it very useful.
____________
Member team AUSTRALIA
My lucky number is 9291*2^1085585+1 | |
|
|
Since the unit is not going to be purged before I finish, I will try and finish it, and see if it at least makes the grade and gets credit.
I am over 60% and still some time to go. No one has finished the unit I am working on, so even though I will be probably over a week late, I may still be the first to finish it. Wouldn't it ironic if it was a GFN and I am the first to finish it?
These WR units really are something.
____________
My lucky numbers are 121*2^4553899-1 and 3756801695685*2^666669±1
My movie https://vimeo.com/manage/videos/502242 | |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14036 ID: 53948 Credit: 475,998,733 RAC: 222,579
                               
|
I am over 60% and still some time to go. No one has finished the unit I am working on, so even though I will be probably over a week late, I may still be the first to finish it. Wouldn't it ironic if it was a GFN and I am the first to finish it?
It would be cause for great celebration!
____________
My lucky number is 75898524288+1 | |
|
|
Well the WR record unit that was late on my machine came in second because another machine who was a lot later than mine dropped their unit off a day before me. That one took 32 days on a lot faster card than mine, and mine ran 24x7 and took about 24 days. Nice to know I have a strong card, too bad I cannot make the deadlines. Sure a hefty payout.
Well good luck to all those running this project.
____________
My lucky numbers are 121*2^4553899-1 and 3756801695685*2^666669±1
My movie https://vimeo.com/manage/videos/502242 | |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14036 ID: 53948 Credit: 475,998,733 RAC: 222,579
                               
|
too bad I cannot make the deadlines. Sure a hefty payout.
The credits/hour on the short WUs are the same as the long WR work units. You could run the short GFNs and get the same credit over those 24 days as running one long GFN. And it's a safe bet that you'll have to wait far less time for your wingmen with the short WUs!
You're also more likely to find a prime with the short WUs. Just because they're "short" doesn't mean they're small. Anything found with the short WUs will still be the largest prime number ever found at PrimeGrid. That's already happened three times this year. :)
The short WUs are, for those and other reasons, more popular than the long WUs.
____________
My lucky number is 75898524288+1 | |
|
|
I have two 640s running the GFN 262144 units on PRPnet. They run those units in about 4hrs 3mins (factory OC'ed one) to 4hrs 20mins (stock clocked card). Perhaps Mike can extrapolate from those numbers?
Where the 640s really shine are on the Wall-Sun-Sun prime search on PRPnet. On that project, they were as good or better than a GTS450, GTX550 Ti, and Quadro 4000.
EDIT (Remembered that I had replied to a PM about the 640s with some relevant info that I will copy to here):
Well, I have tried it out on the GFN prime searches, CW sieve, PPS sieve, and GFN sieve with mixed results compared to the GTX 550 Ti. On the GFN prime, it is much slower (on the 262144 work, the 550 is a bit more than twice as fact as the 640). On the GFN sieve, the 640 does better, but is still much slower than the 550 (20P/day vs 32P/day). Things are similar on the CW sieve (about 700 seconds on the 550 vs. 1100 seconds on the 640). Things are much better, however, on the PPS sieve. The 640 is just a bit faster than the 550 (sorry, I have those times at work in a spreadsheet and cannot remember them at the moment).
So why the mixed results? My guess is that the programs have not yet caught up to Kepler. Fermi was different than the previous architecture, but it built on the strengths of the previous GPUs. Kepler is fundamentally different (for example, it can do 3+3+3 faster than it can multiply 3x3...MUCH faster!). When the code catches up to this, the 640 will be an excellent card. Overall at PG, at least for now, the 550 is a bit better.
That said, the 550 does use about twice as much power, so the production per Watt is similar even on GFN prime searching, and much better for the 640 at PPS sieve.
The new gtx 650 is a 640 with higher gpu clocks and GDDR5 (hence higher memory clocks and much higher memory bandwidth). Has anyone tried it with GFN (or any of the sieves)?
____________
676754^262144+1 is prime | |
|
kiskaVolunteer tester Send message
Joined: 13 Apr 12 Posts: 47 ID: 138397 Credit: 242,087,376 RAC: 97,756
                     
|
Well I can see the new short workunits have to compute on the cpu for 227 hours on a 2nd gen Core i5 processor. What the hell is going on with that? The Short workunits are getting bigger and bigger? Or is the client getting slower and slower? Before it was 57 hours on a C2D P8400 and now it takes 243 hours? Oh and even the GPU 525M needs 196 hours? What the Hell?
____________
| |
|
|
Well I can see the new short workunits have to compute on the cpu for 227 hours on a 2nd gen Core i5 processor. What the hell is going on with that? The Short workunits are getting bigger and bigger? Or is the client getting slower and slower? Before it was 57 hours on a C2D P8400 and now it takes 243 hours? Oh and even the GPU 525M needs 196 hours? What the Hell?
The shorties are getting bigger. | |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14036 ID: 53948 Credit: 475,998,733 RAC: 222,579
                               
|
Then N=524288 search range was exhausted as of November and the 'short' tasks now have N=1048596. These take about 3.5 times as long to process as the 524288 tasks.
____________
My lucky number is 75898524288+1 | |
|
RogerVolunteer developer Volunteer tester
 Send message
Joined: 27 Nov 11 Posts: 1138 ID: 120786 Credit: 268,668,824 RAC: 0
                    
|
Genefer80 (CPU only) for n=15 on PRPNet only takes about 40 minutes for an average CPU. Maybe this could be ported over to BOINC and be the "Short", and n=20 renamed Medium?
____________
| |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14036 ID: 53948 Credit: 475,998,733 RAC: 222,579
                               
|
Genefer80 (CPU only) for n=15 on PRPNet only takes about 40 minutes for an average CPU. Maybe this could be ported over to BOINC and be the "Short", and n=20 renamed Medium?
There's limitations as to how many projects we can run on BOINC, which is why we're not running all of the ranges simultaneously on the BOINC server. Since there's a lot more computing power running on the BOINC side, that's where we want the largest tasks to be run.
There's faster CPU apps in the pipeline, especially for AVX CPUs. I expect CPU computation times to drop by about 25% for non-AVX CPUs and a whopping 60% for AVX. We're not going to think about moving projects around until after those new apps are in place.
____________
My lucky number is 75898524288+1 | |
|
|
Sandy Bridge AVX perfomance is ~8*freq per real core [linpack], SSE2 at least half of its. | |
|
RogerVolunteer developer Volunteer tester
 Send message
Joined: 27 Nov 11 Posts: 1138 ID: 120786 Credit: 268,668,824 RAC: 0
                    
|
There's limitations as to how many projects we can run on BOINC, which is why we're not running all of the ranges simultaneously on the BOINC server. Since there's a lot more computing power running on the BOINC side, that's where we want the largest tasks to be run.
This all makes sense. If more sub-projects can't be added to BOINC GFN without compromising things then it's not worth it.
There's faster CPU apps in the pipeline, especially for AVX CPUs. I expect CPU computation times to drop by about 25% for non-AVX CPUs and a whopping 60% for AVX. We're not going to think about moving projects around until after those new apps are in place.
With faster CPU apps coming it would be best to wait and see. New apps might have different B limits, potentially opening up fresh ranges. An OpenCL version would also allow AMD GPUs, although thats further back in the pipeline.
____________
| |
|
Message boards :
Generalized Fermat Prime Search :
how long does it take to compute for ____? |