Join PrimeGrid
Returning Participants
Community
Leader Boards
Results
Other
drummers-lowrise
|
Message boards :
Cullen/Woodall prime search :
Woo LLR wu's
Author |
Message |
|
After a challenge I normally set up my systems to get a few of the next challenge wu's to test. So after the last challenge I did that. I got 4 wu's (wooLLR) and let them go. It has been some days so I went to check them. At around 55% done they have been running 137 hours. Calculating it out, it is going to take more than 8 days, which is the length of the next challenge.
Wondering if there is something I can do to speed them up?
They are running on somewhat older systems. Both q6600 cpu's. Each has a different motherboard and ram, but are running same in wu speed.
My main concern, even if they are long wu's, is finishing them in time during a challenge.
Thanks
Rick
Edit. I did want to add on almost all other wu types, my system will be close the the avg time listed in the sub-projects. But not these. | |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14011 ID: 53948 Credit: 433,215,444 RAC: 942,255
                               
|
After a challenge I normally set up my systems to get a few of the next challenge wu's to test. So after the last challenge I did that. I got 4 wu's (wooLLR) and let them go. It has been some days so I went to check them. At around 55% done they have been running 137 hours. Calculating it out, it is going to take more than 8 days, which is the length of the next challenge.
Wondering if there is something I can do to speed them up?
They are running on somewhat older systems. Both q6600 cpu's. Each has a different motherboard and ram, but are running same in wu speed.
My main concern, even if they are long wu's, is finishing them in time during a challenge.
Thanks
Rick
Edit. I did want to add on almost all other wu types, my system will be close the the avg time listed in the sub-projects. But not these.
You should see what happens when you run 3 tasks instead of 4. *ALL* prime searches are searching larger numbers than they were searching only a couple of years ago. Bigger numbers mean more digits; more digits mean more memory is used.
The more memory you use, the larger the percentage of memory accesses that are going to not be fulfilled by the CPU's cache. Since the cache is MUCH faster than regular memory, cache misses can significantly slow down the program.
Running multiple copies of LLR uses even more memory, and increases the percentage of cache misses, which causes LLR to run slower.
When I started crunching here in 2010, multi-core CPUs were first becoming common, especially quad core CPUs, and people were starting to notice an inexplicable slowdown when running multiple copies of LLR. It's now pretty obvious that the cause is cache misses.
While the amount of memory used has increased over the years, the amount of cache in your hardware is exactly the same, so running multiple LLR's on your Q6600s is going to have a greater penalty than before. My computer is a Q6600, and it certainly seems that running 4 LLR tasks slows down more now than it used to years ago.
____________
My lucky number is 75898524288+1 | |
|
|
You should see what happens when you run 3 tasks instead of 4. *ALL* prime searches are searching larger numbers than they were searching only a couple of years ago. Bigger numbers mean more digits; more digits mean more memory is used.
The more memory you use, the larger the percentage of memory accesses that are going to not be fulfilled by the CPU's cache. Since the cache is MUCH faster than regular memory, cache misses can significantly slow down the program.
Running multiple copies of LLR uses even more memory, and increases the percentage of cache misses, which causes LLR to run slower.
When I started crunching here in 2010, multi-core CPUs were first becoming common, especially quad core CPUs, and people were starting to notice an inexplicable slowdown when running multiple copies of LLR. It's now pretty obvious that the cause is cache misses.
While the amount of memory used has increased over the years, the amount of cache in your hardware is exactly the same, so running multiple LLR's on your Q6600s is going to have a greater penalty than before. My computer is a Q6600, and it certainly seems that running 4 LLR tasks slows down more now than it used to years ago.
Thanks for that Micheal. I will give it anothor test only running 3 or maybe even 2 and see what happens. I do only have 4gb of ram in each system.
Thanks
Rick
| |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14011 ID: 53948 Credit: 433,215,444 RAC: 942,255
                               
|
Thanks for that Micheal. I will give it anothor test only running 3 or maybe even 2 and see what happens. I do only have 4gb of ram in each system.
Thanks
Rick
The amount of RAM you have doesn't matter that much (4 GB is way more than you need.) It's the amount of cache memory that's built into the CPU that's important.
I'm running a TRP-LLR task right now and it's using less than 25 MB. For main memory, that's nothing. For cache, it can be a lot.
____________
My lucky number is 75898524288+1 | |
|
|
I am using a i5-2450M at 3.1Ghz and even with 3 tasks I find I cannot get one task finished it the eight days of the challenge.
The pair of older desktops are OK with under a hundred hours running 2 tasks at a time.
CPUID says I have 2 CPU on the i5 so perhaps I need to find out how to stop them doing double duty.
A lot of people are going to have trouble getting any work done in the challenge period. I make it that since the last WOO/CUL challenge the challenge has increased from 5 to 8 days but the tasks take about 3 times as long, but I am not certain I am actually using data from the last WOO/CUL challenge or an earlier one. The effect is the same -- some computers take longer than the challenge period to do any tasks.
We may have to have month long challenges before long.
____________
Member team AUSTRALIA
My lucky number is 9291*2^1085585+1 | |
|
|
I am using a i5-2450M at 3.1Ghz and even with 3 tasks I find I cannot get one task finished it the eight days of the challenge.
The HT on these boxes causes issues. Just run two and they will run a lot faster than 3. You should be able to get 4 done in the time you do 3 now with only running two. LLR wreaks havoc on HT machines. Give it a try.
____________
My lucky numbers are 121*2^4553899-1 and 3756801695685*2^666669±1
My movie https://vimeo.com/manage/videos/502242 | |
|
|
Finally started on a new batch of them and limited it to 2 wu's at once. Running much faster. Right around the 100 hour mark by the time it ends. About 1% and hour. Will test the other machine with 1 @ a time next.
Rick | |
|
|
I'm also noticing very long expected work times.
On a Core 2 Quad Q9450, running 1 task, it's estimating another 141 hours
On a Core i7 930, that was running 5 tasks, estimated 178 hours.
My other two crunching systems (a Core i5 660 and an AMD FX-8120) running 2 and 4 tasks, are showing just under 70 hours remaining.
Thinking about HT, I just cut the i7 down to 3 tasks. We'll see what happens as it continues to run today.
____________
"Apollo was astonished, Dionysus thought me mad. But they heard my story further and they wondered and were sad."
| |
|
|
I'm also noticing very long expected work times.
On a Core 2 Quad Q9450, running 1 task, it's estimating another 141 hours
On a Core i7 930, that was running 5 tasks, estimated 178 hours.
Don't trust the BOINC time estimates, they are often excessive.
I spent 2 weeks playing with this problem on my systems and now run 2 on each unit. The core 2 systems will do about 3.8 or 4 if I am lucky, the i5 will do 4 and two halves in the 8 day challenge. For many there will be a low WU return for this challenge, so every one for the team will be valuable.
It is far more challenging than many of the challenges have been recently. Good luck to all.
____________
Member team AUSTRALIA
My lucky number is 9291*2^1085585+1 | |
|
|
Manually calculating how much time I estimate (Time remaining = (TimeDone/%complete) - TimeDone) they should finish during the challenge, but still around 100 total hours (I aborted two WUs on the i7 to free up some cache and what not). Still seems like a long time on processors that aren't super old (the Q9450 blows away anyone that's using a Q6600 and a first gen i3/5, and the i7 isn't too shabby either). I think I tracked down my estimate problem though - the two systems that are reporting extra long times are running the nightly boinc from git (have to compile from source on Fedora for GPU computing to work), where as the other two are still 7.0.x (one is a Win box, the other I built quite a while ago).
____________
"Apollo was astonished, Dionysus thought me mad. But they heard my story further and they wondered and were sad."
| |
|
|
Still seems like a long time on processors that aren't super old.
These are super large numbers. I have an I7 that will only finish 1 set before the end of the challenge. I have an I7 with AVX that should finish 5 or 6 sets. So the processor type does make a difference. My Q6600 is almost keeping up with my oldest I7.
So you are not seeing anything wrong with your systems. The only thing is that HT can cause a lot of slowdown for LLR tasks. Cache misses are enormous with these big numbers with HT on.
____________
My lucky numbers are 121*2^4553899-1 and 3756801695685*2^666669±1
My movie https://vimeo.com/manage/videos/502242 | |
|
Message boards :
Cullen/Woodall prime search :
Woo LLR wu's |