Join PrimeGrid
Returning Participants
Community
Leader Boards
Results
Other
drummers-lowrise
|
Message boards :
Problems and Help :
Getting way too many WUs too handle in time
Author |
Message |
|
Hello everyone,
I'm running Primegrid (PPS Sieve) on a nvidia Geforce 8400 GS only (no cpu). I know that it doesn't have a lot of power, but I had it lying around, and had a rig without a GPU yet, so I popped it in.
Unfortunately, I always get lots of tasks, and my GPU can't finish them all before the deadline. It takes about 20 hours to complete a task with this GPU. When BOINC requests new tasks it will get a batch of about 20-30 WUs with a deadline of 6 days. At first the estimated run time was only 1 hour per WU, so I waited until it adjusted to a realistic estimate and aborted the tasks that wouldn't have finished in time. I though that this would only be a one-time occurence and would now be fixed with the new estimates.
But now, I got a large batch of tasks again (about 20), even though BOINC should know better now (runtime estimates are about 20 hours which is realistic). I could handle about 6 or 7 before the deadline, but not 20.
There is no way I can finish them. And because BOINC notices this, the WUs run on high priority and BOINC will waste a CPU (which would go to another project I crunch on) by reserving it for the GPU task alone, leading to lots of CPU idle time.
Is there anything I can do? Do I harm the project if ~90% of my WUs time out?
Thanks!
David
| |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 13618 ID: 53948 Credit: 265,089,491 RAC: 306,560
                           
|
Not really sure why that is. Once the BOINC client complete's on of those tasks, the next time it downloads work it should grab a reasonable amount of work. Or, perhaps, the time after that. You could try aborting the extra tasks, and see if it still gives you too many once again.
____________
My lucky number is 75898524288+1 | |
|
|
I have the same issue using an old ATI board on the PPS Sieve. I set the DCF to 1 originally and the tasks stated about 8.5 minutes to complete. Naturally, this old board takes about 25 times that long so the DCF has corrected to that. However, the client always downloads 30 or so tasks and with only several days to complete these tasks, it goes into high priority. I just have to calculate how many it can really complete and abort the rest. Does it every time even with the DCF adjusted, so once it completes the last task (usually in high priority), it then downloads another large batch. So probably just have to get used to it.
On a side note, I haven't noticed a CPU being wasted with a GPU task in high priority. The PPS task isn't taking a full CPU in this state or even close. Maybe it's an NVIDIA thing, though I'm running an ATI under Linux, so maybe other variables in play. | |
|
|
I have tried to let the WUs time out (that's why it took so long to report back). I tried aborting the tasks before, but that didn't work.
Unfortunately, I just got 71 (!) new tasks. That's more than 60 days of work for my system!
Only one of my two cores is utilized (one GPU task running in high priority, one CPU task from another project running, one CPU task waiting to run). My personal priority lies on the other project and Primegrid only runs to fill the otherwise unused GPU, so I would really like to get BOINC to use both CPU cores again.
Any ideas? | |
|
mikey Send message
Joined: 17 Mar 09 Posts: 1261 ID: 37043 Credit: 525,300,866 RAC: 21,049
                    
|
I have tried to let the WUs time out (that's why it took so long to report back). I tried aborting the tasks before, but that didn't work.
Unfortunately, I just got 71 (!) new tasks. That's more than 60 days of work for my system!
Only one of my two cores is utilized (one GPU task running in high priority, one CPU task from another project running, one CPU task waiting to run). My personal priority lies on the other project and Primegrid only runs to fill the otherwise unused GPU, so I would really like to get BOINC to use both CPU cores again.
Any ideas?
Lower the percentage under your account, primegrid preferences to say half of what the other project is. The percentage is the top number and sort of relates to how much work each project does in relation to other projects on your pc's. That measn if you have two projects and bot hare at 100% then both try to do the same amount of work, but if you change one project to only 50% then it will only try to do half as much work as the other project. Now Boinc is a funny bird, it takes TIME, sometimes LOTS of time, to settle down and do things like YOU want it to. But make a BIG change to the numbers and see what happens over the next week or so. In the meantime set PG to no new tasks and then abort the ones you can't finish in time, the rest of us will get those units in our caches and all will be fine. After the existing units are all gone turn on new tasks and see how many you get, it should be MUCH less as now Boinc will know to only run PG half as often as the other project. If you have more then two projects it is the same but the numbers are as black and white to Boinc, as I said it is a funny bird sometimes. | |
|
|
Thanks for your input, I tried something different again.
The "Waiting to run" wasn't because of the high priority of Primegrid, but only because my machine needed more RAM. Tweaking the memory settings helped already, now all three tasks are running, and I will try to get a bit more physical RAM for my machine.
Only one question remains: Would I harm the project if I just let all these WUs time out and only finish some of them? I don't want to check this box every day :)
If it's fine I'll just let it run like this. If it's not I will try the suggestion in the last reply.
| |
|
mikey Send message
Joined: 17 Mar 09 Posts: 1261 ID: 37043 Credit: 525,300,866 RAC: 21,049
                    
|
Thanks for your input, I tried something different again.
The "Waiting to run" wasn't because of the high priority of Primegrid, but only because my machine needed more RAM. Tweaking the memory settings helped already, now all three tasks are running, and I will try to get a bit more physical RAM for my machine.
Only one question remains: Would I harm the project if I just let all these WUs time out and only finish some of them? I don't want to check this box every day :)
If it's fine I'll just let it run like this. If it's not I will try the suggestion in the last reply.
Yes you can let them just time out or you can abort them now, it makes no difference in the big scheme of things. They will all get resent to someone else to crunch either way. | |
|
|
That's what I wanted to know. I'll just let it run unsupervised now.
Thank you. | |
|
mikey Send message
Joined: 17 Mar 09 Posts: 1261 ID: 37043 Credit: 525,300,866 RAC: 21,049
                    
|
That's what I wanted to know. I'll just let it run unsupervised now.
Thank you.
No problem, if you have any more questions just ask. | |
|
|
I'm also getting a ridiculous amount of PrimeGrid work units lately and have tried everything to change that, but nothing works. And it is doing so on all of my client machines. I've let work units time out, I've aborted them multiple times in a row, I've reset the project a dozen times, I've even removed and reinstalled the project. Even changing my account preferences to NO GPU work does not stop these downloads of an impossible amount of GPU work!
The PG scheduler is still sending me forty or more 12 hour projects for my single old ATI GPU that have a deadline within the next two days. Obviously, the problem is on the PrimeGrid scheduler end. The scheduler is no longer paying any attention to preferences or benchmarks or common sense. The main problem with all of this is that due to the time deadlines, the client is prioritizing all the PrimeGrid work and none of my other projects are getting work done.
So I will simply suspend the PrimeGrid account until there is notification that this has been repaired. | |
|
mikey Send message
Joined: 17 Mar 09 Posts: 1261 ID: 37043 Credit: 525,300,866 RAC: 21,049
                    
|
I'm also getting a ridiculous amount of PrimeGrid work units lately and have tried everything to change that, but nothing works. And it is doing so on all of my client machines. I've let work units time out, I've aborted them multiple times in a row, I've reset the project a dozen times, I've even removed and reinstalled the project. Even changing my account preferences to NO GPU work does not stop these downloads of an impossible amount of GPU work!
The PG scheduler is still sending me forty or more 12 hour projects for my single old ATI GPU that have a deadline within the next two days. Obviously, the problem is on the PrimeGrid scheduler end. The scheduler is no longer paying any attention to preferences or benchmarks or common sense. The main problem with all of this is that due to the time deadlines, the client is prioritizing all the PrimeGrid work and none of my other projects are getting work done.
So I will simply suspend the PrimeGrid account until there is notification that this has been repaired.
Everytime you reset the project you reset the settings to the default, meaning the project unlearns all your old data. Why you are still getting gpu work after changing your settings to no gpu work is unknown to me though. | |
|
|
On the Preferences web page, hopefully you tried to stop GPU work by un-checking the "Use ATI GPU" box near the top of the page, not by simply making all of the GPU subproject options un-checked (which for ATI there is only 1). Also, make sure you're working with the right "venue" for the machine: home, work, school, or "---" (which is the default venue).
As for the general "too much work" issue, many people have this problem. I agree with mikey in that resetting/detaching probably just moves you back to square 1. Just abort the excess work units; there's no stigma attached to that. Or just let them time out and they'll get recycled fairly quickly. Boinc will "learn" eventually and provide a more reasonable work cache.
--Gary | |
|
|
I just loaded windows 7 to do some testing and I'm getting the same results.
I have my buffers set to 0.00 Max and 0.00 Min.
It just never stops downloading. Very strange. I've not seen this before. I know it does not happen on Linux.
Edit: I was running the beta of boinc 7.2.5. Downgraded to recommended version and it stopped downloading.
8/31/2013 9:37:43 PM | PrimeGrid | Sending scheduler request: To fetch work.
8/31/2013 9:37:43 PM | PrimeGrid | Requesting new tasks for ATI
8/31/2013 9:37:44 PM | PrimeGrid | Scheduler request completed: got 1 new tasks
8/31/2013 9:37:54 PM | PrimeGrid | Sending scheduler request: To fetch work.
8/31/2013 9:37:54 PM | PrimeGrid | Requesting new tasks for ATI
8/31/2013 9:37:55 PM | PrimeGrid | Scheduler request completed: got 1 new tasks
8/31/2013 9:38:05 PM | PrimeGrid | Sending scheduler request: To fetch work.
8/31/2013 9:38:05 PM | PrimeGrid | Requesting new tasks for ATI
8/31/2013 9:38:06 PM | PrimeGrid | Scheduler request completed: got 1 new tasks
8/31/2013 9:38:16 PM | PrimeGrid | Sending scheduler request: To fetch work.
8/31/2013 9:38:16 PM | PrimeGrid | Requesting new tasks for ATI
8/31/2013 9:38:17 PM | PrimeGrid | Scheduler request completed: got 1 new tasks
8/31/2013 9:38:27 PM | PrimeGrid | Sending scheduler request: To fetch work.
8/31/2013 9:38:27 PM | PrimeGrid | Requesting new tasks for ATI
8/31/2013 9:38:28 PM | PrimeGrid | Scheduler request completed: got 1 new tasks
8/31/2013 9:38:38 PM | PrimeGrid | Sending scheduler request: To fetch work.
8/31/2013 9:38:38 PM | PrimeGrid | Requesting new tasks for ATI
8/31/2013 9:38:39 PM | PrimeGrid | Scheduler request completed: got 1 new tasks
8/31/2013 9:38:49 PM | PrimeGrid | Sending scheduler request: To fetch work.
8/31/2013 9:38:49 PM | PrimeGrid | Requesting new tasks for ATI
8/31/2013 9:38:51 PM | PrimeGrid | Scheduler request completed: got 1 new tasks
8/31/2013 9:39:01 PM | PrimeGrid | Sending scheduler request: To fetch work.
8/31/2013 9:39:01 PM | PrimeGrid | Requesting new tasks for ATI
8/31/2013 9:39:02 PM | PrimeGrid | Scheduler request completed: got 1 new tasks
8/31/2013 9:39:12 PM | PrimeGrid | Sending scheduler request: To fetch work.
8/31/2013 9:39:12 PM | PrimeGrid | Requesting new tasks for ATI
8/31/2013 9:39:13 PM | PrimeGrid | Scheduler request completed: got 1 new tasks
8/31/2013 9:39:23 PM | PrimeGrid | Sending scheduler request: To fetch work.
8/31/2013 9:39:23 PM | PrimeGrid | Requesting new tasks for ATI
8/31/2013 9:39:24 PM | PrimeGrid | Scheduler request completed: got 1 new tasks
8/31/2013 9:39:34 PM | PrimeGrid | Sending scheduler request: To fetch work.
8/31/2013 9:39:34 PM | PrimeGrid | Requesting new tasks for ATI
8/31/2013 9:39:35 PM | PrimeGrid | Scheduler request completed: got 1 new tasks
8/31/2013 9:39:45 PM | PrimeGrid | Sending scheduler request: To fetch work.
8/31/2013 9:39:45 PM | PrimeGrid | Requesting new tasks for ATI
8/31/2013 9:39:46 PM | PrimeGrid | Scheduler request completed: got 1 new tasks
8/31/2013 9:39:56 PM | PrimeGrid | Sending scheduler request: To fetch work.
8/31/2013 9:39:56 PM | PrimeGrid | Requesting new tasks for ATI
8/31/2013 9:39:57 PM | PrimeGrid | Scheduler request completed: got 1 new tasks
8/31/2013 9:40:07 PM | PrimeGrid | Sending scheduler request: To fetch work.
8/31/2013 9:40:07 PM | PrimeGrid | Requesting new tasks for ATI
8/31/2013 9:40:09 PM | PrimeGrid | Scheduler request completed: got 1 new tasks
8/31/2013 9:40:19 PM | PrimeGrid | Sending scheduler request: To fetch work.
8/31/2013 9:40:19 PM | PrimeGrid | Requesting new tasks for ATI
8/31/2013 9:40:21 PM | PrimeGrid | Scheduler request completed: got 1 new tasks
8/31/2013 9:40:31 PM | PrimeGrid | Sending scheduler request: To fetch work.
8/31/2013 9:40:31 PM | PrimeGrid | Requesting new tasks for ATI
8/31/2013 9:40:32 PM | PrimeGrid | Scheduler request completed: got 1 new tasks
8/31/2013 9:40:36 PM | PrimeGrid | work fetch suspended by user
____________
| |
|
mikey Send message
Joined: 17 Mar 09 Posts: 1261 ID: 37043 Credit: 525,300,866 RAC: 21,049
                    
|
I just loaded windows 7 to do some testing and I'm getting the same results.
I have my buffers set to 0.00 Max and 0.00 Min.
It just never stops downloading. Very strange. I've not seen this before. I know it does not happen on Linux.
Edit: I was running the beta of boinc 7.2.5. Downgraded to recommended version and it stopped downloading.
Sounds like a bug! Put some numbers in the fields for the min and max and see if it ONLY downloads that much work, roughly. | |
|
Message boards :
Problems and Help :
Getting way too many WUs too handle in time |