Join PrimeGrid
Returning Participants
Community
Leader Boards
Results
Other
drummers-lowrise
|
Message boards :
Problems and Help :
Way WAY WAYYYY to much work
Author |
Message |
|
BOINC/Primegrid is grossly overestimating my CPU's ability to complete tasks. I've been watching my account over the last couple of months I've found that I either have to abort the tasks or BOINC will eventually do it.
The problem is the BOINC/Primegrid insists on sending more work with very short TAT and I'm aborting more tasks.
I've got my system to have 0.25 days of extra work which shouldn't allow that much.
I aborted several tasks as they wouldn't be started by the deadline. BOINC immediately sent me 34 more tasks that expired on december 1. These tasks wont be started until the 29th of this month. I've done the best case scenario. thats 275 hours of work over 7 cores. thats about 39 1/4 hours of work in 48 hours. That only works if I don't play any video games (which I do) during that time. BOINC should see that it only has so many hours a day to run tasks and not attempt to fill every single moment with a WU.
A simple easy solution is to move back the TAT date by a few days. Thats for every project on Primegrid. I'm getting a bit tired of having so many abandoned WU's when its just so unnecessary. You guys are clearly wasting so much bandwidth as I've seen so many WU's just sit. I can't believe that everyone is just quitting the project and not completing the work. This appears to me to be a problem with attempts to overfeed CPU's and wasting both of our time. | |
|
RafaelVolunteer tester
 Send message
Joined: 22 Oct 14 Posts: 905 ID: 370496 Credit: 459,403,918 RAC: 159,669
                   
|
BOINC/Primegrid is grossly overestimating my CPU's ability to complete tasks. I've been watching my account over the last couple of months I've found that I either have to abort the tasks or BOINC will eventually do it.
The problem is the BOINC/Primegrid insists on sending more work with very short TAT and I'm aborting more tasks.
I've got my system to have 0.25 days of extra work which shouldn't allow that much.
I aborted several tasks as they wouldn't be started by the deadline. BOINC immediately sent me 34 more tasks that expired on december 1. These tasks wont be started until the 29th of this month. I've done the best case scenario. thats 275 hours of work over 7 cores. thats about 39 1/4 hours of work in 48 hours. That only works if I don't play any video games (which I do) during that time. BOINC should see that it only has so many hours a day to run tasks and not attempt to fill every single moment with a WU.
A simple easy solution is to move back the TAT date by a few days. Thats for every project on Primegrid. I'm getting a bit tired of having so many abandoned WU's when its just so unnecessary. You guys are clearly wasting so much bandwidth as I've seen so many WU's just sit. I can't believe that everyone is just quitting the project and not completing the work. This appears to me to be a problem with attempts to overfeed CPU's and wasting both of our time.
Note #1: If your PC has started a task and it reports continuous back to the server, the deadline will be extended. So while downloading too much work is never a good thing, don't abort work that you've started already, but won't meet the deadline.
Note #2: If 0.25 days of extra work is too much... well, then simply lower that number. In fact, most of us here usually go with 0 buffer at all, and that's the recommended way to go about it. Though if you can't (say, PC is not often connected to the internet), then I guess you could make do with 0.1 days of work. As for wrong estimates, don't worry, that happens to everyone, and there's no real way to fix it either.
Note #3: FX CPUs are bad at LLR work and tasks take forever (when compared to Intel CPUs). Now, that's not to say that you can't crunch whatever you want, but I recommend avoiding the heavy hitters such as SoB, CUL or WOO. That's just a suggestion, though.
Note #4: you have 2 inconclusive tasks on your PC, which is a bad sign. This could be caused by a number of things, but the 2 big ones would be: A- a not so stable OC; B- heat is building up. Try giving your PC a good ol' de-dust section and monitor your temps, this could be a sign of a problem that's easily fixed by dust. | |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 13780 ID: 53948 Credit: 343,945,598 RAC: 11,355
                              
|
A simple easy solution is to move back the TAT date by a few days. Thats for every project on Primegrid. I'm getting a bit tired of having so many abandoned WU's when its just so unnecessary. You guys are clearly wasting so much bandwidth as I've seen so many WU's just sit. I can't believe that everyone is just quitting the project and not completing the work. This appears to me to be a problem with attempts to overfeed CPU's and wasting both of our time.
BOINC is notoriously terrible at scheduling. We've got it running a lot better than it used to run, but it still is awful. Just somewhat less awful than it used to be. We'd love for it to work as people would expect it to work (and by that I mean "rationally"), but we can't do that without making a major change to our server that would be very disruptive -- and we're not even a tiny bit certain it would work.
You have two options:
1) Ignore the nonsense and just let BOINC do its thing. It seems very wasteful, but what it's wasting is actually inconsequential: a tiny bit of data transfer between your host and the server. The amount of bandwidth is insignificant. The BOINC client on your host automatically aborts tasks when they haven't started by the deadline. Tasks that are already started will be completed. So your computer isn't wasting any computing -- everything it does, will count. The tasks it aborts don't hurt anything (except for us humans' sensibilities, of course.)
2) Understand the conditions that are causing this to happen, and change the way you use BOINC so that those conditions are avoided. BOINC gets REALLY confused when you have more than one type of task running on your computer. So pick one sub project, and run just that one. BOINC will learn how long those tasks take, and should eventually start making more rational decisions. Also, as Rafael said, running a cache of 0 will completely solve this problem (with the understanding that this isn't a good solution for some people.) There's another reason to run with 0 cache: it gives you the best chance of being the person who gets credit for discovering a prime as opposed to being the double checker.
____________
My lucky number is 75898524288+1 | |
|
|
while its nice to know that the system will eventually report back the work. However, This prevents me from running other projects that have long TAT. Primegrid/BOINC will ignore those tasks and fill(overfill) my Que to the point that once again I am forced to abort tasks.
I was mistaken. I actually received 49 tasks. The second grouping of tasks is slated for 2 days after the first grouping. The huge problem once again is the limited time to return tasks and the quantity of tasks.
your point 3. is not relevant. BOINC knows after so many runs how long a task takes to run. Primegrid WU's always seem to have the incorrect time assigned to them when I get them. Some either complete well before time others complete several hours after. My CPU isn't in control of how BOINC and Primegrid check times.
I don't OC and Don't have a heat problem. 45 C on water cooling is generally considered good. You assume that the problem is with my system and not the other computer involved. | |
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 13780 ID: 53948 Credit: 343,945,598 RAC: 11,355
                              
|
I'm sorry my earlier response (or Rafael's) wasn't satisfactory. Unfortunately, there's nothing further I can do to assist you.
____________
My lucky number is 75898524288+1 | |
|
|
My PrimeGrid cache ceased to be a problem once I set BOINC manager to store '0' days of work and '0' days of additional work. The only thing that works consistently (EDIT: for me) is BOINC automatically downloading new work only as needed. This setting worked when I used my old AMD FX-8350 as well as my current Intel cpu.
I would only maintain a cache of work if I had crappy or intermittent internet access. | |
|
|
Since you are running multiple applications at PG, and BOINC gives a score (duration correction factor) only to a single project, not each application, the problem is BOINC not being able to provide an accurate number for your machine. You also run both CPU and GPU projects, which skews that number even more. BOINC is notorious for having issues with projects that have multiple applications, especially if they are of different styles, as PG has, with Sieve, LLR, GFN and AP.
Many people have asked the BOINC people to fix this. but they have not found a good way to provide that sort of ability. So, since each project gets one DFC, it's hard for it to estimate how much work you need.
____________
My lucky numbers are 121*2^4553899-1 and 3756801695685*2^666669±1
My movie https://vimeo.com/manage/videos/502242 | |
|
Message boards :
Problems and Help :
Way WAY WAYYYY to much work |