Author |
Message |
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
Current project status
Project goal has been reached! 09 March 2020 first GFN-16 MEGA prime was found by Robish. See this post for details.
Π‘leanup of remaining workunits finished 05 May 2020. Project closed.
Total candidates tested: up to bmin+310000
Timeline:
2019-03-14: Project started
2020-03-09: Two GFN-16 MEGA primes discovered
2020-04-15: Started project cleanup before shutdown
2020-05-05: Cleanup finished, project closed.
Project description
We would like to invite you to small and fun side project, the search for first GFN-16 Mega Prime!
As JeppeSN noticed, first mega prime (>= 1'000'000 digits) is known for GFN-17 and above, but not for GFN-16 and below. GFN-15 and below requires very high b to reach 1M digits, above all software limits. For GFN-16, this goal is possible.
How it works
We will search for a prime starting from bmin=1814570322693370 (i.e. 181457032269337065536+1, lowest candidate with 1'000'000 digits) up to bmin+1000000. This b is above Genefer limit, so we'll use standard LLR software, which all of you should be familiar with from PrimeGrid projects. There are about 156'000 candidates to test.
The prime is a true mega prime and will be reported to T5K.
It may be smallest mega prime ever found.
Schedule
The search will be started at March, 14, 2019, 12:00 UTC (plus-minus few minutes, daemons on this server are slow and lazy beasts, they need a minute or two to scan database and fill buffers with tasks).
This search is planned to be stopped when prime is found and cleanup completes (to be sure that no primes below are missed), but this decision is not final. It's expected to have 3-4 primes in the search range, and they could be anywhere. We may find a prime on the second day of the search, lacking all fun. In this case, search could be expanded until next prime is found or even for full test range.
How to participate
The search is held on my experimental GFN server. To participate as new user, please register at this Boinc server at http://boincvm.proxyma.ru:30080/test4vm/
- Go to URL above;
- Click on "Your account" on left panel;
- Click on the "or create an account" link in the bottom left part of the page;
- In the "Invitation Code" field, enter PrimeGrid
- Fill other fields in usual way. It's preferred to use same email address as on PrimeGrid.
- After registration, connect your Boinc client to the project using URL above. It's recommended to use "weak account key" in place of of true password. Your can get your key on "Your account" page, click on "Account keys"
After registration, please change your computing preferences. By default server will send you GFN-13 (if you have GPU) and a mix of GFN-14 and GFN-16 MEGA tasks. Go to "Your account" -> "Computing Preferences" and select GFN-16 only or GFN-16 + GFN-13 if you want to participate in second experimental GPU-only project.
The server is compatible with PrimeGrid and supports extra venues (Sun...Pluto). Note: this is not a PrimeGrid server. Although all discussion and support happens here on PrimeGrid forum, the server is running on my own resources.
Optimizing performance
Tasks will took 4-8 hours on modern processors with FMA3 (for a single core). LLR multithreading will help to use CPU cache better and improve overall throughput.
LLR multithreading was discussed may times on PG forums and I'll not repeat here everything. Application name for app_config.xml is gfn16_mega, and minimal app_config.xml for 4 threads may look like this:
<app_config>
<app_version>
<app_name>gfn16_mega</app_name>
<cmdline>-t 4</cmdline>
<avg_ncpus>4</avg_ncpus>
</app_version>
</app_config>
For these new users who would also like to try GFN-13 or GFN-14, GPU optimization details for these projects are described here.
Running PrimeGrid as backup project
It's a good idea to crunch something even if project server temporary goes offline. Go to PrimeGrid Preferences page and set "Resource Share" to 0. If GFN server becomes out work or unreachable, your Boinc client will automatically switch to PrimeGrid tasks until GFN server recovers.
Credit
Credit is assigned according to PrimeGrid rules. But this is not a PrimeGrid server, so credit stays here. Detailed statistic is available (all tasks have same size and give same amount of credit).
Edit: fixed closing tag in app_config |
|
|
SpearVolunteer tester
 Send message
Joined: 29 Mar 09 Posts: 148 ID: 37675 Credit: 1,344,985,510 RAC: 148
                        
|
Small typo in the app_config, the / is missing from the closing app_config. |
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
I'm in :)
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
Dad Send message
Joined: 28 Feb 18 Posts: 284 ID: 984171 Credit: 182,080,291 RAC: 0
                 
|
Me too
Go team Storm!!!
____________
Tonight's lucky numbers are
555*2^3563328+1 (PPS-MEGA)
and
58523466^131072+1 (GFN-17 MEGA) |
|
|
Vato Volunteer tester
 Send message
Joined: 2 Feb 08 Posts: 850 ID: 18447 Credit: 707,599,869 RAC: 1,598,367
                           
|
Do you have a plan for submitting to T5K?
Or would the lucky finder have to do it themselves?
Not that it's hard, but getting the Proof code right for person, project and software is important, and we wouldn't want someone else submitting it!
Forewarned is forearmed :-)
____________
|
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
Do you have a plan for submitting to T5K?
Or would the lucky finder have to do it themselves?
Not that it's hard, but getting the Proof code right for person, project and software is important, and we wouldn't want someone else submitting it!
Forewarned is forearmed :-)
Exact answer on this question will depend on who'll find this prime. Many primes finders here (including myself) already have accounts created and managed by PG.
|
|
|
|
I am getting only this:
Th 14 Mar 2019 07:27:14 CET | PRIVATE GFN SERVER | No tasks are available for GFN-16 MEGA Prime Search
____________
DeleteNull |
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
I am getting only this:
Th 14 Mar 2019 07:27:14 CET | PRIVATE GFN SERVER | No tasks are available for GFN-16 MEGA Prime Search
It's not due to start till 12 uct so your good to go DeleteNull. :)
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14011 ID: 53948 Credit: 430,792,981 RAC: 1,088,994
                               
|
Do you have a plan for submitting to T5K?
Or would the lucky finder have to do it themselves?
Not that it's hard, but getting the Proof code right for person, project and software is important, and we wouldn't want someone else submitting it!
Forewarned is forearmed :-)
Exact answer on this question will depend on who'll find this prime. Many primes finders here (including myself) already have accounts created and managed by PG.
To report a prime to T5K, you need (1) a user account and (2) a proof coce. The proof code identifies the person who found the prime, one or more programs that were used to find the prime, and zero or more organizations that were involved in finding the prime.
Code L4341, for example, has:
User: Goetz
Programs: GFNSvCUDA, GeneFer, AthGFNSieve, LLR
Organizations: PrimeGrid
If I were to find a GFN16 mega prime on your project, my account code could certainly be used, but the existing proof code would contain incorrect information for the organization and for some of the programs.
____________
My lucky number is 75898524288+1 |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
Wow, first result already returned at 13:52 UTC, it was less then two hours! Two more were returned few minutes later. Alas, they weren't primes... :)
|
|
|
Crun-chi Volunteer tester
 Send message
Joined: 25 Nov 09 Posts: 3233 ID: 50683 Credit: 151,443,349 RAC: 125,986
                         
|
To report a prime to T5K, you need (1) a user account and (2) a proof coce
____________
92*10^1585996-1 NEAR-REPDIGIT PRIME :) :) :)
4β*β650^498101-1 CRUS PRIME
2022202116^131072+1 GENERALIZED FERMAT
Proud member of team Aggie The Pew. Go Aggie! |
|
|
|
<app_config>
<app_version>
<app_name>gfn16_mega</app_name>
<cmdline>-t 4</cmdline>
<avg_ncpus>4</avg_ncpus>
</app_version>
</app_config>
Well, even if there was a typo there which I fixed before even trying first, this "minimal" xml doesn't work (for me) - I get a syntax error nevertheless and lost my wus several times now.
As far as I could see in the project folder the app name is NOT gfn16_mega but cllr64.3.8.21.exe, so I tried it with a more detailed one like this:
<app_config>
<app_version>
<app_name>gfn16_mega</app_name>
<user_friendly_name>GFN-16 MEGA Prime Search</user_friendly_name>
<version_num>801</version_num>
<file_ref>
<file_name>cllr64.3.8.21.exe</file_name>
<main_program/>
</file_ref>
<cmdline>-t 6</cmdline>
<avg_ncpus>6</avg_ncpus>
</app_version>
</app_config>
But it still doesn't work - I don't get where the mistake is...
____________
Life is Science, and Science rules. To the universe and beyond
Proud member of BOINC@Heidelberg
|
|
|
|
Hello Doctor,
I am using (8x8 threads, cc_config with 8 ncpus):
<app_config>
<app>
<name>gfn16_mega</name>
<fraction_done_exact/>
<max_concurrent>8</max_concurrent>
</app>
<app_version>
<app_name>gfn16_mega</app_name>
<cmdline>-t 8</cmdline>
<avg_ncpus>1</avg_ncpus>
<max_ncpus>1</max_ncpus>
</app_version>
</app_config>
This works fine for me.
____________
DeleteNull |
|
|
SpearVolunteer tester
 Send message
Joined: 29 Mar 09 Posts: 148 ID: 37675 Credit: 1,344,985,510 RAC: 148
                        
|
I've only had the smallest amount of contact with configuring multithreaded apps, but is there a limit on how scalable this one is? I can't get it to use more than about 65% of the CPU power despite setting it to use 9 of the 10 cores available. |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
<app_config>
<app_version>
<app_name>gfn16_mega</app_name>
<cmdline>-t 4</cmdline>
<avg_ncpus>4</avg_ncpus>
</app_version>
</app_config>
Well, even if there was a typo there which I fixed before even trying first, this "minimal" xml doesn't work (for me) - I get a syntax error nevertheless and lost my wus several times now.
As far as I could see in the project folder the app name is NOT gfn16_mega but cllr64.3.8.21.exe, so I tried it with a more detailed one like this:
<app_config>
<app_version>
<app_name>gfn16_mega</app_name>
<user_friendly_name>GFN-16 MEGA Prime Search</user_friendly_name>
<version_num>801</version_num>
<file_ref>
<file_name>cllr64.3.8.21.exe</file_name>
<main_program/>
</file_ref>
<cmdline>-t 6</cmdline>
<avg_ncpus>6</avg_ncpus>
</app_version>
</app_config>
But it still doesn't work - I don't get where the mistake is...
This "detailed" text is a incorrect mix of app_info.xml and app_config.xml directives. They're two different files with completely different purpose and directives. The "minimal" one must be saved as app_config.xml |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
I've only had the smallest amount of contact with configuring multithreaded apps, but is there a limit on how scalable this one is? I can't get it to use more than about 65% of the CPU power despite setting it to use 9 of the 10 cores available.
First of all, be careful when editing number of cores in app_config and rereading configuration on the fly. You must restart running tasks to apply new settings. Just pause them, Boinc will immediately run something else with new settings. Don't forget to unpause.
As for problem with 9-10 cores, I don't know. Required number of threads simply passed to LLR, the rest is up to LLR. All what you can do is to check that command line was correct. It can be done in Windows task manager. I don't remember exact details on Win10 (I'm on Win7 now) but there must something like "Details" tab, on this tab "Select columns" option is accessible somewhere (may be via right click, may be via top menu), where you can choose "Command line". Then look for command line of primegrid_llr process.
|
|
|
|
The "minimal" one must be saved as app_config.xml
app_config? Oh I see, overlooked that... *doh*
Seems to work now! :-)
@DeleteNull:
Hi "Stoffel" - the nickname's still DoctorNow, not only Doctor. That's "elemantary". ;-)
____________
Life is Science, and Science rules. To the universe and beyond
Proud member of BOINC@Heidelberg
|
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
Stream. Any metrics on 16mega wus? Number sent out, returned etc? Just to guage the interest. Cheers Rob
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
Stream. Any metrics on 16mega wus? Number sent out, returned etc? Just to guage the interest. Cheers Rob
After 24 hours:
Returned with success: 223
Returned with error: 0 (not including manually aborted tasks)
In progress: 494
I'll not post number of sent tasks, it's just a sum of all above.
Highest candidate returned (leading edge): bmin+6770
Validation is not enabled yet, everybody will receive only primary-checker tasks for some time. |
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
Stream. Any metrics on 16mega wus? Number sent out, returned etc? Just to guage the interest. Cheers Rob
After 24 hours:
Returned with success: 223
Returned with error: 0 (not including manually aborted tasks)
In progress: 494
I'll not post number of sent tasks, it's just a sum of all above.
Highest candidate returned (leading edge): bmin+6770
Validation is not enabled yet, everybody will receive only primary-checker tasks for some time.
Thanks Stream. π
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
|
Will finders be rewarded with an "M" badge? |
|
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 14011 ID: 53948 Credit: 430,792,981 RAC: 1,088,994
                               
|
Will finders be rewarded with an "M" badge?
This is not a PrimeGrid project, so no.
____________
My lucky number is 75898524288+1 |
|
|
|
Will finders be rewarded with an "M" badge?
This is not a PrimeGrid project, so no.
Gotcha, thanks. |
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
Hi Stream
ΠΡΠ±ΠΎΠ΅ ΠΎΠ±Π½ΠΎΠ²Π»Π΅Π½ΠΈΠ΅ ΠΎ ΠΠ΅ΡΠ²ΠΎΠΌ ΠΏΠΎΠΈΡΠΊΠ΅ MEGA - ΡΠΊΠΎΡΠΎ :)
ΠΡ Π½Π°ΡΠ»ΠΈ ΡΡΠΎ-Π½ΠΈΠ±ΡΠ΄Ρ Π΅ΡΠ΅?
Rob.
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
dukebgVolunteer tester
 Send message
Joined: 21 Nov 17 Posts: 242 ID: 950482 Credit: 23,670,125 RAC: 0
                  
|
Rob, you intended to ask "did we find anything yet?", but your question reads as "is there anything else we found?". I appreciate the attempt, though! I'm always in favor of people learning languages. |
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
Rob, you intended to ask "did we find anything yet?", but your question reads as "is there anything else we found?". I appreciate the attempt, though! I'm always in favor of people learning languages.
Oops, ah well , I tried :)
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
ΠΡΠ±ΠΎΠ΅ ΠΎΠ±Π½ΠΎΠ²Π»Π΅Π½ΠΈΠ΅ ΠΎ ΠΠ΅ΡΠ²ΠΎΠΌ ΠΏΠΎΠΈΡΠΊΠ΅ MEGA - ΡΠΊΠΎΡΠΎ :)
:)))
OK, here is an update.
After 4 days:
------------------------------------------
Returned with success: 1018
Returned with error: 25 (not including manually aborted tasks)
In progress: 588
Leading edge: bmin+10482
No primes yet.
------------------------------------------
Leading edge is an estimated value. This version of server software is sending tasks in random order, selecting them from intermediate buffer of 450 tasks. It can be any number of tasks below leading edge which even wasn't sent to first host. The buffer itself is filled from database sequentially, ordered by 'b'
|
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
ΠΡΠ±ΠΎΠ΅ ΠΎΠ±Π½ΠΎΠ²Π»Π΅Π½ΠΈΠ΅ ΠΎ ΠΠ΅ΡΠ²ΠΎΠΌ ΠΏΠΎΠΈΡΠΊΠ΅ MEGA - ΡΠΊΠΎΡΠΎ :)
:)))
OK, here is an update.
After 4 days:
------------------------------------------
Returned with success: 1018
Returned with error: 25 (not including manually aborted tasks)
In progress: 588
Leading edge: bmin+10482
No primes yet.
------------------------------------------
Leading edge is an estimated value. This version of server software is sending tasks in random order, selecting them from intermediate buffer of 450 tasks. It can be any number of tasks below leading edge which even wasn't sent to first host. The buffer itself is filled from database sequentially, ordered by 'b'
That's great Stream, Π‘ΠΏΠ°ΡΠΈΠ±ΠΎ :)
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
After 10 days:
Completed successfully: 2932
Completed with error: 29
In progress: 376
Estimated leading edge: bmin+23136
|
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
After 10 days:
Completed successfully: 2932
Completed with error: 29
In progress: 376
Estimated leading edge: bmin+23136
Thanks Stream.
At the current participation rate it's looking like about a 2 or 3 year project.
That is until we find a prime, which would probably see more people coming on board.
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
At the current participation rate it's looking like about a 2 or 3 year project.
That is until we find a prime, which would probably see more people coming on board.
No, it's too pessimistic. The goal of project is to find first mega prime for GFN-16. There is no need to test whole range, other primes will not have scientific meaning. We can try to find 1-2 other primes just for fun if we find first prime too fast (2-3 months).
How long could it take to find first prime? First of all, it's a lottery, we may find it tomorrow. Talking about averages and probabilities, JeppeSN mentioned that prediction is 3.89 primes in the interval of length 800'000, which give us that we should test (in average) interval of ~206K per prime. Let's take even more pessimistic value of 300K. Then, using our rate after 10 days, we have following rough estimation:
leading edge 23136 - 10 days
leading edge 300'000 - 130 days
Total: it should take no more then 260 days (130*2 due to double-checking)
|
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
At the current participation rate it's looking like about a 2 or 3 year project.
That is until we find a prime, which would probably see more people coming on board.
No, it's too pessimistic. The goal of project is to find first mega prime for GFN-16. There is no need to test whole range, other primes will not have scientific meaning. We can try to find 1-2 other primes just for fun if we find first prime too fast (2-3 months).
How long could it take to find first prime? First of all, it's a lottery, we may find it tomorrow. Talking about averages and probabilities, JeppeSN mentioned that prediction is 3.89 primes in the interval of length 800'000, which give us that we should test (in average) interval of ~206K per prime. Let's take even more pessimistic value of 300K. Then, using our rate after 10 days, we have following rough estimation:
leading edge 23136 - 10 days
leading edge 300'000 - 130 days
Total: it should take no more then 260 days (130*2 due to double-checking)
Ahhh right. I thought we were testing the whole range, That looking much more promising :)
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
|
I have done 20 WUs so far, all but three are solo, no wingman, and the three timed out before I even joined.
|
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
I have done 20 WUs so far, all but three are solo, no wingman, and the three timed out before I even joined.
Solo is OK for a while, I'm still making small tweaks and improvements to validator program so it's not running full-time on all workunits.
Timed out tasks were sent to somebody else but server didn't get any reply from this user. So a second copy of tasks was created and sent to other user (you). It happens very often.
|
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
After 10 days:
Completed successfully: 2932
Completed with error: 29
In progress: 376
Estimated leading edge: bmin+23136
After 20 days Stream?
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
After 20 days Stream?
Completed successfully: 5285
In progress: 247
Estimated leading edge: bmin+38302
No primes yet.
As usual, estimated leading edge is indeed estimated. It's better to use number of successfully completed tasks to describe project progress.
|
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
After 20 days Stream?
Completed successfully: 5285
In progress: 247
Estimated leading edge: bmin+38302
No primes yet.
As usual, estimated leading edge is indeed estimated. It's better to use number of successfully completed tasks to describe project progress.
Thanks Stream, progressing along nicely :)
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
|
Validation is not enabled yet, everybody will receive only primary-checker tasks for some time.
What are the plans for turning on double-checking? It can be helpful for the owners of the individual hosts to detect if some computer is not crunching properly. /JeppeSN |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
Validation is not enabled yet, everybody will receive only primary-checker tasks for some time.
What are the plans for turning on double-checking? It can be helpful for the owners of the individual hosts to detect if some computer is not crunching properly. /JeppeSN
1-2 weeks to be sure that no other ideas about new validator improvements came into my mind (for example, just implemented JimB's idea to track LLR version and avoid buggy ones).
Few tasks which had LLR errors during crunching were sent to double-checkers manually. Lucky, all of them were validated.
|
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
After 1 month:
Completed successfully: 7640
In progress: 205
Estimated leading edge: bmin+51918
No primes yet.
There are total of 150'000 candidates and 4 possible primes in the whole 1M range, so we may expect to find one prime per 37500 candidates (average), or within bmin+250000. Thus, we made quite good progress in this month, completing (in one pass) more then 20% of these numbers.
|
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
I've made a simple scoreboard for this project: http://boincvm.proxyma.ru:30080/test4vm/user_profile/gfn16_hunt_status.html
|
|
|
|
Optimizing performance
Tasks will took 4-8 hours on modern processors with FMA3 (for a single core). LLR multithreading will help to use CPU cache better and improve overall throughput.
Just a quick question for understanding this since I'm currently a bit wondering about my runtimes in comparison to others sometimes:
Is this app ONLY using FMA3 or does it also use others like AVX etc.?
____________
Life is Science, and Science rules. To the universe and beyond
Proud member of BOINC@Heidelberg
|
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
Optimizing performance
Tasks will took 4-8 hours on modern processors with FMA3 (for a single core). LLR multithreading will help to use CPU cache better and improve overall throughput.
Just a quick question for understanding this since I'm currently a bit wondering about my runtimes in comparison to others sometimes:
Is this app ONLY using FMA3 or does it also use others like AVX etc.?
This is standard LLR program, exactly the same as used for many PrimeGrid projects. It can use all possible processor features - SSEx, AVX, etc, but best performance is currently achieved on Intel processors supporting AVX2/FMA3.
Comparing runtimes with other users often have no sense because many peoples are using LLR in multi-threaded mode; you must know their exact setup. Multi-threading is recommended for this project; there are enough information on PG forums how to write necessary app_config.xml
|
|
|
Vato Volunteer tester
 Send message
Joined: 2 Feb 08 Posts: 850 ID: 18447 Credit: 707,599,869 RAC: 1,598,367
                           
|
Wow - Robish is hammering it!
____________
|
|
|
TimT  Send message
Joined: 2 Dec 11 Posts: 504 ID: 121414 Credit: 2,577,540,524 RAC: 1,682,208
                            
|
anyone else seeing hanging tasks on this project? my machine seems to be unable to report or get new tasks. I know the 'job cache full' is because the machine grabbed tasks from PG, but currently have four tasks waiting to report results on the machine.
18045 PRIVATE GFN SERVER 4/23/2019 7:41:41 AM update requested by user
18046 PRIVATE GFN SERVER 4/23/2019 7:41:44 AM Sending scheduler request: Requested by user.
18047 PRIVATE GFN SERVER 4/23/2019 7:41:44 AM Reporting 4 completed tasks
18048 PRIVATE GFN SERVER 4/23/2019 7:41:44 AM Not requesting tasks: don't need (job cache full)
18049 PRIVATE GFN SERVER 4/23/2019 7:42:16 AM Scheduler request failed: Error 408
18050 PRIVATE GFN SERVER 4/23/2019 7:43:26 AM Fetching scheduler list
18051 PRIVATE GFN SERVER 4/23/2019 7:43:27 AM Master file download succeeded
18052 PRIVATE GFN SERVER 4/23/2019 7:43:32 AM Sending scheduler request: To report completed tasks.
18053 PRIVATE GFN SERVER 4/23/2019 7:43:32 AM Reporting 4 completed tasks
18054 PRIVATE GFN SERVER 4/23/2019 7:43:32 AM Not requesting tasks: don't need (job cache full)
18055 PRIVATE GFN SERVER 4/23/2019 7:44:03 AM Scheduler request failed: Error 408
|
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
anyone else seeing hanging tasks on this project? my machine seems to be unable to report or get new tasks. I know the 'job cache full' is because the machine grabbed tasks from PG, but currently have four tasks waiting to report results on the machine.
I see unusually high number of request timeouts in Apache logs today (~100, with almost zero for all last week). I remember we had something similar during GFN-13 crunching. May be something wrong with network or my ISP today. Don't worry, your tasks will be reported on the following retries.
|
|
|
TimT  Send message
Joined: 2 Dec 11 Posts: 504 ID: 121414 Credit: 2,577,540,524 RAC: 1,682,208
                            
|
stream: still seeing the same issue from my side, wondering if it's just me or everyone.
looking at netstat, it seems the connection is either left in a "Closing" state, or "Last Ack" state after the connection times out and I get the "Scheduler request failed: error 408" message in the boinc log. maybe some kind of network issue, but hard to tell if it's me or not -- the same machine has no issues contacting primegrid.
|
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
stream: still seeing the same issue from my side, wondering if it's just me or everyone.
This error seems to happen only with few IP addresses. In most cases they could recover, but look like not in yours. I think it's not your problem, it's something misbehaving in all these routers and NATs between some users and me. When your clients sends a POST request to scheduler, my Apache reports an error
(70007)The timeout specified has expired: [client *****] AH01224: Timeout during reading request entity data
I.e. it cannot receive complete POST request from you. Rest of site is accessed without problems (may be because GET requests are short).
It may be related to my ISP because number of errors greatly increased exactly after last loss of connection. May be I'm connected to some buggy routers now. I made some changes to Apache configuration, if it'll not help - I'll reconnect once again and hope that new pool will be better (I prefer to avoid this, each reconnection changes my public IP. Although DNS renewal period is set to 1 hour, some systems are caching DNS replies too long, causing problems for some users for 4-8 hours).
|
|
|
|
When your clients sends a POST request to scheduler, my Apache reports an error
(70007)The timeout specified has expired: [client *****] AH01224: Timeout during reading request entity data
I.e. it cannot receive complete POST request from you.
cc_config.xml
<max_tasks_reported>N</max_tasks_reported>
Report at most N tasks per scheduler RPC. Try N=1000 if your computer has lots of tasks to report and is having trouble completing a scheduler RPC.
|
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
When your clients sends a POST request to scheduler, my Apache reports an error
(70007)The timeout specified has expired: [client *****] AH01224: Timeout during reading request entity data
I.e. it cannot receive complete POST request from you.
cc_config.xml
<max_tasks_reported>N</max_tasks_reported>
Report at most N tasks per scheduler RPC. Try N=1000 if your computer has lots of tasks to report and is having trouble completing a scheduler RPC.
According to source, it just cuts request after N tasks, so it makes sense to try N=1 to make request shorter. N=1000 will not do anything because default value is unlimited and there are only 10-20 tasks pending.
|
|
|
TimT  Send message
Joined: 2 Dec 11 Posts: 504 ID: 121414 Credit: 2,577,540,524 RAC: 1,682,208
                            
|
I tried limiting reports to 1 as directed -- it didnt make a difference (except that it caused the machine to get ahead of the short SGS PG tasks it's crunching) so, changed it back.
I did a little bit of packet capture (and understand, I'm no network engineer) and I see my machine talking to the private server, but when it gets to the 'scheduler request' packets, I don't see any response, so it retries several times with no response. |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
I did a little bit of packet capture (and understand, I'm no network engineer) and I see my machine talking to the private server, but when it gets to the 'scheduler request' packets, I don't see any response, so it retries several times with no response.
Tried to setup packet capture too (on the router) and my router rebooted... So it reconnected and may be the problem with go away as soon as DNS recovers. At least I will have packet capture on my side.
|
|
|
Scott Brown Volunteer moderator Project administrator Volunteer tester Project scientist
 Send message
Joined: 17 Oct 05 Posts: 2392 ID: 1178 Credit: 18,631,505,153 RAC: 7,088,033
                                                
|
@TimT Have you tried to flush the DNS and reconnect? On windows, enter the following on the command line:
ipconfig /flushdns
I don't remember the command in Linux if that is the OS on the machine having trouble.
|
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
@TimT Have you tried to flush the DNS and reconnect? On windows, enter the following on the command line:
ipconfig /flushdns
I don't remember the command in Linux if that is the OS on the machine having trouble.
No, it's not related. I see incoming connections from his PC and even some data of HTTP request arrives (Apache can understand that it's POST request and get URL), but Apache cannot read remaining POST request body.
|
|
|
dukebgVolunteer tester
 Send message
Joined: 21 Nov 17 Posts: 242 ID: 950482 Credit: 23,670,125 RAC: 0
                  
|
The POST requests are generally often split by various libraries into 2+ separate TCP packets β the header and the body (the body is often multiple packets itself when the request is big). I've found that out myself at work about few months ago when the task I was doing required maximizing throughput of a high amount of parallel stream messaging (done with POSTs). The messages were very short, just like 20 bytes of data posted, the headers were much bigger themselves.
This is also highly connected to implementations of the Nagle's algorithm (see wikipedia if needed). If I recall correctly, in my case throughput was much better when disabling nagle algorithm β while it was on (by default in dotnet framework implementation of httpclient) each POST was two packets as described, with it off it was a single packet.
This of course is very unlikely to help the problem since nagling is about throughput, not the ability to send packets at all. But it's an insight about how there's probably this separation of packets into headers and bodies.
P.S. Yeah, and the reason why throughput in my case was higher with single packets was because the client would wait for the web server to answer to the first packet while the webserver waits some tiny amount of time expecting the continuation before sending the ACK to the first (this is also related to Expect100Continue stuff) β that tiny amount is negligible when you're sending a single request, but not when trying to optimize throughput. |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
Got a packet capture, seems to be low-level TCP/IP problem (not related to applications). First packet of request body never reaches input interface on my router. May be it's MTU/MSS problem somewhere in the middle (this packet should be of maximum length), may be problem is in these damn ISPs which have lot of traffic ananyzers and ALG's these days which may trigger on some data inside packet where they should not.
TL;DR: try to set
<http_1_0>0</http_1_0>
in cc_config.xml, it may change data and sequence of packets.
No. Time Protocol Info
1 0.000000 TCP 54426 > 30080 [SYN] Seq=0 Win=29200 Len=0 MSS=1460 TSV=1091435396 TSER=0 WS=7
2 0.001194 TCP 30080 > 54426 [SYN, ACK] Seq=0 Ack=1 Win=28960 Len=0 MSS=1460 TSV=934037421 TSER=1091435396 WS=7
3 0.143876 TCP 54426 > 30080 [ACK] Seq=1 Ack=1 Win=29312 Len=0 TSV=1091435539 TSER=934037421
*** session established with MSS=1460 (maximum segment size)
4 0.143911 TCP 54426 > 30080 [PSH, ACK] Seq=1 Ack=1 Win=29312 Len=283 TSV=1091435540 TSER=934037421
*** request header
POST /test4vm_cgi/cgi HTTP/1.1
.......
Content-Length:
9535
Expect: 100-continue
5 0.144702 TCP 30080 > 54426 [ACK] Seq=1 Ack=284 Win=30080 Len=0 TSV=934037457 TSER=1091435540
*** (received)
6 0.146646 TCP 30080 > 54426 [PSH, ACK] Seq=1 Ack=284 Win=30080 Len=25 TSV=934037457 TSER=1091435540
*** server said to continue, as requested
HTTP/1.1 100 Continue
7 0.289698 TCP 54426 > 30080 [ACK] Seq=284 Ack=26 Win=29312 Len=0 TSV=1091435685 TSER=934037457
*** (received)
8 0.291714 TCP [TCP Previous segment lost] 54426 > 30080 [PSH, ACK] Seq=8972 Ack=26 Win=29312 Len=847 TSV=1091435686 TSER=934037457
*** As said above, first segment is lost. This is a second (last) part of request body. Text is split in the middle. First part was never received.
K, Pass1=448, Pass2=768, 4 threads..... (and so on)
0370 74 68 65 72 5f 72 65 73 75 6c 74 73 3e 0a 3c 2f ther_results>.</
0380 73 63 68 65 64 75 6c 65 72 5f 72 65 71 75 65 73 scheduler_reques
0390 74 3e 0a t>.
9 0.292683 TCP [TCP Window Update] 30080 > 54426 [ACK] Seq=26 Ack=284 Win=31744 Len=0 TSV=934037494 TSER=1091435685 SLE=8972 SRE=9819
*** server asked to resend, but resent packets seem to be lost in same way
10 300.244378 TCP 30080 > 54426 [PSH, ACK] Seq=26 Ack=284 Win=31744 Len=494 TSV=934112482 TSER=1091435685 SLE=8972 SRE=9819
*** after 300 seconds, server sent a error.
HTTP/1.1 408 Request Timeout
......
|
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
MSS=1460 seems to be wrong. I'm on L2TP so it must be 1420. I found that my router is working in Path MTU Discovery (PMTUD) mode. According to Cisco doc, "a disadvantage of this mode is that it does not work reliable on the Internet" :) Also see https://en.wikipedia.org/wiki/Black_hole_(networking)#PMTUD_black_holes Which seems to be your case - some router on your side, or between me and you fails to support PMTUD, probably blocks ICMP. (on your side because it works for everybody else).
I'll think what can be done here.
|
|
|
Vato Volunteer tester
 Send message
Joined: 2 Feb 08 Posts: 850 ID: 18447 Credit: 707,599,869 RAC: 1,598,367
                           
|
You may be able to reduce MSS on your server socket to something sufficiently low.
However, it may still be a no-op, since many other connections are working fine.
____________
|
|
|
TimT  Send message
Joined: 2 Dec 11 Posts: 504 ID: 121414 Credit: 2,577,540,524 RAC: 1,682,208
                            
|
Interesting enough, I tried the cc_config change Http1.0 -- my config was already set to 1 for that setting, so I gave 0 a shot -- seems to be no difference, or didn't fix the issue anyway.
I tried some random setting I found that attempted to add a static route to your server with a specific MSS value, that didn't seem to help either. read a bit about the tunnel mss issue and the PMTUD stuff, and it really seems strange that we would use a protocol that relies on ICMP delivery across the internet -- it seems rare to even be able to traceroute very far these days. either way, something somewhere must have changed in the last few days just to break my machine :(
for what it's worth, maybe the iptables stuff in this article could help?
https://www.zeitgeist.se/2013/11/26/mtu-woes-in-ipsec-tunnels-how-to-fix/
if you are interested, I'm happy to send you the cap file from my side if you PM me with an email address.
edit: (and just for kicks, here's what my traceroute looks like -- I'm guessing ICMP has a snowball's chance of making it here)
[timt@Node1 boinc]$ traceroute 128.73.162.179
traceroute to 128.73.162.179 (128.73.162.179), 30 hops max, 60 byte packets
1 gateway (10.10.0.1) 3.355 ms 3.556 ms 3.541 ms
2 * * *
3 B3309.RCMDVA-LCR-21.verizon-gni.net (130.81.30.208) 12.996 ms B3309.RCMDVA-LCR-22.verizon-gni.net (130.81.30.210) 13.773 ms 13.510 ms
4 * * *
5 * * *
6 0.ae17.GW11.IAD8.ALTER.NET (140.222.225.135) 20.924 ms 8.457 ms 0.ae18.GW11.IAD8.ALTER.NET (140.222.225.137) 10.465 ms
7 vodafone-gw.customer.alter.net (204.148.79.74) 11.114 ms 9.723 ms 11.074 ms
8 et-10-1-0-xcr1.ltw.cw.net (195.2.24.246) 98.732 ms 95.086 ms 94.728 ms
9 ae27.0-xcr1.hex.cw.net (195.2.28.149) 94.723 ms 95.151 ms 95.191 ms
10 ae21-xcr1.att.cw.net (195.2.28.166) 89.468 ms 112.579 ms 88.254 ms
11 ae10-xcr2.fri.cw.net (195.2.8.189) 110.989 ms 108.476 ms 106.877 ms
12 217.161.65.146 (217.161.65.146) 105.983 ms 105.036 ms 98.284 ms
13 pe01.Tula.gldn.net (79.104.234.111) 137.345 ms pe01.Tula.gldn.net (79.104.234.109) 148.946 ms pe02.Tula.gldn.net (79.104.234.103) 159.807 ms
14 194.154.95.7 (194.154.95.7) 140.276 ms 79.104.58.126 (79.104.58.126) 130.927 ms 134.810 ms
15 * * *
16 * * *
17 * * *
18 * * *
19 * * *
20 * * *
21 * * *
22 * * *
23 * * *
24 * * *
25 * * *
26 * * *
27 * * *
28 * * *
29 * * *
30 * * *
|
|
|
TimT  Send message
Joined: 2 Dec 11 Posts: 504 ID: 121414 Credit: 2,577,540,524 RAC: 1,682,208
                            
|
...and Viola!
I've again stumbled upon the brilliance of someone else and put together a command that at least makes things work..
ip route add 128.73.162.179/32 via 10.10.0.1 dev eno2 advmss 1400 mtu 1400
apparently, this applies a specific MTU to connections to your server only and I'll probably have to manually redo it when your IP changes... anyhow, working now! |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
edit: (and just for kicks, here's what my traceroute looks like -- I'm guessing ICMP has a snowball's chance of making it here)
13 pe01.Tula.gldn.net (79.104.234.111) 137.345 ms pe01.Tula.gldn.net (79.104.234.109) 148.946 ms pe02.Tula.gldn.net (79.104.234.103) 159.807 ms
14 194.154.95.7 (194.154.95.7) 140.276 ms 79.104.58.126 (79.104.58.126) 130.927 ms 134.810 ms
15 * * *
16 * * *
Yes, here is point where ICMP breaks. After last host in your trace should be one more intermediate host and final destination. Here is a traceroute from my home (uses another ISP, so path is not shortcut'ed:
7 45 ms 45 ms 44 ms 79.133.94.226
8 * * *
9 47 ms 46 ms 47 ms 194.154.95.7
10 46 ms 47 ms 46 ms bras255-te0-2-0.tul.corbina.net [213.234.199.145]
11 53 ms 60 ms 56 ms 128-73-162-179.broadband.corbina.ru [128.73.162.179]
Trace complete
ip route add 128.73.162.179/32 via 10.10.0.1 dev eno2 advmss 1400 mtu 1400
I'm not a big expert here but I think advmss should be enough, changing mtu is excessive and just wrong (because mss = mtu-40 for IP4)
Also I disabled PMTUD on my router and switched to good old MSS clamp (similar to your method, but automatically done by router for all connections). Not sure was this setting applied on the fly or not; do not want to reconnect without good reason (I see no changes in capture, but capture may happen before clamp). At least I know where to look if this problem appears again.
|
|
|
TimT  Send message
Joined: 2 Dec 11 Posts: 504 ID: 121414 Credit: 2,577,540,524 RAC: 1,682,208
                            
|
I first tried creating a route with just the advmss option, and it appeared to only adjust the mss that my host suggested in the conversation -- looks like my host said 'hey, lets talk in 4000 byte chunks', then your host replied with 'naaah, lets do 1460 byte chunks' -- to which my host replied 'ehh, ok then' --
after changing both advmss AND MTU (and this just affects traffic to your server) it appears that now, my machine's response is something like 'sorry, I can only talk in 4k chunks max' and that worked around the issue...
I guess time will tell, next time your server changes IP address, we'll know if the fix on your end worked :) -- thanks a lot for digging into it! |
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
Roughly 1/3rd of the way there! :)
Edit: but it could be tomorrow ;)
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
|
Roughly 1/3rd of the way there! :)
Edit: but it could be tomorrow ;)
Why, to me it looks like we are much less than one third of the way?
http://boincvm.proxyma.ru:30080/test4vm/user_profile/gfn16_hunt_status.html estimates leading edge as bmin+86248, and if we want to go to bmin+1000000 i necessary (which it should not be!), then we have done only 8.6%. And if we consider double-checking also, we are not even that far.
/JeppeSN |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
Roughly 1/3rd of the way there! :)
Edit: but it could be tomorrow ;)
Why, to me it looks like we are much less than one third of the way?
http://boincvm.proxyma.ru:30080/test4vm/user_profile/gfn16_hunt_status.html estimates leading edge as bmin+86248, and if we want to go to bmin+1000000 i necessary (which it should not be!), then we have done only 8.6%. And if we consider double-checking also, we are not even that far.
1/3 is based on assumption that there are 4 primes in range (according to GFN prime destiny calculator), i.e average target of 250'000 was selected. Today (at 86'362) we are at 34% by leading edge and 17% by overall progress with doublechecking. The real prime could be anywhere (sooner or later).
The doublechecking is ready and will be fully enabled at 100K (just a arbitrary round number). Leading edge will not advance for some time but number of tasks pending validation will go down.
|
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
Roughly 1/3rd of the way there! :)
Edit: but it could be tomorrow ;)
Why, to me it looks like we are much less than one third of the way?
http://boincvm.proxyma.ru:30080/test4vm/user_profile/gfn16_hunt_status.html estimates leading edge as bmin+86248, and if we want to go to bmin+1000000 i necessary (which it should not be!), then we have done only 8.6%. And if we consider double-checking also, we are not even that far.
/JeppeSN
Sorry JeppeSN, I just saw this now, yes I based it on 1 prime out of 37500 candidates
http://www.primegrid.com/forum_thread.php?id=8504&nowrap=true#128769
and we've almost 13000 done.
Cheers
Rob.
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
|
I understand now! Thanks to both of you. But since there is no "memory" in this if we model it as a stochastic process, if we say
"1 prime out of 37500 candidates"
we will always be 37500 candidates away from the next prime, on average. So just because we did away with β
of 37500 candidate, it does not imply our expected (average) "distance" to the next prime is any different. That distance is still 37500.
/JeppeSN |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
After 2 months, we did 14473 tasks reaching estimated leading edge of 97624. The progress is a bit less then in first month but very close, so I could say that we're working at stable rate. Scaling to average distance between primes (250'000), our current progress is 39% by leading edge or at 19,5% overall (considering doublechecking).
As I've mentioned early, full automatic double-checking will be turned on when leading edge reaches 100'000. It should happens in 2-3 days - only 800 tasks left.
|
|
|
|
After 2 months, we did 14473 tasks reaching estimated leading edge of 97624. The progress is a bit less then in first month but very close, so I could say that we're working at stable rate. Scaling to average distance between primes (250'000), our current progress is 39% by leading edge or at 19,5% overall (considering doublechecking).
As I've mentioned early, full automatic double-checking will be turned on when leading edge reaches 100'000. It should happens in 2-3 days - only 800 tasks left.
If we were to do the entire range, with double checking, we would not finish before 2021. /JeppeSN |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
After 2 months, we did 14473 tasks reaching estimated leading edge of 97624. The progress is a bit less then in first month but very close, so I could say that we're working at stable rate. Scaling to average distance between primes (250'000), our current progress is 39% by leading edge or at 19,5% overall (considering doublechecking).
As I've mentioned early, full automatic double-checking will be turned on when leading edge reaches 100'000. It should happens in 2-3 days - only 800 tasks left.
If we were to do the entire range, with double checking, we would not finish before 2021. /JeppeSN
Main goal of the project is to find only one - first mega-prime (out of 4 predicted), and I hope it will happen faster. But it's a lottery.
|
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
After 2 months, we did 14473 tasks reaching estimated leading edge of 97624. The progress is a bit less then in first month but very close, so I could say that we're working at stable rate. Scaling to average distance between primes (250'000), our current progress is 39% by leading edge or at 19,5% overall (considering doublechecking).
As I've mentioned early, full automatic double-checking will be turned on when leading edge reaches 100'000. It should happens in 2-3 days - only 800 tasks left.
If we were to do the entire range, with double checking, we would not finish before 2021. /JeppeSN
Main goal of the project is to find only one - first mega-prime (out of 4 predicted), and I hope it will happen faster. But it's a lottery.
If we find one (and its not me π) im willing to keep going on my own. Its a cool project. π
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
dukebgVolunteer tester
 Send message
Joined: 21 Nov 17 Posts: 242 ID: 950482 Credit: 23,670,125 RAC: 0
                  
|
Is the server down? I'm getting connection timeouts since about 15:01 EEST (12:01 UTC) and some online tools show it down too. |
|
|
JimB Honorary cruncher Send message
Joined: 4 Aug 11 Posts: 920 ID: 107307 Credit: 989,219,538 RAC: 267,986
                     
|
Is the server down? I'm getting connection timeouts since about 15:01 EEST (12:01 UTC) and some online tools show it down too.
I've been working on the server for the past six hours without any problems. No timeouts and my ssh sessions have been fine.
EDIT: I'm talking about PrimeGrid's server, guess I was reading too fast. I haven't grabbed work from stream's server in a very long time so no idea about that. |
|
|
dukebgVolunteer tester
 Send message
Joined: 21 Nov 17 Posts: 242 ID: 950482 Credit: 23,670,125 RAC: 0
                  
|
Is the server down? I'm getting connection timeouts since about 15:01 EEST (12:01 UTC) and some online tools show it down too.
I've been working on the server for the past six hours without any problems. No timeouts and my ssh sessions have been fine.
I was talking about the Private GFN Server. |
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
Is the server down? I'm getting connection timeouts since about 15:01 EEST (12:01 UTC) and some online tools show it down too.
I've been working on the server for the past six hours without any problems. No timeouts and my ssh sessions have been fine.
I was talking about the Private GFN Server.
I cant reach it from Ireland either.
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
It's not clear what happened there. The outer perimeter is working but some office servers, including GFN server, are unreachable. Since it's a Sunday evening here, please wait for few hours, when I'll be back in the office in the Monday morning.
|
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
It's not clear what happened there. The outer perimeter is working but some office servers, including GFN server, are unreachable. Since it's a Sunday evening here, please wait for few hours, when I'll be back in the office in the Monday morning.
π
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
Hey Stream,
I dont get it, on gfn16 mega, the leading edge hasn't moved in sometime, yet the total tasks number is increasing.
Just an estimate?
22k done, 15k to go fingers crossed :)
Two months maybe?
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
dukebgVolunteer tester
 Send message
Joined: 21 Nov 17 Posts: 242 ID: 950482 Credit: 23,670,125 RAC: 0
                  
|
Hey Stream,
I dont get it, on gfn16 mega, the leading edge hasn't moved in sometime, yet the total tasks number is increasing.
Just an estimate?
22k done, 15k to go fingers crossed :)
Two months maybe?
In the beginning we've been running first tasks only without double checks. Now we're running pretty much mostly double checks for those first tasks without new first tasks. That's why the leading edge is not increasing. |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
22k done, 15k to go fingers crossed :)
22K done, 7.7K to go (we need to double-check "Pending" tasks). ~ 1 month left. Then, new tasks will be available and leading edge will go up again.
|
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
Ok thanks Stream and Dukebg.
I also note tasks in progress is way up! We might finish quicker :)
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
|
In the stats page, if a user has an invalid task, what will it look like? Is it removed entirely from his totals? It seems that "Total Tasks" = "(Validated)" + "(Pending)" for everyone, so the total column may omit invalid results? /JeppeSN |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
In the stats page, if a user has an invalid task, what will it look like? Is it removed entirely from his totals? It seems that "Total Tasks" = "(Validated)" + "(Pending)" for everyone, so the total column may omit invalid results? /JeppeSN
Yes, it's removed entirely. (Really, it's removed from pending, and indeed total = validated + pending). This is due to technical reasons - when completed workunits are purged from database, only valid results are kept for future processing, everything else (timed out, invalid, etc) are thrown away.
Lucky, we had only three hosts with invalid results, one of them must be disabled now and two other sent a single occasional error. Few results failed validation because they had a zero length result file (may be some network glitch or unlucky reboot).
|
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
Getting close now Stream.
Hopefully we'll get some more back active when the dc is over. Realistically >10 days I'm guessing. What was the estimate again, 1 in 37k was it?
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
Hopefully we'll get some more back active when the dc is over. Realistically >10 days I'm guessing
Yes, it's very close. You can use number of pending tasks as reference. The system will load fresh batch automatically when it'll be less then 400 tasks in this queue. Also we have about 150 workunits (300 tasks) unprocessed - they were not sent yet when I've enabled double-check. All of them will be new and have a chance to find a prime. They could be randomly sent to clients even if stats show more then 400 pending tasks.
Please be aware that you, personally, may be temporary out of work at the end of double-check because you still have more pending tasks to validate then sum of everybody else's tasks. This may or may not happens (depends on how many tasks will be completed in next few days and which tasks will be buffered in server queue), just be aware that it may happens to you.
. What was the estimate again, 1 in 37k was it?
Sorry, I don't understand which estimate you mean. If you mean chance of the prime, average distance between primes in the search range is 250K, which corresponds to 36700 candidates. Current leading edge is 100K.
|
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
Please be aware that you, personally, may be temporary out of work at the end of double-check because you still have more pending tasks to validate then sum of everybody else's tasks. This may or may not happens (depends on how many tasks will be completed in next few days and which tasks will be buffered in server queue), just be aware that it may happens to you.
Ah I figured that might happen. Thats ok.
Sorry, I don't understand which estimate you mean. If you mean chance of the prime, average distance between primes in the search range is 250K, which corresponds to 36700 candidates. Current leading edge is 100K.
Sorry yes number of candidates 36700. Hopefully we get a result in the range :) Fingers crossed.
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
Please be aware that you, personally, may be temporary out of work at the end of double-check because you still have more pending tasks to validate then sum of everybody else's tasks. This may or may not happens (depends on how many tasks will be completed in next few days and which tasks will be buffered in server queue), just be aware that it may happens to you.
Ah I figured that might happen. Thats ok.
Sorry, I don't understand which estimate you mean. If you mean chance of the prime, average distance between primes in the search range is 250K, which corresponds to 36700 candidates. Current leading edge is 100K.
Sorry yes number of candidates 36700. Hopefully we get a result in the range :) Fingers crossed.
Ok so confused again I'm afraid. I always get mixed up between candidates and tasks, just to clarify, is it 36700 candidates = 73400 tasks because of double check? 1 candidate = 2 tasks?
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
Ok so confused again I'm afraid. I always get mixed up between candidates and tasks, just to clarify, is it 36700 candidates = 73400 tasks because of double check? 1 candidate = 2 tasks?
Yes.
|
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
Ok so confused again I'm afraid. I always get mixed up between candidates and tasks, just to clarify, is it 36700 candidates = 73400 tasks because of double check? 1 candidate = 2 tasks?
Yes.
Great so we're almost half way. I had been thinking there were only 7k left. So be it, keep looking :)
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
|
When we go over b_min + 100000, will testing and double-checking be simultaneous, just like in PrimeGrid's BOINC projects?
Once we find a prime, we can be certain it is really the smallest GFN-16 mega.
/JeppeSN |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
When we go over b_min + 100000, will testing and double-checking be simultaneous, just like in PrimeGrid's BOINC projects?
May be not really simultaneous, like in PG (I still prefer that peoples with slower computers had a chance to find a prime, avoiding additional competition with a wingman). Of course this approach have some drawbacks. Let's see. It's just a one-line changes in few scripts and SQL statements. At least there will be no more so long DC-only periods (this one happened because I had no time to write validator quickly, combined with my hope to find a prime in one-pass "Zerg rush").
Once we find a prime, we can be certain it is really the smallest GFN-16 mega.
It will be smallest Mega prime in the world known so far.
|
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
When we go over b_min + 100000, will testing and double-checking be simultaneous, just like in PrimeGrid's BOINC projects?
May be not really simultaneous, like in PG (I still prefer that peoples with slower computers had a chance to find a prime, avoiding additional competition with a wingman). Of course this approach have some drawbacks. Let's see. It's just a one-line changes in few scripts and SQL statements. At least there will be no more so long DC-only periods (this one happened because I had no time to write validator quickly, combined with my hope to find a prime in one-pass "Zerg rush").
Once we find a prime, we can be certain it is really the smallest GFN-16 mega.
It will be smallest Mega prime in the world known so far.
How many digits? Smallest Mega on PG is 1,000,075. Is it going to be bang on 1,000,000?
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
|
How many digits? Smallest Mega on PG is 1,000,075. Is it going to be bang on 1,000,000?
Yes, everything up to bmin + 10^10 has exactly 1,000,000 digits.
A.
Update:
bmin^65536 + 1 is 1.000000000002196... Γ 10^999999
and (bmin + 10^6)^65536 is still 1.0000361... Γ 10^999999
according to Yves Gallots estimator there should be more than 50,000 GFN-16 primes with exactly 1,000,000 digits :-) |
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
How many digits? Smallest Mega on PG is 1,000,075. Is it going to be bang on 1,000,000?
Yes, everything up to bmin + 10^10 has exactly 1,000,000 digits.
A.
Update:
bmin^65536 + 1 is 1.000000000002196... Γ 10^999999
and (bmin + 10^6)^65536 is still 1.0000361... Γ 10^999999
according to Yves Gallots estimator there should be more than 50,000 GFN-16 primes with exactly 1,000,000 digits :-)
Thanks for that theAy, cool estimator too :) Nice one Yves!
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
How many digits? Smallest Mega on PG is 1,000,075. Is it going to be bang on 1,000,000?
Exactly 1,000,000.
There are only 3 primes with 1000000 digits known at T5K. Smallest one is 191273β*β2^3321908-1.
log10(191273*2^3321908) = 999999.23248
log10((bmin+1000000)^65536) = 999999.0000156852
Even if we find a prime at the end of our search range (bmin+1000000), it still will be smallest known mega prime. |
|
|
|
This was also discussed (by myself) in the thread linked in the first post above.
It will be by far the smallest proven megaprime. But if we go on (after the completion of this project) to also find the first GFN-15 megaprime, that one will be much closer still to the threshold value 10^999999.
/JeppeSN |
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
This was also discussed (by myself) in the thread linked in the first post above.
It will be by far the smallest proven megaprime. But if we go on (after the completion of this project) to also find the first GFN-15 megaprime, that one will be much closer still to the threshold value 10^999999.
/JeppeSN
Is that possible? I thought there were limits on the software that it couldn't be done. Maybe i misread something.
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
This was also discussed (by myself) in the thread linked in the first post above.
It will be by far the smallest proven megaprime. But if we go on (after the completion of this project) to also find the first GFN-15 megaprime, that one will be much closer still to the threshold value 10^999999.
For GFN-15 we need b of 3292665455999520712131951278258 > 2^64
LLR seems to work with this value (well, it's calculates something, must look at source to be sure that there are no hidden truncations somewhere), but I don't know any existing sieving software which can handle numbers above 2^64.
|
|
|
|
Is that possible? I thought there were limits on the software that it couldn't be done. Maybe i misread something.
I do not know if GFN-15 mega is possible. The first GFN-15 megaprime would be so close to 10^999999 that you would need to read more than twenty decimal figures (from the left, most significant digits) before you could tell the difference between it and 10^999999. Its log10 as reported in Caldwell's Prime Pages would be 999999.000000000 when given as a double-precision binary floating-point number.
At least the bases b needed for GFN-15 mega would still be quite easy to factor. So it should be no problem to give a deterministic primality proof if a GFN-15 mega is ever located.
But let us find a GFN-16 mega before we worry about GFN-15 mega!
/JeppeSN |
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
Is that possible? I thought there were limits on the software that it couldn't be done. Maybe i misread something.
I do not know if GFN-15 mega is possible. The first GFN-15 megaprime would be so close to 10^999999 that you would need to read more than twenty decimal figures (from the left, most significant digits) before you could tell the difference between it and 10^999999. Its log10 as reported in Caldwell's Prime Pages would be 999999.000000000 when given as a double-precision binary floating-point number.
At least the bases b needed for GFN-15 mega would still be quite easy to factor. So it should be no problem to give a deterministic primality proof if a GFN-15 mega is ever located.
But let us find a GFN-16 mega before we worry about GFN-15 mega!
/JeppeSN
Sounds good π
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
Fresh tasks loaded
Long double-checking period is almost complete! New batch of tasks was loaded recently to the database.
Although there are still some DC tasks in the queue, you already have a chance to get new task and discover the prime - a prime number with two unique properties:
- First known GFN-16 Mega Prime (1,000,000 digits exactly)
- Smallest Mega prime known so far.
Join the search (see first post in thread) and test your luck! |
|
|
|
By the way, the smallest megaprime of any kind is almost certainly 10^999999+593499 (but no deterministic test can be run on it, realistically). Out of a million digits, only seven are nonzero (the leading one (1) and the last six (593499)). Read more about that PRP number by clicking on its rank number in the list. /JeppeSN |
|
|
Crun-chi Volunteer tester
 Send message
Joined: 25 Nov 09 Posts: 3233 ID: 50683 Credit: 151,443,349 RAC: 125,986
                         
|
By the way, the smallest megaprime of any kind is almost certainly 10^999999+593499 (but no deterministic test can be run on it, realistically). Out of a million digits, only seven are nonzero (the leading one (1) and the last six (593499)). Read more about that PRP number by clicking on its rank number in the list. /JeppeSN
And what is with lets say 3*10^999999+1?
It also have exactly milion digits
____________
92*10^1585996-1 NEAR-REPDIGIT PRIME :) :) :)
4β*β650^498101-1 CRUS PRIME
2022202116^131072+1 GENERALIZED FERMAT
Proud member of team Aggie The Pew. Go Aggie! |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
By the way, the smallest megaprime of any kind is almost certainly 10^999999+593499 (but no deterministic test can be run on it, realistically). Out of a million digits, only seven are nonzero (the leading one (1) and the last six (593499)). Read more about that PRP number by clicking on its rank number in the list. /JeppeSN
And what is with lets say 3*10^999999+1?
It also have exactly milion digits
This number is too big. Get log10() of it and compare. We're talking about million-digit primes with smallest numerical value.
|
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
Changes in validation policy
Since we're searching for a single unique prime, I've made some changes to make this game more intriguing and fair. I've already mentioned most of these changes on Discord channel and now writing them here to have a sort of formal description.
1. Only one task for each candidate will exist in each given moment of time. This is good for you and mean that you don't have to complete with your wingman. Even you have a slow computer, you may win. You only need to be lucky to get this task from the server and complete it in reasonable time. To help this, I've also implemented extended deadlines like on PG. As soon as your task continues to crunch, your deadline is automatically extended to at least 7 days forward, but no more then 30 days in total.
2. You will not get any information are you a first person working on this candidate or double-checking it. This is necessary to prevent unfair players from aborting double-check tasks in attempt to increase their chances to be a primary finder. This includes two changes:
2a. You'll not see full information about incomplete (unverfied) workunits. This page will be available but only your own tasks and tasks which returned with errors (they don't affect anything) will be shown. (Errored-out tasks were included to help troubleshooting and avoid situation when somebody asking for a help on the forum, but rest of community cannot get detailed info about problematic task because this WU is not complete yet).
2b. Task names will be obfuscated, it'll be not possible to find out is it first task or not. Please don't try to guess this from other fields like 'b' of candidate - tasks are sent from big in-memory pool in completely random order. It's OK for server to send b=200000 before b=150000. (Right now server queue still contains enough tasks created using old rules, but eventually they'll be completed and only obfuscated tasks names will exist).
|
|
|
|
Changes in validation policy
[β¦]
That is cool. /JeppeSN |
|
|
|
The changes sound fairer to me.
____________
|
|
|
|
What is the minimum GPU required for this project?
I cannot get tasks to download for a Gigabyte GTX750ti 2047MB. |
|
|
|
What is the minimum GPU required for this project?
I cannot get tasks to download for a Gigabyte GTX750ti 2047MB.
Hi Vaughan,
Are you trying to get GFN-13 or GFN-14 tasks? If so, it is likely that you just have too many pending and the server won't send you more. I have found that the server doesn't send you more after you reach ~5,000 pending for either of the GFN-13 or GFN-14 subprojects. I just switched my main GFN-13/14 computer over to GFN-13 only (as my pending had dropped significantly, so I think that is likely what you are after) and brought a second one online. If you begin receiving GFN-13 tasks soon, then it is likely that you just got out too far ahead and had too many pending. If that is the case, I will resolve your issue shortly :)
On a side note for Stream; would it be possible to have the same, or similar, validation policy applied for the GFN-13 and 14 subprojects as well so that each participant has precisely a 50% chance of being first, no matter how much processing power is thrown at the project?
Regards,
Kellen |
|
|
dthonon Volunteer tester
 Send message
Joined: 6 Dec 17 Posts: 435 ID: 957147 Credit: 1,748,804,574 RAC: 348,038
                                 
|
I don't run GFN-16 MEGA, but my understanding was that it is a CPU only sub-project, using LLR. Did I miss something ? |
|
|
|
I don't run GFN-16 MEGA, but my understanding was that it is a CPU only sub-project, using LLR. Did I miss something ?
You are definitely correct with that but I failed to mention it as I got excited/distracted at the prospect of someone else doing a huge number of GFN-13 tasks, lol. GFN-16 MEGA is a CPU-only LLR sub-project. |
|
|
|
Thanks Kellen. So it seems to be a bad idea to turn the battle fleet of GPUs on this project then? I'm referring to the tiny GFN13 tasks that finish in less than 10 seconds each. I set cache to 0 so as not to download too many for each computer but it likes to grab a few each request.
dthonon - no I'm not running the slow LLR tasks they take too long. :)
stream - a request please - could you put a ranking number next to each line for the scoring table?
http://boincvm.proxyma.ru:30080/test4vm/user_profile/gfnxx_all_status.html |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
What is the minimum GPU required for this project?
I cannot get tasks to download for a Gigabyte GTX750ti 2047MB.
Kellen is correct, your performance is great and server cannot find a doublechecker for you. I see 7300 GFN-13 and 4400 GFN-14 tasks pending validation. Server queue is 5000 tasks for all projects, where 4500 are for GFN13+GFN14 and 500 for GFN-16. Sometimes you could have more pending tasks because too old WUs could be retired, or feeder could load another set of WUs after restart, but think of 4500 as a general limit.
Unfortunately, nothing could be done here. It was a problem since beginning. Things were better when we had an interesting sub-project/target and many GPU farms joined. Now, with a standard sequential prime search, participation is not so big, anyone with few good GPUs could easily over-perform all other users combined. See "Running PrimeGrid as backup project" in top post of thread (of course you could use any project as backup, not only PG).
|
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
On a side note for Stream; would it be possible to have the same, or similar, validation policy applied for the GFN-13 and 14 subprojects as well so that each participant has precisely a 50% chance of being first, no matter how much processing power is thrown at the project?
Personally, I didn't care much about there 10-seconds tasks where you can find tens and hundreds of primes. I'll think of it but it's not as simple as for GFN-16. Problem is in "Only one task will exist" rule. It's not supported by server code and I've implemented it as a hack, using periodical external SQL query. First, this query does not scales well (and GFN-16 does ~200 tasks per day, GFN-13 best value was 100'000 tasks per day). Second, it could make lack of doublecheckers problem worse. Probably new logic must be included directly into server code; this is doable but definitely not a 5-minute change.
|
|
|
|
I'll put some GPUs on 13 and 14. It may take a few days, depending on which project is picked for the current Formula BOINC sprint.
____________
Reno, NV
|
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
The server will be down for a short maintenance tomorrow, Aug 30, starting from 12:00 UTC. Your good feedback on GFN-13/14 problem caused that free disk space is being quickly eaten by MySQL :) |
|
|
|
On a side note for Stream; would it be possible to have the same, or similar, validation policy applied for the GFN-13 and 14 subprojects as well so that each participant has precisely a 50% chance of being first, no matter how much processing power is thrown at the project?
Personally, I didn't care much about there 10-seconds tasks where you can find tens and hundreds of primes. I'll think of it but it's not as simple as for GFN-16. Problem is in "Only one task will exist" rule. It's not supported by server code and I've implemented it as a hack, using periodical external SQL query. First, this query does not scales well (and GFN-16 does ~200 tasks per day, GFN-13 best value was 100'000 tasks per day). Second, it could make lack of doublecheckers problem worse. Probably new logic must be included directly into server code; this is doable but definitely not a 5-minute change.
Thank you for the explanation about the implementation. I did notice that the revision numbers for the GFN-13 tasks are randomized (or at least appear random), so that must be related to this hack. While it would be nice to get the exact 50/50 odds of PF/DC for each task, it sounds like it is much more effort to implement than it would be worth.
The server will be down for a short maintenance tomorrow, Aug 30, starting from 12:00 UTC. Your good feedback on GFN-13/14 problem caused that free disk space is being quickly eaten by MySQL :)
Thanks also for the heads up (and sorry for the issues)! I will set a large buffer tomorrow morning to make it through the downtime as much as possible.
I'll put some GPUs on 13 and 14. It may take a few days, depending on which project is picked for the current Formula BOINC sprint.
Welcome back to the party Zombie67!
Thanks Kellen. So it seems to be a bad idea to turn the battle fleet of GPUs on this project then? I'm referring to the tiny GFN13 tasks that finish in less than 10 seconds each. I set cache to 0 so as not to download too many for each computer but it likes to grab a few each request.
I wouldn't call it a bad idea, just maybe one that didn't get the desired result, lol. I can't see your pending tasks, but I just clicked through a bunch of random workunits and it looks like your computers are getting tasks on or about the same time as your partner, so I think that participation has increased enough that you are now getting double-checkers quickly enough to keep your machines fed. Either that or you pulled some machines off of GFN-13. Either way; we've advanced the leading edge by almost 1,000,000 in the past 24 hours! |
|
|
|
I actually put more machines on. They seem to be getting tasks OK now. The GTX750ti still cannot get tasks. |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
I actually put more machines on. They seem to be getting tasks OK now. The GTX750ti still cannot get tasks.
Assuming you mean host #564
2019-08-30 09:53:31.7871 [PID=18590] [version] Checking plan class 'opencl_nvidia_gfn_101'
2019-08-30 09:53:31.7872 [PID=18590] [version] GPU doesn't support OpenCL
Windows update strikes again?
|
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
Incidentally Stream, although I really like the "found" feature you use, I hope you have the number displayed part muted for 16.
Nice to know it is found but, has to be a secret until reported to T5K. I'm sure you thought of that, but just in case :)
Easily overlooked. π
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
Incidentally Stream, although I really like the "found" feature you use, I hope you have the number displayed part muted for 16.
Nice to know it is found but, has to be a secret until reported to T5K. I'm sure you thought of that, but just in case :)
Easily overlooked. π
You probably refer to GFN-13/14 stats, they're for GFN-13/14 only. GFN-16 uses another script which prints only number of tasks per user, nothing else. Of course the discovery (and the winner) must be a secret until T5K report.
|
|
|
robish Volunteer moderator Volunteer tester
 Send message
Joined: 7 Jan 12 Posts: 2209 ID: 126266 Credit: 7,505,119,736 RAC: 3,500,578
                               
|
Incidentally Stream, although I really like the "found" feature you use, I hope you have the number displayed part muted for 16.
Nice to know it is found but, has to be a secret until reported to T5K. I'm sure you thought of that, but just in case :)
Easily overlooked. π
You probably refer to GFN-13/14 stats, they're for GFN-13/14 only. GFN-16 uses another script which prints only number of tasks per user, nothing else. Of course the discovery (and the winner) must be a secret until T5K report.
Ahh good, I thought you may have cut and pasted, (like I would) the code π€£
Good to see a good few people crunching 16 now! Renewed interest π
____________
My lucky numbers 10590941048576+1 and 224584605939537911+81292139*23#*n for n=0..26 |
|
|
|
Server appears overloaded today, really hard to get it to accept completed tasks. I have to baby-sit each computer and click upload repeatedly to clear backlog of finished tasks. |
|
|
|
Server appears overloaded today, really hard to get it to accept completed tasks. I have to baby-sit each computer and click upload repeatedly to clear backlog of finished tasks.
It seems to choke when trying to report a large number of completed tasks. Try adding this to your cc_config.xml file:
<max_tasks_reported>N</max_tasks_reported>
I set "N" to "100". 100 seems to work smoothly.
____________
Reno, NV
|
|
|
streamVolunteer moderator Project administrator Volunteer developer Volunteer tester Send message
Joined: 1 Mar 14 Posts: 1033 ID: 301928 Credit: 543,608,970 RAC: 7,830
                         
|
Server appears overloaded today, really hard to get it to accept completed tasks. I have to baby-sit each computer and click upload repeate |