PrimeGrid
Please visit donation page to help the project cover running costs for this month

Toggle Menu

Join PrimeGrid

Returning Participants

Community

Leader Boards

Results

Other

drummers-lowrise

Advanced search

Message boards : 321 Prime Search : Multi-threading: Max # of threads for each task Setting?

Author Message
tt2012tt
Send message
Joined: 22 Jul 20
Posts: 1
ID: 1283679
Credit: 5,383,149
RAC: 48
321 LLR Bronze: Earned 10,000 credits (26,621)Generalized Cullen/Woodall LLR Bronze: Earned 10,000 credits (35,221)321 Sieve (suspended) Bronze: Earned 10,000 credits (14,637)PPS Sieve Bronze: Earned 10,000 credits (13,484)AP 26/27 Silver: Earned 100,000 credits (396,214)GFN Ruby: Earned 2,000,000 credits (4,895,295)
Message 146935 - Posted: 22 Dec 2020 | 4:22:46 UTC

What should I set this at? I have windows 10, i5-3470 4c-4t and 24g of ram. I also have a Ryzen3900XT 12c-24t. How do you gauge thread count for this and different computers? Any insight would be appreciated.

Kellen
Send message
Joined: 10 Jan 18
Posts: 484
ID: 967938
Credit: 1,600,003,090
RAC: 0
Discovered 2 mega primes321 LLR Sapphire: Earned 20,000,000 credits (20,008,344)Cullen LLR Sapphire: Earned 20,000,000 credits (20,000,917)ESP LLR Sapphire: Earned 20,000,000 credits (20,001,144)Generalized Cullen/Woodall LLR Sapphire: Earned 20,000,000 credits (20,011,942)PPS LLR Sapphire: Earned 20,000,000 credits (20,000,023)PSP LLR Sapphire: Earned 20,000,000 credits (20,008,714)SoB LLR Sapphire: Earned 20,000,000 credits (20,035,621)SR5 LLR Sapphire: Earned 20,000,000 credits (20,004,459)SGS LLR Sapphire: Earned 20,000,000 credits (20,000,002)TRP LLR Sapphire: Earned 20,000,000 credits (20,000,236)Woodall LLR Sapphire: Earned 20,000,000 credits (20,017,137)321 Sieve (suspended) Sapphire: Earned 20,000,000 credits (20,000,569)Generalized Cullen/Woodall Sieve (suspended) Sapphire: Earned 20,000,000 credits (20,006,424)PPS Sieve Sapphire: Earned 20,000,000 credits (49,900,913)AP 26/27 Sapphire: Earned 20,000,000 credits (45,002,633)GFN Double Silver: Earned 200,000,000 credits (200,000,012)WW Sapphire: Earned 20,000,000 credits (45,004,000)PSA Double Amethyst: Earned 1,000,000,000 credits (1,000,000,000)
Message 146936 - Posted: 22 Dec 2020 | 4:38:24 UTC - in response to Message 146935.

For the i5-3470 I would run 1 task on 4 cores and for the 3900XT I would run 4 tasks on 3 cores each. These tasks are presently using about 7MB of CPU cache each, so the 3470 will be a bit slow, but the 3900XT should run them pretty quick with that setting.

You will want to make sure that the tasks are run on the same CCX (a CCX is a set of 3 adjacent cores for the 3900XT) for the Ryzen CPU though to maximize efficiency. You can use Process Lasso (free, here https://bitsum.com/) or, if you hop on the Discord server, someone may have a link to another program that the same person who made the LLR2 software wrote. Ryzen 3rd gen CPUs are much more productive and efficient if you can keep a task on the same CCX.

As far as optimal configurations go, that is a function of FFT length and cache size. If you look at the output of a task after you run it (by clicking on the task on your tasks summary page) you can see the FFT size. Multiply that by 8 and you get MB of CPU cache used during the test. For 321 right now the FFT size is 864K, so each task uses about 7MB cache. The i5-3470 has 6MB and each CCX of the 3900XT has 16MB.

Some examples: Looking at PPSE tasks, with an FFT length of 120K (~1MB cache usage), the most efficient configuration would be to run 1 task on each core (4 tasks total for the 3470 and 12 tasks total for the 3900XT). For PPS-Mega the FFT size right now is 256K, or about 2MB per task. You would probably want to run 2 tasks with 2 threads each on the 3470 and 12 tasks with 1 thread each for the 3900XT for optimal throughput, to stay within the cache limits for each CPU.

Hope this helps, and good luck!

Profile JeppeSNProject donor
Avatar
Send message
Joined: 5 Apr 14
Posts: 1806
ID: 306875
Credit: 49,103,157
RAC: 13,892
Found 1 prime in the 2020 Tour de Primes321 LLR Gold: Earned 500,000 credits (593,283)Cullen LLR Gold: Earned 500,000 credits (611,298)ESP LLR Silver: Earned 100,000 credits (174,818)Generalized Cullen/Woodall LLR Silver: Earned 100,000 credits (112,799)PPS LLR Jade: Earned 10,000,000 credits (19,066,317)PSP LLR Silver: Earned 100,000 credits (428,457)SoB LLR Silver: Earned 100,000 credits (466,812)SR5 LLR Silver: Earned 100,000 credits (210,142)SGS LLR Silver: Earned 100,000 credits (136,265)TRP LLR Silver: Earned 100,000 credits (476,246)Woodall LLR Silver: Earned 100,000 credits (281,400)321 Sieve (suspended) Silver: Earned 100,000 credits (175,037)PPS Sieve Bronze: Earned 10,000 credits (10,113)AP 26/27 Bronze: Earned 10,000 credits (12,129)GFN Ruby: Earned 2,000,000 credits (4,977,751)WW Jade: Earned 10,000,000 credits (13,756,000)PSA Turquoise: Earned 5,000,000 credits (7,614,290)
Message 146940 - Posted: 22 Dec 2020 | 9:03:26 UTC - in response to Message 146936.

or, if you hop on the Discord server, someone may have a link to another program that the same person who made the LLR2 software wrote. Ryzen 3rd gen CPUs are much more productive and efficient if you can keep a task on the same CCX.


I think you are referring to AffinityWatcher by Pavel Atnashev (user 914937). /JeppeSN

Profile Jordan Romaidis
Avatar
Send message
Joined: 11 May 17
Posts: 274
ID: 880615
Credit: 823,947,522
RAC: 30,635
Discovered 5 mega primesEliminated 1 conjecture "k"Discovered 1 AP26Found 1 prime in the 2018 Tour de PrimesFound 2 primes in the 2019 Tour de PrimesFound 2 primes in the 2020 Tour de PrimesFound 1 mega prime in the 2020 Tour de PrimesFound 1 prime in the 2020 Tour de Primes Mountain StageFound 1 mega prime in the 2020 Tour de Primes Mountain StageFound 1 prime in the 2021 Tour de PrimesFound 2 primes in the 2022 Tour de PrimesFound 1 mega prime in the 2022 Tour de PrimesFound 1 prime in the 2023 Tour de Primes321 LLR Turquoise: Earned 5,000,000 credits (5,014,730)Cullen LLR Ruby: Earned 2,000,000 credits (2,080,460)ESP LLR Gold: Earned 500,000 credits (502,799)Generalized Cullen/Woodall LLR Turquoise: Earned 5,000,000 credits (6,000,054)PPS LLR Double Bronze: Earned 100,000,000 credits (103,292,338)PSP LLR Amethyst: Earned 1,000,000 credits (1,092,773)SoB LLR Sapphire: Earned 20,000,000 credits (41,382,619)SR5 LLR Sapphire: Earned 20,000,000 credits (21,787,728)SGS LLR Sapphire: Earned 20,000,000 credits (20,284,625)TRP LLR Gold: Earned 500,000 credits (821,174)Woodall LLR Jade: Earned 10,000,000 credits (15,028,246)321 Sieve (suspended) Amethyst: Earned 1,000,000 credits (1,084,376)Generalized Cullen/Woodall Sieve (suspended) Sapphire: Earned 20,000,000 credits (20,606,749)PPS Sieve Sapphire: Earned 20,000,000 credits (30,065,949)AP 26/27 Double Silver: Earned 200,000,000 credits (257,818,067)GFN Double Bronze: Earned 100,000,000 credits (185,033,172)WW Emerald: Earned 50,000,000 credits (59,040,000)PSA Emerald: Earned 50,000,000 credits (53,011,665)
Message 146958 - Posted: 22 Dec 2020 | 21:01:51 UTC - in response to Message 146936.

For the i5-3470 I would run 1 task on 4 cores and for the 3900XT I would run 4 tasks on 3 cores each. These tasks are presently using about 7MB of CPU cache each, so the 3470 will be a bit slow, but the 3900XT should run them pretty quick with that setting.

You will want to make sure that the tasks are run on the same CCX (a CCX is a set of 3 adjacent cores for the 3900XT) for the Ryzen CPU though to maximize efficiency. You can use Process Lasso (free, here https://bitsum.com/) or, if you hop on the Discord server, someone may have a link to another program that the same person who made the LLR2 software wrote. Ryzen 3rd gen CPUs are much more productive and efficient if you can keep a task on the same CCX.

As far as optimal configurations go, that is a function of FFT length and cache size. If you look at the output of a task after you run it (by clicking on the task on your tasks summary page) you can see the FFT size. Multiply that by 8 and you get MB of CPU cache used during the test. For 321 right now the FFT size is 864K, so each task uses about 7MB cache. The i5-3470 has 6MB and each CCX of the 3900XT has 16MB.

Some examples: Looking at PPSE tasks, with an FFT length of 120K (~1MB cache usage), the most efficient configuration would be to run 1 task on each core (4 tasks total for the 3470 and 12 tasks total for the 3900XT). For PPS-Mega the FFT size right now is 256K, or about 2MB per task. You would probably want to run 2 tasks with 2 threads each on the 3470 and 12 tasks with 1 thread each for the 3900XT for optimal throughput, to stay within the cache limits for each CPU.

Hope this helps, and good luck!


Hi, are you talking L2 or L3 CPU cache? I ask because my CPU has only 2MB of L2 but 45MB L3. Should I be calculating for L2 or L3?

Profile GrebulonerProject donor
Volunteer tester
Avatar
Send message
Joined: 2 Nov 09
Posts: 490
ID: 49572
Credit: 2,943,462,374
RAC: 4,052,586
Discovered 5 mega primesFound 2 primes in the 2018 Tour de PrimesFound 4 primes in the 2019 Tour de PrimesFound 3 primes in the 2020 Tour de PrimesFound 1 mega prime in the 2020 Tour de PrimesFound 4 primes in the 2021 Tour de PrimesFound 7 primes in the 2022 Tour de PrimesFound 1 mega prime in the 2022 Tour de PrimesFound 10 primes in the 2023 Tour de PrimesFound 2 mega primes in the 2023 Tour de Primes321 LLR Emerald: Earned 50,000,000 credits (50,304,710)Cullen LLR Sapphire: Earned 20,000,000 credits (22,096,858)ESP LLR Sapphire: Earned 20,000,000 credits (20,453,074)Generalized Cullen/Woodall LLR Emerald: Earned 50,000,000 credits (50,327,156)PPS LLR Emerald: Earned 50,000,000 credits (75,487,921)PSP LLR Sapphire: Earned 20,000,000 credits (20,571,669)SoB LLR Emerald: Earned 50,000,000 credits (50,873,282)SR5 LLR Sapphire: Earned 20,000,000 credits (24,982,219)SGS LLR Sapphire: Earned 20,000,000 credits (31,592,082)TRP LLR Sapphire: Earned 20,000,000 credits (24,211,899)Woodall LLR Sapphire: Earned 20,000,000 credits (24,238,438)321 Sieve (suspended) Emerald: Earned 50,000,000 credits (55,630,889)Cullen/Woodall Sieve (suspended) Ruby: Earned 2,000,000 credits (4,178,073)Generalized Cullen/Woodall Sieve (suspended) Emerald: Earned 50,000,000 credits (56,046,594)PPS Sieve Double Gold: Earned 500,000,000 credits (521,014,891)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Turquoise: Earned 5,000,000 credits (9,468,384)TRP Sieve (suspended) Jade: Earned 10,000,000 credits (10,076,645)AP 26/27 Double Gold: Earned 500,000,000 credits (553,962,809)GFN Double Silver: Earned 200,000,000 credits (380,866,271)WW Double Gold: Earned 500,000,000 credits (830,928,000)PSA Double Bronze: Earned 100,000,000 credits (126,200,096)
Message 146971 - Posted: 23 Dec 2020 | 5:22:07 UTC - in response to Message 146958.

For the i5-3470 I would run 1 task on 4 cores and for the 3900XT I would run 4 tasks on 3 cores each. These tasks are presently using about 7MB of CPU cache each, so the 3470 will be a bit slow, but the 3900XT should run them pretty quick with that setting.

You will want to make sure that the tasks are run on the same CCX (a CCX is a set of 3 adjacent cores for the 3900XT) for the Ryzen CPU though to maximize efficiency. You can use Process Lasso (free, here https://bitsum.com/) or, if you hop on the Discord server, someone may have a link to another program that the same person who made the LLR2 software wrote. Ryzen 3rd gen CPUs are much more productive and efficient if you can keep a task on the same CCX.

As far as optimal configurations go, that is a function of FFT length and cache size. If you look at the output of a task after you run it (by clicking on the task on your tasks summary page) you can see the FFT size. Multiply that by 8 and you get MB of CPU cache used during the test. For 321 right now the FFT size is 864K, so each task uses about 7MB cache. The i5-3470 has 6MB and each CCX of the 3900XT has 16MB.

Some examples: Looking at PPSE tasks, with an FFT length of 120K (~1MB cache usage), the most efficient configuration would be to run 1 task on each core (4 tasks total for the 3470 and 12 tasks total for the 3900XT). For PPS-Mega the FFT size right now is 256K, or about 2MB per task. You would probably want to run 2 tasks with 2 threads each on the 3470 and 12 tasks with 1 thread each for the 3900XT for optimal throughput, to stay within the cache limits for each CPU.

Hope this helps, and good luck!


Hi, are you talking L2 or L3 CPU cache? I ask because my CPU has only 2MB of L2 but 45MB L3. Should I be calculating for L2 or L3?


Generally speaking, it's L3, but there is an "it depends" based on cache inclusiveness. AMD and consumer Intel chips store a copy of L2 in L3, so only L3 matters (and in Ryzen 3k/5k, it's L3 per CCX). Skylake-X/SP and up (HEDT/server) have a mostly non-inclusive hierarchy, so it's a little less than L2+L3.

What CPU are you using that mixes itsy bitsy L2 with massive L3?
____________
Eating more cheese on Thursdays.

Profile Jordan Romaidis
Avatar
Send message
Joined: 11 May 17
Posts: 274
ID: 880615
Credit: 823,947,522
RAC: 30,635
Discovered 5 mega primesEliminated 1 conjecture "k"Discovered 1 AP26Found 1 prime in the 2018 Tour de PrimesFound 2 primes in the 2019 Tour de PrimesFound 2 primes in the 2020 Tour de PrimesFound 1 mega prime in the 2020 Tour de PrimesFound 1 prime in the 2020 Tour de Primes Mountain StageFound 1 mega prime in the 2020 Tour de Primes Mountain StageFound 1 prime in the 2021 Tour de PrimesFound 2 primes in the 2022 Tour de PrimesFound 1 mega prime in the 2022 Tour de PrimesFound 1 prime in the 2023 Tour de Primes321 LLR Turquoise: Earned 5,000,000 credits (5,014,730)Cullen LLR Ruby: Earned 2,000,000 credits (2,080,460)ESP LLR Gold: Earned 500,000 credits (502,799)Generalized Cullen/Woodall LLR Turquoise: Earned 5,000,000 credits (6,000,054)PPS LLR Double Bronze: Earned 100,000,000 credits (103,292,338)PSP LLR Amethyst: Earned 1,000,000 credits (1,092,773)SoB LLR Sapphire: Earned 20,000,000 credits (41,382,619)SR5 LLR Sapphire: Earned 20,000,000 credits (21,787,728)SGS LLR Sapphire: Earned 20,000,000 credits (20,284,625)TRP LLR Gold: Earned 500,000 credits (821,174)Woodall LLR Jade: Earned 10,000,000 credits (15,028,246)321 Sieve (suspended) Amethyst: Earned 1,000,000 credits (1,084,376)Generalized Cullen/Woodall Sieve (suspended) Sapphire: Earned 20,000,000 credits (20,606,749)PPS Sieve Sapphire: Earned 20,000,000 credits (30,065,949)AP 26/27 Double Silver: Earned 200,000,000 credits (257,818,067)GFN Double Bronze: Earned 100,000,000 credits (185,033,172)WW Emerald: Earned 50,000,000 credits (59,040,000)PSA Emerald: Earned 50,000,000 credits (53,011,665)
Message 146996 - Posted: 23 Dec 2020 | 21:42:55 UTC - in response to Message 146971.

For the i5-3470 I would run 1 task on 4 cores and for the 3900XT I would run 4 tasks on 3 cores each. These tasks are presently using about 7MB of CPU cache each, so the 3470 will be a bit slow, but the 3900XT should run them pretty quick with that setting.

You will want to make sure that the tasks are run on the same CCX (a CCX is a set of 3 adjacent cores for the 3900XT) for the Ryzen CPU though to maximize efficiency. You can use Process Lasso (free, here https://bitsum.com/) or, if you hop on the Discord server, someone may have a link to another program that the same person who made the LLR2 software wrote. Ryzen 3rd gen CPUs are much more productive and efficient if you can keep a task on the same CCX.

As far as optimal configurations go, that is a function of FFT length and cache size. If you look at the output of a task after you run it (by clicking on the task on your tasks summary page) you can see the FFT size. Multiply that by 8 and you get MB of CPU cache used during the test. For 321 right now the FFT size is 864K, so each task uses about 7MB cache. The i5-3470 has 6MB and each CCX of the 3900XT has 16MB.

Some examples: Looking at PPSE tasks, with an FFT length of 120K (~1MB cache usage), the most efficient configuration would be to run 1 task on each core (4 tasks total for the 3470 and 12 tasks total for the 3900XT). For PPS-Mega the FFT size right now is 256K, or about 2MB per task. You would probably want to run 2 tasks with 2 threads each on the 3470 and 12 tasks with 1 thread each for the 3900XT for optimal throughput, to stay within the cache limits for each CPU.

Hope this helps, and good luck!


Hi, are you talking L2 or L3 CPU cache? I ask because my CPU has only 2MB of L2 but 45MB L3. Should I be calculating for L2 or L3?


Generally speaking, it's L3, but there is an "it depends" based on cache inclusiveness. AMD and consumer Intel chips store a copy of L2 in L3, so only L3 matters (and in Ryzen 3k/5k, it's L3 per CCX). Skylake-X/SP and up (HEDT/server) have a mostly non-inclusive hierarchy, so it's a little less than L2+L3.

What CPU are you using that mixes itsy bitsy L2 with massive L3?


So I'm guessing 6 cores per task is optimal for me? CPU is a E5 v3 2686

Profile Jordan Romaidis
Avatar
Send message
Joined: 11 May 17
Posts: 274
ID: 880615
Credit: 823,947,522
RAC: 30,635
Discovered 5 mega primesEliminated 1 conjecture "k"Discovered 1 AP26Found 1 prime in the 2018 Tour de PrimesFound 2 primes in the 2019 Tour de PrimesFound 2 primes in the 2020 Tour de PrimesFound 1 mega prime in the 2020 Tour de PrimesFound 1 prime in the 2020 Tour de Primes Mountain StageFound 1 mega prime in the 2020 Tour de Primes Mountain StageFound 1 prime in the 2021 Tour de PrimesFound 2 primes in the 2022 Tour de PrimesFound 1 mega prime in the 2022 Tour de PrimesFound 1 prime in the 2023 Tour de Primes321 LLR Turquoise: Earned 5,000,000 credits (5,014,730)Cullen LLR Ruby: Earned 2,000,000 credits (2,080,460)ESP LLR Gold: Earned 500,000 credits (502,799)Generalized Cullen/Woodall LLR Turquoise: Earned 5,000,000 credits (6,000,054)PPS LLR Double Bronze: Earned 100,000,000 credits (103,292,338)PSP LLR Amethyst: Earned 1,000,000 credits (1,092,773)SoB LLR Sapphire: Earned 20,000,000 credits (41,382,619)SR5 LLR Sapphire: Earned 20,000,000 credits (21,787,728)SGS LLR Sapphire: Earned 20,000,000 credits (20,284,625)TRP LLR Gold: Earned 500,000 credits (821,174)Woodall LLR Jade: Earned 10,000,000 credits (15,028,246)321 Sieve (suspended) Amethyst: Earned 1,000,000 credits (1,084,376)Generalized Cullen/Woodall Sieve (suspended) Sapphire: Earned 20,000,000 credits (20,606,749)PPS Sieve Sapphire: Earned 20,000,000 credits (30,065,949)AP 26/27 Double Silver: Earned 200,000,000 credits (257,818,067)GFN Double Bronze: Earned 100,000,000 credits (185,033,172)WW Emerald: Earned 50,000,000 credits (59,040,000)PSA Emerald: Earned 50,000,000 credits (53,011,665)
Message 147009 - Posted: 24 Dec 2020 | 1:09:27 UTC - in response to Message 146996.

Hmm according to task manager my system has the following available:

L1 2.2MB
L2 9MB
L3 90MB

Profile GrebulonerProject donor
Volunteer tester
Avatar
Send message
Joined: 2 Nov 09
Posts: 490
ID: 49572
Credit: 2,943,462,374
RAC: 4,052,586
Discovered 5 mega primesFound 2 primes in the 2018 Tour de PrimesFound 4 primes in the 2019 Tour de PrimesFound 3 primes in the 2020 Tour de PrimesFound 1 mega prime in the 2020 Tour de PrimesFound 4 primes in the 2021 Tour de PrimesFound 7 primes in the 2022 Tour de PrimesFound 1 mega prime in the 2022 Tour de PrimesFound 10 primes in the 2023 Tour de PrimesFound 2 mega primes in the 2023 Tour de Primes321 LLR Emerald: Earned 50,000,000 credits (50,304,710)Cullen LLR Sapphire: Earned 20,000,000 credits (22,096,858)ESP LLR Sapphire: Earned 20,000,000 credits (20,453,074)Generalized Cullen/Woodall LLR Emerald: Earned 50,000,000 credits (50,327,156)PPS LLR Emerald: Earned 50,000,000 credits (75,487,921)PSP LLR Sapphire: Earned 20,000,000 credits (20,571,669)SoB LLR Emerald: Earned 50,000,000 credits (50,873,282)SR5 LLR Sapphire: Earned 20,000,000 credits (24,982,219)SGS LLR Sapphire: Earned 20,000,000 credits (31,592,082)TRP LLR Sapphire: Earned 20,000,000 credits (24,211,899)Woodall LLR Sapphire: Earned 20,000,000 credits (24,238,438)321 Sieve (suspended) Emerald: Earned 50,000,000 credits (55,630,889)Cullen/Woodall Sieve (suspended) Ruby: Earned 2,000,000 credits (4,178,073)Generalized Cullen/Woodall Sieve (suspended) Emerald: Earned 50,000,000 credits (56,046,594)PPS Sieve Double Gold: Earned 500,000,000 credits (521,014,891)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Turquoise: Earned 5,000,000 credits (9,468,384)TRP Sieve (suspended) Jade: Earned 10,000,000 credits (10,076,645)AP 26/27 Double Gold: Earned 500,000,000 credits (553,962,809)GFN Double Silver: Earned 200,000,000 credits (380,866,271)WW Double Gold: Earned 500,000,000 credits (830,928,000)PSA Double Bronze: Earned 100,000,000 credits (126,200,096)
Message 147016 - Posted: 24 Dec 2020 | 5:03:56 UTC - in response to Message 147009.

Hmm according to task manager my system has the following available:

L1 2.2MB
L2 9MB
L3 90MB


Might you be running a two socket system? 36 cores total across 2 CPUs? (your computers are hidden)

You can fit 6 tasks in L3 with room to spare, so I'd suggest 3 threads/task.

Not fully comparable, of course, but I found much better throughput doing 321 with 6x3t instead of 3x6t on my 18c 10980XE.
____________
Eating more cheese on Thursdays.

Profile roadrunner_gsProject donor
Volunteer developer
Send message
Joined: 11 Sep 08
Posts: 600
ID: 28785
Credit: 331,699,243
RAC: 8
321 LLR Silver: Earned 100,000 credits (372,062)Cullen LLR Gold: Earned 500,000 credits (581,999)ESP LLR Gold: Earned 500,000 credits (816,852)Generalized Cullen/Woodall LLR Amethyst: Earned 1,000,000 credits (1,889,912)PPS LLR Emerald: Earned 50,000,000 credits (54,675,475)PSP LLR Ruby: Earned 2,000,000 credits (4,224,875)SoB LLR Ruby: Earned 2,000,000 credits (2,553,982)SGS LLR Gold: Earned 500,000 credits (655,953)TRP LLR Silver: Earned 100,000 credits (292,869)Woodall LLR Gold: Earned 500,000 credits (546,071)321 Sieve (suspended) Gold: Earned 500,000 credits (958,303)Cullen/Woodall Sieve (suspended) Silver: Earned 100,000 credits (283,632)PPS Sieve Jade: Earned 10,000,000 credits (10,362,884)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Silver: Earned 100,000 credits (339,564)TRP Sieve (suspended) Silver: Earned 100,000 credits (310,404)AP 26/27 Double Silver: Earned 200,000,000 credits (223,705,778)GFN Jade: Earned 10,000,000 credits (18,041,996)WW Ruby: Earned 2,000,000 credits (4,080,000)PSA Turquoise: Earned 5,000,000 credits (6,999,200)
Message 147325 - Posted: 3 Jan 2021 | 14:03:02 UTC
Last modified: 3 Jan 2021 | 14:05:03 UTC

I don't know.
Have not done extensive testing for now since the runtime of WUs forbids this, but i currently have following 4 WUs, two running with 2 threads per WU (the upper two), 2 running with 1 thread per WU (the lower two).



this is a Xeon E5-2690 with 8 Cores, 256kB L2-Cache per Core and 20 MB shared L3-Cache for the CPU, so yes this woul be above the

one could see that the 2-threaded WUs are slower than single-threaded ones.

When those WUs are finished i will give the last found 321-prime ( 3*2^1832496+1) a shot (since i know the outcome, but it is longer still) and run this 8 time in parallel versus 1-time 8-threaded, 2-times 4-threaded, 4-times 2-threaded.

Got a Xeon Gold 6254 with 1 ML L2-Cache per Core and 24.75 MB L3-Cache per CPU, will run it on that one, too, baseline single-threaded fired up for now, seems to be running at 3.4 GHz.
With 4 processes in parallel it should need 30 MB Cache, since the L3-Cache is a non-inclusive-victim-cache (same as in my E5-2690 above) it should utilize 28.75 MB of Cache and as per Specification Update should still run at 3.4 GHz.
I bind it to the second node to avoid cache-copies between nodes as well as foreign-node-ram-access.
Hyperthreading is not deactivated, but you have to use what you have got.

$ numactl -N 1 ./llr64 -d -q"3*2^16408818+1" -a1 & disown $ Starting Proth prime test of 3*2^16408818+1 Using all-complex FMA3 FFT length 960K, Pass1=384, Pass2=2560, a = 5 3*2^16408818+1, bit: 30000 / 16408819 [0.18%]. Time per bit: 3.825 ms.

Ravi Fernando
Project administrator
Volunteer tester
Project scientist
Send message
Joined: 21 Mar 19
Posts: 211
ID: 1108183
Credit: 13,391,841
RAC: 6,188
321 LLR Amethyst: Earned 1,000,000 credits (1,061,204)Cullen LLR Silver: Earned 100,000 credits (111,878)ESP LLR Bronze: Earned 10,000 credits (16,570)Generalized Cullen/Woodall LLR Bronze: Earned 10,000 credits (68,801)PPS LLR Ruby: Earned 2,000,000 credits (4,652,378)PSP LLR Silver: Earned 100,000 credits (106,263)SoB LLR Silver: Earned 100,000 credits (258,849)SR5 LLR Bronze: Earned 10,000 credits (76,537)SGS LLR Silver: Earned 100,000 credits (220,642)TRP LLR Silver: Earned 100,000 credits (254,161)Woodall LLR Bronze: Earned 10,000 credits (97,444)321 Sieve (suspended) Turquoise: Earned 5,000,000 credits (5,001,667)AP 26/27 Bronze: Earned 10,000 credits (72,774)GFN Amethyst: Earned 1,000,000 credits (1,374,310)WW Bronze: Earned 10,000 credits (12,000)
Message 147330 - Posted: 3 Jan 2021 | 17:26:33 UTC - in response to Message 147325.

When those WUs are finished i will give the last found 321-prime ( 3*2^1832496+1) a shot

Do you mean 3*2^16408818+1?

Profile roadrunner_gsProject donor
Volunteer developer
Send message
Joined: 11 Sep 08
Posts: 600
ID: 28785
Credit: 331,699,243
RAC: 8
321 LLR Silver: Earned 100,000 credits (372,062)Cullen LLR Gold: Earned 500,000 credits (581,999)ESP LLR Gold: Earned 500,000 credits (816,852)Generalized Cullen/Woodall LLR Amethyst: Earned 1,000,000 credits (1,889,912)PPS LLR Emerald: Earned 50,000,000 credits (54,675,475)PSP LLR Ruby: Earned 2,000,000 credits (4,224,875)SoB LLR Ruby: Earned 2,000,000 credits (2,553,982)SGS LLR Gold: Earned 500,000 credits (655,953)TRP LLR Silver: Earned 100,000 credits (292,869)Woodall LLR Gold: Earned 500,000 credits (546,071)321 Sieve (suspended) Gold: Earned 500,000 credits (958,303)Cullen/Woodall Sieve (suspended) Silver: Earned 100,000 credits (283,632)PPS Sieve Jade: Earned 10,000,000 credits (10,362,884)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Silver: Earned 100,000 credits (339,564)TRP Sieve (suspended) Silver: Earned 100,000 credits (310,404)AP 26/27 Double Silver: Earned 200,000,000 credits (223,705,778)GFN Jade: Earned 10,000,000 credits (18,041,996)WW Ruby: Earned 2,000,000 credits (4,080,000)PSA Turquoise: Earned 5,000,000 credits (6,999,200)
Message 147331 - Posted: 3 Jan 2021 | 17:31:00 UTC - in response to Message 147330.

When those WUs are finished i will give the last found 321-prime ( 3*2^1832496+1) a shot

Do you mean 3*2^16408818+1?


Yes, my bad, wrong buffer ([ctrl]-[v] vs middle mouse button).
The code has the right one, obviously

Profile roadrunner_gsProject donor
Volunteer developer
Send message
Joined: 11 Sep 08
Posts: 600
ID: 28785
Credit: 331,699,243
RAC: 8
321 LLR Silver: Earned 100,000 credits (372,062)Cullen LLR Gold: Earned 500,000 credits (581,999)ESP LLR Gold: Earned 500,000 credits (816,852)Generalized Cullen/Woodall LLR Amethyst: Earned 1,000,000 credits (1,889,912)PPS LLR Emerald: Earned 50,000,000 credits (54,675,475)PSP LLR Ruby: Earned 2,000,000 credits (4,224,875)SoB LLR Ruby: Earned 2,000,000 credits (2,553,982)SGS LLR Gold: Earned 500,000 credits (655,953)TRP LLR Silver: Earned 100,000 credits (292,869)Woodall LLR Gold: Earned 500,000 credits (546,071)321 Sieve (suspended) Gold: Earned 500,000 credits (958,303)Cullen/Woodall Sieve (suspended) Silver: Earned 100,000 credits (283,632)PPS Sieve Jade: Earned 10,000,000 credits (10,362,884)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Silver: Earned 100,000 credits (339,564)TRP Sieve (suspended) Silver: Earned 100,000 credits (310,404)AP 26/27 Double Silver: Earned 200,000,000 credits (223,705,778)GFN Jade: Earned 10,000,000 credits (18,041,996)WW Ruby: Earned 2,000,000 credits (4,080,000)PSA Turquoise: Earned 5,000,000 credits (6,999,200)
Message 147375 - Posted: 5 Jan 2021 | 9:31:15 UTC

So here goes:
Xeon Gold 6254, 1 WU@1 Thread:
3*2^16408818+1 is prime! (4939547 decimal digits) Time : 62610.163 sec.

Xeon Gold 6254, 4 WU@1 Thread:
3*2^16408818+1 is prime! (4939547 decimal digits) Time : 63368.355 sec.
3*2^16408818+1 is prime! (4939547 decimal digits) Time : 63656.892 sec.
3*2^16408818+1 is prime! (4939547 decimal digits) Time : 63385.569 sec.
3*2^16408818+1 is prime! (4939547 decimal digits) Time : 63500.971 sec.

Xeon Gold 6254, 1 WU@4 Threads:
3*2^16408818+1 is prime! (4939547 decimal digits) Time : 18658.265 sec.

Xeon E5-2690, 4 WU@1 Thread:
3*2^16408818+1 is prime! (4939547 decimal digits) Time : 103037.590 sec.
3*2^16408818+1 is prime! (4939547 decimal digits) Time : 103037.493 sec.
3*2^16408818+1 is prime! (4939547 decimal digits) Time : 103038.159 sec.
3*2^16408818+1 is prime! (4939547 decimal digits) Time : 103036.720 sec.

Xeon E5-2690, 1 WU@2Thread:
3*2^16408818+1 is prime! (4939547 decimal digits) Time : 26465.674 sec.

Throughput/day (4WU@1T vs 1WU@4T):
Xeon Gold 6254 => 5.24 vs 4.63
xeon E5-2690 => 3.35 vs 3.25

I will go for 8WUs@1T vs 1WU@8T now.

Profile roadrunner_gsProject donor
Volunteer developer
Send message
Joined: 11 Sep 08
Posts: 600
ID: 28785
Credit: 331,699,243
RAC: 8
321 LLR Silver: Earned 100,000 credits (372,062)Cullen LLR Gold: Earned 500,000 credits (581,999)ESP LLR Gold: Earned 500,000 credits (816,852)Generalized Cullen/Woodall LLR Amethyst: Earned 1,000,000 credits (1,889,912)PPS LLR Emerald: Earned 50,000,000 credits (54,675,475)PSP LLR Ruby: Earned 2,000,000 credits (4,224,875)SoB LLR Ruby: Earned 2,000,000 credits (2,553,982)SGS LLR Gold: Earned 500,000 credits (655,953)TRP LLR Silver: Earned 100,000 credits (292,869)Woodall LLR Gold: Earned 500,000 credits (546,071)321 Sieve (suspended) Gold: Earned 500,000 credits (958,303)Cullen/Woodall Sieve (suspended) Silver: Earned 100,000 credits (283,632)PPS Sieve Jade: Earned 10,000,000 credits (10,362,884)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Silver: Earned 100,000 credits (339,564)TRP Sieve (suspended) Silver: Earned 100,000 credits (310,404)AP 26/27 Double Silver: Earned 200,000,000 credits (223,705,778)GFN Jade: Earned 10,000,000 credits (18,041,996)WW Ruby: Earned 2,000,000 credits (4,080,000)PSA Turquoise: Earned 5,000,000 credits (6,999,200)
Message 147381 - Posted: 5 Jan 2021 | 14:19:56 UTC

Ich konnte noch folgende CPUs auftreiben
Xeon Gold 6128 (6 Kerne @ 3.6 GHz Turbo; 19.25 MB L3)
Xeon Gold 5217 (8 Kerne @ 3.0 GHz Turbo; 11 MB L3)
Xeon E7-4880 v2 (15 Kerne @ ? Turbo; 37.5 MB L3)

let's see...

Yves GallotProject donor
Volunteer developer
Project scientist
Send message
Joined: 19 Aug 12
Posts: 803
ID: 164101
Credit: 305,702,054
RAC: 5,416
GFN Double Silver: Earned 200,000,000 credits (305,702,054)
Message 147383 - Posted: 5 Jan 2021 | 16:02:39 UTC - in response to Message 146971.

Hi, are you talking L2 or L3 CPU cache? I ask because my CPU has only 2MB of L2 but 45MB L3. Should I be calculating for L2 or L3?

Generally speaking, it's L3, but there is an "it depends" based on cache inclusiveness. AMD and consumer Intel chips store a copy of L2 in L3, so only L3 matters (and in Ryzen 3k/5k, it's L3 per CCX).

Zen L3 cache is a victim cache then the size is L2 + L3... no?

Profile roadrunner_gsProject donor
Volunteer developer
Send message
Joined: 11 Sep 08
Posts: 600
ID: 28785
Credit: 331,699,243
RAC: 8
321 LLR Silver: Earned 100,000 credits (372,062)Cullen LLR Gold: Earned 500,000 credits (581,999)ESP LLR Gold: Earned 500,000 credits (816,852)Generalized Cullen/Woodall LLR Amethyst: Earned 1,000,000 credits (1,889,912)PPS LLR Emerald: Earned 50,000,000 credits (54,675,475)PSP LLR Ruby: Earned 2,000,000 credits (4,224,875)SoB LLR Ruby: Earned 2,000,000 credits (2,553,982)SGS LLR Gold: Earned 500,000 credits (655,953)TRP LLR Silver: Earned 100,000 credits (292,869)Woodall LLR Gold: Earned 500,000 credits (546,071)321 Sieve (suspended) Gold: Earned 500,000 credits (958,303)Cullen/Woodall Sieve (suspended) Silver: Earned 100,000 credits (283,632)PPS Sieve Jade: Earned 10,000,000 credits (10,362,884)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Silver: Earned 100,000 credits (339,564)TRP Sieve (suspended) Silver: Earned 100,000 credits (310,404)AP 26/27 Double Silver: Earned 200,000,000 credits (223,705,778)GFN Jade: Earned 10,000,000 credits (18,041,996)WW Ruby: Earned 2,000,000 credits (4,080,000)PSA Turquoise: Earned 5,000,000 credits (6,999,200)
Message 147385 - Posted: 5 Jan 2021 | 16:37:52 UTC - in response to Message 147383.

Hi, are you talking L2 or L3 CPU cache? I ask because my CPU has only 2MB of L2 but 45MB L3. Should I be calculating for L2 or L3?

Generally speaking, it's L3, but there is an "it depends" based on cache inclusiveness. AMD and consumer Intel chips store a copy of L2 in L3, so only L3 matters (and in Ryzen 3k/5k, it's L3 per CCX).

Zen L3 cache is a victim cache then the size is L2 + L3... no?


If long running "heavy" tasks and non-inlusive, then almost yes.

Profile roadrunner_gsProject donor
Volunteer developer
Send message
Joined: 11 Sep 08
Posts: 600
ID: 28785
Credit: 331,699,243
RAC: 8
321 LLR Silver: Earned 100,000 credits (372,062)Cullen LLR Gold: Earned 500,000 credits (581,999)ESP LLR Gold: Earned 500,000 credits (816,852)Generalized Cullen/Woodall LLR Amethyst: Earned 1,000,000 credits (1,889,912)PPS LLR Emerald: Earned 50,000,000 credits (54,675,475)PSP LLR Ruby: Earned 2,000,000 credits (4,224,875)SoB LLR Ruby: Earned 2,000,000 credits (2,553,982)SGS LLR Gold: Earned 500,000 credits (655,953)TRP LLR Silver: Earned 100,000 credits (292,869)Woodall LLR Gold: Earned 500,000 credits (546,071)321 Sieve (suspended) Gold: Earned 500,000 credits (958,303)Cullen/Woodall Sieve (suspended) Silver: Earned 100,000 credits (283,632)PPS Sieve Jade: Earned 10,000,000 credits (10,362,884)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Silver: Earned 100,000 credits (339,564)TRP Sieve (suspended) Silver: Earned 100,000 credits (310,404)AP 26/27 Double Silver: Earned 200,000,000 credits (223,705,778)GFN Jade: Earned 10,000,000 credits (18,041,996)WW Ruby: Earned 2,000,000 credits (4,080,000)PSA Turquoise: Earned 5,000,000 credits (6,999,200)
Message 147395 - Posted: 5 Jan 2021 | 23:22:05 UTC

Yeah, the Xeon Gold 5217 (8 Cores @ 3.0 GHz Turbo; 11 MB L3) is currently working on 8 WUs in parallel with a "Time per it: 16.570 ms", that is a lot slower than the E5-2690 with 9.317 ms and a lot slower than the Xeon Gold 6254.
It is running with 3.0 GHz, but only drawing 68 W as per rapl, whereas it was drawing 85 W (max TDP) when doing one WU with 8 threads.
Projected finishing time is around 3 days, yielding a throughput of 2.54/day, throughput was 7.22/day with 1 WU@8threads and 4.52/day with 1 WU@4threads.
But Fujitsu was cheap only fitting one memory module with 32 GB instead of populating all six channels.

Will post back when finished

Profile GrebulonerProject donor
Volunteer tester
Avatar
Send message
Joined: 2 Nov 09
Posts: 490
ID: 49572
Credit: 2,943,462,374
RAC: 4,052,586
Discovered 5 mega primesFound 2 primes in the 2018 Tour de PrimesFound 4 primes in the 2019 Tour de PrimesFound 3 primes in the 2020 Tour de PrimesFound 1 mega prime in the 2020 Tour de PrimesFound 4 primes in the 2021 Tour de PrimesFound 7 primes in the 2022 Tour de PrimesFound 1 mega prime in the 2022 Tour de PrimesFound 10 primes in the 2023 Tour de PrimesFound 2 mega primes in the 2023 Tour de Primes321 LLR Emerald: Earned 50,000,000 credits (50,304,710)Cullen LLR Sapphire: Earned 20,000,000 credits (22,096,858)ESP LLR Sapphire: Earned 20,000,000 credits (20,453,074)Generalized Cullen/Woodall LLR Emerald: Earned 50,000,000 credits (50,327,156)PPS LLR Emerald: Earned 50,000,000 credits (75,487,921)PSP LLR Sapphire: Earned 20,000,000 credits (20,571,669)SoB LLR Emerald: Earned 50,000,000 credits (50,873,282)SR5 LLR Sapphire: Earned 20,000,000 credits (24,982,219)SGS LLR Sapphire: Earned 20,000,000 credits (31,592,082)TRP LLR Sapphire: Earned 20,000,000 credits (24,211,899)Woodall LLR Sapphire: Earned 20,000,000 credits (24,238,438)321 Sieve (suspended) Emerald: Earned 50,000,000 credits (55,630,889)Cullen/Woodall Sieve (suspended) Ruby: Earned 2,000,000 credits (4,178,073)Generalized Cullen/Woodall Sieve (suspended) Emerald: Earned 50,000,000 credits (56,046,594)PPS Sieve Double Gold: Earned 500,000,000 credits (521,014,891)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Turquoise: Earned 5,000,000 credits (9,468,384)TRP Sieve (suspended) Jade: Earned 10,000,000 credits (10,076,645)AP 26/27 Double Gold: Earned 500,000,000 credits (553,962,809)GFN Double Silver: Earned 200,000,000 credits (380,866,271)WW Double Gold: Earned 500,000,000 credits (830,928,000)PSA Double Bronze: Earned 100,000,000 credits (126,200,096)
Message 147396 - Posted: 5 Jan 2021 | 23:45:34 UTC - in response to Message 147395.

Yeah, the Xeon Gold 5217 (8 Cores @ 3.0 GHz Turbo; 11 MB L3) is currently working on 8 WUs in parallel with a "Time per it: 16.570 ms", that is a lot slower than the E5-2690 with 9.317 ms and a lot slower than the Xeon Gold 6254.
It is running with 3.0 GHz, but only drawing 68 W as per rapl, whereas it was drawing 85 W (max TDP) when doing one WU with 8 threads.
Projected finishing time is around 3 days, yielding a throughput of 2.54/day, throughput was 7.22/day with 1 WU@8threads and 4.52/day with 1 WU@4threads.
But Fujitsu was cheap only fitting one memory module with 32 GB instead of populating all six channels.

Will post back when finished


One memory channel? Oy Vey! Leaves quite a lot of performance potential off the table.

Also, the Gold 5000 series only has a single AVX512 unit, which when used for PG makes it slower than using the AVX2/FMA3 optimization. You might want to do an additional run and see what the difference is.
____________
Eating more cheese on Thursdays.

Profile roadrunner_gsProject donor
Volunteer developer
Send message
Joined: 11 Sep 08
Posts: 600
ID: 28785
Credit: 331,699,243
RAC: 8
321 LLR Silver: Earned 100,000 credits (372,062)Cullen LLR Gold: Earned 500,000 credits (581,999)ESP LLR Gold: Earned 500,000 credits (816,852)Generalized Cullen/Woodall LLR Amethyst: Earned 1,000,000 credits (1,889,912)PPS LLR Emerald: Earned 50,000,000 credits (54,675,475)PSP LLR Ruby: Earned 2,000,000 credits (4,224,875)SoB LLR Ruby: Earned 2,000,000 credits (2,553,982)SGS LLR Gold: Earned 500,000 credits (655,953)TRP LLR Silver: Earned 100,000 credits (292,869)Woodall LLR Gold: Earned 500,000 credits (546,071)321 Sieve (suspended) Gold: Earned 500,000 credits (958,303)Cullen/Woodall Sieve (suspended) Silver: Earned 100,000 credits (283,632)PPS Sieve Jade: Earned 10,000,000 credits (10,362,884)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Silver: Earned 100,000 credits (339,564)TRP Sieve (suspended) Silver: Earned 100,000 credits (310,404)AP 26/27 Double Silver: Earned 200,000,000 credits (223,705,778)GFN Jade: Earned 10,000,000 credits (18,041,996)WW Ruby: Earned 2,000,000 credits (4,080,000)PSA Turquoise: Earned 5,000,000 credits (6,999,200)
Message 147438 - Posted: 8 Jan 2021 | 16:24:31 UTC

Does the llr make use of the AVX512?
I only see "using FMA3" or the like in the output.

Profile GrebulonerProject donor
Volunteer tester
Avatar
Send message
Joined: 2 Nov 09
Posts: 490
ID: 49572
Credit: 2,943,462,374
RAC: 4,052,586
Discovered 5 mega primesFound 2 primes in the 2018 Tour de PrimesFound 4 primes in the 2019 Tour de PrimesFound 3 primes in the 2020 Tour de PrimesFound 1 mega prime in the 2020 Tour de PrimesFound 4 primes in the 2021 Tour de PrimesFound 7 primes in the 2022 Tour de PrimesFound 1 mega prime in the 2022 Tour de PrimesFound 10 primes in the 2023 Tour de PrimesFound 2 mega primes in the 2023 Tour de Primes321 LLR Emerald: Earned 50,000,000 credits (50,304,710)Cullen LLR Sapphire: Earned 20,000,000 credits (22,096,858)ESP LLR Sapphire: Earned 20,000,000 credits (20,453,074)Generalized Cullen/Woodall LLR Emerald: Earned 50,000,000 credits (50,327,156)PPS LLR Emerald: Earned 50,000,000 credits (75,487,921)PSP LLR Sapphire: Earned 20,000,000 credits (20,571,669)SoB LLR Emerald: Earned 50,000,000 credits (50,873,282)SR5 LLR Sapphire: Earned 20,000,000 credits (24,982,219)SGS LLR Sapphire: Earned 20,000,000 credits (31,592,082)TRP LLR Sapphire: Earned 20,000,000 credits (24,211,899)Woodall LLR Sapphire: Earned 20,000,000 credits (24,238,438)321 Sieve (suspended) Emerald: Earned 50,000,000 credits (55,630,889)Cullen/Woodall Sieve (suspended) Ruby: Earned 2,000,000 credits (4,178,073)Generalized Cullen/Woodall Sieve (suspended) Emerald: Earned 50,000,000 credits (56,046,594)PPS Sieve Double Gold: Earned 500,000,000 credits (521,014,891)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Turquoise: Earned 5,000,000 credits (9,468,384)TRP Sieve (suspended) Jade: Earned 10,000,000 credits (10,076,645)AP 26/27 Double Gold: Earned 500,000,000 credits (553,962,809)GFN Double Silver: Earned 200,000,000 credits (380,866,271)WW Double Gold: Earned 500,000,000 credits (830,928,000)PSA Double Bronze: Earned 100,000,000 credits (126,200,096)
Message 147442 - Posted: 8 Jan 2021 | 17:12:03 UTC - in response to Message 147438.

Does the llr make use of the AVX512?
I only see "using FMA3" or the like in the output.


It does. The performance hit to single AVX512 unit CPUs has been well-documented, so I wonder if bypassing it and using faster FMA3 on affected chips (like that Xeon) was baked into the code?

(copied from an stderr output on my Cascade Lake):

LLR Program - Version 3.8.23, using Gwnum Library Version 29.8 LLR command line: primegrid_cllr.exe -d -oDiskWriteTime=1 -oThreadsPerTest=1 llr.in Using zero-padded AVX-512 FFT length 128K, Pass1=128, Pass2=1K, clm=1

____________
Eating more cheese on Thursdays.

Profile roadrunner_gsProject donor
Volunteer developer
Send message
Joined: 11 Sep 08
Posts: 600
ID: 28785
Credit: 331,699,243
RAC: 8
321 LLR Silver: Earned 100,000 credits (372,062)Cullen LLR Gold: Earned 500,000 credits (581,999)ESP LLR Gold: Earned 500,000 credits (816,852)Generalized Cullen/Woodall LLR Amethyst: Earned 1,000,000 credits (1,889,912)PPS LLR Emerald: Earned 50,000,000 credits (54,675,475)PSP LLR Ruby: Earned 2,000,000 credits (4,224,875)SoB LLR Ruby: Earned 2,000,000 credits (2,553,982)SGS LLR Gold: Earned 500,000 credits (655,953)TRP LLR Silver: Earned 100,000 credits (292,869)Woodall LLR Gold: Earned 500,000 credits (546,071)321 Sieve (suspended) Gold: Earned 500,000 credits (958,303)Cullen/Woodall Sieve (suspended) Silver: Earned 100,000 credits (283,632)PPS Sieve Jade: Earned 10,000,000 credits (10,362,884)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Silver: Earned 100,000 credits (339,564)TRP Sieve (suspended) Silver: Earned 100,000 credits (310,404)AP 26/27 Double Silver: Earned 200,000,000 credits (223,705,778)GFN Jade: Earned 10,000,000 credits (18,041,996)WW Ruby: Earned 2,000,000 credits (4,080,000)PSA Turquoise: Earned 5,000,000 credits (6,999,200)
Message 147444 - Posted: 8 Jan 2021 | 18:04:09 UTC
Last modified: 8 Jan 2021 | 18:42:21 UTC

Ah no problem. 3.8.24 seems to be from July 2020.
I have an usb-stick for my test-runs, on it is version 3.8.21 (see my post below or above, depending on sort-order).
Since i have a fair amount of test-data for "697*2^530150+1" for different CPUs i would not change that for the time being. (also regarding the fact i did runs for days worth now with that version).

initial quickrun with 3.8.24 the Gold 6254:

1WU@18 threads decidedly slower @ 0.692 ms per bit (16408817)
1WU@4 threads decidedly faster @ 0.930 ms per bit

Profile roadrunner_gsProject donor
Volunteer developer
Send message
Joined: 11 Sep 08
Posts: 600
ID: 28785
Credit: 331,699,243
RAC: 8
321 LLR Silver: Earned 100,000 credits (372,062)Cullen LLR Gold: Earned 500,000 credits (581,999)ESP LLR Gold: Earned 500,000 credits (816,852)Generalized Cullen/Woodall LLR Amethyst: Earned 1,000,000 credits (1,889,912)PPS LLR Emerald: Earned 50,000,000 credits (54,675,475)PSP LLR Ruby: Earned 2,000,000 credits (4,224,875)SoB LLR Ruby: Earned 2,000,000 credits (2,553,982)SGS LLR Gold: Earned 500,000 credits (655,953)TRP LLR Silver: Earned 100,000 credits (292,869)Woodall LLR Gold: Earned 500,000 credits (546,071)321 Sieve (suspended) Gold: Earned 500,000 credits (958,303)Cullen/Woodall Sieve (suspended) Silver: Earned 100,000 credits (283,632)PPS Sieve Jade: Earned 10,000,000 credits (10,362,884)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Silver: Earned 100,000 credits (339,564)TRP Sieve (suspended) Silver: Earned 100,000 credits (310,404)AP 26/27 Double Silver: Earned 200,000,000 credits (223,705,778)GFN Jade: Earned 10,000,000 credits (18,041,996)WW Ruby: Earned 2,000,000 credits (4,080,000)PSA Turquoise: Earned 5,000,000 credits (6,999,200)
Message 147570 - Posted: 14 Jan 2021 | 23:04:10 UTC

So far (seconds per WU, Throughput, Speedup compared to 1 WU@1Thread).



The Gold 52xx is severly hampered by its cache (or the lack thereof).

Message boards : 321 Prime Search : Multi-threading: Max # of threads for each task Setting?

[Return to PrimeGrid main page]
DNS Powered by DNSEXIT.COM
Copyright © 2005 - 2023 Rytis Slatkevičius (contact) and PrimeGrid community. Server load 2.06, 1.83, 1.95
Generated 1 Apr 2023 | 18:10:47 UTC