PrimeGrid
Please visit donation page to help the project cover running costs for this month

Toggle Menu

Join PrimeGrid

Returning Participants

Community

Leader Boards

Results

Other

drummers-lowrise

Advanced search

Message boards : Generalized Fermat Prime Search : High CPU usage again: Genefer 21 3.19 GPU (OCLcudaGFN)

Author Message
Hazel
Send message
Joined: 2 Oct 17
Posts: 18
ID: 930140
Credit: 440,689,569
RAC: 0
Discovered 2 mega primesFound 1 prime in the 2019 Tour de Primes321 LLR Turquoise: Earned 5,000,000 credits (7,855,893)PPS LLR Sapphire: Earned 20,000,000 credits (43,002,984)PSP LLR Turquoise: Earned 5,000,000 credits (5,137,739)SGS LLR Bronze: Earned 10,000 credits (15,886)Generalized Cullen/Woodall Sieve (suspended) Bronze: Earned 10,000 credits (87,930)PPS Sieve Double Silver: Earned 200,000,000 credits (242,877,179)AP 26/27 Jade: Earned 10,000,000 credits (13,762,372)GFN Double Bronze: Earned 100,000,000 credits (127,944,098)
Message 120918 - Posted: 5 Oct 2018 | 1:07:11 UTC

The GFN GPU tasks on my linux machines still use 100% CPU (100% of a single thread) even though this is supposed to be fixed according to Mike Goetz.

The systems involved are (driver versions saved here for reference):
crystal:
http://www.primegrid.com/show_host_detail.php?hostid=936874
Intel(R) Core(TM) i7-8750H CPU @ 2.20GHz [Family 6 Model 158 Stepping 10] (12 processors)
NVIDIA GeForce GTX 1060 (4095MB) driver: 396.54, INTEL Intel(R) UHD Graphics Coffee Lake Halo GT2 (4096MB)
Debian GNU/Linux testing (buster) [4.18.0-1-amd64|libc 2.27 (Debian GLIBC 2.27-6)]
Output from this machine:

geneferocl 3.3.3-2 (Linux/OpenCL/64-bit) Copyright 2001-2018, Yves Gallot Copyright 2009, Mark Rodenkirch, David Underbakke Copyright 2010-2012, Shoichiro Yamada, Ken Brazier Copyright 2011-2014, Michael Goetz, Ronald Schneider Copyright 2011-2018, Iain Bethune Genefer is free source code, under the MIT license. Running on platform 'NVIDIA CUDA', device 'GeForce GTX 1060', vendor 'NVIDIA Corporation', version 'OpenCL 1.2 CUDA' and driver '396.54'. 10 computeUnits @ 1733MHz, memSize=6078MB, cacheSize=160kB, cacheLineSize=128B, localMemSize=48kB, maxWorkGroupSize=1024. Supported transform implementations: ocl ocl2 ocl3 ocl4 ocl5 Command line: ../../projects/www.primegrid.com/primegrid_genefer_3_3_3_3.19_x86_64-pc-linux-gnu__OCLcudaGFN15 -boinc -q 101950630^32768+1 --device 0 Normal priority change failed (needs superuser privileges. Checking available transform implementations... OCL transform is past its b limit. OCL3 transform is past its b limit. OCL4 transform is past its b limit. OCL5 transform is past its b limit. Using OCL2 transform Starting initialization... Initialization complete (0.054 seconds). Testing 101950630^32768+1... Estimated time for 101950630^32768+1 is 0:03:26 101950630^32768+1 is complete. (262419 digits) (err = 0.0000) (time = 0:03:30) 15:20:24 15:20:24 (4685): called boinc_finish

(the GPU on this system is locked into a power-saving mode, so it should run a fraction as fast as you would expect from a normal 1060, so the CPU thread should have to communicate with it even less often. Despite the fact that it says 10 compute units at 1733Mhz its actually running at 607Mhz at best. ETA for GFN 21 was about 7 days...)

buttercup:
GenuineIntel
Intel(R) Core(TM) i7-3770K CPU @ 3.50GHz [Family 6 Model 58 Stepping 9] (8 processors)
NVIDIA GeForce GTX 970 (4041MB) driver: 390.67
Debian GNU/Linux 9 (stretch) 4.17.0-0.bpo.1-amd64
http://www.primegrid.com/show_host_detail.php?hostid=910164

Output from this machine:
geneferocl 3.3.3-2 (Linux/OpenCL/64-bit) Copyright 2001-2018, Yves Gallot Copyright 2009, Mark Rodenkirch, David Underbakke Copyright 2010-2012, Shoichiro Yamada, Ken Brazier Copyright 2011-2014, Michael Goetz, Ronald Schneider Copyright 2011-2018, Iain Bethune Genefer is free source code, under the MIT license. Running on platform 'NVIDIA CUDA', device 'GeForce GTX 970', vendor 'NVIDIA Corporation', version 'OpenCL 1.2 CUDA' and driver '390.67'. 13 computeUnits @ 1177MHz, memSize=4041MB, cacheSize=208kB, cacheLineSize=128B, localMemSize=48kB, maxWorkGroupSize=1024. Supported transform implementations: ocl ocl2 ocl3 ocl4 ocl5 Command line: ../../projects/www.primegrid.com/primegrid_genefer_3_3_3_3.19_x86_64-pc-linux-gnu__OCLcudaGFN -boinc -q 266884^2097152+1 --device 0 Normal priority change failed (needs superuser privileges. Checking available transform implementations... A benchmark is needed to determine best transform, testing available transform implementations... Testing OCL transform... Testing OCL2 transform... Testing OCL3 transform... Testing OCL4 transform... Testing OCL5 transform... Benchmarks completed (20.951 seconds). Using OCL4 transform Starting initialization... Initialization complete (12.814 seconds). Testing 266884^2097152+1... Estimated time for 266884^2097152+1 is 21:30:00


I tested both GFN-15 and GFN-21. This is all just via BOINC.

Since this interferes with my CPU tasks, I have switched to AP27 for now.

I have confirmed BOINC is using the latest GPU version: primegrid_genefer_3_3_3_3.19_x86_64-pc-linux-gnu__OCLcudaGFN

Both systems use the nVidia GPU for X11.

I will try the LD_PRELOAD hack later or this weekend if I have time.

____________
Crunched with love,
Hazel

Hazel
Send message
Joined: 2 Oct 17
Posts: 18
ID: 930140
Credit: 440,689,569
RAC: 0
Discovered 2 mega primesFound 1 prime in the 2019 Tour de Primes321 LLR Turquoise: Earned 5,000,000 credits (7,855,893)PPS LLR Sapphire: Earned 20,000,000 credits (43,002,984)PSP LLR Turquoise: Earned 5,000,000 credits (5,137,739)SGS LLR Bronze: Earned 10,000 credits (15,886)Generalized Cullen/Woodall Sieve (suspended) Bronze: Earned 10,000 credits (87,930)PPS Sieve Double Silver: Earned 200,000,000 credits (242,877,179)AP 26/27 Jade: Earned 10,000,000 credits (13,762,372)GFN Double Bronze: Earned 100,000,000 credits (127,944,098)
Message 120929 - Posted: 5 Oct 2018 | 20:50:27 UTC - in response to Message 120918.

I have confirmed the LD_PRELOAD hack still fixes it. This seems to affect all my machines (but I set up all of my machines very similarly, they're all running close to the same versions of everything.)
____________
Crunched with love,
Hazel

Iain BethuneProject donor
Honorary cruncher
Send message
Joined: 28 Jan 09
Posts: 1588
ID: 34775
Credit: 194,056,043
RAC: 0
321 LLR Gold: Earned 500,000 credits (597,175)Cullen LLR Amethyst: Earned 1,000,000 credits (1,355,381)ESP LLR Turquoise: Earned 5,000,000 credits (5,048,594)Generalized Cullen/Woodall LLR Ruby: Earned 2,000,000 credits (2,564,412)PPS LLR Amethyst: Earned 1,000,000 credits (1,025,115)PSP LLR Sapphire: Earned 20,000,000 credits (28,367,849)SoB LLR Jade: Earned 10,000,000 credits (16,215,329)SR5 LLR Turquoise: Earned 5,000,000 credits (5,189,992)SGS LLR Amethyst: Earned 1,000,000 credits (1,244,067)TRP LLR Turquoise: Earned 5,000,000 credits (5,169,405)Woodall LLR Amethyst: Earned 1,000,000 credits (1,070,956)321 Sieve (suspended) Bronze: Earned 10,000 credits (20,003)Cullen/Woodall Sieve (suspended) Silver: Earned 100,000 credits (200,371)Generalized Cullen/Woodall Sieve (suspended) Jade: Earned 10,000,000 credits (11,645,025)PPS Sieve Turquoise: Earned 5,000,000 credits (7,536,532)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Amethyst: Earned 1,000,000 credits (1,017,144)TRP Sieve (suspended) Gold: Earned 500,000 credits (554,946)AP 26/27 Ruby: Earned 2,000,000 credits (3,577,848)GFN Emerald: Earned 50,000,000 credits (97,047,976)PSA Ruby: Earned 2,000,000 credits (4,606,694)
Message 120956 - Posted: 7 Oct 2018 | 7:34:04 UTC - in response to Message 120929.

I have confirmed the LD_PRELOAD hack still fixes it. This seems to affect all my machines (but I set up all of my machines very similarly, they're all running close to the same versions of everything.)


Can you let me know specifically what libraries you linked via LD_PRELOAD to resolve this?

Cheers

- Iain

____________
Twitter: IainBethune
Proud member of team "Aggie The Pew". Go Aggie!
3073428256125*2^1290000-1 is Prime!

Profile composite
Volunteer tester
Send message
Joined: 16 Feb 10
Posts: 817
ID: 55391
Credit: 720,782,068
RAC: 326,236
Discovered 2 mega primesFound 1 prime in the 2018 Tour de Primes321 LLR Turquoise: Earned 5,000,000 credits (5,477,467)Cullen LLR Gold: Earned 500,000 credits (776,297)ESP LLR Ruby: Earned 2,000,000 credits (3,120,351)Generalized Cullen/Woodall LLR Ruby: Earned 2,000,000 credits (2,093,491)PPS LLR Sapphire: Earned 20,000,000 credits (25,903,171)PSP LLR Turquoise: Earned 5,000,000 credits (5,211,778)SoB LLR Sapphire: Earned 20,000,000 credits (26,056,338)SR5 LLR Turquoise: Earned 5,000,000 credits (6,110,877)SGS LLR Ruby: Earned 2,000,000 credits (3,486,285)TRP LLR Turquoise: Earned 5,000,000 credits (7,070,795)Woodall LLR Amethyst: Earned 1,000,000 credits (1,693,614)321 Sieve (suspended) Emerald: Earned 50,000,000 credits (50,256,050)Cullen/Woodall Sieve (suspended) Turquoise: Earned 5,000,000 credits (5,571,178)Generalized Cullen/Woodall Sieve (suspended) Emerald: Earned 50,000,000 credits (50,009,610)PPS Sieve Double Silver: Earned 200,000,000 credits (333,766,702)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Jade: Earned 10,000,000 credits (10,165,888)TRP Sieve (suspended) Sapphire: Earned 20,000,000 credits (20,071,454)AP 26/27 Turquoise: Earned 5,000,000 credits (6,616,128)WW Gold: Earned 500,000 credits (920,000)GFN Emerald: Earned 50,000,000 credits (53,659,063)PSA Double Bronze: Earned 100,000,000 credits (102,762,384)
Message 120961 - Posted: 7 Oct 2018 | 18:39:46 UTC - in response to Message 120956.
Last modified: 7 Oct 2018 | 18:41:12 UTC

This message has the complete solution.
The compile and run instructions are in the C source code and message.
The library you create overrides the sched_yield() function.
The path to the library (shared object file) which you create is what you put in LD_PRELOAD.

Hazel
Send message
Joined: 2 Oct 17
Posts: 18
ID: 930140
Credit: 440,689,569
RAC: 0
Discovered 2 mega primesFound 1 prime in the 2019 Tour de Primes321 LLR Turquoise: Earned 5,000,000 credits (7,855,893)PPS LLR Sapphire: Earned 20,000,000 credits (43,002,984)PSP LLR Turquoise: Earned 5,000,000 credits (5,137,739)SGS LLR Bronze: Earned 10,000 credits (15,886)Generalized Cullen/Woodall Sieve (suspended) Bronze: Earned 10,000 credits (87,930)PPS Sieve Double Silver: Earned 200,000,000 credits (242,877,179)AP 26/27 Jade: Earned 10,000,000 credits (13,762,372)GFN Double Bronze: Earned 100,000,000 credits (127,944,098)
Message 120988 - Posted: 9 Oct 2018 | 14:29:14 UTC - in response to Message 120956.

I have confirmed the LD_PRELOAD hack still fixes it. This seems to affect all my machines (but I set up all of my machines very similarly, they're all running close to the same versions of everything.)


Can you let me know specifically what libraries you linked via LD_PRELOAD to resolve this?

Cheers

- Iain

That tiny libsleep.c one that overrides sched_yield.
____________
Crunched with love,
Hazel

Message boards : Generalized Fermat Prime Search : High CPU usage again: Genefer 21 3.19 GPU (OCLcudaGFN)

[Return to PrimeGrid main page]
DNS Powered by DNSEXIT.COM
Copyright © 2005 - 2021 Rytis Slatkevičius (contact) and PrimeGrid community. Server load 4.46, 3.53, 2.88
Generated 18 Apr 2021 | 18:32:49 UTC