Author |
Message |
|
For some reason i get error when trying to run GFN-15 trough GFN-19
GFN-20 (that have same requirements, run with no problems)
GFN-21 and GFN-22 that have same requirements plus additional double precision requirement run with no problems
Why can't i run the smaller tasks when the big ones run fine?
the errors GFN-15 trough GFN-19 is giving me is as follows:
<core_client_version>7.16.11</core_client_version>
< Send message
Joined: 21 Jan 10 Posts: 14045 ID: 53948 Credit: 485,740,737 RAC: 668,833
                               
|
Let's try to narrow down the circumstances under which the error is happening.
You're both reporting that GFN-15 through GFN-19 are failing, while GFN-20 and higher are working. I can't verify this, so I'll take you at your word. Between the two of you there's only a single failed task currently in the database.
GFN15 through GFN-19 use a different transform (OCL2) than the larger GFNs. It appears, therefore, that the problem is limited to OCL2.
Both of you have similar, recent vintage AMD GPUs, and are running Windows 10.
I do not know what the problem is, but I can tell you that there are currently 1835 successfully completed and validated GFN-16 Windows AMD tasks. Whatever this problem is, it's not affecting all AMD GPUs.
Narrowing the search down to AMD 6600 and 6700 GPUs, there's 6 computers with one of those GPUs that have completed at least one task in the relevant GFN apps. Two of them have successfully completed and validated those tasks, so we know it *is* possible for your GPU models to run these tasks. Both of those computers are also running Windows 10.
Beyond this I'm afraid I can't really help you. I've never owned an AMD GPU and I'm not familiar with them. But since there's examples of your GPU models successfully completing these tasks, it's likely that it's possible to get yours working too.
____________
My lucky number is 75898524288+1 |
|
|
|
@Michael Goetz
Yes I admit it is some time ago that i tested this and thought it would solve it self with the next driver and Windows updates.
Every time I test this I unfortunately have to abort a TON of those GNF-15 to 19 tasks, because i can't just download one task (as i said they run for 1-2 minutes then they wait 10 minutes and then they start over and over and over.... for hours until they finely give up on one task and want to try the next task), after a driver update yesterday i attempted to run a GNF-16 block again wich also failed and i had to abort the next 100 tasks my computer had downloaded to just retest one GNF-16 task ;'(... you can view the task results here https://www.primegrid.com/result.php?resultid=1262968959
I am sure they ran fine in the weary beginning when i first got my RX6700XT. But after a driver update they suddenly began failing.
So as you say "lets try to narrow the circumstances down":
I seem to remember they ran fine when i first got my card but then something happened.
I updated my drivers and activated SAM (Smart Memory Access) and/or maybe made some other changes in my Radion Software.
After the first failed runs i also reinstalled Windows fresh and the newest GFX drivers, but with the same outcome. they still fail.
So if nobody can tell me what settings I need to change. Then i guess there is only two options:
Either I stop trying to run GNF-15 to GNF-19 tasks all together or i have to try the long trail and error road and enable-disable options in my Radion software until I finely get a pass :) |
|
|
|
@SeanHsu your old GTX 1650 is NVIDIA and is running NVIDIA drivers and therefore not included in this AMD RX series phenomenon.
My daughter is also running a MSI 1650 OC GFX card and it crunches all the tasks fine :) |
|
|
|
I am sure they ran fine in the [very] beginning when i first got my RX6700XT. But after a driver update they suddenly began failing.
Sounds like the later display driver caused the problem.
RX6700XT is first introduced in March 2021, and the first ever driver package is "Radeon™ Software Adrenalin 2020 Edition 21.3.1"
https://www.amd.com/en/support/kb/release-notes/rn-rad-win-21-3-1
Later driver packages include:
Between these patches, there can be one that starts to fail on OCL2.
|
|
|
|
Just upgraded the Radeon drivers to the latest 21.10.2. Release date 10/5/2021.... and retested. With the same result:
https://www.primegrid.com/result.php?resultid=1262986948
Normal priority change succeeded.
Checking available transform implementations...
OCL transform is past its b limit.
OCL3 transform is past its b limit.
OCL4 transform is past its b limit.
OCL5 transform is past its b limit.
Using OCL2 transform
Starting initialization...
Initialization complete (0.081 seconds).
Testing 134334752^65536+1...
Estimated time for 134334752^65536+1 is 0:01:19
maxErr exceeded for 134334752^65536+1, 1.0000 > 0.4500
Errors occurred for all available transform implementations
Waiting 10 minutes before attempting to continue from last checkpoint...
geneferocl 3.3.3-2 (Windows/OpenCL/32-bit)
@SeanHsu this problem started 3+ months back. I refuse to roll back drivers to many months outdated drivers. If you really want to test outdated drivers then I don't see why you shouldn't be able to run ANY RX6000 series drivers with your RX6600XT? |
|
|
|
If you would like to test earlier drivers, take a few notes:
1. You'll need Display Driver Uninstaller (DDU) from Guru3D.com to remove the driver before you install an older version.
2. In the PrimeGrid Preferences, limit the Max No. of Simultaneous tasks to 1 or few, so you'll download 1 task only.
3. Quit and restart BOINC Manager every time you re-install a driver, but reboot is unnecessary.
4. Alternatively, if you're familiar with the command line prompt, you can test with the command line directly. For example, "cd" to C:\BOINC\projects\www.primegrid.com (or your BOINC data directory), and run
.\geneferocl_windows_3.3.3-2.exe -boinc -q 134379876^65536+1
while monitoring the "stderr.txt" in the project folder.
Edit: I could not install earlier versions because they will not detect the GPU as a valid model...
And naively extracting and replacing amdocl.dll and amdocl64.dll from earlier versions will also not work. See these two fails. 1262157145 1262075752
Testing OCL transform...
Error: OpenCL error detected: CL_OUT_OF_HOST_MEMORY.
Testing OCL2 transform...
0x5E799AA6 (0x00000000 0x0A5DFD94 0x5E6F2598 0x00000001), clGetPipeInfo() + 0xA7996 bytes(s)
0x5E79A9CB (0x00000001 0x0B064C98 0x0AE850D0 0x5E79CB04), clGetPipeInfo() + 0xA88BB bytes(s)
(...stack traces...) |
|
|
|
A computer with an RX 6700 XT reported a successful Genefer 18 result.
http://www.primegrid.com/result.php?resultid=1258942975 (It was erased the moment I posted, glad I found it before it was gone.)
Running on platform 'AMD Accelerated Parallel Processing', device 'gfx1031', vendor 'Advanced Micro Devices, Inc.', version 'OpenCL 1.2 AMD-APP (3224.5)' and driver '3224.5 (PAL,LC)'.
Backup screenshot:
https://imgur.com/M8dQypv
AMD OpenCL version 3224.5 comes from the March 21.3.1 driver package.
https://imgur.com/2eqbGWz
This confirms that the earliest driver for 6700XT worked. |
|
|
|
If you look at this host : https://www.primegrid.com/results.php?hostid=1096086
GFN 15: 3/3
GFN 16: 1/1
GFN 17 MEGA: 1/1
GFN 19: 1/1
These are all Errored (logs suppressed for now).
Their one valid result (GFN 22) https://www.primegrid.com/result.php?resultid=1261441201 reported the driver was "OpenCL 1.2 AMD-APP (3276.6)",
which matches the amdocl.dll from as early as version 21.6.1.
Also, this host (6900XT, 3276.6). Why would they want to run cpuGFN19, rather than OpenCL? Probably because the OpenCL one was broken for them too. (Look at those 2 suppressed errors.)
Here comes a list of OpenCL drivers that are known:
3110.7 (RX 5700) -- works
intermediate versions -- should always work
3224.5 from Radeon Software 21.3.1 and 21.3.2 -- works
3240.6 from Radeon Software 21.4.1, 21.5.1, and 21.5.2 -- unknown status
3276.6 from Radeon Software 21.6.1 through 21.8.2 -- fails
3302.6 from Radeon Software 21.9.1, 21.9.2, and 21.10.1 -- fails
@Dennis Menace, or anyone with a RX 6000 series GPU:
Please help to test Radeon Software 21.5.2 to see if the OpenCL version breaks OCL2 transform, thanks!
|
|
|
|
OpenCL 1.2 AMD-APP (3276.6), but on other GPUs, returned good results.
https://www.primegrid.com/result.php?resultid=1262643841 RX 5700 XT (3276.6 (PAL,LC))
https://www.primegrid.com/result.php?resultid=1262763778 RX 590 (3276.6 (GSL))
https://www.primegrid.com/result.php?resultid=1261446620 RX Vega (3276.6 (PAL,HSAIL))
https://www.primegrid.com/result.php?resultid=1263259145 Radeon VII (3276.6 (PAL,HSAIL))
OpenCL 1.2 AMD-APP (3240.6), but on the R9 290X and the R9 Fury, also worked.
https://www.primegrid.com/result.php?resultid=1262867574 R9 290X (3240.6)
https://www.primegrid.com/result.php?resultid=1263101266 R9 Fury (3240.6)
Now it narrows down to the RX 6000 series, on Windows 10, with the 3276.6 (PAL,LC) OpenCL driver, and maybe the 3240.6 (PAL,LC) driver. |
|
|
|
I can confirm that the 2020-21.4.1-win10-64bit-apr20 driver works
This was the one that worked for me when i ran successful tasks
I will test 21.5.2 soon as i get home :)
|
|
|
|
I can confirm that the 2020-21.4.1-win10-64bit-apr20 driver works
This was the one that worked for me when i ran successful tasks
I will test 21.5.2 soon as i get home :)
Seeing your testimony is enough. No need to revert to 21.5.2, then.
OpenCL 1.2 AMD-APP 3240.6 (PAL,LC) from Radeon Software 21.4.1, 21.5.1, and 21.5.2 -- works
We find the culprit -- it's the June update 21.6.1 and the OpenCL 3276.6 (PAL,LC) driver for the RX 6000 series cards. RX 5000 series are fine.
PAL=Platform Abstraction Library
HSAIL=HSA (Heterogeneous System Architecture) Intermediate Language
LC=(Legacy?)
I should also test Linux...
|
|
|
|
what is next step then?
We confront AMD with the problem?
you say that extracting the amdocl.dll from a working package is not gonna work? |
|
|
|
you say that extracting the amdocl.dll from a working package is not gonna work?
Not for my GPU (that didn't exist then). Chances are it will work for a 6700XT and higher. But the process involved is a little messy. Extract the dll. Find the working copy under C:\Windows\System32\DriverStore and replace it (need to change ownership and permissions). If it's locked because some program is using it, go to Device Manager and disable the Graphics device. It will fall back to Microsoft Basic Adapter, and now you can replace the dll and re-enable the device and do the testing. |
|
|
|
Ubuntu 20.04.3 with "amdgpu-pro-21.30-1290604-ubuntu-20.04" driver,
Works very well.
https://www.primegrid.com/result.php?resultid=1263537497
<core_client_version>7.16.6</core_client_version>
<![CDATA[
<stderr_txt>
geneferocl 3.3.3-2 (Linux/OpenCL/64-bit)
Copyright 2001-2018, Yves Gallot
Copyright 2009, Mark Rodenkirch, David Underbakke
Copyright 2010-2012, Shoichiro Yamada, Ken Brazier
Copyright 2011-2014, Michael Goetz, Ronald Schneider
Copyright 2011-2018, Iain Bethune
Genefer is free source code, under the MIT license.
Running on platform 'AMD Accelerated Parallel Processing', device 'gfx1032', vendor 'Advanced Micro Devices, Inc.', version 'OpenCL 2.0 ' and driver '3314.0 (HSA1.1,LC)'.
16 computeUnits @ 2900MHz, memSize=8176MB, cacheSize=16kB, cacheLineSize=64B, localMemSize=64kB, maxWorkGroupSize=256.
Supported transform implementations: ocl ocl2 ocl3 ocl4 ocl5
Command line: ../../projects/www.primegrid.com/geneferocl_linux64_3.3.3-2 -boinc -q 134438730^65536+1 --device 0
Normal priority change failed (needs superuser privileges.
Checking available transform implementations...
OCL transform is past its b limit.
OCL3 transform is past its b limit.
OCL4 transform is past its b limit.
OCL5 transform is past its b limit.
Using OCL2 transform
Starting initialization...
Initialization complete (0.060 seconds).
Testing 134438730^65536+1...
Estimated time for 134438730^65536+1 is 0:01:46
134438730^65536+1 is complete. (532711 digits) (err = 0.0000) (time = 0:01:48) 17:14:47
17:14:47 (39062): called boinc_finish
</stderr_txt>
]]> |
|
|
|
@SeanHsu you installed a Linux driver and it works with your rx6600xt? Woooow
If I'm to attempt this then I'm afraid that you will need to hold my hand and walk me trough it step by step please |
|
|
|
@SeanHsu Stupid question from my side.... Offcause you have a Linux OS installed too :p |
|
|
|
I submitted a bug report from my Radeon software to AMD.
@SeanHsu and everybody else with a rx6000 series GPU experiencing the same bug.
Please do the same, so we can get as many bugreports to AMD about this issue.
I'm also going to open a thread on AMD forum about this bug. |
|
|
|
mail from AMD Community:
Hello Dennis_Menace,
Your topic recently received replies.
Topic: Error on RX6000 series cards AMDOCL.ddl drivers newer than 3240.6 from Radeon Software 21.5.2
Date: 10-14-2021 01:56 AM
Did it solve your problem?
Click here to view the replies and mark one as an Accepted Solution.
This helps others find helpful answers in the community too!
Thanks for being a AMD Community member.
Your AMD Community Team
AMD Community sent this message to dennismenace68@gmail.com.
Don't want to receive emails from us? Just click here.
No It is you (AMD) that need to fix this problem on the next RX6000 series drivers.. We just posted the problem to make you aware of that there is a problem on the RX6000 series GPU drivers in Windows 10 on the AMDOCL.dll drivers newer than the ones that come along after the Radeon software 21.5.2. That one driver and older worked well. (See the post)
The reply you say I received is another member confirming my findings (read the post)
I also replied to your mail in hope you will see it this time :)
Thanks in advance. We are looking forward to an updated (working driver) Dennis The Menace
(send to AMD) |
|
|
|
The error still persists on the new drivers coming along with the new Radeon Software Adrenalin 21.10.3. Released 25. Oktober 2021
I submitted yet another bugreport to AMD and I am going to write it on their forum too |
|
|
mikey Send message
Joined: 17 Mar 09 Posts: 1910 ID: 37043 Credit: 834,776,745 RAC: 811,725
                     
|
The error still persists on the new drivers coming along with the new Radeon Software Adrenalin 21.10.3. Released 25. Oktober 2021
I submitted yet another bugreport to AMD and I am going to write it on their forum too
Is the file not replaceable? Meaning can't you take the old one that works and paste it over the top of the new one? Or is that is a task by task thing which would be impossible. |
|
|
|
Posted on AMDs forum in my thread:
Re: Error on RX6000 series cards AMDOCL.ddl drivers newer than 3240.6 from Radeon Software 21.5.2
The error still persists on the new drivers coming along with the new Radeon Software Adrenalin 21.10.3. Released 25. Oktober 2021.
My Daughters older and smaller NVidia GTX1650 run all PrimeGrid tasks with NO issues! And the few tasks my RX6700XT can actually run. Her small GTX1650 run only a few seconds slower than my super expensive AMD RX6700XT!!!
AMD have neglected keeping their AMDOCL.dll drivers tuned and in working condition for their super expensive RX6000 series GPUs. It is embarrassing that my daughter can buy a NVidia GPU to 1/10 of the price of mine and can run all the tasks and to close to the same performance?
Just have a look at PrimeGrid fastest GPU list... https://www.primegrid.com/gpu_list.php
AND GPUGRID on Boinc don't even bother making their tasks compatible with AMD GPUs because AMD neglect keeping AMDOCL.dll up to date and tuned.
It is this kind of things that makes it really difficult to keep supporting AMD and is making me regret spending so many money on a AMD RX6700RX
(Does anybody want to buy a slightly used Sapphire Radeon RX6700XT +Nitro so i can go back to NVidia)?
I do not think anybody ADMINs there is bothered with reading any problems going on in their forum |
|
|
|
Is the file not replaceable? Meaning can't you take the old one that works and paste it over the top of the new one? Or is that is a task by task thing which would be impossible.
@mikey chances are that it would work but the process is a little messy. You would need to boot your PC in safe mode... completely wipe all current display drivers with a program like DDU (Display Driver Uninstaller) then install an old working Radeon Driver software with a working AMDOCL.dll driver, find it in C:\Windows\System32\DriverStore and take ownership of the AMDOCL.dll and make a backup of it... then boot your PC in safe mode again wipe ALL display drivers again preventing Windows update from installing any drivers and install the newest Radeon Driver software pack reboot in Safe mode and overwrite the AMDOCL.dll with your backup.... (this could work)
Ohhh and then you would need to repeat the boot in safe mode and overwriting the AMDOCL.dll every time you update your GPU drivers..... |
|
|
mikey Send message
Joined: 17 Mar 09 Posts: 1910 ID: 37043 Credit: 834,776,745 RAC: 811,725
                     
|
Is the file not replaceable? Meaning can't you take the old one that works and paste it over the top of the new one? Or is that is a task by task thing which would be impossible.
@mikey chances are that it would work but the process is a little messy. You would need to boot your PC in safe mode... completely wipe all current display drivers with a program like DDU (Display Driver Uninstaller) then install an old working Radeon Driver software with a working AMDOCL.dll driver, find it in C:\Windows\System32\DriverStore and take ownership of the AMDOCL.dll and make a backup of it... then boot your PC in safe mode again wipe ALL display drivers again preventing Windows update from installing any drivers and install the newest Radeon Driver software pack reboot in Safe mode and overwrite the AMDOCL.dll with your backup.... (this could work)
Ohhh and then you would need to repeat the boot in safe mode and overwriting the AMDOCL.dll every time you update your GPU drivers.....
Yeah that's alot to go thru, I too would just crunch what works. |
|
|
|
I run a residue test for OCL2, and get random residues every time.
(6600XT, driver 21.10.2)
.\geneferocl_windows_3.3.3-2.exe -r -x ocl2
Output:
geneferocl 3.3.3-2 (Windows/OpenCL/32-bit)
Copyright 2001-2018, Yves Gallot
Copyright 2009, Mark Rodenkirch, David Underbakke
Copyright 2010-2012, Shoichiro Yamada, Ken Brazier
Copyright 2011-2014, Michael Goetz, Ronald Schneider
Copyright 2011-2018, Iain Bethune
Genefer is free source code, under the MIT license.
Command line: C:\boinc\projects\www.primegrid.com\geneferocl_windows_3.3.3-2.exe -r -x ocl2
Normal priority change succeeded.
Running tests for transform implementation "OCL2"
Testing 10000^32+1...
Using OCL2 transform
Running on platform 'AMD Accelerated Parallel Processing', device 'gfx1032', vendor 'Advanced Micro Devices, Inc.', version 'OpenCL 1.2 AMD-APP (3302.6)' and driver '3302.6 (PAL,LC)'.
16 computeUnits @ 2359MHz, memSize=3072MB, cacheSize=16kB, cacheLineSize=64B, localMemSize=64kB, maxWorkGroupSize=256.
Starting initialization...
Initialization complete (0.001 seconds).
Expected residue [8d38b1fa99437373] does not match actual residue [118b70e8f7b28b0f]
maxErr exceeded for 10000^32+1, 1.0000 > 0.4500 during final check
Second time:
Expected residue [8d38b1fa99437373] does not match actual residue [0005b8ef3b09d4a9]
Third time:
Expected residue [8d38b1fa99437373] does not match actual residue [31e2a6ac297e6c14]
What else can I test? OCL, OCL3, OCL4, and OCL5 transforms all pass the residue tests for small Ns, didn't wait for larger Ns. |
|
|
|
Just created a bug report with AMD.
I'm not too optimistic, I must admit, as I could not select the "game" from their "affected application" list :-) |
|
|
|
Good news! Update 21.12.1 fixed the error.
https://www.amd.com/en/support/kb/release-notes/rn-rad-win-21-12-1
The new OpenCL driver (3354.13) that AMD released in December is doing OCL2 transform properly!
Say yes to previously blacklisted GFN tasks again!
... Or after the Geminids Shower Challenge. (XD) |
|
|
|
AMD finely FIXED the RX6000 series OCL2.dll driver in their new Adrenalin 21.12.1 Optional software driver package 3.dec. 2021
____________
|
|
|
mikey Send message
Joined: 17 Mar 09 Posts: 1910 ID: 37043 Credit: 834,776,745 RAC: 811,725
                     
|
AMD finely FIXED the RX6000 series OCL2.dll driver in their new Adrenalin 21.12.1 Optional software driver package 3.dec. 2021
WOO HOO!!! |
|
|
|
Indeed... works here, too.
Z |
|
|
|
Newest drivers 22.3.1 have this same problem. Glad a friend found this thread for me. Going to try reverting back to 21.12.1....
|
|
|
|
I'm running version 22.5.1 with an RX6800 and it appears the ocl bug has returned or never been fixed?? Since I game on this machine I'd like to use the upscaling features... however, using these newest certified drivers the ocl bug exists and I am unable to run anything from gfn15-gfn-19. 20 and above is nice, but that's not always what I'd like to run.
Has anyone found any recent beta or other releases where ocl2 works properly with the 6000 series AMD cards?
|
|
|
|
Totally randomly I saw 22.6.1 listed on Techpowerup. Installed as an upgrade with no hassles and it appears to be working now on all the GFNs!
22.6.1 was not showing with AMD's software package so if anyone needs to download it I grabbed it off here - https://www.techpowerup.com/download/amd-radeon-graphics-drivers/
I have not tried the latest beta 22.10.1. |
|
|