Skip to content

Commit 1d7ea33

Browse files
committed
Update Readme and ReleaseNotes
1 parent bddd95f commit 1d7ea33

File tree

2 files changed

+18
-11
lines changed

2 files changed

+18
-11
lines changed

README.md

Lines changed: 17 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,7 @@ Prebuilt binaries can be downloaded from the Releases page: https://github.com/G
3131
* Provides access to some raw hardware counters. See [Raw Hardware Counters](#raw-hardware-counters) for more information.
3232

3333
## What's New
34-
## Version 3.10 (01/255555/22)
34+
## Version 3.10 (01/25/22)
3535
* Add support for additional GPUs and APUs.
3636
* Redefined derived counters on GCN (Vega), RDNA, and RDNA2 hardware.
3737
* New pipeline-based counters to better match hardware behavior.
@@ -80,6 +80,7 @@ Prebuilt binaries can be downloaded from the Releases page: https://github.com/G
8080
* New entrypoint added: GpaGetDeviceGeneration. Binary backwards compatibility is maintained.
8181
* OpenGL on Linux: Fixed hardware detection on MESA drivers.
8282
* OpenGL: Fixed hardware detection accuracy.
83+
* Setting GPA_OVERRIDE_LOG_LEVEL env var to an integer equal to a GpaLoggingType enum can be used to increase or decrease logging output.
8384
* DX11:
8485
* Fixed Adrenalin driver version detection.
8586
* Fixed setting the number of shader arrays based on client hardware.
@@ -147,18 +148,27 @@ It was discovered that the improvements introduced in Vega, RDNA, and RDNA2 arch
147148
| CS | | | | | | CS |
148149

149150
## Known Issues
151+
### Counter Validity on Specific Hardware
152+
There are some counters that are returning unexpected results on specific hardware with certain APIs.
153+
* AMD Radeon RX 6800, DX12: HiZ and PreZ counters may consistently report 33% higher than expected.
154+
* AMD Radeon RX 6800, DX11: CSThreadGroups may consistently report 33% higher than expected.
155+
* AMD Radeon RX 6700M, DX11: CSLDSBankConflict and CSLDSBankConflictCycles may consistently report as much as 30x higher than expected.
156+
* AMD Radeon RX 5000 Series, DX12: ScalarCacheHitCount may consistently report 0.
157+
* AMD Radeon RX 480, DX12: CulledPrims and PSPixelsOut may inconsistently report higher than expected.
158+
150159
### Counter Validation Errors in D3D12ColorCube Sample App
151160
Due to the extensive counter validation now being done in the D3D12ColorCube sample application, and some expected variation in nondeterministic counters across a wide range of systems, the sample app may report errors on some systems. Likewise, some counters are marked as known issues and we are investigating the underlying causes of the inconsistent results.
152161

153-
Additionally, the following deterministic performance counter values may not be accurate for the D3D12ColorCube sample application:
154-
* CulledPrims, PSPixelsOut on Radeon RX 480 hardware.
155-
156-
### Ubuntu 20.04 LTS Vulkan ICD Issue
157-
On Ubuntu 20.04 LTS, Vulkan ICD may not be set to use AMD Vulkan ICD. In this case, it needs to be explicitly set to use AMD Vulkan ICD before using the GPA. It can be done by setting the ```VK_ICD_FILENAMES``` environment variable to ```/etc/vulkan/icd.d/amd_icd64.json```.
162+
### OpenCL Performance Counter Accuracy For Radeon 6000 Series GPUs
163+
The following performance counter values may not be accurate for OpenCL applications running on Radeon 6000 Series GPUs:
164+
* Wavefronts, VALUInsts, SALUInsts, SALUBusy, VALUUtilization: These values should be representative of performance, but may not be 100% accurate.
158165

159166
### OpenGL FetchSize Counter on Radeon RX 6000 Series GPUs
160167
FetchSize counter will show an error when enabled on Radeon RX 6000 Series GPUs using OpenGL.
161168

169+
### Ubuntu 20.04 LTS Vulkan ICD Issue
170+
On Ubuntu 20.04 LTS, Vulkan ICD may not be set to use AMD Vulkan ICD. In this case, it needs to be explicitly set to use AMD Vulkan ICD before using the GPA. It can be done by setting the ```VK_ICD_FILENAMES``` environment variable to ```/etc/vulkan/icd.d/amd_icd64.json```.
171+
162172
### Adjusting Linux Clock Mode
163173
Adjusting the GPU clock mode on Linux is accomplished by writing to: ```/sys/class/drm/card\<N\>/device/power_dpm_force_performance_level```, where \<N\> is the index of the card in question.
164174

@@ -168,14 +178,10 @@ By default this file is only modifiable by root, so the application being profil
168178
* You may have to reboot the system for the change to take effect.
169179
* Setting the GPU clock mode is not working correctly for <b>Radeon 5700 Series GPUs</b>, potentially leading to some inconsistencies in counter values from one run to the next.
170180

171-
### OpenCL Performance Counter Accuracy For Radeon 6000 Series GPUs
172-
The following performance counter values may not be accurate for OpenCL applications running on Radeon 6000 Series GPUs:
173-
* Wavefronts, VALUInsts, SALUInsts, SALUBusy, VALUUtilization: These values should be representative of performance, but may not be 100% accurate.
174-
175181
### Profiling Bundles
176182
Profiling bundles in DirectX12 and Vulkan is not working properly. It is recommended to remove those GPA Samples from your application, or move the calls out of the bundle for profiling.
177183

178184
## Style and Format Change
179185
The source code of this product is being reformatted to follow the Google C++ Style Guide https://google.github.io/styleguide/cppguide.html.
180186
In the interim you may encounter a mix of both an older C++ coding style, as well as the newer Google C++ Style.
181-
Please refer to the _clang-format file in the root directory of the product for additional style information.
187+
Please refer to the .clang-format file in the root directory of the product for additional style information.

ReleaseNotes.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -50,6 +50,7 @@
5050
* New entrypoint added: GpaGetDeviceGeneration. Binary backwards compatibility is maintained.
5151
* OpenGL on Linux: Fixed hardware detection on MESA drivers.
5252
* OpenGL: Fixed hardware detection accuracy.
53+
* Setting GPA_OVERRIDE_LOG_LEVEL env var to an integer equal to a GpaLoggingType enum can be used to increase or decrease logging output.
5354
* DX11:
5455
* Fixed Adrenalin driver version detection.
5556
* Fixed setting the number of shader arrays based on client hardware.

0 commit comments

Comments
 (0)