APP Profiler
2015-10-23 19:07
169 查看
APP Profiler
AMD APP Profiler has been succeeded by AMD’s CodeXL. With the exception of critical
bug fixes, there will be no new versions of APP Profiler. For new features, continuous support and a unified developer tools suite, please visit AMD’s CodeXL web page and check out the CodeXL
Forum.
The AMD APP Profiler is a performance analysis tool that gathers data from the OpenCL™ run-time and AMD Radeon™ GPUs during the execution of an OpenCL™ application. We can then use this information to discover bottlenecks in an application and find ways to
optimize the application’s performance for AMD platforms.
Go to Downloads
OpenCL and the OpenCL logo are trademarks of Apple Inc. used by permission by Khronos.
New in Version 2.5
Support for AMD APP SDK v2.7.Support for OpenCL™ 1.2.
Support for collecting performance counters on APU devices.
Full support for profiling with AMD Radeon™ HD7000 series GPUs based on Graphics Core Next/Southern Islands:
Added support for kernel occupancy analysis.
Added support for collecting performance counters for DirectCompute (DirectX 11) applications.
Addition of SALUBusy counter.
Fixed value reported for VALUBusy counter.
The values reported for LDSFetchInsts and LDSWriteInsts counters were inaccurate on AMD Radeon™ HD7000 series GPUs; thus for those GPUs, those two counters have been replaced by a single LDSInsts counter.
Fixed display of kernel ISA.
Improved OpenCL™ analysis module:
Added detection of deprecated OpenCL™ APIs.
Added support for showing source and destination location, as well as zero-copy status for memory transfers initiated using clEnqueueMapBuffer or clEnqueueMapImage. This information is displayed in the API Trace view.
Added support for Microsoft® Visual Studio® projects that use User-defined Macros in the project settings.
Fixed the –workingdirectory (-w) command line switch (set current directory) on Linux.
Fixed some problems with importing previously-generated profile results into Microsoft® Visual Studio®.
Changed the default installation directory on Windows to %PROGRAMFILES(X86)%\AMD\AMD APP Profiler to make it more consistent with other AMD tools (i.e gDEBugger and CodeAnalyst)
Getting Started
RequirementsMicrosoft Windows Vista or 7 (32 bit/64 bit) or Linux 32-bit/64-bit
[Optional] Microsoft Visual Studio 2008 and 2010 (Standard/Professional/Team System Edition)
To profile OpenCL™ applications:
AMD APP SDK v2.7 or later
[GPU device] AMD Catalyst with OpenCL™ GPU support (12.4 or newer)
[GPU device] AMD Radeon™ HD 4000 series or newer
To profile DirectCompute applications:
Microsoft DirectX run-time (June 2010 or later)
AMD Radeon™ HD 5000 series or newer
Features
Collect OpenCL™ Application TraceView and debug the input parameters and output results for all OpenCL™ API calls
Search the API calls
Navigate to the source code that called an OpenCL™ API
Specify which OpenCL™ APIs will be traced
Collect GPU Performance Counters of AMD Radeon™ graphics cards
Show kernel resource usages
Show the number of instructions executed by the GPU
Show the GPU utilization
Show the GPU memory access characteristics
Measure kernel execution time
OpenCL™ Timeline visualization
Visualize the application high level structure
Visualize kernel execution and data transfer operations
Visualize host code execution
Annotate host code in the timeline with performance markers using the included CLPerfMarkerAMD library
OpenCL™ Application Summary pages
Find incorrect or inefficient usage of the OpenCL™ API using the OpenCL™ analysis module
Find the API hotspots
Find the bottleneck between kernel execution and data transfer operations
Find the top 10 data transfer and kernel execution operations
OpenCL™ Kernel Occupancy Viewer
Calculates and displays a kernel occupancy number, which estimates the number of in-flight wavefronts on a compute unit as a percentage of the theoretical maximum number of wavefronts that the compute unit can support
Find out which kernel resource (GPR usage, LDS size, or Work-group size) is currently limiting the number of in-flight wavefronts
Displays graphs showing how kernel occupancy would be affected by changes in each kernel resource
Display the AMD IL and ISA (hardware disassembly) code of the kernel for OpenCL™ kernels and DXASM code for DirectCompute kernels.
Support
AMD APP Profiler Knowledge BaseGPU Developer Tool Forum
Resources
AMD Accelerated Parallel Processing OpenCL Programming GuideAMD APP Profiler’s article at Siggraph 2010
AMD OpenCL™ Zone
Download
File Name | Version | Size | Launch Date | OS | Bitness | Description |
---|---|---|---|---|---|---|
Linux® | ||||||
AMDAPPProfiler-v2.5.1804-lnx.tgz | 2.5 | 3 MB | 05/16/2012 | 32-bit/64-bit | AMD APP Profiler for OpenCL on Linux platforms | |
Windows® | ||||||
AMDAPPProfiler-v2.5.1777.msi | 2.5 | 8 MB | 05/16/2012 | 32-/64-bit | AMD APP Profiler for OpenCL on Windows platforms |
相关文章推荐
- Unity 分析器(仅专业版) Profiler (Pro only)
- Android从服务器获取.9图片失效处理
- Scrollview嵌套Listview出现的问题
- EventBus-再也不用什么Handler了
- iOS开发 -- 百度地图api的使用
- KeyBoard 相关小记
- KeyBoard 相关小记
- android访问数据库时报runtiameException,和空指针异常
- 4.Swift初探-2048小游戏中的问题
- [Unity]关于多角色自由切换主操作角色的问题
- iOS开发 -- 利用dispatch_once创建单例
- iOS开发 -- 懒加载
- iOS开发 -- main.m文件解析
- Android沉浸式状态栏、导航栏
- iOS开发 -- SVN的使用
- ViewPager图片轮播
- Android Notification详解——响应notification事件
- Android Studio 混淆出现can't find referenced method 的解决方法
- android 版本在线更新
- Android内存泄漏检测-LeakCanary