Explore code execution on various CPU and GPU cores on your platform, correlate CPU and GPU activity, and identify whether your application is GPU or CPU bound.
vtune -collect gpu-offload [-knob <knobName=knobValue>] -- <target> [target_options]
Knobs: collect-cpu-gpu-bandwidth, collect-programming-api, enable-stack-collection, enable-characterization-insights, target-gpu.
For the most current information on available knobs (configuration options) for the GPU Offload analysis, enter:
vtune -help collect gpu-offload
This example runs GPU Offload analysis with enabled tracing for GPU programming APIs on the specified Linux* application:
vtune -collect gpu-offload -knob collect-programming-api=true -- /home/test/myApplication
When the data collection is complete, do one of the following to view the result:
Use the -report action to view the data from command line.
Use the -report-output action to write report to a .txt or .csv file
Open the data collection result (*.vtune) in the VTune Profiler graphical interface.