Analyze performance of your application running on GPU and identify headroom for optimization:
- Explore the basic performance metrics and identify top hotspots for optimization using the
GPU Roofline Summary
- Visualize performance of your kernels against hardware-imposed performance ceilings and explore the relationships between your kernels and different memory levels using the
GPU Roofline chart
- Analyze performance and memory metrics for specific kernels, identify headroom for optimization, and get actionable recommendations helping you optimize your application performance using the
GPU Details tab
- Compare results of different optimization iterations using
Roofline Compare functionality