All of the Intel products above are available as part of the Intel oneAPI HPC Toolkit.
Set up the environment by running the setvars.bat script available in the installation directory (by default, Program Files (x86)\Intel\oneAPI\itac).
This sets the required environment variables for compilers, the Intel MPI Library, and Intel Trace Analyzer and Collector, and you are ready to trace your applications.
For more details, see the HPC Kit Get Started Guide. For system requirements, refer to the Release Notes.
Compile your application with the -trace option. For example:
> mpiicc -trace myApp.c
> mpiexec -n 4 myApp.exe
A trace file .stf is generated, for example myApp.stf. For more details on tracing, see Intel Trace Collector User and Reference Guide.
Open the generated .stf file in Intel Trace Analyzer:
> traceanalyzer myApp.stf
For detailed description of the available charts and diagrams, see Intel Trace Analyzer User and Reference Guide.
Intel Trace Analyzer and Collector has a full-color customizable GUI with charts and drill-down view options.
The analyzer is able to rapidly unwind the call stack and use debug information to map instruction addresses to source code.
With both command-line and GUI interfaces, you can additionally set up batch runs or do interactive debugging.
Trace Map represents a miniature view of the MPI functions activity over time. It also enables you to zoom into relevant subsets of a trace file on all open charts.
Hover over a GUI element for short description, or click for more details.
MPI Correctness Checking
The MPI Correctness Checker detects deadlocks, data corruption, and errors with MPI parameters, data types, buffers, communicators, point-to-point messages, and collective operations.
The Correctness Checker enables you to scale to large systems and detect errors even among a large number of processes.
Performance Assistant
Identify performance issues in your application using the Performance Assistant, which provides the following information:
Ideal Environment Simulation
Understand application imbalance and estimate a potential application speedup after MPI implementation tuning and/or network upgrades by using the Idealization capability.
User-Level Instrumentation
Control the profiling library and record user-defined activities, define groups of processes, define performance counters and record their values using the Intel® Trace Collector API.
Automatically instrument all your application functions with the Intel® compilers to analyze them with Intel Trace Analyzer.
Visit the Intel Trace Analyzer and Collector page on for training and documentation resources including user guides, videos, webinars, featured articles, and more.
Refer to the Release Notes for product updates and system requirements.