1
|
Select by Mouse Rect: Select one or more kernels by tracing a rectangle with your mouse.
Zoom by Mouse Rect: Zoom in and out by tracing a rectangle with your mouse. You can also zoom in and out using your mouse wheel.
Move View by Mouse: Move the chart left, right, up, and down.
Undo or
Redo: Undo or redo the previous zoom action.
|
2
|
Select the
Memory Level(s) to show for each kernel in the chart: CARM, L3, SLM, GTI, L3 + SLM.
|
3
|
Add visual
Guidance to the GPU Roofline chart to make the interpretation of data easier, including performance limits and whether kernels are memory bound, compute bound, or both.
Use the following checkboxes in the
Guidance drop-down toolbar:
Display roof rulers: Enable showing a vertical line from a kernel to the nearest and topmost performance ceilings. To view the ruler, hover the cursor over a kernel dot. Where the line intersects with each roof, labels display hardware performance limits for the kernel.
Show memory level relationships: Visually emphasize the relationships among displayed memory levels and roofs for a selected dot by enabling.
Show Roofline boundaries: Color the GPU Roofline zones to make it easier to see if enclosed kernels are fundamentally memory bound, compute bound, or bound by compute and memory roofs.
The preview picture is updated as you select guidance options, allowing you to see how changes will affect the GPU Roofline chart's appearance. Click
Apply to apply your changes or
Default to return the GPU Roofline chart to its original appearance.
|
4
|
Roofline View Settings: Change the default scale setting to show:
The
optimal scale (default), which adjusts to a chosen GPU Roofline chart view.
A
constant scale, which adjusts to the tallest or widest view and does not change when a different GPU Roofline chart view is chosen.
Roof Settings: Change the visibility and appearance of roofline representations (lines):
Click a
Visible checkbox to show/hide a roof line.
Click a
Selected checkbox to change a roof line appearance: display the roof line as a solid or a dashed line.
Manually fine-tune roof values in the
Value column to set hardware limits specific to your code.
Loop Weight Representation: Change the appearance of dots:
Point Weight Calculation: Change the
Base Value for a point weight calculation.
Note For a GPU Roofline chart, only
Self Elapsed Time is available as a base value.
Point Weight Ranges: Change the
Size,
Color, and weight
Range ® of a dot. Click the
+ button to split a point weight range in two. Click the
- button to merge a point weight range with the range below.
Point Colorization: Color dots by
weight ranges or by
type (vectorized or scalar). You can also change the color of loop with no self time.
|
5
|
Hover your mouse over a dot to display metrics and, if enabled, a roof ruler for it.
By default, Intel Advisor Beta generates a
GPU Cache-Aware Roofline Model (CARM), which reports memory traffic, in bytes, generated by all execution units.
If
Show memory level relationships is enabled: Double-click a dot or select a dot and press
SPACE or
ENTER to display labeled dots representing memory levels for the selected kernel. Lines connect the dots to indicate that they correspond to the selected kernel.
Note If you have chosen to display only
some memory levels in the chart using the
Memory Level toolbar, unselected memory levels are displayed with
X marks.
To hide the labeled dots, do one of the following:
Right-click a kernel dot or a blank area in the Roofline chart to perform more functions, such as:
|