Performance
Benchmarks for Gridwise primitives over a range of input sizes (logarithmically spaced), plotting throughput in GB/s versus input length. Both CPU and GPU timing are reported. See the timing strategy documentation for how primitives are timed and how to interpret results.
Scan and Sort Performance
Benchmarks scan, reduce, and sort over a configurable range of input sizes. Choose the primitive, datatype, and sort direction; results are plotted after each run. Note that sort overwrites its input, so repeated trials measure a partially-sorted array.
Reduce Performance
Benchmarks the reduce primitive over a configurable range of input sizes. Explains the
warmup-then-trials timing strategy with inline code excerpts showing how to call
getTimingResult after execution.