Performance

Benchmarks for Gridwise primitives over a range of input sizes (logarithmically spaced), plotting throughput in GB/s versus input length. Both CPU and GPU timing are reported. See the timing strategy documentation for how primitives are timed and how to interpret results.

Scan and Sort Performance

Benchmarks scan, reduce, and sort over a configurable range of input sizes. Choose the primitive, datatype, and sort direction; results are plotted after each run. Note that sort overwrites its input, so repeated trials measure a partially-sorted array.

Open Example Source

Reduce Performance

Benchmarks the reduce primitive over a configurable range of input sizes. Explains the warmup-then-trials timing strategy with inline code excerpts showing how to call getTimingResult after execution.

Open Example Source