Benchmark results

Benchmark results#

Note

The results shown here are based on quite naive, straight forward implementations to use the GPU as a calculation backend. The best possible performance is depending on many things, such as the type of the GPU, driver versions, library versions and bandwidth between CPU and GPU contexts for example.

To achieve the best possible performance, care should be taken to keep the amount of data transfer between CPU and GPU contexts as low as possible.

view_names = ["base", "slight01", "medium01"]
host_name = "NUC"
engine_names = ["naive", "cupy", "numba-cuda", "torch-cuda"]

We have 120 results to analyze.