Histogram: cuda backend
Description
This MR adds a cuda backend for nabu.misc.histogram
(PartialHistogram
).
Computing a (sub)volume histogram with numpy can be very slow (20 mins on 16 GB of data), so a cuda implementation will be beneficial even with Host-to-Device / Device-to-Host copies.
Close #172 (closed).
To do
-
Implement CudaPartialHistogram
-
Unit tests -
Integrate in full-field pipeline -
Test on "big"
Notes
The "fixed_bins_width" is not implemented in this backend.
Edited by Pierre Paleo