Addition of CuPy as an Accelerated Computing Option #499

kian1377 · 2022-03-28T20:04:44Z

This is a bit more of an extensive PR as it seeks to add CuPy as an accelerated math option for many computations including FFTs, MFTs, and exponentials for propagation with the OpticalSystem or FresnelOpticalSystem classes. To start with, CuPy is a package for GPU accelerated computing. While POPPY has GPU computing options available with PyOpenCL and with pyculib (which has been deprecated by Numba in favor of CuPy), this implementation seeks to perform all calculations on the GPU until the end of propagation. This reduces time for calculations as arrays no longer need to be transferred between GPU memory and standard memory when performing different calculations.

CuPy has been designed to be very similar to Numpy and has many of the same features requiring the same syntax to use. An example is numpy.fft.fft2, which with CuPy is cupy.fft.fft2. The catch is that CuPy functions can not be used on Numpy arrays since Numpy arrays are stored on standard memory. As such, the implementation strategy was to use the statement “import cupy as np” when POPPY’s Config has been set to use CuPy. This makes performing all calculations on GPU more seamless as wavefront arrays for Wavefront or FresnelWavefront objects along with optical element phasors are automatically calculated as CuPy arrays.

In addition to using CuPy for basic computations of FFTs and exponentials, CuPy also enables the use of many SciPy functions through its cupyx functions. The cupyx functions can be imported much like many scipy functions with a statement such as “import cupyx.scipy.ndimage as ndimage”. So many functions for computing array rotations or interpolations are also imported through cupyx or scipy based on POPPY’s Config setup. The same method is also used to compute Bessel functions.

Currently, many optics have been tested for use with CuPy. A table with these optics is provided below with some comments regarding the functionality of some optics.

Optical Element Class Compatible with CuPy Comments

CircularAperture Yes

MultiCircularAperture Yes

SquareAperture Yes

RectangularAperture Yes

HexagonAperture Yes

MultiHexagonAperture Yes

NgonAperture Yes

SecondaryObscuration Yes

AsymmetricSecondaryObscuration Yes

CompoundAnalyticOptic Yes

ThinLens Yes

GaussianAperture Yes

KnifeEdge Yes

SquareFieldStop Yes These image plane elements have only been tested in Fraunhofer systems since it is difficult to use these elements in Fresnel systems anyway. This is because of the issue in converting to angular units in a Fresnel system.

RectagularFieldStop Yes

AnnularFieldStop Yes

HexagonFieldStop Yes

CircularOcculter Yes

BarOcculter Yes

BandLimitedCoronagraph Yes

IdealFQPM Yes

ScalarTransmission Yes

InverseTransmission Yes

ScalarOpticalPathDifference Yes

ZernikeWFE Yes

KolmogorovWFE Yes Does not pass test when using Tatarski power spectrum.

SineWaveWFE Yes

StatisticalPSDWFE Yes Had to implement hack found in Issue #452, pre-existing bug not related to CuPyNote: slight differences in numpy vs cupy random generators so does not produce the exact same result with the same seed value.

PowerSpectrumWFE Yes Had to assume opd is calculated in nanometers and then rescale it to meters, there should be a better solution to this. Note: slight differences in numpy vs cupy random generators so does not produce exact same result

ContinuousDeformableMirror Yes Influence function must be provided. When using .set_surface(), you should still provide a numpy.ndarray instead of a cupy.ndarray.

HexSegmentedDeformableMirror Yes These segmented optics are functional but when converted to an ArrayOpticalElement with fixed_sampling_optic(), the OPD is only 0. Still not 100% sure why but it does not seem to be an issue with CuPy because get_opd() functions as expected.

CircularSegmentedDeformableMirror Yes

ArrayOpticalElement Yes

FITSOpticalElement Yes

FixedSamplingImagePlaneElement Yes

All tests in the test suite for POPPY were also run. Currently, there is only an issue with the KolmogorovWFE test not passing due to a units issue in an exponential. This only comes up when using a Tatarski power spectrum. The tests have only been run with standard CPU computations to make sure POPPY is not running into critical errors because of the addition of CuPy even if a user isn’t using the CuPy feature.

Computation comparisons have been performed to illustrate the benefit of this accelerated computing feature. Below are comparisons of the times required for a PSF to be calculated for varying array sizes using the MKL FFT option versus the CuPy calculations. The optical systems tested had 5 different surfaces/optics. The system used for these comparisons was the University of Arizona’s HPC Puma nodes. The node utilized 32 AMD EPYC 7642 CPUs and the NVIDIA Tesla V100S GPU.

Propagation Type	Array Size	MKL Method Times [s]	CuPy Method Times [s]	Speed Up Factor
Fraunhofer	1024	0.218	0.0261	8.35
Fraunhofer	2048	0.755	0.0294	25.7
Fraunhofer	4096	3.36	0.0423	79.4
Fresnel	1024	0.714	0.0438	16.3
Fresnel	2048	4.16	0.0845	49.2
Fresnel	4096	17.5	0.225	77.8

One catch with this is none of POPPY’s display functionality is compatible with CuPy. This is because matplotlib cannot plot CuPy arrays since they are only on GPU memory. In order to obtain a Numpy array from a CuPy array, the “cupy.ndarray.get()” method can be used. So users can obtain intensity and phase arrays by adding .get().

It should be noted that the following POPPY features have not been tested for functionality with CuPy:

PhysicalFresnelWavefront
Active optics such as the TipTiltStage (found in active_optics.py)
The Instrument class
Special propagation features such as SemiAnalyticCoronagraph (found in special_prop.py)
Sub-sampled optics such as ShackHartmannWavefrontSensor
Floating-window centroid calculations

@douglase was also involved in the addition so I am including him in this PR.

Let me know your thoughts and what changes need to be made.

…to cupyx

…snel

…rent kinds

codecov · 2022-03-29T17:24:49Z

Codecov Report

Patch coverage: 79.57% and project coverage change: -0.78 ⚠️

Comparison is base (7b8d44a) 74.74% compared to head (c2f925b) 73.97%.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop     #499      +/-   ##
===========================================
- Coverage    74.74%   73.97%   -0.78%     
===========================================
  Files           18       18              
  Lines         6502     6612     +110     
===========================================
+ Hits          4860     4891      +31     
- Misses        1642     1721      +79

Impacted Files	Coverage Δ
poppy/dms.py	`46.04% <36.53%> (-0.36%)`	⬇️
poppy/accel_math.py	`39.84% <60.00%> (+2.15%)`	⬆️
poppy/poppy_core.py	`79.72% <71.42%> (-0.95%)`	⬇️
poppy/utils.py	`53.56% <85.71%> (+0.27%)`	⬆️
poppy/optics.py	`81.81% <86.48%> (-0.11%)`	⬇️
poppy/wfe.py	`74.63% <87.50%> (-6.65%)`	⬇️
poppy/physical_wavefront.py	`92.59% <88.46%> (-0.92%)`	⬇️
poppy/fresnel.py	`84.95% <94.44%> (+0.02%)`	⬆️
poppy/zernike.py	`82.91% <94.64%> (+0.17%)`	⬆️
poppy/__init__.py	`98.00% <100.00%> (+0.08%)`	⬆️
... and 3 more

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

Pulling changes made to update POPPY support to 3.10

…and GPU (CuPy)

…mpatible

…nor fixed tilt() with cupy

…eWFE

kian1377 · 2023-03-03T17:27:10Z

I am working through it to see what I changed, but I dont recall ever changing anything related to that test. I remember adding a test called test_inwave_fresnel in #402, but dont remember touching any other tests.

Also, there are a couple minor changes I also made since yesterday which I will be pushing shortly.

BradleySappington · 2023-03-09T18:53:51Z

@kian1377 - FYI that development has been updated to settle failed tests. Fetch/Pull required.

kian1377 · 2023-03-10T17:08:15Z

@BradleySappington , Are you talking about the issue with the scikit-image test failing? I haven't seen any commits or changes to that test so I am a little confused as to what development you are talking about.

BradleySappington · 2023-03-10T17:15:56Z

@kian1377 I was speaking to the CI tests that have been historically failing for reasons unrelated to your branch. Looks like you just pushed develop so some of the CI tests should now pass.

kian1377 · 2023-03-10T17:24:25Z

Ok, I saw there was a conflict so I just resolved that and updated my repo by pulling some of the other commits.

kian1377 · 2023-03-10T18:07:17Z

@mperrin , I installed scikit-image onto my desktop again and I am recreating the issue with the scikit-image test. Because this is is an issue for both CPU and GPU versions of the code, is it necessary to resolve this within this PR? I was thinking this issue should be resolved in a separate PR so there is a more clear record of its problem and the solution.

Here is the current status of the tests on my desktop running with GPU.

mperrin · 2023-03-13T14:08:08Z

Bravo @kian1377 on getting the remainder of the tests passing! It's very nice to get the entire test suite passing again on GPU, and that gives a lot of confidence in the robustness of a PR this complicated.

I completely concur with splitting out the failing Fresnel text (which doesn't even get ran by the Github Actions CI) to a separate task in #552.

Let me give this one more review/reread in the next day or two (I was out last week), but I think we are very very close to pushing the merge button on this one :-)

kian1377 · 2023-03-13T16:18:52Z

Sounds good to me. I put some comments in the code so it is easy to understand why I made those changes, but feel free to delete those comments to clean up the code as you review the changes.

One of your initial concerns that I have not addressed are those repeated lines of code that update the accel_math settings in the initialization of objects because I continued switching between GPU and CPU while I was debugging the tests, so that was a feature I kept using.

Let me know if there are aspects of the code that I should change.

kian1377 · 2023-03-15T21:24:36Z

@mperrin One change that I just thought of that may make the code a little cleaner is instead of import cupy or numpy as _ncp, I could change it to just by xp. This way, anyone reading the code knows that xp is a stand in for np or cp. Just thought this may be a better practice to use so let me know if you want this to be changed.

mperrin · 2023-03-17T14:08:44Z

Hi @kian1377, yes, good suggestion. I read some and found that yes using xp is a common/recommended idiom for "this could be numpy or cupy", and I agree it's good practice to use common conventions like that when possible.

(Remind me which code editor you're using? I remember you had asked me about PyCharm which I use. That sort of global rename refactoring is super straightforward to do in PyCharm, with automated code refactoring tools that are much more sophisticated than just a string search-and-replace. One of the many reasons I like to use PyCharm, FYI)

mperrin · 2023-03-17T14:42:25Z

@kian1377 I'm doing a full reread/review now. FYI, I assume this will be OK with you, but for minor cleanup like removing commented-out lines, I'm just going to do that myself and push to this branch; it's as easy for me to do that as it would be to flag those lines here in GitHub.

To answer your earlier question about the repeated lines of code in the initialization of objects. I do think that's less than desirable to have blocks of repeated code. And isn't it a performance hit to have many many calls to update_math_settings which are most of the time going to be unnecessary? I don't want to get hung up on this, but I wonder if the use case of switching back and forth is rare enough or only needed in special cases for debugging, and so maybe it's not worth trying to add the extra complications to support switching arbitrarily in general?

kian1377 · 2023-03-17T17:47:11Z

I just went ahead and replaced all _ncp instances with xp and fixed a couple bugs that came up after doing so. I agree that we should remove those repeated blocks of code that are for switching between packages, but I kept using them for debugging purposes so if we wait until everything else in the PR is ready to be merged, then I can delete those lines and do a final check to make sure it didn't break anything else.

And feel free to delete some lines/comments that I had put in and push them directly. I just left those so it would be easier for you during code review.

…_settings, even if that makes it harder to toggle between GPU and CPU calculation backends.

mperrin

After lots of excellent work by @kian1377, plus several passes of close edits and review by me, I'm going to declare this good to go! A major enhancement to poppy with substantial speed improvements using GPU hardware.

The entire test suite passes using GPU hardware on my laptop. And likewise all tests pass using CPU, locally on my laptop and also on the Github Actions CI.

Comments below are just some minor notes and comments on places we could further tune in subsequent PRs later.

mperrin · 2023-03-17T14:30:46Z

poppy/dms.py

+                if isinstance(self.rotation, u.Quantity) and self.rotation.unit==u.degree:
+                    angle = -np.deg2rad(self.rotation).value


Does this change have anything to do with the GPU code? Or is it an unrelated bug fix? I guess the point is that you want the variable angle to be a bare float rather than a Quantity in the end, yes?

In any case this could be generalized & simplified to

if isinstance(self.rotation, u.Quantity): angle = -np.deg2rad(self.rotation.to_value(u.degree))

That will work for any input rotation unit.

mperrin · 2023-03-17T16:25:13Z

poppy/accel_math.py

+    elif _USE_CUPY:
+        return cp.fft.ifftshift(x)
    else:
        return np.fft.ifftshift(x)


Couldn't we simplify this whole part to use _ncp, or xp after that switch, to call the appropriate CPU/GPU code? In other words like:

else: return xp.fft.ifftshift(x)

This is true for several places here in accel_math.py.

For now I'm choosing to leave this as-is - we want to get this PR merged in rather than keep polishing indefinitely :-)

mperrin · 2023-03-17T16:27:01Z

poppy/accel_math.py

+    if _USE_CUPY: #########################################################################
+        do_fft = cp.fft.fft2 if forward else cp.fft.ifft2
+        if normalization is None:
+            normalization = 1./wavefront.shape[0] if forward else wavefront.shape[0]
+        wavefront = do_fft(wavefront)


Another place where we could simplify - I think we don't even need an _USE_CUPY branch of the if statement. The same case could handle plain numpy and cupy using the xp approach.

poppy/dms.py

mperrin · 2023-03-24T19:54:22Z

Everything passes (including local tests on GPU hardware).

For the record the one 'failing' CI check is the code coverage, which has a slight decrease in coverage. This is not surprising since there's no way currently to test GPU code on Github Actions. (See https://github.com/orgs/github/projects/4247/views/1?filterQuery=+GPU&pane=issue&itemId=4967370; this is a "future" feature for Github Actions, with no particular timeline announced)

Good to merge!

kian1377 · 2023-03-24T21:38:59Z

Thank you for doing a thorough review! Happy to have helped and contributed to this package once more. I can certainly address bugs or more general code updates in future pull requests. Have a great day!

…

On Fri, Mar 24, 2023 at 12:54 PM Marshall Perrin ***@***.***> wrote: *External Email* Merged #499 <#499> into develop. — Reply to this email directly, view it on GitHub <#499 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AMCLT2W2L6MYRZOK3MKUDUTW5X3YFANCNFSM5R4HJSKA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

Kian Milani and others added 13 commits February 22, 2022 13:42

modifications to use CuPy in Fresnel propagation

8f159a9

fixed issue with pad_or_crop_to_shape returning zeros with cupy

8890980

working on more elegant implementation of cupy

53e4e91

cupy working with FITSOpticalElements, changed resampling techniques …

9fac1b5

…to cupyx

Fraunhofer systems and DM functionality with Cupy

8a4a5f2

ContinuousDeformableMirror works with CuPy when given influence_func

07fd452

cupy integrated for more optics: particularly for WFE optics

f79bcfb

minor modifications for a couple optics to work with cupy

03e40ed

Merge branch 'spacetelescope:develop' into develop

d680b3e

cupy now functional with most optical elements for Fraunhofer and Fre…

c365e22

…snel

minor update for SegmentedDeformableMirrors to be CuPy compatible

b6d71d3

making sure tests pass, currently only Kolmogorov test does not pass

5f604e7

modified KolmogorovWFE.power_spectrum and test_wfe to work with diffe…

c091d60

…rent kinds

kian1377 and others added 2 commits April 4, 2022 10:51

Merge branch 'spacetelescope:develop' into develop

eaae159

minor update to rotation of analytic optics

2128f71

kian1377 mentioned this pull request May 24, 2022

Addition of CuPy as an Accelerated Computing Option uasal/poppy#1

Merged

Kian Milani and others added 13 commits May 26, 2022 15:09

minor update to check if a GPU exists when CuPy available

f42e8ea

Merge branch 'spacetelescope:develop' into develop

6961c19

changes to allow for switching between CPU and GPU with CuPy

ddd9306

Merge branch 'develop' of https://github.com/kian1377/poppy into develop

4c6588a

Pulling changes made to update POPPY support to 3.10

most functionality restored along with ability to switch between CPU …

70442dc

…and GPU (CuPy)

a few minor changes for scipy ndimage and special functions

1b9b131

minor update for accel_math test and cleaning up code

9457f8c

CuPy update for matrixDFT and FITSOpticalElement resample

fcc8ccd

FITSOpticalElement change for when only OPD is supplied to be CuPy co…

8ba5a6c

…mpatible

implemneted map_coordinates for wavefront resampling with cupy and mi…

34afa79

…nor fixed tilt() with cupy

Merge branch 'spacetelescope:develop' into develop

d6059af

updated zernike.py so switching between CPU and CuPy works for Zernik…

b697e07

…eWFE

minor fixes for BaseWavefront and zernike.arbitrary_basis

298f1cf

minor changes to the wfe and test_wfe files

b63bc3c

Merge branch 'develop' into develop

5affeec

making sure everything is up to date with local repo

d2edb25

douglase mentioned this pull request Mar 12, 2023

fresnel propagation test test_fresnel.test_Circular_Aperture_PTP_short fails for recent skimage versions #552

Open

renamed _ncp to xp to be consistent with common practices

e5c02e7

mperrin added 8 commits March 24, 2023 12:20

minor: cleanup comments and whitespace

02e3889

clean up / simplify imports. Avoid proliferating calls to update_math…

8cb000d

…_settings, even if that makes it harder to toggle between GPU and CPU calculation backends.

more whitespace and comment cleanup

4ea0d07

switch to xp convention for numpy/cupy in all files, including tests

4699058

Merge branch 'develop' into develop

105b83a

fix syntax error typo from prior commit

c71e2d2

fix to run update_math_settings once on initial import

4c38bd8

one more round of comment/whitespace cleanup for GPU code

c2f925b

mperrin approved these changes Mar 24, 2023

View reviewed changes

mperrin merged commit a8e641d into spacetelescope:develop Mar 24, 2023

JTBakerAO mentioned this pull request Mar 21, 2024

cupy support is not quite there #614

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Addition of CuPy as an Accelerated Computing Option #499

Addition of CuPy as an Accelerated Computing Option #499

kian1377 commented Mar 28, 2022

codecov bot commented Mar 29, 2022 •

edited

Loading

kian1377 commented Mar 3, 2023

BradleySappington commented Mar 9, 2023

kian1377 commented Mar 10, 2023

BradleySappington commented Mar 10, 2023

kian1377 commented Mar 10, 2023

kian1377 commented Mar 10, 2023

mperrin commented Mar 13, 2023

kian1377 commented Mar 13, 2023

kian1377 commented Mar 15, 2023

mperrin commented Mar 17, 2023

mperrin commented Mar 17, 2023

kian1377 commented Mar 17, 2023

mperrin left a comment

mperrin Mar 17, 2023

mperrin Mar 17, 2023

mperrin Mar 17, 2023

mperrin commented Mar 24, 2023

kian1377 commented Mar 24, 2023 via email

Optical Element Class	Compatible with CuPy	Comments
CircularAperture	Yes
MultiCircularAperture	Yes
SquareAperture	Yes
RectangularAperture	Yes
HexagonAperture	Yes
MultiHexagonAperture	Yes
NgonAperture	Yes
SecondaryObscuration	Yes
AsymmetricSecondaryObscuration	Yes
CompoundAnalyticOptic	Yes
ThinLens	Yes
GaussianAperture	Yes
KnifeEdge	Yes
SquareFieldStop	Yes	These image plane elements have only been tested in Fraunhofer systems since it is difficult to use these elements in Fresnel systems anyway. This is because of the issue in converting to angular units in a Fresnel system.
RectagularFieldStop	Yes
AnnularFieldStop	Yes
HexagonFieldStop	Yes
CircularOcculter	Yes
BarOcculter	Yes
BandLimitedCoronagraph	Yes
IdealFQPM	Yes
ScalarTransmission	Yes
InverseTransmission	Yes
ScalarOpticalPathDifference	Yes
ZernikeWFE	Yes
KolmogorovWFE	Yes	Does not pass test when using Tatarski power spectrum.
SineWaveWFE	Yes
StatisticalPSDWFE	Yes	Had to implement hack found in Issue #452, pre-existing bug not related to CuPyNote: slight differences in numpy vs cupy random generators so does not produce the exact same result with the same seed value.
PowerSpectrumWFE	Yes	Had to assume opd is calculated in nanometers and then rescale it to meters, there should be a better solution to this. Note: slight differences in numpy vs cupy random generators so does not produce exact same result
ContinuousDeformableMirror	Yes	Influence function must be provided. When using .set_surface(), you should still provide a numpy.ndarray instead of a cupy.ndarray.
HexSegmentedDeformableMirror	Yes	These segmented optics are functional but when converted to an ArrayOpticalElement with fixed_sampling_optic(), the OPD is only 0. Still not 100% sure why but it does not seem to be an issue with CuPy because get_opd() functions as expected.
CircularSegmentedDeformableMirror	Yes
ArrayOpticalElement	Yes
FITSOpticalElement	Yes
FixedSamplingImagePlaneElement	Yes

		if isinstance(self.rotation, u.Quantity) and self.rotation.unit==u.degree:
		angle = -np.deg2rad(self.rotation).value

Addition of CuPy as an Accelerated Computing Option #499

Addition of CuPy as an Accelerated Computing Option #499

Conversation

kian1377 commented Mar 28, 2022

codecov bot commented Mar 29, 2022 • edited Loading

Codecov Report

kian1377 commented Mar 3, 2023

BradleySappington commented Mar 9, 2023

kian1377 commented Mar 10, 2023

BradleySappington commented Mar 10, 2023

kian1377 commented Mar 10, 2023

kian1377 commented Mar 10, 2023

mperrin commented Mar 13, 2023

kian1377 commented Mar 13, 2023

kian1377 commented Mar 15, 2023

mperrin commented Mar 17, 2023

mperrin commented Mar 17, 2023

kian1377 commented Mar 17, 2023

mperrin left a comment

Choose a reason for hiding this comment

mperrin Mar 17, 2023

Choose a reason for hiding this comment

mperrin Mar 17, 2023

Choose a reason for hiding this comment

mperrin Mar 17, 2023

Choose a reason for hiding this comment

mperrin commented Mar 24, 2023

kian1377 commented Mar 24, 2023 via email

codecov bot commented Mar 29, 2022 •

edited

Loading