power spectrum module optimization and parallelization by lgarrison · Pull Request #102 · abacusorg/abacusutils

lgarrison · 2023-07-12T17:37:34Z

Some optimization, parallelization, and Numba-fication of various parts of the power spectrum module. Benchmark scripts used to produce the timings in the ZCV paper are included.

…the (k,mu) counts

… calc_pk_from_deltak. Refactor handling of poles argument as numba workaround.

lgarrison · 2023-07-12T22:05:07Z

@boryanah Here are the changes we talked about. It's about half optimization and half refactoring. I tried to rename some things that were confusing to me, but let me know if I misunderstood anything. I also tried to cut down on the number of arguments and return values in some places. For example, I changed calc_power() to return an Astropy Table; let me know if you think that makes sense.

boryanah · 2023-07-14T01:47:09Z

That looks great to me! The optimizations make sense (I am sorry I didn't implement the normalization one myself). The astropy table for the power spectrum with the meta data is great, and I think it's a good solution for outputting a single object that contains some useful information about the simulation. I think it makes sense why you got rid of some of the del X; gc.collect() (they probably weren't doing anything as the variables were passed externally rather than locally defined). I also like the variable and function name changes, and I think they make more sense.

lgarrison · 2023-07-14T14:08:43Z

Yeah, that's exactly right about the gc.collect(). The other reason was that garbage collection was making the timings really noisy; there might be one or two that could go back in, but for the most part I think they weren't doing anything. And if we really wanted to save memory, there are other ways: using an in-place FFT (pyfftw supports this, not sure if scipy.fft does), and adding on-the-fly offsets for the interlacing calculation (right now it makes a whole copy of the input data).

lgarrison added 2 commits July 12, 2023 12:35

power: add benchmark scripts

cffade3

power: optimization, parallelization, numba-fication, formatting

402db70

lgarrison force-pushed the pk_bench branch from 43cdd99 to 402db70 Compare July 12, 2023 17:39

lgarrison marked this pull request as draft July 12, 2023 17:39

lgarrison added 5 commits July 12, 2023 13:40

power: doc

ef843d2

Merge branch 'main' into pk_bench

1567c59

power: don't count modes for poles directly; it can be computed from …

5b4e91c

…the (k,mu) counts

power: refactor calc_power to return astropy Table. Rename calc_pk ->…

e9fece0

… calc_pk_from_deltak. Refactor handling of poles argument as numba workaround.

power: update tutorials and scripts for refactored code

900e4ff

lgarrison marked this pull request as ready for review July 12, 2023 22:05

lgarrison requested a review from boryanah July 12, 2023 22:07

changelog

66aa855

lgarrison merged commit b0ae7b7 into main Jul 14, 2023

lgarrison deleted the pk_bench branch July 14, 2023 14:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

power spectrum module optimization and parallelization#102

power spectrum module optimization and parallelization#102
lgarrison merged 8 commits intomainfrom
pk_bench

lgarrison commented Jul 12, 2023

Uh oh!

lgarrison commented Jul 12, 2023

Uh oh!

boryanah commented Jul 14, 2023

Uh oh!

lgarrison commented Jul 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

lgarrison commented Jul 12, 2023

Uh oh!

lgarrison commented Jul 12, 2023

Uh oh!

boryanah commented Jul 14, 2023

Uh oh!

lgarrison commented Jul 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants