13 proper handling of batches #17

pzimbrod · 2021-11-02T13:03:10Z

Resolves #13

pzimbrod · 2021-11-09T15:09:00Z

Following things had to be changed:

For sake of extensibility, NNlib's batched routines have been ditched by OMEinsum. Looping over 3-dimensional arrays would suffice for a 1D Problem, but certainly not for higher-dimensional ones.
Pre-allocation of arrays had to be dropped for now. In combination with OMEinsum, gains were marginal at best anyway.
Fixed a problem where the constructor and subsequent training wouldn't work when bias was set to false.
Typed every part of the data structure. This should allow the LLVM Compiler to produce more optimized code.
Ditched the batchsize argument out of the initialization function as it's no longer needed. This increases the flexibility of the layer considerably.
Had to do some permutation at the beginning and end of the layer pass. Otherwise it's not possible to satisfy the sequential requirement of batch dims that CuFFT poses and the requirements of Flux.DataLoader at the same time.

pzimbrod and others added 27 commits November 2, 2021 12:18

replace OMEinsum with batched_mul

b5b6ed3

🚮 get rid of bias params

62d5d8e

remove old code

1b3986e

remove batch labels in burgers script

8cbd84c

add custom batched_mul! methods

d4421e1

fix CUDA detection for FFT plan

4303e7b

add CUDA implementation of batched_mul!

f764a35

add proper (but slow) CUDA version of batched_mul!

096bab9

stick with non-planned FFT for now

dbc1964

extend training script

ccb0e43

pre-allocate fourier path

6aa630b

make only fourier modes trainable

b85e693

update test syntax

d66d0b9

correct trainable parameter syntax

353dcfa

correct trainable parameter syntax

f4deb17

👨‍🏫 introduce strict typing for struct

a925df7

use the allocated arrays

adcbd4c

revert dot operations as this breaks CUDA support

269fd9b

some cleanups, fix # of output dims in burgers.jl

6a9bbcb

use allocation in batch multiplication, fix gpu transfer of arrays

1383209

fix test dim of Wl

59ba985

swap batch dim to ensure compatibility with Flux.DataLoader

6e0c24d

fix bias creation

1b84fc1

fix params assignment of bias

47d1798

fix params syntax error

6deedc1

re-introduce OMEinsum, get rid of batch argument

f690d52

fix Wl dim in test & update constructor in burgers

75c095e

pzimbrod merged commit cb41f77 into master Nov 9, 2021

pzimbrod deleted the 13-proper-handling-of-batches branch January 31, 2022 07:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

13 proper handling of batches #17

13 proper handling of batches #17

Uh oh!

pzimbrod commented Nov 2, 2021

Uh oh!

pzimbrod commented Nov 9, 2021

Uh oh!

Uh oh!

Uh oh!

13 proper handling of batches #17

13 proper handling of batches #17

Uh oh!

Conversation

pzimbrod commented Nov 2, 2021

Uh oh!

pzimbrod commented Nov 9, 2021

Uh oh!

Uh oh!