Paper, Table 1, Convolution number of parameters #128

ndvbd · 2024-01-17T12:50:47Z

Hi, a few things that are not fully clear to me on Table 1. It says convolution has LH parameters. How can it be if only the A matrix, which is learnable, is of shape LxL. Maybe it is because A is diagonalizable plus low rank, and we only learn the diagonal, and neglect the low rank?

in 3.1, it says:

shouldn't the time complexity should O(N^3L)?

In Table 1, why S4 number of parameters is H^2 and not LH? After all, section 3.4 says the number of parameters is L==N, and we need H dimensions, which makes it LH.

albertfgu · 2024-01-23T17:25:58Z

The convolution column of the table is not an SSM convolution, but directly parameterizing the convolution's kernel elements (like a standard convolution). (This is mentioned in the footnote.) See this work for an example of people attempting this in practice: https://hazyresearch.stanford.edu/blog/2023-02-15-long-convs
It's a matrix-vector multiplication, not matrix-matrix, so $O(N^2)$ per $L$ iterations.
I think you have misread something. S4's parameterization does not depend on sequence length and I don't see anything in Section 3.4 that implies so

ndvbd · 2024-01-24T08:10:47Z

Thank you @albertfgu,

Thank you; I understand: it’s like a regular convolution kernel, with the size of the sequence length (L), multiplied by the dimension size (H)
But in equation (5), in order to compute the kernel, you need to raise matrix A by the power of L, and A is nxn. Am I missing something?
I see, thank you.

Do you know where can I find explanation to the training time of the Convolution and S4:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Paper, Table 1, Convolution number of parameters #128

Paper, Table 1, Convolution number of parameters #128

ndvbd commented Jan 17, 2024 •

edited

Loading

albertfgu commented Jan 23, 2024

ndvbd commented Jan 24, 2024

Paper, Table 1, Convolution number of parameters #128

Paper, Table 1, Convolution number of parameters #128

Comments

ndvbd commented Jan 17, 2024 • edited Loading

albertfgu commented Jan 23, 2024

ndvbd commented Jan 24, 2024

ndvbd commented Jan 17, 2024 •

edited

Loading