How to handle hidden state reset？ #577

Babylonehy · 2024-09-28T14:37:48Z

I have a tensor like BLD input to Mamba, the output is also BLD. This is for training. I wanna to know every new Batch is given, does the hidden stats reset?
for inference, I pass throught 1 image at once, how to keep the hidden state until the end of sequence. for new seq, how to manual reset hidden states?

gkianfar · 2024-10-18T15:03:50Z

@Babylonehy I have the same question about how the hidden stats reset is handled. Have you found any answers?

Hprairie · 2024-11-05T01:54:12Z

You can pass it in as different samples in a batch and the hidden state for each one will be kept different. You can think of it as essentially just doing a different scan for each sample in B for BLD, starting with a hidden state initialized to 0 for each of the samples in the batch.

Babylonehy · 2024-11-08T02:41:08Z

You can pass it in as different samples in a batch and the hidden state for each one will be kept different. You can think of it as essentially just doing a different scan for each sample in B for BLD, starting with a hidden state initialized to 0 for each of the samples in the batch.

The training is fine, but the issue is that during online inference with a video sequence, I can’t obtain the full data for dimension B; I only have 1. However, I still want the hidden state to be passed across different frames within the same video sequence.

Hprairie · 2024-11-08T15:35:23Z

They have an inference mode, which will cache the hidden state for inference. I would take a look at InferenceParams, which will call this function in Mamba.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to handle hidden state reset？ #577

How to handle hidden state reset？ #577

Babylonehy commented Sep 28, 2024

gkianfar commented Oct 18, 2024

Hprairie commented Nov 5, 2024

Babylonehy commented Nov 8, 2024

Hprairie commented Nov 8, 2024

How to handle hidden state reset？ #577

How to handle hidden state reset？ #577

Comments

Babylonehy commented Sep 28, 2024

gkianfar commented Oct 18, 2024

Hprairie commented Nov 5, 2024

Babylonehy commented Nov 8, 2024

Hprairie commented Nov 8, 2024