张芷铭的个人博客

可逆结构允许在反向传播时从前向传播的输出重建输入,无需存储中间激活值,显著降低显存占用。

Reversible structures enable reconstructing inputs from outputs during backpropagation, eliminating the need to store intermediate activations and significantly reducing memory usage.

Comments