something about Model.py , in which something wrong between encoder and decoder #163

glorymu · 2020-09-11T09:01:50Z

model.py
def train():
line140 memory, sents1, src_masks = self.encode(xs)
line141 logits, preds, y, sents2 = self.decode(ys, memory, src_masks)

we know the memory is the last block's output of the encoder ,but author directly send the output into the decoder structure,
so every block in the decoder use the last memory as K and V .
Obviously it'wrong ,we should take every block's ouputs into a list[]. then send them to the corresponding block in the decoder as
memory.
friends, who can tell me ,am i right?

GuoshenLi · 2021-07-18T04:03:40Z

............. omg you are totally wrong.... the author is right.

glorymu · 2021-07-25T09:45:07Z

嗯，我知道，我两种方法后来都尝试了，作者的效果会好一些，我的方式会快一些。

…

------------------ 原始邮件 ------------------ 发件人: "Kyubyong/transformer" ***@***.***>; 发送时间: 2021年7月18日(星期天) 中午12:03 ***@***.***>; ***@***.******@***.***>; 主题: Re: [Kyubyong/transformer] something about Model.py , in which something wrong between encoder and decoder (#163) ............. omg you are totally wrong.... the author is right. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

something about Model.py , in which something wrong between encoder and decoder #163

something about Model.py , in which something wrong between encoder and decoder #163

glorymu commented Sep 11, 2020

GuoshenLi commented Jul 18, 2021

glorymu commented Jul 25, 2021 via email

something about Model.py , in which something wrong between encoder and decoder #163

something about Model.py , in which something wrong between encoder and decoder #163

Comments

glorymu commented Sep 11, 2020

GuoshenLi commented Jul 18, 2021

glorymu commented Jul 25, 2021 via email