[读论文] Copying Mechanism in Sequence-to-Sequence

杨熹

发布于 2018-04-02 08:38:50

1.3K0

文章被收录于专栏：杨熹的专栏杨熹的专栏

Paper Today:

'Incorporating Copying Mechanism in Sequence-to-Sequence Learning'

This paper develops a model called COPYNET which performs well in an important mechanism called 'copy mechanism'.

In human language communication, there are many situations that we will use 'copy mechanism', such as in a dialogue:

In order to make machine generate such dialogue, there are two things to do.

First, to identify what should be copied.
Second, to decide where the copy part should be addressed.

Currently there are some popular models like seq2seq, and adding Attention Mechanism to seq2seq. COPYNET is also an encoder-decoder model, but a different strategy in neural network based models. RNN and Attention Mechanism requires more 'understanding', but COPYNET requires high 'literal fidelity'.

There are mainly 3 improvements in the decoder part. Prediction: Based on the mix of two probabilistic modes, generate mode and copy mode, the model can pick the proper subsentence and generate some OOV words.

State Update: There's a minor change that they designed a selective read for copy mode, which enables the model to notice the location information.

Reading M: This model can get a hybrid of content based addressing and location based addressing.

In the experiment, this model did very well in tasks like text summarization.

本文参与腾讯云自媒体同步曝光计划，分享自作者个人站点/博客。

原始发表：2016.08.20 ，如有侵权请联系 cloudcommunity@tencent.com 删除

其他

本文分享自作者个人站点/博客前往查看

如有侵权，请联系 cloudcommunity@tencent.com 删除。

本文参与腾讯云自媒体同步曝光计划，欢迎热爱写作的你一起参与！

其他

登录后参与评论

0 条评论

热度