A stable and effective learning strategy for trainable greedy decoding

Chen, Y; Li, VOK; Cho, K; Bowman, SR

File Download

re01.htm

Links for fulltext

(May Require Subscription)

Publisher Website: 10.18653/v1/D18-1035

Supplementary

Citations:
Appears in Collections:
- Electrical & Electronic Engineering: Conference papers

Conference Paper: A stable and effective learning strategy for trainable greedy decoding

Title	A stable and effective learning strategy for trainable greedy decoding
Authors	Chen, Y Li, VOK Cho, K Bowman, SR
Issue Date	2018
Publisher	Association for Computational Linguistics.
Citation	Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Brussels, Belgium, October 31-November 4, 2018, p. 380-390 How to Cite? DOI: http://dx.doi.org/10.18653/v1/D18-1035
Abstract	Beam search is a widely used approximate search strategy for neural network decoders, and it generally outperforms simple greedy decoding on tasks like machine translation. However, this improvement comes at substantial computational cost. In this paper, we propose a flexible new method that allows us to reap nearly the full benefits of beam search with nearly no additional computational cost. The method revolves around a small neural network actor that is trained to observe and manipulate the hidden state of a previously-trained decoder. To train this actor network, we introduce the use of a pseudo-parallel corpus built using the output of beam search on a base model, ranked by a target quality metric like BLEU. Our method is inspired by earlier work on this problem, but requires no reinforcement learning, and can be trained reliably on a range of models. Experiments on three parallel corpora and three architectures show that the method yields substantial improvements in translation quality and speed over each base system.
Persistent Identifier	http://hdl.handle.net/10722/278333

DC Field	Value	Language
dc.contributor.author	Chen, Y	-
dc.contributor.author	Li, VOK	-
dc.contributor.author	Cho, K	-
dc.contributor.author	Bowman, SR	-
dc.date.accessioned	2019-10-04T08:11:58Z	-
dc.date.available	2019-10-04T08:11:58Z	-
dc.date.issued	2018	-
dc.identifier.citation	Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Brussels, Belgium, October 31-November 4, 2018, p. 380-390	-
dc.identifier.uri	http://hdl.handle.net/10722/278333	-
dc.description.abstract	Beam search is a widely used approximate search strategy for neural network decoders, and it generally outperforms simple greedy decoding on tasks like machine translation. However, this improvement comes at substantial computational cost. In this paper, we propose a flexible new method that allows us to reap nearly the full benefits of beam search with nearly no additional computational cost. The method revolves around a small neural network actor that is trained to observe and manipulate the hidden state of a previously-trained decoder. To train this actor network, we introduce the use of a pseudo-parallel corpus built using the output of beam search on a base model, ranked by a target quality metric like BLEU. Our method is inspired by earlier work on this problem, but requires no reinforcement learning, and can be trained reliably on a range of models. Experiments on three parallel corpora and three architectures show that the method yields substantial improvements in translation quality and speed over each base system.	-
dc.language	eng	-
dc.publisher	Association for Computational Linguistics.	-
dc.relation.ispartof	Conference on Empirical Methods in Natural Language Processing (EMNLP) Proceedings	-
dc.title	A stable and effective learning strategy for trainable greedy decoding	-
dc.type	Conference_Paper	-
dc.identifier.email	Li, VOK: vli@eee.hku.hk	-
dc.identifier.authority	Li, VOK=rp00150	-
dc.description.nature	link_to_OA_fulltext	-
dc.identifier.doi	10.18653/v1/D18-1035	-
dc.identifier.hkuros	306535	-
dc.identifier.spage	380	-
dc.identifier.epage	390	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: A stable and effective learning strategy for trainable greedy decoding

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats