File Download

There are no files associated with this item.

Supplementary

Conference Paper: Neural machine translation with Gumbel-Greedy Decoding

TitleNeural machine translation with Gumbel-Greedy Decoding
Authors
KeywordsMachine Translation
Gumbel Softmax
Greedy Decoding
Generator Discriminator
Issue Date2018
PublisherAssociation for the Advancement of Artificial Intelligence (AAAI) Press.
Citation
Proceedings of the Thirty-Second Association for the Advancement of Artificial Intelligence (AAAI) Conference on Artificial Intelligence (AAAI-18), New Orleans, Louisiana, USA, 2-7 February 2018, p. 5125-5132 How to Cite?
AbstractPrevious neural machine translation models used some heuristic search algorithms (e.g., beam search) in order to avoid solving the maximum a posteriori problem over translation sentences at test phase. In this paper, we propose the extit{Gumbel-Greedy Decoding} which trains a generative network to predict translation under a trained model. We solve such a problem using the Gumbel-Softmax reparameterization, which makes our generative network differentiable and trainable through standard stochastic gradient methods. We empirically demonstrate that our proposed model is effective for generating sequences of discrete words.
DescriptionSession: AAAI18 - NLP and Machine Learning
Persistent Identifierhttp://hdl.handle.net/10722/262422

 

DC FieldValueLanguage
dc.contributor.authorGu, J-
dc.contributor.authorIm, DJ-
dc.contributor.authorLi, VOK-
dc.date.accessioned2018-09-28T04:59:04Z-
dc.date.available2018-09-28T04:59:04Z-
dc.date.issued2018-
dc.identifier.citationProceedings of the Thirty-Second Association for the Advancement of Artificial Intelligence (AAAI) Conference on Artificial Intelligence (AAAI-18), New Orleans, Louisiana, USA, 2-7 February 2018, p. 5125-5132-
dc.identifier.urihttp://hdl.handle.net/10722/262422-
dc.descriptionSession: AAAI18 - NLP and Machine Learning-
dc.description.abstractPrevious neural machine translation models used some heuristic search algorithms (e.g., beam search) in order to avoid solving the maximum a posteriori problem over translation sentences at test phase. In this paper, we propose the extit{Gumbel-Greedy Decoding} which trains a generative network to predict translation under a trained model. We solve such a problem using the Gumbel-Softmax reparameterization, which makes our generative network differentiable and trainable through standard stochastic gradient methods. We empirically demonstrate that our proposed model is effective for generating sequences of discrete words.-
dc.languageeng-
dc.publisherAssociation for the Advancement of Artificial Intelligence (AAAI) Press. -
dc.relation.ispartofAAAI Conference on Artificial Intelligence, AAAI-18-
dc.subjectMachine Translation-
dc.subjectGumbel Softmax-
dc.subjectGreedy Decoding-
dc.subjectGenerator Discriminator-
dc.titleNeural machine translation with Gumbel-Greedy Decoding-
dc.typeConference_Paper-
dc.identifier.emailLi, VOK: vli@eee.hku.hk-
dc.identifier.authorityLi, VOK=rp00150-
dc.identifier.hkuros292177-
dc.identifier.spage5125-
dc.identifier.epage5132-
dc.publisher.placeUnited States-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats