Show Me How To Revise: Improving Lexically Constrained Sentence Generation with XLNet

He, X; Li, VOK

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Find via

Supplementary

Citations:
Appears in Collections:
- Computer Science: Conference papers
- Electrical & Electronic Engineering: Conference papers

Conference Paper: Show Me How To Revise: Improving Lexically Constrained Sentence Generation with XLNet

Title	Show Me How To Revise: Improving Lexically Constrained Sentence Generation with XLNet
Authors	He, X Li, VOK
Keywords	Generation Applications Language Models
Issue Date	2021
Publisher	AAAI Press. The Journal's web site is located at https://aaai.org/Library/AAAI/aaai-library.php
Citation	Proceedings of the 35th Association for the Advancement of Artificial Intelligence (AAAI) Conference on Artificial Intelligence (AAAI-21), Virtual Conference, USA, 2-9 February 2021, v. 35 n. 14, p. 12989-12997 How to Cite?
Abstract	Lexically constrained sentence generation allows the incorporation of prior knowledge such as lexical constraints into the output. This technique has been applied to machine translation, and dialog response generation. Previous work usually used Markov Chain Monte Carlo (MCMC) sampling to generate lexically constrained sentences, but they randomly determined the position to be edited and the action to be taken, resulting in many invalid refinements. To overcome this challenge, we used a classifier to instruct the MCMC-based models where and how to refine the candidate sentences. First, we developed two methods to create synthetic data on which the pre-trained model is fine-tuned to obtain a reliable classifier. Next, we proposed a two-step approach, “Predict and Revise”, for constrained sentence generation. During the predict step, we leveraged the classifier to compute the learned prior for the candidate sentence. During the revise step, we resorted to MCMC sampling to revise the candidate sentence by conducting a sampled action at a sampled position drawn from the learned prior. We compared our proposed models with many strong baselines on two tasks, generating sentences with lexical constraints and text infilling. Experimental results have demonstrated that our proposed model performs much better than the previous work in terms of sentence fluency and diversity. Our code, pre-trained models and Appendix are available at https://github.com/NLPCode/MCMCXLNet.
Description	AAAI-21 Technical Tracks 14 / AAAI Technical Track on Speech and Natural Language Processing I
Persistent Identifier	http://hdl.handle.net/10722/305497
ISSN	2159-5399

DC Field	Value	Language
dc.contributor.author	He, X	-
dc.contributor.author	Li, VOK	-
dc.date.accessioned	2021-10-20T10:10:14Z	-
dc.date.available	2021-10-20T10:10:14Z	-
dc.date.issued	2021	-
dc.identifier.citation	Proceedings of the 35th Association for the Advancement of Artificial Intelligence (AAAI) Conference on Artificial Intelligence (AAAI-21), Virtual Conference, USA, 2-9 February 2021, v. 35 n. 14, p. 12989-12997	-
dc.identifier.issn	2159-5399	-
dc.identifier.uri	http://hdl.handle.net/10722/305497	-
dc.description	AAAI-21 Technical Tracks 14 / AAAI Technical Track on Speech and Natural Language Processing I	-
dc.description.abstract	Lexically constrained sentence generation allows the incorporation of prior knowledge such as lexical constraints into the output. This technique has been applied to machine translation, and dialog response generation. Previous work usually used Markov Chain Monte Carlo (MCMC) sampling to generate lexically constrained sentences, but they randomly determined the position to be edited and the action to be taken, resulting in many invalid refinements. To overcome this challenge, we used a classifier to instruct the MCMC-based models where and how to refine the candidate sentences. First, we developed two methods to create synthetic data on which the pre-trained model is fine-tuned to obtain a reliable classifier. Next, we proposed a two-step approach, “Predict and Revise”, for constrained sentence generation. During the predict step, we leveraged the classifier to compute the learned prior for the candidate sentence. During the revise step, we resorted to MCMC sampling to revise the candidate sentence by conducting a sampled action at a sampled position drawn from the learned prior. We compared our proposed models with many strong baselines on two tasks, generating sentences with lexical constraints and text infilling. Experimental results have demonstrated that our proposed model performs much better than the previous work in terms of sentence fluency and diversity. Our code, pre-trained models and Appendix are available at https://github.com/NLPCode/MCMCXLNet.	-
dc.language	eng	-
dc.publisher	AAAI Press. The Journal's web site is located at https://aaai.org/Library/AAAI/aaai-library.php	-
dc.relation.ispartof	Proceedings of the AAAI Conference on Artificial Intelligence	-
dc.subject	Generation	-
dc.subject	Applications	-
dc.subject	Language Models	-
dc.title	Show Me How To Revise: Improving Lexically Constrained Sentence Generation with XLNet	-
dc.type	Conference_Paper	-
dc.identifier.email	Li, VOK: vli@eee.hku.hk	-
dc.identifier.authority	Li, VOK=rp00150	-
dc.identifier.hkuros	327679	-
dc.identifier.volume	35	-
dc.identifier.issue	14	-
dc.identifier.spage	12989	-
dc.identifier.epage	12997	-
dc.publisher.place	United States	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Show Me How To Revise: Improving Lexically Constrained Sentence Generation with XLNet

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats