File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Graphmax for Text Generation

TitleGraphmax for Text Generation
Authors
Issue Date27-Nov-2023
PublisherAI Access Foundation
Citation
Journal of Artificial Intelligence Research, 2023, v. 78, p. 823-848 How to Cite?
AbstractIn text generation, a large language model (LM) makes a choice of each new word based only on the former selection of its context using the softmax function. Nevertheless, the link statistics information of concurrent words based on a scene-specific corpus is valuable in choosing the next word, which can help to ensure the topic of the generated text to be aligned with the current task. To fully explore the co-occurrence information, we propose a graphmax function for task-specific text generation. Using the graph-based regularization, graphmax enables the final word choice to be determined by both the global knowledge from the LM and the local knowledge from the scene-specific corpus. The traditional softmax function is regularized with a graph total variation (GTV) term, which incorporates the local knowledge into the LM and encourages the model to consider the statistical relationships between words in a scene-specific corpus. The proposed graphmax is versatile and can be readily plugged into any large pre-trained LM for text generation and machine translation. Through extensive experiments, we demonstrate that the new GTV-based regularization can improve performances in various natural language processing (NLP) tasks in comparison with existing methods. Moreover, through human experiments, we observe that participants can easily distinguish the text generated by graphmax or softmax.
Persistent Identifierhttp://hdl.handle.net/10722/347132
ISSN
2023 Impact Factor: 4.5
2023 SCImago Journal Rankings: 1.614

 

DC FieldValueLanguage
dc.contributor.authorLiu, Bin-
dc.contributor.authorYin, Guosheng-
dc.date.accessioned2024-09-18T00:30:32Z-
dc.date.available2024-09-18T00:30:32Z-
dc.date.issued2023-11-27-
dc.identifier.citationJournal of Artificial Intelligence Research, 2023, v. 78, p. 823-848-
dc.identifier.issn1076-9757-
dc.identifier.urihttp://hdl.handle.net/10722/347132-
dc.description.abstractIn text generation, a large language model (LM) makes a choice of each new word based only on the former selection of its context using the softmax function. Nevertheless, the link statistics information of concurrent words based on a scene-specific corpus is valuable in choosing the next word, which can help to ensure the topic of the generated text to be aligned with the current task. To fully explore the co-occurrence information, we propose a graphmax function for task-specific text generation. Using the graph-based regularization, graphmax enables the final word choice to be determined by both the global knowledge from the LM and the local knowledge from the scene-specific corpus. The traditional softmax function is regularized with a graph total variation (GTV) term, which incorporates the local knowledge into the LM and encourages the model to consider the statistical relationships between words in a scene-specific corpus. The proposed graphmax is versatile and can be readily plugged into any large pre-trained LM for text generation and machine translation. Through extensive experiments, we demonstrate that the new GTV-based regularization can improve performances in various natural language processing (NLP) tasks in comparison with existing methods. Moreover, through human experiments, we observe that participants can easily distinguish the text generated by graphmax or softmax.-
dc.languageeng-
dc.publisherAI Access Foundation-
dc.relation.ispartofJournal of Artificial Intelligence Research-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.titleGraphmax for Text Generation-
dc.typeArticle-
dc.identifier.doi10.1613/JAIR.1.15280-
dc.identifier.scopuseid_2-s2.0-85179132521-
dc.identifier.volume78-
dc.identifier.spage823-
dc.identifier.epage848-
dc.identifier.eissn1943-5037-
dc.identifier.issnl1076-9757-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats