File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: A la carte embedding: Cheap but effective induction of semantic feature vectors

TitleA la carte embedding: Cheap but effective induction of semantic feature vectors
Authors
Issue Date2018
Citation
ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers), 2018, v. 1, p. 12-22 How to Cite?
AbstractMotivations like domain adaptation, transfer learning, and feature learning have fueled interest in inducing embeddings for rare or unseen words, n-grams, synsets, and other textual features. This paper introduces à la carte embedding, a simple and general alternative to the usual word2vec-based approaches for building such representations that is based upon recent theoretical results for GloVe-like embeddings. Our method relies mainly on a linear transformation that is efficiently learnable using pretrained word vectors and linear regression. This transform is applicable “on the fly” in the future when a new text feature or rare word is encountered, even if only a single usage example is available. We introduce a new dataset showing how the à la carte method requires fewer examples of words in context to learn high-quality embeddings and we obtain state-of-the-art results on a nonce task and some unsupervised document classification tasks.
Persistent Identifierhttp://hdl.handle.net/10722/341242

 

DC FieldValueLanguage
dc.contributor.authorKhodak, Mikhail-
dc.contributor.authorSaunshi, Nikunj-
dc.contributor.authorLiang, Yingyu-
dc.contributor.authorMa, Tengyu-
dc.contributor.authorStewart, Brandon-
dc.contributor.authorArora, Sanjeev-
dc.date.accessioned2024-03-13T08:41:17Z-
dc.date.available2024-03-13T08:41:17Z-
dc.date.issued2018-
dc.identifier.citationACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers), 2018, v. 1, p. 12-22-
dc.identifier.urihttp://hdl.handle.net/10722/341242-
dc.description.abstractMotivations like domain adaptation, transfer learning, and feature learning have fueled interest in inducing embeddings for rare or unseen words, n-grams, synsets, and other textual features. This paper introduces à la carte embedding, a simple and general alternative to the usual word2vec-based approaches for building such representations that is based upon recent theoretical results for GloVe-like embeddings. Our method relies mainly on a linear transformation that is efficiently learnable using pretrained word vectors and linear regression. This transform is applicable “on the fly” in the future when a new text feature or rare word is encountered, even if only a single usage example is available. We introduce a new dataset showing how the à la carte method requires fewer examples of words in context to learn high-quality embeddings and we obtain state-of-the-art results on a nonce task and some unsupervised document classification tasks.-
dc.languageeng-
dc.relation.ispartofACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers)-
dc.titleA la carte embedding: Cheap but effective induction of semantic feature vectors-
dc.typeConference_Paper-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.18653/v1/p18-1002-
dc.identifier.scopuseid_2-s2.0-85063088505-
dc.identifier.volume1-
dc.identifier.spage12-
dc.identifier.epage22-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats