File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Domain adapted word embeddings for improved sentiment classification

TitleDomain adapted word embeddings for improved sentiment classification
Authors
Issue Date2018
Citation
ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers), 2018, v. 2, p. 37-42 How to Cite?
AbstractGeneric word embeddings are trained on large-scale generic corpora; Domain Specific (DS) word embeddings are trained only on data from a domain of interest. This paper proposes a method to combine the breadth of generic embeddings with the specificity of domain specific embeddings. The resulting embeddings, called Domain Adapted (DA) word embeddings, are formed by aligning corresponding word vectors using Canonical Correlation Analysis (CCA) or the related nonlinear Kernel CCA. Evaluation results on sentiment classification tasks show that the DA embeddings substantially outperform both generic and DS embeddings when used as input features to standard or state-of-the-art sentence encoding algorithms for classification.
Persistent Identifierhttp://hdl.handle.net/10722/341243

 

DC FieldValueLanguage
dc.contributor.authorSarma, Prathusha K.-
dc.contributor.authorLiang, Yingyu-
dc.contributor.authorSethares, William A.-
dc.date.accessioned2024-03-13T08:41:17Z-
dc.date.available2024-03-13T08:41:17Z-
dc.date.issued2018-
dc.identifier.citationACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers), 2018, v. 2, p. 37-42-
dc.identifier.urihttp://hdl.handle.net/10722/341243-
dc.description.abstractGeneric word embeddings are trained on large-scale generic corpora; Domain Specific (DS) word embeddings are trained only on data from a domain of interest. This paper proposes a method to combine the breadth of generic embeddings with the specificity of domain specific embeddings. The resulting embeddings, called Domain Adapted (DA) word embeddings, are formed by aligning corresponding word vectors using Canonical Correlation Analysis (CCA) or the related nonlinear Kernel CCA. Evaluation results on sentiment classification tasks show that the DA embeddings substantially outperform both generic and DS embeddings when used as input features to standard or state-of-the-art sentence encoding algorithms for classification.-
dc.languageeng-
dc.relation.ispartofACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers)-
dc.titleDomain adapted word embeddings for improved sentiment classification-
dc.typeConference_Paper-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.scopuseid_2-s2.0-85063150754-
dc.identifier.volume2-
dc.identifier.spage37-
dc.identifier.epage42-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats