Domain Adapted Word Embeddings for Improved Sentiment Classification

Sarma, Prathusha K.; Liang, Yingyu; Sethares, William A.

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Scopus: eid_2-s2.0-85089289412
Find via

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- HKU Musketeers Foundation Institute of Data Science: Conference papers

Conference Paper: Domain Adapted Word Embeddings for Improved Sentiment Classification

Title	Domain Adapted Word Embeddings for Improved Sentiment Classification
Authors	Sarma, Prathusha K.Liang, Yingyu Sethares, William A.
Issue Date	2018
Citation	Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2018, p. 51-59 How to Cite?
Abstract	Generic word embeddings are trained on large-scale generic corpora; Domain Specific (DS) word embeddings are trained only on data from a domain of interest. This paper proposes a method to combine the breadth of generic embeddings with the specificity of domain specific embeddings. The resulting embeddings, called Domain Adapted (DA) word embeddings, are formed by first aligning corresponding word vectors using Canonical Correlation Analysis (CCA) or the related nonlinear Kernel CCA (KCCA) and then combining them via convex optimization. Results from evaluation on sentiment classification tasks show that the DA embeddings substantially outperform both generic, DS embeddings when used as input features to standard or state-of-the-art sentence encoding algorithms for classification.
Persistent Identifier	http://hdl.handle.net/10722/341283
ISSN	0736-587X

DC Field	Value	Language
dc.contributor.author	Sarma, Prathusha K.	-
dc.contributor.author	Liang, Yingyu	-
dc.contributor.author	Sethares, William A.	-
dc.date.accessioned	2024-03-13T08:41:36Z	-
dc.date.available	2024-03-13T08:41:36Z	-
dc.date.issued	2018	-
dc.identifier.citation	Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2018, p. 51-59	-
dc.identifier.issn	0736-587X	-
dc.identifier.uri	http://hdl.handle.net/10722/341283	-
dc.description.abstract	Generic word embeddings are trained on large-scale generic corpora; Domain Specific (DS) word embeddings are trained only on data from a domain of interest. This paper proposes a method to combine the breadth of generic embeddings with the specificity of domain specific embeddings. The resulting embeddings, called Domain Adapted (DA) word embeddings, are formed by first aligning corresponding word vectors using Canonical Correlation Analysis (CCA) or the related nonlinear Kernel CCA (KCCA) and then combining them via convex optimization. Results from evaluation on sentiment classification tasks show that the DA embeddings substantially outperform both generic, DS embeddings when used as input features to standard or state-of-the-art sentence encoding algorithms for classification.	-
dc.language	eng	-
dc.relation.ispartof	Proceedings of the Annual Meeting of the Association for Computational Linguistics	-
dc.title	Domain Adapted Word Embeddings for Improved Sentiment Classification	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.scopus	eid_2-s2.0-85089289412	-
dc.identifier.spage	51	-
dc.identifier.epage	59	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Domain Adapted Word Embeddings for Improved Sentiment Classification

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats