File Download

There are no files associated with this item.

Supplementary

Article: A deceptive reviews detection model: Separated training of multi-feature learning and classification

TitleA deceptive reviews detection model: Separated training of multi-feature learning and classification
Authors
Issue Date2022
Citation
Expert Systems with Applications, 2022, v. 187, p. 115977 How to Cite?
AbstractThe increasing online reviews play an essential role in the e-commerce platform, which profoundly affects the purchase decisions of consumers. However, rampant dishonest sellers manipulate other buyers or robots to post deceptive reviews for profit. Recently, the detection of deceptive reviews has attracted general research attention, which mainly comprises two directions, traditional methods based on statistics and intelligent methods based on neural networks. These methods use a single feature or multiple features for classifier design. To make full use of different features for better feature representation of detecting deceptive reviews, this paper proposes a new feature fusion strategy and verifies its performance by comparing it with other feature fusion strategies. First, we utilize three independent models for feature extraction: the TextCNN, the Bidirectional Gated Recurrent Unit (GRU), and the Self-Attention are used to learn local semantic features, temporal semantic features, and weighted semantic features of reviews, respectively. Secondly, after obtaining different feature representations from the fully connected layers of these three models, we concatenate them together to form the final documental representation. Finally, we use a full connection layer and the sigmoid function to further learn and complete deceptive review detection. Experiments on three balanced and unbalanced in-domain small datasets (hotel, restaurant, doctor) and mixed-domain datasets show that our model is superior to baselines. Experiments on large-scale data with various imbalanced proportions verify the effectiveness of our method. We also analyze the results of different datasets from the perspective of part of speech to improve the model's interpretability.
Persistent Identifierhttp://hdl.handle.net/10722/307938

 

DC FieldValueLanguage
dc.contributor.authorCao, N-
dc.contributor.authorJi, S-
dc.contributor.authorChiu, KWD-
dc.contributor.authorGong, M-
dc.date.accessioned2021-11-12T13:40:07Z-
dc.date.available2021-11-12T13:40:07Z-
dc.date.issued2022-
dc.identifier.citationExpert Systems with Applications, 2022, v. 187, p. 115977-
dc.identifier.urihttp://hdl.handle.net/10722/307938-
dc.description.abstractThe increasing online reviews play an essential role in the e-commerce platform, which profoundly affects the purchase decisions of consumers. However, rampant dishonest sellers manipulate other buyers or robots to post deceptive reviews for profit. Recently, the detection of deceptive reviews has attracted general research attention, which mainly comprises two directions, traditional methods based on statistics and intelligent methods based on neural networks. These methods use a single feature or multiple features for classifier design. To make full use of different features for better feature representation of detecting deceptive reviews, this paper proposes a new feature fusion strategy and verifies its performance by comparing it with other feature fusion strategies. First, we utilize three independent models for feature extraction: the TextCNN, the Bidirectional Gated Recurrent Unit (GRU), and the Self-Attention are used to learn local semantic features, temporal semantic features, and weighted semantic features of reviews, respectively. Secondly, after obtaining different feature representations from the fully connected layers of these three models, we concatenate them together to form the final documental representation. Finally, we use a full connection layer and the sigmoid function to further learn and complete deceptive review detection. Experiments on three balanced and unbalanced in-domain small datasets (hotel, restaurant, doctor) and mixed-domain datasets show that our model is superior to baselines. Experiments on large-scale data with various imbalanced proportions verify the effectiveness of our method. We also analyze the results of different datasets from the perspective of part of speech to improve the model's interpretability.-
dc.languageeng-
dc.relation.ispartofExpert Systems with Applications-
dc.titleA deceptive reviews detection model: Separated training of multi-feature learning and classification-
dc.typeArticle-
dc.identifier.emailChiu, KWD: dchiu88@hku.hk-
dc.identifier.hkuros329525-
dc.identifier.volume187-
dc.identifier.spage115977-
dc.identifier.epage115977-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats