File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Book Chapter: Stylometry and mathematical study of authorship

TitleStylometry and mathematical study of authorship
Authors
KeywordsFeature ranking
Feature subset
Function character
Sentence length
Training data
Issue Date2016
Citation
Applied and Numerical Harmonic Analysis, 2016, n. 9783319278711, p. 281-300 How to Cite?
AbstractInspired by various authorship attribution problems in the history of literature and the application of machine learning in the study of literary stylometry, we develop a rigorous new method for the mathematical analysis of authorship by testing for a so-called chrono-divide in writing styles. Our method incorporates some of the latest advances in the study of authorship attribution, particularly techniques from support vector machines. By introducing the notion of relative frequency as a feature ranking metric our method proves to be highly effective and robust. Applying our method to the Cheng-Gao version of Dream of the Red Chamber has led to convincing if not irrefutable evidence that the first 80 chapters and the last 40 chapters of the book were written by two different authors. Applying our method to the novel Micro, we are able to confirm the existence of the chrono-divide and identify its location so that we can differentiate the contribution of Michael Crichton and Richard Preston. We have also tested our method to the other three Great Classical Novels in Chinese. As expected no chrono-divides have been found. This provides further evidence of the robustness of our method.
Persistent Identifierhttp://hdl.handle.net/10722/363284
ISSN
2020 SCImago Journal Rankings: 0.125

 

DC FieldValueLanguage
dc.contributor.authorHu, Xianfeng-
dc.contributor.authorWang, Yang-
dc.contributor.authorWu, Qiang-
dc.date.accessioned2025-10-10T07:45:49Z-
dc.date.available2025-10-10T07:45:49Z-
dc.date.issued2016-
dc.identifier.citationApplied and Numerical Harmonic Analysis, 2016, n. 9783319278711, p. 281-300-
dc.identifier.issn2296-5009-
dc.identifier.urihttp://hdl.handle.net/10722/363284-
dc.description.abstractInspired by various authorship attribution problems in the history of literature and the application of machine learning in the study of literary stylometry, we develop a rigorous new method for the mathematical analysis of authorship by testing for a so-called chrono-divide in writing styles. Our method incorporates some of the latest advances in the study of authorship attribution, particularly techniques from support vector machines. By introducing the notion of relative frequency as a feature ranking metric our method proves to be highly effective and robust. Applying our method to the Cheng-Gao version of Dream of the Red Chamber has led to convincing if not irrefutable evidence that the first 80 chapters and the last 40 chapters of the book were written by two different authors. Applying our method to the novel Micro, we are able to confirm the existence of the chrono-divide and identify its location so that we can differentiate the contribution of Michael Crichton and Richard Preston. We have also tested our method to the other three Great Classical Novels in Chinese. As expected no chrono-divides have been found. This provides further evidence of the robustness of our method.-
dc.languageeng-
dc.relation.ispartofApplied and Numerical Harmonic Analysis-
dc.subjectFeature ranking-
dc.subjectFeature subset-
dc.subjectFunction character-
dc.subjectSentence length-
dc.subjectTraining data-
dc.titleStylometry and mathematical study of authorship-
dc.typeBook_Chapter-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1007/978-3-319-27873-5_8-
dc.identifier.scopuseid_2-s2.0-85047225259-
dc.identifier.issue9783319278711-
dc.identifier.spage281-
dc.identifier.epage300-
dc.identifier.eissn2296-5017-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats