File Download
Supplementary
-
Citations:
- Appears in Collections:
postgraduate thesis: Automatic identification of hot topics and user clusters from online discussion forums
Title | Automatic identification of hot topics and user clusters from online discussion forums |
---|---|
Authors | |
Advisors | |
Issue Date | 2011 |
Publisher | The University of Hong Kong (Pokfulam, Hong Kong) |
Citation | Lai, Y. [黎耀明]. (2011). Automatic identification of hot topics and user clusters from online discussion forums. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. Retrieved from http://dx.doi.org/10.5353/th_b4784995 |
Abstract | With the advancement of Internet technology and the changes in the mode
of communications, it is found that much first-hand news have been discussed
in Internet forums well before they are reported in traditional mass media.
Also, this communication channel provides an effective channel for illegal activities
such as dissemination of copyrighted movies, threatening messages and
online gambling etc. The law enforcement agencies are looking for solutions to
monitor these discussion forums for possible criminal activities and download
suspected postings as evidence for investigation. The volume of postings is
huge, for 10 popular forums in Hong Kong; we found that there are 300,000
new messages every day. In this thesis, we propose an automatic system that
tackles this problem. Our proposed system downloads postings from selected
discussion forums continuously and employs data mining techniques to identify
hot topics and cluster authors into different groups using word based user
profiles. Using these data, we try to locate some useful trends and detect crime
from the data, the result is discussed afterward with include advantages and
limitations of different approaches and at the end, there is a conclusion of the
way to solve those problems and provide future direction of this research. |
Degree | Master of Philosophy |
Subject | Data mining. Cluster analysis. |
Dept/Program | Computer Science |
Persistent Identifier | http://hdl.handle.net/10722/174553 |
HKU Library Item ID | b4784995 |
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Chow, KP | - |
dc.contributor.advisor | Hui, CK | - |
dc.contributor.author | Lai, Yiu-ming. | - |
dc.contributor.author | 黎耀明. | - |
dc.date.issued | 2011 | - |
dc.identifier.citation | Lai, Y. [黎耀明]. (2011). Automatic identification of hot topics and user clusters from online discussion forums. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. Retrieved from http://dx.doi.org/10.5353/th_b4784995 | - |
dc.identifier.uri | http://hdl.handle.net/10722/174553 | - |
dc.description.abstract | With the advancement of Internet technology and the changes in the mode of communications, it is found that much first-hand news have been discussed in Internet forums well before they are reported in traditional mass media. Also, this communication channel provides an effective channel for illegal activities such as dissemination of copyrighted movies, threatening messages and online gambling etc. The law enforcement agencies are looking for solutions to monitor these discussion forums for possible criminal activities and download suspected postings as evidence for investigation. The volume of postings is huge, for 10 popular forums in Hong Kong; we found that there are 300,000 new messages every day. In this thesis, we propose an automatic system that tackles this problem. Our proposed system downloads postings from selected discussion forums continuously and employs data mining techniques to identify hot topics and cluster authors into different groups using word based user profiles. Using these data, we try to locate some useful trends and detect crime from the data, the result is discussed afterward with include advantages and limitations of different approaches and at the end, there is a conclusion of the way to solve those problems and provide future direction of this research. | - |
dc.language | eng | - |
dc.publisher | The University of Hong Kong (Pokfulam, Hong Kong) | - |
dc.relation.ispartof | HKU Theses Online (HKUTO) | - |
dc.rights | The author retains all proprietary rights, (such as patent rights) and the right to use in future works. | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.source.uri | http://hub.hku.hk/bib/B47849952 | - |
dc.subject.lcsh | Data mining. | - |
dc.subject.lcsh | Cluster analysis. | - |
dc.title | Automatic identification of hot topics and user clusters from online discussion forums | - |
dc.type | PG_Thesis | - |
dc.identifier.hkul | b4784995 | - |
dc.description.thesisname | Master of Philosophy | - |
dc.description.thesislevel | Master | - |
dc.description.thesisdiscipline | Computer Science | - |
dc.description.nature | published_or_final_version | - |
dc.identifier.doi | 10.5353/th_b4784995 | - |
dc.date.hkucongregation | 2012 | - |
dc.identifier.mmsid | 991033487679703414 | - |