File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/ICDE.2011.5767863
- Scopus: eid_2-s2.0-79957801787
- Find via
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Conference Paper: CubeLSI: an effective and efficient method for searching resources in social tagging systems
Title | CubeLSI: an effective and efficient method for searching resources in social tagging systems |
---|---|
Authors | |
Keywords | Efficient method Feature space IR techniques Key word matching Resource retrieval |
Issue Date | 2011 |
Publisher | IEEE, Computer Society. The Journal's web site is located at http://ieeexplore.ieee.org/xpl/conhome.jsp?punumber=1000178 |
Citation | The IEEE 27th International Conference on Data Engineering (ICDE 2011), Hannover, Germany, 11-16 April 2011. In International Conference on Data Engineering Proceedings, 2011, p. 27-38 How to Cite? |
Abstract | In a social tagging system, resources (such as photos, video and web pages) are associated with tags. These tags allow the resources to be effectively searched through tag-based keyword matching using traditional IR techniques. We note that in many such systems, tags of a resource are often assigned by a diverse audience of causal users (taggers). This leads to two issues that gravely affect the effectiveness of resource retrieval: (1) Noise: tags are picked from an uncontrolled vocabulary and are assigned by untrained taggers. The tags are thus noisy features in resource retrieval. (2) A multitude of aspects: different taggers focus on different aspects of a resource. Representing a resource using a flattened bag of tags ignores this important diversity of taggers. To improve the effectiveness of resource retrieval in social tagging systems, we propose CubeLSI a technique that extends traditional LSI to include taggers as another dimension of feature space of resources. We compare CubeLSI against a number of other tag-based retrieval models and show that CubeLSI significantly outperforms the other models in terms of retrieval accuracy. We also prove two interesting theorems that allow CubeLSI to be very efficiently computed despite the much enlarged feature space it employs. © 2011 IEEE. |
Persistent Identifier | http://hdl.handle.net/10722/135695 |
ISBN | |
ISSN | 2023 SCImago Journal Rankings: 1.306 |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Bi, B | en_HK |
dc.contributor.author | Lee, SD | en_HK |
dc.contributor.author | Kao, B | en_HK |
dc.contributor.author | Cheng, R | en_HK |
dc.date.accessioned | 2011-07-27T01:39:53Z | - |
dc.date.available | 2011-07-27T01:39:53Z | - |
dc.date.issued | 2011 | en_HK |
dc.identifier.citation | The IEEE 27th International Conference on Data Engineering (ICDE 2011), Hannover, Germany, 11-16 April 2011. In International Conference on Data Engineering Proceedings, 2011, p. 27-38 | en_HK |
dc.identifier.isbn | 978-1-4244-8960-2 | - |
dc.identifier.issn | 1084-4627 | en_HK |
dc.identifier.uri | http://hdl.handle.net/10722/135695 | - |
dc.description.abstract | In a social tagging system, resources (such as photos, video and web pages) are associated with tags. These tags allow the resources to be effectively searched through tag-based keyword matching using traditional IR techniques. We note that in many such systems, tags of a resource are often assigned by a diverse audience of causal users (taggers). This leads to two issues that gravely affect the effectiveness of resource retrieval: (1) Noise: tags are picked from an uncontrolled vocabulary and are assigned by untrained taggers. The tags are thus noisy features in resource retrieval. (2) A multitude of aspects: different taggers focus on different aspects of a resource. Representing a resource using a flattened bag of tags ignores this important diversity of taggers. To improve the effectiveness of resource retrieval in social tagging systems, we propose CubeLSI a technique that extends traditional LSI to include taggers as another dimension of feature space of resources. We compare CubeLSI against a number of other tag-based retrieval models and show that CubeLSI significantly outperforms the other models in terms of retrieval accuracy. We also prove two interesting theorems that allow CubeLSI to be very efficiently computed despite the much enlarged feature space it employs. © 2011 IEEE. | en_HK |
dc.language | eng | en_US |
dc.publisher | IEEE, Computer Society. The Journal's web site is located at http://ieeexplore.ieee.org/xpl/conhome.jsp?punumber=1000178 | - |
dc.relation.ispartof | International Conference on Data Engineering Proceedings | en_HK |
dc.subject | Efficient method | - |
dc.subject | Feature space | - |
dc.subject | IR techniques | - |
dc.subject | Key word matching | - |
dc.subject | Resource retrieval | - |
dc.title | CubeLSI: an effective and efficient method for searching resources in social tagging systems | en_HK |
dc.type | Conference_Paper | en_HK |
dc.identifier.email | Bi, B: bbi@cs.hku.hk | en_HK |
dc.identifier.email | Lee, SD: sdlee@cs.hku.hk | en_HK |
dc.identifier.email | Kao, B: kao@cs.hku.hk | - |
dc.identifier.email | Cheng, R: ckcheng@cs.hku.hk | - |
dc.identifier.authority | Kao, B=rp00123 | en_HK |
dc.identifier.authority | Cheng, R=rp00074 | en_HK |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1109/ICDE.2011.5767863 | en_HK |
dc.identifier.scopus | eid_2-s2.0-79957801787 | en_HK |
dc.identifier.hkuros | 186894 | en_US |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-79957801787&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.spage | 27 | en_HK |
dc.identifier.epage | 38 | en_HK |
dc.publisher.place | United States | en_HK |
dc.description.other | The IEEE 27th International Conference on Data Engineering (ICDE 2011), Hannover, Germany, 11-16 April 2011. In International Conference on Data Engineering Proceedings, 2011, p. 27-38 | - |
dc.identifier.scopusauthorid | Cheng, R=7201955416 | en_HK |
dc.identifier.scopusauthorid | Kao, B=35221592600 | en_HK |
dc.identifier.scopusauthorid | Lee, SD=7601400741 | en_HK |
dc.identifier.scopusauthorid | Bi, B=24558571800 | en_HK |
dc.identifier.issnl | 1084-4627 | - |