File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: A Semantic DOM Approach For Webpage Information Extraction

TitleA Semantic DOM Approach For Webpage Information Extraction
Authors
KeywordsInformation extraction
SDOM
Semi-structured
Issue Date2009
PublisherIEEE.
Citation
International Conference on Management and Service Science (MASS '09), Wuhan, China, 20-22 September 2009, p. 1-5 How to Cite?
AbstractWith the development of electronic technology and e-commerce, technology for Web pages has attracted a lot of research efforts which becomes one of the hottest topics recently. This paper has proposed a semantic DOM (SDOM) approach for information extraction of e-commerce Web pages. With the combination of content and structure information, the precision and recall can achieve a good result which is shown in our experiments on listpage and tablepage data sets.
Persistent Identifierhttp://hdl.handle.net/10722/223762
ISBN

 

DC FieldValueLanguage
dc.contributor.authorFei, Y-
dc.contributor.authorLuo, Z-
dc.contributor.authorXu, Y-
dc.contributor.authorZhang, W-
dc.date.accessioned2016-03-14T08:16:36Z-
dc.date.available2016-03-14T08:16:36Z-
dc.date.issued2009-
dc.identifier.citationInternational Conference on Management and Service Science (MASS '09), Wuhan, China, 20-22 September 2009, p. 1-5-
dc.identifier.isbn978-1-4244-4638-4-
dc.identifier.urihttp://hdl.handle.net/10722/223762-
dc.description.abstractWith the development of electronic technology and e-commerce, technology for Web pages has attracted a lot of research efforts which becomes one of the hottest topics recently. This paper has proposed a semantic DOM (SDOM) approach for information extraction of e-commerce Web pages. With the combination of content and structure information, the precision and recall can achieve a good result which is shown in our experiments on listpage and tablepage data sets.-
dc.languageeng-
dc.publisherIEEE.-
dc.relation.ispartofInternational Conference on Management and Service Science (MASS)-
dc.rights©2009 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.-
dc.subjectInformation extraction-
dc.subjectSDOM-
dc.subjectSemi-structured-
dc.titleA Semantic DOM Approach For Webpage Information Extraction-
dc.typeConference_Paper-
dc.identifier.emailLuo, Z: zwluo@eti.hku.hk-
dc.description.naturepublished_or_final_version-
dc.identifier.doi10.1109/ICMSS.2009.5302541-
dc.identifier.scopuseid_2-s2.0-73849097404-
dc.identifier.hkuros164908-
dc.identifier.spage1-
dc.identifier.epage5-
dc.publisher.placeWuhan, China-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats