File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: MULCE: Multi-level Canonicalization with Embeddings of Open Knowledge Bases

TitleMULCE: Multi-level Canonicalization with Embeddings of Open Knowledge Bases
Authors
Issue Date2020
PublisherSpringer.
Citation
21st International Conference on Web Information Systems Engineering (WISE 2020), Amsterdam and Leiden, Netherlands, October 20–24, 2020. In Web Information Systems Engineering – WISE 2020: 21st International Conference, Amsterdam, The Netherlands, October 20–24, 2020, Proceedings, Part I, p. 315-327 How to Cite?
AbstractAn open knowledge base (OKB) is a repository of facts, which are typically represented in the form of ⟨subject; relation; object⟩ triples. The problem of canonicalizing OKB triples is to map different names mentioned in the triples that refer to the same entity into a basic canonical form. We propose the algorithm Multi-Level Canonicalization with Embeddings (MULCE) to perform canonicalization. MULCE executes in two steps. The first step performs word-level canonicalization to coarsely group subject names based on their GloVe vectors into semantically similar clusters. The second step performs sentence-level canonicalization to refine the clusters by employing BERT embedding to model relation and object information. Our experimental results show that MULCE outperforms state-of-the-art methods.
Persistent Identifierhttp://hdl.handle.net/10722/320033
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorWu, TH-
dc.contributor.authorKao, CM-
dc.contributor.authorWu, Z-
dc.contributor.authorFeng, F-
dc.contributor.authorSong, Q-
dc.contributor.authorChen, C-
dc.date.accessioned2022-10-14T05:24:13Z-
dc.date.available2022-10-14T05:24:13Z-
dc.date.issued2020-
dc.identifier.citation21st International Conference on Web Information Systems Engineering (WISE 2020), Amsterdam and Leiden, Netherlands, October 20–24, 2020. In Web Information Systems Engineering – WISE 2020: 21st International Conference, Amsterdam, The Netherlands, October 20–24, 2020, Proceedings, Part I, p. 315-327-
dc.identifier.urihttp://hdl.handle.net/10722/320033-
dc.description.abstractAn open knowledge base (OKB) is a repository of facts, which are typically represented in the form of ⟨subject; relation; object⟩ triples. The problem of canonicalizing OKB triples is to map different names mentioned in the triples that refer to the same entity into a basic canonical form. We propose the algorithm Multi-Level Canonicalization with Embeddings (MULCE) to perform canonicalization. MULCE executes in two steps. The first step performs word-level canonicalization to coarsely group subject names based on their GloVe vectors into semantically similar clusters. The second step performs sentence-level canonicalization to refine the clusters by employing BERT embedding to model relation and object information. Our experimental results show that MULCE outperforms state-of-the-art methods.-
dc.languageeng-
dc.publisherSpringer.-
dc.relation.ispartofWeb Information Systems Engineering – WISE 2020: 21st International Conference, Amsterdam, The Netherlands, October 20–24, 2020, Proceedings, Part I-
dc.rightsThis version of the article has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: https://doi.org/[insert DOI]-
dc.titleMULCE: Multi-level Canonicalization with Embeddings of Open Knowledge Bases-
dc.typeConference_Paper-
dc.identifier.emailKao, CM: kao@cs.hku.hk-
dc.identifier.authorityKao, CM=rp00123-
dc.identifier.doi10.1007/978-3-030-62005-9_23-
dc.identifier.hkuros339384-
dc.identifier.spage315-
dc.identifier.epage327-
dc.identifier.isiWOS:000739662100023-
dc.publisher.placeCham, Switzerland-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats