File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1007/978-3-030-62005-9_23
- WOS: WOS:000739662100023
Supplementary
-
Citations:
- Web of Science: 0
- Appears in Collections:
Conference Paper: MULCE: Multi-level Canonicalization with Embeddings of Open Knowledge Bases
Title | MULCE: Multi-level Canonicalization with Embeddings of Open Knowledge Bases |
---|---|
Authors | |
Issue Date | 2020 |
Publisher | Springer. |
Citation | 21st International Conference on Web Information Systems Engineering (WISE 2020), Amsterdam and Leiden, Netherlands, October 20–24, 2020. In Web Information Systems Engineering – WISE 2020: 21st International Conference, Amsterdam, The Netherlands, October 20–24, 2020, Proceedings, Part I, p. 315-327 How to Cite? |
Abstract | An open knowledge base (OKB) is a repository of facts, which are typically represented in the form of ⟨subject; relation; object⟩ triples. The problem of canonicalizing OKB triples is to map different names mentioned in the triples that refer to the same entity into a basic canonical form. We propose the algorithm Multi-Level Canonicalization with Embeddings (MULCE) to perform canonicalization. MULCE executes in two steps. The first step performs word-level canonicalization to coarsely group subject names based on their GloVe vectors into semantically similar clusters. The second step performs sentence-level canonicalization to refine the clusters by employing BERT embedding to model relation and object information. Our experimental results show that MULCE outperforms state-of-the-art methods. |
Persistent Identifier | http://hdl.handle.net/10722/320033 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Wu, TH | - |
dc.contributor.author | Kao, CM | - |
dc.contributor.author | Wu, Z | - |
dc.contributor.author | Feng, F | - |
dc.contributor.author | Song, Q | - |
dc.contributor.author | Chen, C | - |
dc.date.accessioned | 2022-10-14T05:24:13Z | - |
dc.date.available | 2022-10-14T05:24:13Z | - |
dc.date.issued | 2020 | - |
dc.identifier.citation | 21st International Conference on Web Information Systems Engineering (WISE 2020), Amsterdam and Leiden, Netherlands, October 20–24, 2020. In Web Information Systems Engineering – WISE 2020: 21st International Conference, Amsterdam, The Netherlands, October 20–24, 2020, Proceedings, Part I, p. 315-327 | - |
dc.identifier.uri | http://hdl.handle.net/10722/320033 | - |
dc.description.abstract | An open knowledge base (OKB) is a repository of facts, which are typically represented in the form of ⟨subject; relation; object⟩ triples. The problem of canonicalizing OKB triples is to map different names mentioned in the triples that refer to the same entity into a basic canonical form. We propose the algorithm Multi-Level Canonicalization with Embeddings (MULCE) to perform canonicalization. MULCE executes in two steps. The first step performs word-level canonicalization to coarsely group subject names based on their GloVe vectors into semantically similar clusters. The second step performs sentence-level canonicalization to refine the clusters by employing BERT embedding to model relation and object information. Our experimental results show that MULCE outperforms state-of-the-art methods. | - |
dc.language | eng | - |
dc.publisher | Springer. | - |
dc.relation.ispartof | Web Information Systems Engineering – WISE 2020: 21st International Conference, Amsterdam, The Netherlands, October 20–24, 2020, Proceedings, Part I | - |
dc.rights | This version of the article has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: https://doi.org/[insert DOI] | - |
dc.title | MULCE: Multi-level Canonicalization with Embeddings of Open Knowledge Bases | - |
dc.type | Conference_Paper | - |
dc.identifier.email | Kao, CM: kao@cs.hku.hk | - |
dc.identifier.authority | Kao, CM=rp00123 | - |
dc.identifier.doi | 10.1007/978-3-030-62005-9_23 | - |
dc.identifier.hkuros | 339384 | - |
dc.identifier.spage | 315 | - |
dc.identifier.epage | 327 | - |
dc.identifier.isi | WOS:000739662100023 | - |
dc.publisher.place | Cham, Switzerland | - |