File Download
There are no files associated with this item.
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Conference Paper: Compressed index for dynamic text
Title | Compressed index for dynamic text |
---|---|
Authors | |
Issue Date | 2004 |
Citation | Data Compression Conference Proceedings, 2004, p. 102-111 How to Cite? |
Abstract | This paper investigates how to index a text which is subject to updates. The best solution in the literature is based on suffix tree using O(n log n) bits of storage, where n is the length of the text. It supports finding all occurrences of a pattern P in O(|P| + occ) time, where occ is the number of occurrences. Each text update consists of inserting or deleting a substring of length y and can be supported in O(y + √n) time. In this paper, we initiate the study of compressed index using only O(n log |Σ|) bits of space, where Σ denotes the alphabet. Our solution supports finding all occurrences of a pattern P in O(|P|log2 n(logε n + log |ΣE|) + occ log1+ε n) time, while insertion or deletion of a substring of length y can be done in O((y + √n) log 2+εn) amortized time, where 0 < ε ≤ 1. The core part of our data structure is based on the recent work on Compressed Suffix Trees (CST) and Compressed Suffix Arrays (CSA). |
Persistent Identifier | http://hdl.handle.net/10722/151868 |
ISSN | 2023 SCImago Journal Rankings: 0.371 |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Hon, WK | en_US |
dc.contributor.author | Lam, TW | en_US |
dc.contributor.author | Sadakane, K | en_US |
dc.contributor.author | Sung, WK | en_US |
dc.contributor.author | Yiu, SM | en_US |
dc.date.accessioned | 2012-06-26T06:30:14Z | - |
dc.date.available | 2012-06-26T06:30:14Z | - |
dc.date.issued | 2004 | en_US |
dc.identifier.citation | Data Compression Conference Proceedings, 2004, p. 102-111 | en_US |
dc.identifier.issn | 1068-0314 | en_US |
dc.identifier.uri | http://hdl.handle.net/10722/151868 | - |
dc.description.abstract | This paper investigates how to index a text which is subject to updates. The best solution in the literature is based on suffix tree using O(n log n) bits of storage, where n is the length of the text. It supports finding all occurrences of a pattern P in O(|P| + occ) time, where occ is the number of occurrences. Each text update consists of inserting or deleting a substring of length y and can be supported in O(y + √n) time. In this paper, we initiate the study of compressed index using only O(n log |Σ|) bits of space, where Σ denotes the alphabet. Our solution supports finding all occurrences of a pattern P in O(|P|log2 n(logε n + log |ΣE|) + occ log1+ε n) time, while insertion or deletion of a substring of length y can be done in O((y + √n) log 2+εn) amortized time, where 0 < ε ≤ 1. The core part of our data structure is based on the recent work on Compressed Suffix Trees (CST) and Compressed Suffix Arrays (CSA). | en_US |
dc.language | eng | en_US |
dc.relation.ispartof | Data Compression Conference Proceedings | en_US |
dc.title | Compressed index for dynamic text | en_US |
dc.type | Conference_Paper | en_US |
dc.identifier.email | Lam, TW:twlam@cs.hku.hk | en_US |
dc.identifier.email | Yiu, SM:smyiu@cs.hku.hk | en_US |
dc.identifier.authority | Lam, TW=rp00135 | en_US |
dc.identifier.authority | Yiu, SM=rp00207 | en_US |
dc.description.nature | link_to_subscribed_fulltext | en_US |
dc.identifier.scopus | eid_2-s2.0-2642533893 | en_US |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-2642533893&selection=ref&src=s&origin=recordpage | en_US |
dc.identifier.spage | 102 | en_US |
dc.identifier.epage | 111 | en_US |
dc.identifier.scopusauthorid | Hon, WK=7004282818 | en_US |
dc.identifier.scopusauthorid | Lam, TW=7202523165 | en_US |
dc.identifier.scopusauthorid | Sadakane, K=7005716583 | en_US |
dc.identifier.scopusauthorid | Sung, WK=13310059700 | en_US |
dc.identifier.scopusauthorid | Yiu, SM=7003282240 | en_US |
dc.identifier.issnl | 1068-0314 | - |