Abstract
We consider the problem of indexing a given text T[0...n − 1] of n characters over an alphabet set Σ of size σ, in order to answer the position-restricted substring searching queries. The query input consists of a pattern P (of length p) and two indices ℓ and r and the output is the set of all occ ℓ,r occurrences of P in T[ℓ...r]. In this paper, we propose an O(nlogσ)-word space index with O(p + occ ℓ,r loglogn) query time. Our solution is interesting when the alphabet size is small. For example, when the alphabet set is of constant size, we achieve exponential time improvement over the previously best-known linear space index by Navarro and Nekrich [SWAT 2012] with O(p + occ ℓ,r logε n) query time, where ε > 0 is any positive constant. We also study the property matching problem and provide an improved index for handling semi-dynamic (only insertions) properties, where we use position-restricted substring queries as the main technique.
This work is supported in part by US NSF Grant CCF–1017623 (R. Shah and J. S. Vitter) and CCF–1218904 (R. Shah).
The original version of this chapter was revised: The copyright line was incorrect. This has been corrected. The Erratum to this chapter is available at DOI: 10.1007/978-3-319-02432-5_33
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Amir, A., Chencinski, E., Iliopoulos, C.S., Kopelowitz, T., Zhang, H.: Property matching and weighted matching. Theoretical Computer Science 395, 298–310 (2008)
Amir, A., Farach, M., Idury, R.M., La Poutré, J.A., Schäffer, A.A.: Improved Dynamic Dictionary Matching. Information and Computation 119(2), 258–282 (1995)
Bille, P., Gørtz, I.L.: Substring Range Reporting. In: Giancarlo, R., Manzini, G. (eds.) CPM 2011. LNCS, vol. 6661, pp. 299–308. Springer, Heidelberg (2011)
Chan, T.M., Larsen, K.G., Patrascu, M.: Orthogonal range searching on the RAM, revisited. In: SoCG, pp. 1–10 (2011)
Chien, Y.-F., Hon, W.K., Shah, R., Thankachan, S.V., Vitter, J.S.: Geometric BWT: Compressed Text Indexing via Sparse Suffixes and Range Searching. Algorithmica, 1–21 (2013)
Crochemore, M., Iliopoulos, C.S., Kubica, M., Rahman, M.S., Walen, T.: Improved Algorithms for the Range Next Value Problem and Applications. In: STACS, pp. 205–216 (2008)
Gagie, T., Gawrychowski, P.: Linear-Space Substring Range Counting over Polylogarithmic Alphabets. CoRR, arXiv: 1202.3208 (2012)
Golynski, A., Munro, J.I., Rao, S.S.: Rank/Select Operations on Large Alphabets: A Tool for Text Indexing. In: SODA, pp. 368–373 (2006)
Hon, W.K., Patil, M., Shah, R., Thankachan, S.V.: Compressed Property Suffix Tree. In: IEEE Data Compression Conference, pp. 123–132 (2011)
Hon, W.K., Shah, R., Thankachan, S.V., Vitter, J.S.: On position restricted substring searching in succinct space. Journal of Discrete Algorithms (2012); Hon, W.-K., Ku, T.-H., Shah, R., Thankachan, S.V., Vitter, J.S.: Compressed text indexing with wildcards. In: Grossi, R., Sebastiani, F., Silvestri, F. (eds.) SPIRE 2011. LNCS, vol. 7024, pp. 267–277. Springer, Heidelberg (2011)
Juan, M.T., Liu, J.J., Wang, Y.L.: Errata for “Faster index for property matching”. Information Processing Letter 109(18), 1027–1029 (2009)
Kopelowitz, T.: The Property Suffix Tree with Dynamic Properties. In: Amir, A., Parida, L. (eds.) CPM 2010. LNCS, vol. 6129, pp. 63–75. Springer, Heidelberg (2010)
Kopelowitz, T., Lewenstein, M., Porat, E.: Persistency in Suffix Trees with Applications to String Interval Problems. In: Grossi, R., Sebastiani, F., Silvestri, F. (eds.) SPIRE 2011. LNCS, vol. 7024, pp. 67–80. Springer, Heidelberg (2011)
Mäkinen, V., Navarro, G.: Position-Restricted Substring Searching. In: Correa, J.R., Hevia, A., Kiwi, M. (eds.) LATIN 2006. LNCS, vol. 3887, pp. 703–714. Springer, Heidelberg (2006)
Manber, U., Myers, G.: Suffix Arrays: A New Method for On-Line String Searches. SIAM Journal on Computing 22(5), 935–948 (1993)
McCreight, E.M.: A Space-Economical Suffix Tree Construction Algorithm. Journal of the ACM 23(2), 262–272 (1976)
Nekrich, Y., Navarro, G.: Sorted Range Reporting. In: Fomin, F.V., Kaski, P. (eds.) SWAT 2012. LNCS, vol. 7357, pp. 271–282. Springer, Heidelberg (2012)
Weiner, P.: Linear Pattern Matching Algorithms. In: SWAT (1973)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Biswas, S., Ku, TH., Shah, R., Thankachan, S.V. (2013). Position-Restricted Substring Searching over Small Alphabets. In: Kurland, O., Lewenstein, M., Porat, E. (eds) String Processing and Information Retrieval. SPIRE 2013. Lecture Notes in Computer Science, vol 8214. Springer, Cham. https://doi.org/10.1007/978-3-319-02432-5_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-02432-5_7
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-02431-8
Online ISBN: 978-3-319-02432-5
eBook Packages: Computer ScienceComputer Science (R0)