Representing a bilingual lexicon with suffix trees
Pages 1164 - 1165
Abstract
This paper presents a system based on generalized suffix trees that efficiently implements a set of operations over a bilingual lexicon. Besides the basic operations of adding and removing translations from the lexicon, the system provides two unique query functions that we refer to as monolingual and bilingual coverage. These two functions lay the foundation for higher-level mining operations, such as identification of translation patterns, that are the subject of ongoing research. Nevertheless, the system presented here is interesting in and by itself, for the novelty of the coverage functions and the potential of the whole data structure. We compare the performance of two implementations, one based on suffix trees and the other on suffix arrays.
References
[1]
Aires, G. P. Lopes, and L. Gomes. Phrase translation extraction from aligned parallel corpora using suffix arrays and related structures. In Progress in Artificial Intelligence, LNCS, pages 588--597, 2009.
[2]
L. Gomes. Parallel texts alignment. 2009.
[3]
L. Gomes, Aires, and G. P. Lopes. Parallel texts alignment. In EPIA 2009, Aveiro, pages 513--524. Universidade de Aveiro.
[4]
D. Gusfield. Algorithms on strings, trees, and sequences: computer science and computational biology. Cambridge Univ Pr, 1997.
[5]
L. Russo, G. Navarro, and A. Oliveira. Fully-compressed suffix trees. LATIN 2008: Theoretical Informatics, pages 362--373, 2008.
[6]
E. Ukkonen. On-line construction of suffix trees. Algorithmica, 14(3): 249--260, 1995.
Information & Contributors
Information
Published In
March 2011
1868 pages
ISBN:9781450301138
DOI:10.1145/1982185
- Conference Chairs:
- William Chu,
- W. Eric Wong,
- Program Chairs:
- Mathew J. Palakal,
- Chih-Cheng Hung
Copyright © 2011 Authors.
Sponsors
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
Published: 21 March 2011
Check for updates
Author Tags
Qualifiers
- Poster
Conference
SAC'11
Sponsor:
Acceptance Rates
Overall Acceptance Rate 1,650 of 6,669 submissions, 25%
Upcoming Conference
SAC '25
- Sponsor:
- sigapp
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 77Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Reflects downloads up to 03 Mar 2025
Other Metrics
Citations
View Options
Login options
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in